web-crawler

Introdution

A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering). This is a java application which crawls through specific domain.

Getting Started

Follow below instructions, to get up and running application.

Prerequisites

Installed maven.
Installed JAVA 1.8 or higher version

How to run this application

Before any step, first take clone of this repository.
Import project in your IDE as a maven java project.
Run this below command from terminal by going to respective directory or use IDE feature maven clean install to make build.

Built With

Maven - Dependency Management

Add more feature in future

Add queue in case of Future Task.
Add UI interface in which you just need to type domain URL and you get beautiful HTMl output.
Add database to store results and retrieve on basis of some criterias

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
src		src
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

web-crawler

Introdution

Getting Started

Prerequisites

How to run this application

Built With

Add more feature in future

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

shaileshpandey11/web-crawler

Folders and files

Latest commit

History

Repository files navigation

web-crawler

Introdution

Getting Started

Prerequisites

How to run this application

Built With

Add more feature in future

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages