Webster

Overview

Webster is A Powerful and Extensible Web Crawling Framework for Node.js application. You can use Webster to crawl websites and extract structured data from their pages.

Which is different from other crawling framework is that Webster can scrape the content which rendered by browser client side javascript and ajax request.

Docker quick start

pull the example docker image:

docker pull zhuyingda/webster-demo
docker run -it zhuyingda/webster-demo

here is a simple demo for crawling this sample site, (which was a demo used by Scrapy framework):

node demo_producer.js
env MOD=debug node demo_consumer.js

Requirements

Node.js 10.x+, redis
Works on Linux, Mac OSX

Or you can deploy on Docker.

Install

npm install webster

Usage on Raspbian Platform

sudo apt install chromium-browser chromium-codecs-ffmpeg
env MOD=debug EXE_PATH=/usr/bin/chromium-browser node demo_consumer.js

Architecture overview

Documentation

You can see more details from here.

Contributors

Code Contributors

This project exists thanks to all the people who contribute. [Contribute].

Financial Contributors

Become a financial contributor and help us sustain our community. [Contribute]

Individuals

Organizations

Support this project with your organization. Your logo will show up here with a link to your website. [Contribute]

License

GPL-V3

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github		.github
doc		doc
example		example
lib		lib
test		test
.eslintrc		.eslintrc
.gitignore		.gitignore
.npmignore		.npmignore
.travis.yml		.travis.yml
Dockerfile.ci		Dockerfile.ci
Dockerfile.demo		Dockerfile.demo
Dockerfile.runtime		Dockerfile.runtime
LICENSE		LICENSE
README.md		README.md
index.js		index.js
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Webster

Overview

Docker quick start

Requirements

Install

Usage on Raspbian Platform

Architecture overview

Documentation

Contributors

Code Contributors

Financial Contributors

Individuals

Organizations

License

About

Releases

Packages

Languages

License

fakegit/webster

Folders and files

Latest commit

History

Repository files navigation

Webster

Overview

Docker quick start

Requirements

Install

Usage on Raspbian Platform

Architecture overview

Documentation

Contributors

Code Contributors

Financial Contributors

Individuals

Organizations

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages