Selenium Twitter Scraper

This twitter scraper use selenium to crawl data from twitter without authentication.

Feature

Use Redis to deal with duplicate crawel
Use Mysql to store data
Use python database ORM SQLalchemy

Install

If you want to use latest version, install from source. To install twitter-scraper from source, follow these steps:

Linux and macOS:

git clone git@github.com:FmKnight/Selenium-Twitter-Scraper.git
cd Selenium-Twitter-Scraperhttps://github.com/FmKnight/Selenium-Twitter-Scraper
pip3 install -r requirements.txt

Run

1、crawel tweets

tweet_craweler.py : run this py file to get specific keywords tweets.Contain following fields:

user_name
user_id
date
content
reply
retweet
like

2、crawel user info

user_info_craweler.py: run this py file to get specific user's info.Contain following fields:

following
followers

Result

Change Log

v0.6.3(2021/04/22 15:50)

change tweet duplicate detection way from user_id+time to sha256 digest of tweet content
add logs to monitor running process

v0.6.2(2021/04/21 21:50)

change crawl way from one-time to time-span-based
refactor the running process,add more condition judge

v0.6.1(2021/04/20 21:50)

Crawel tweets of specific keywords
Crawel specific user's info

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
craweler		craweler
db_settings		db_settings
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selenium Twitter Scraper

Feature

Install

Run

1、crawel tweets

2、crawel user info

Result

Change Log

v0.6.3(2021/04/22 15:50)

v0.6.2(2021/04/21 21:50)

v0.6.1(2021/04/20 21:50)

About

Releases

Packages

Languages

FmKnight/Selenium-Twitter-Scraper

Folders and files

Latest commit

History

Repository files navigation

Selenium Twitter Scraper

Feature

Install

Run

1、crawel tweets

2、crawel user info

Result

Change Log

v0.6.3(2021/04/22 15:50)

v0.6.2(2021/04/21 21:50)

v0.6.1(2021/04/20 21:50)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages