Please read comments in each file to use.
-
To build a crawler to get job infos from rocketpuch.com, a startup community.
- because I am looking for job.
-
To practice my skill.
- a good practice to code often.
-
Just for fun with productive time.
- crawler.py
- jobsdao.py
- jobsdao_mongo.py
- config.py
- 308 job posts / 25 minutes on AWS t2.micro
- Python 3.x
- MySQL
- MongoDB
- chromedriver
- pyvirtualdisplay
- selenium
- requests
- pymysql
- pymongo
- json
- re
- speed is slower than supposed
- as infos remain in DB, suggest to make web app dashboard
- may good to put search ability in it