Skip to content

ajcerejeira/base.gov.pt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

base.gov.pt

A crawler that fetches data from http://www.base.gov.pt/.

Spiders

Usage

Download the spider and its dependencies:

git clone 'https://github.com/ajcerejeira/base.gov.pt.git'
cd base.gov.pt/
pip install -r requirements.txt

And then run the desired spider:

scrapy crawl get_contracts

This will generate the following files:

  • contracts.csv - main table, containing the most important info regarding the contracts
  • contestants.csv
  • invitees.csv
  • documents.csv
  • places.csv

Please be patient, since it takes some hours before it completes (on my machine it took about 26 hours to finish gathering all data).

About

A crawler that fetches data from base.gov.pt

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages