This repo contains the data scrapers for Police Data Accessibility Project. Thank you for being here!
Scrapers are one piece of the puzzle of making police data more accessible. For a broader look at the work we're doing, and where our attention is focused, start with the docs: https://docs.pdap.io/ and the roadmap: https://github.com/orgs/Police-Data-Accessibility-Project/projects/17
Right now, this requires some Python knowledge and patience. We're in the early stages: there's no automated scraper farm or fancy GUI yet.
- Install Python.
- Clone this repo.
- Find the scraper you wish to run. These are sorted geographically, so start by looking in
/USA/...
. - Run the
scraper.py
file with something likepython3 <scraper path>
depending on how you installed it.
If it worked, discuss your findings in our Discord. If it didn't, make an issue in this repo or reach out in Discord.
To write a scraper, start with CONTRIBUTING.md. Be sure to check out the /common folder!
For everything else, start with docs.pdap.io.
The data sources listed here are our to-do list. If we should targeting a new data type, suggest it in Discord or make a DoltHub PR!
Potentially useful tools. If you find something useful, or if one of these is out of date, make a PR!