Skip to content

Scraper for the Dublin City Council Planning Application website

Notifications You must be signed in to change notification settings

dtuite/dcc-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dublin City Council Planning Application Scraper

This is a web scraper for the Dublin City Council planning website.

It's capable of scraping:

  1. All of the results from a Search of the DCC planning website
  2. In-depth details of individual planning applications
  3. The documents associated with each planning application

It is set up to scrape to a local sqlite3 database and create associations between planning applications and their documents.

It doesn't actually download any documents. It just records their details.

Using the DCC Search

  • To get a search URL that can be scraped, you must perform a search manually, then navigate to the second page and back to the first page. Insert this URL into the src/searchResults/scrape.js page with the startIndex URL parameter passed in from the function.
  • The format of the Date Registered From field is dd-mmm-YYYY. For example: "01-jan-2019".

Further work

  1. The scraper doesn't attempt to figure out how many pages of results there are in a particular planning application search. The number of pages to paginate is hardcoded.
  2. It doesn't yet record the planning decision.
  3. It's not capable of executing searches automatically. You have to manually run a search then feed the search URL into the code to scrape the results.
  4. Create a CLI with options and flags.

Instructions

There is no CLI yet. You have to manually edit the src/index.js file to tell the scraoer what to do. Read the code and you'll get it pretty quickly.

Run it like this:

node src/index.js

Instructions for Lauren

Open terminal

cd /Users/laurennorton/Desktop/dcc-scraper-master

open the

About

Scraper for the Dublin City Council Planning Application website

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published