Skip to content

Project of Advance Database Systems in AGH University (Kraków)

Notifications You must be signed in to change notification settings

Jandrov/ads19-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Advance Database Systems

Project "Crawler"

Create script that, for a given website, generates a graph in Neo4j of all (or a reasonable amount of) subpages and their connections. Each node should contain the URI, title, summary of content (extracted from h* tags, title, etc.).

Team members

Szymon Majkut
Maciej Rajs
Bartosz Tyński
Alejandro Sanchez Sanz

Division of labour

  • Together we established the schema and structure of the project, and we divided labour
  • Szymon prepared the basic algorithm of the Crawler and implemented those cells
  • Alejandro prepared the algorithm of the Neo4j part and implemented those cells
  • Bartosz and Maciek helped improving the Crawler part after some tests and remarks
  • Together we wrote the wiki

About

Project of Advance Database Systems in AGH University (Kraków)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published