Scrabby

Scrabby is a Ruby-based tool that collects data from the Harry Potter Wiki, covering characters, potions, and spells. It powers the Potter DB by keeping its magical database up to date with reliable, well-structured information from the wizarding world.

How it works

Scrabby performs the following spellbinding tasks:

Data scraping: Scrabby collects data from the Harry Potter Wiki using Nokogiri. It focuses on characters, potions, and spells.
Data transformation: Scrabby then organizes the data into a clean and easy-to-use format, using CSV, so it's ready to use right away.
Data storage: Scrabby saves the transformed into separate CSV files in the data/ folder, making it easy to analyze or use in other projects.

Setup

You usually don’t need to do anything. Scrabby automatically scrapes and updates the data once a month using GitHub Actions. But if you want to run the scrapers yourself, just follow these simple steps:

1. Clone the repository

git clone https://github.com/danielschuster-muc/scrabby.git && cd scrabby

2. Install ruby

Ensure you have Ruby 3.4.5 installed on your system.

rbenv install 3.4.5

3. Install dependencies

bundle install

4. Run scrapers

Execute the following commands to manually trigger the scrapers for characters, potions, and spells:

bundle exec rake scrabby:characters
bundle exec rake scrabby:potions
bundle exec rake scrabby:spells

5. Output

The fresh scraped data will be saved to data/*.csv.

License

This project is licensed under the terms of the MIT license. See the LICENSE file.

Data is scraped from the Harry Potter Wiki and therefore licensed under CC-BY-SA unless otherwise stated. For specific details, please refer to the URLs linking to the corresponding wiki pages in the data files.

Name		Name	Last commit message	Last commit date
Latest commit History 225 Commits
.github/workflows		.github/workflows
data		data
scrapers		scrapers
.rubocop.yml		.rubocop.yml
.ruby-version		.ruby-version
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
README.md		README.md
Rakefile		Rakefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scrabby

How it works

Setup

1. Clone the repository

2. Install ruby

3. Install dependencies

4. Run scrapers

5. Output

License

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

License

danielschuster-muc/scrabby

Folders and files

Latest commit

History

Repository files navigation

Scrabby

How it works

Setup

1. Clone the repository

2. Install ruby

3. Install dependencies

4. Run scrapers

5. Output

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages