Text data of KonoSuba: God's Blessing on This Wonderful World! Light Novel Volume 1 to 17 + short stories (English fan translation).
Note:
Most of the unrelated metadata/TL note have been removed.
This might have accidentally removed some lines from the light novel, but the damage should be minimal.
Feel free to create an issue if there are some lines that have been accidentally removed.
KonoSuba: God's Blessing on This Wonderful World!, often referred to simply as KonoSuba, is a Japanese light novel series written by Natsume Akatsuki. The series follows Kazuma Satou, a boy who is sent to a fantasy world with MMORPG elements following his death, where he forms a dysfunctional adventuring party with a goddess, an archwizard, and a crusader.
Source: https://en.wikipedia.org/wiki/KonoSuba
Download the files below.
File | Lines | Size | Description |
---|---|---|---|
konosuba.txt |
47573 | 4.5MB | 17 volumes of KonoSuba light novel condensed into 1 file. Both dialogue and monologue are included. |
konosuba-dialogue.txt |
18689 | 2.3MB | Contains only dialogues in between quotes (“” ). Monologue is excluded. |
Shameless self-plug:
- Wanna make a Markov chain random sentence generator? Check out
aqua
. - Wanna make a AI chatbot? Check out
kazuma
.
If you want to manually generate the data yourself, I recommend using a proxy/VPN before running the webscraper.
Clone the project.
git clone https://github.com/MarsRon/konosuba-data
Create a Python virtual environment.
python3 -m venv venv
source venv/bin/activate
Install libraries.
pip install -r requirements.txt
Run the webscraper.
python scrape.py
This will create a ./data
directory which temporarily stores each chapter from Volume 1 to Volume 17 in text form.
Then, the script will merge all the posts into konosuba.txt
and also generate konosuba-dialogue.txt
only from speeches.
The data is scraped from cgtranslations.me and crimsonmagic.me.
Distributed under the MIT License.
See LICENSE.md
for more information.
MarsRon - marsron204@gmail.com - marsron.name.my