I made this repository to hold text files corresponding to the Adventure Zone's transcripts. I have done this to put the data into a more useable form for analysis. Mainly I wanted to practice some Natural Language Processing techniques using the words from a show that I greatly enjoy.
Original transcript files have been taken from https://tazscripts.tumblr.com/tazscripts, and edited into the text file format found in this repository by me. Each file has been formatted so that every line contains a single colon (:). Found on the left of each colon is the name of the person/character/entity that was credited as speaking the line on the podcast. On the right of each colon is the text of what was said along with staging notes contained, typically, in square brackets ([]).
Each file corresponds to a single episode of the podcast. Seasons of the podcast will be broken into unique folders.
The repository is not yet complete. I will update it as I complete each arc of the show. I plan to start with just the Balance campaign, but will likely continue to update with Amnesty, Graduation and Ethersea.
Stored in the "balance" folder. See the "taz_balance_eps.tsv" file for meta data related to each Balance episode, not including live shows.
- Here there be Gerblins - Complete
- Moonlighting - Complete
- Murder on the Rockport Limited - Complete
- Lunar Interlude I: Carnival Chaos - Complete
- Petals to the Metal - Complete
- Lunar Interlude II - Complete
- Crystal Kingdom - Complete
- Lunar Interlude III - Complete
- The Eleventh Hour - Complete
- Lunar Interlude IV - Complete
- The Suffering Game - Complete
- Lunar Interlude V - Complete
- The Stolen Century - Complete
- Story and Song: Finale - Complete
Stored in the "amnesty" folder. See the "taz_amnesty_eps.tsv" file for meta data related to each Amnesty episode, not including live shows.
- Setup Episode - Complete
- The Beast Arc - Complete
- The Water Monster Arc - Complete
- Interlude 1 - Complete
- The Calamity Arc - Complete
- Interlude 2 - Complete
- The Countenance Arc - Complete
- Interlude 3 - Complete
- The Final Arc - Complete
I've provided this data with the hope that it would be fun for others. That said if you use this data for any kind of presented work (blog or social media post for instance), please cite this repository and give credit to the original creators (The McElroys) and the MaximumFun podcast network. Also, if you do not already support MaximumFun, please consider doing so. You can find information on supporting the MaximumFun network here.
The text files were created by first scraping HTML files and then editing by hand. As such there is the chance for small errors. If I detect any, I will update the files with those errors fixed. Thanks for your understanding!
Thanks to Eliot McKinley, https://twitter.com/etmckinley, for his helping identifying typos and inconsistencies!