The GregoBaseCorpus is a corpus of Gregorian chant for computational musicology.
It is a research-friendly version of GregoBase, a
large database of chant transcriptions in the
GABC format.
This format can be loaded into music21
, a Python
toolkit for computational musicology, using the library
chant21
.
> Download the latest release of the GregoBaseCorpus
> Check out chant21 and the CantusCorpus, a related plainchant corpus.
The GregoBaseCorpus (the collection of .gabc
and .csv
files)
is released under a CC0-1.0
license, just like GregoBase itself.
The Python code used to generate the corpus (that is, the code in the src/
directory) is released under an MIT license.
The GregoBase Corpus is created automatically from database dumps of GregoBase. If you just want to use the corpus, you don't have to regenerate it yourself: simply download the latest release and you're good to go. But if you want to regenerate the corpus yourself, you can of course do so: read on....