This repo contains a single book in different representations:
tei
: TEI (our source, from the DBNL)tf
: Text-Fabricpd
: Pandas
The conversion TEI to TF is done with programs/tfFromTei.py.
The conversion TF to Pandas is done with export.ipynb.
For an example of how to use the Pandas file, see programs/pandas.ipynb.
See docs for documentation about provenance and encoding.
If you want to reproduce or make your own computations, pip-install the following Python modules
pip install text-fabric pandas pyarrow
Also, clone this repo, preferably in your
~/github/annotation
directory.
See about for the authors/editors of the data.
Dirk Roorda is the author of the representation in Text-Fabric of the data, and the programs and docs.