Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
-
Updated
Sep 12, 2025 - Python
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
PhiloLogic4
TEI-IIIF converts TEI-XML into conformant IIIF Annotation manifests.
Convert bibliographic meta data in MODS format to TEI headers
PhiloLogic5
Text Embeddings Inference (TEI)'s unofficial python wrapper library for batch processing with asyncio
convert a repurposed subset of markdown syntax to valid TEI XML designed to be compatible with the IGNTP specifications
High-quality congressional bill search with hybrid BM25+vector similarity using DuckDB, TEI embeddings, and GovInfo API. Local deployment with Docker.
Computational linguistics project for university. The file named readme is written in Italian.
A django project to deal with dictionaries which were once encoded in TEI
AnnotationTool for TEI XML
Add a description, image, and links to the tei topic page so that developers can more easily learn about it.
To associate your repository with the tei topic, visit your repo's landing page and select "manage topics."