Wiktionary dump file parser and multilingual data extractor
-
Updated
Apr 17, 2025 - Python
Wiktionary dump file parser and multilingual data extractor
The last online dictionary CLI framework you need.
A calibre plugin that generates Kindle Word Wise and X-Ray files for KFX, AZW3, MOBI and EPUB eBook.
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
Extract data from German Wiktionary XML files.
Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.
Web front end for WikDict dictionaries
Anki add-on to look up vocabulary using Wiktionary
Anki add-on to view and extract info from ZIM files
Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)
A program for creating a searchable local language dictionary based (mainly) on dumped wiktionary data. Allows user to collect definitions which can be exported as a machine readable flashcard file. Currently supports Latin, Ancient Greek and Old English.
Language files for WordDumb
Scrapes Wiktionary to find cognates
lookup words and pronunciations in Wiktionary
Scraping grapheme-to-phoneme data from Wiktionary
A library for parsing the french wiktionary
Add a description, image, and links to the wiktionary topic page so that developers can more easily learn about it.
To associate your repository with the wiktionary topic, visit your repo's landing page and select "manage topics."