Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
-
Updated
Oct 4, 2025 - Rust
Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
A versatile library offering utility functions for processing and transforming Arabic text. Can be used in Node.js and the browser.
Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)
Using Natural Language Processing techniques, to predict diacritics of an Arabic Text.
A versatile library in Java offering utility functions for processing and transforming Arabic text.
The official implementation of CATT Arabic diacritization models.
ABGD converts Arabic text into a structured list of decimal numbers based on traditional Abjad gematrical values. It also supports encoding diacritics (e.g. shadda + fatha → 0.61) using a smart fractional system.
Add a description, image, and links to the tashkeel topic page so that developers can more easily learn about it.
To associate your repository with the tashkeel topic, visit your repo's landing page and select "manage topics."