A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
-
Updated
Aug 12, 2025 - Python
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Simple embedding based text classifier inspired by fastText, implemented in tensorflow
Textpipe: clean and extract metadata from text
⚡️ 80x faster Fasttext language detection out of the box | Split text by language
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
A TensorFlow-based spoken language identification
Faster, modernized fork of the language identification tool langid.py
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
End to End Dialect Identification using Convolutional Neural Network
End-to-end spoken language identification out of the box.
fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Babel Street Analytics Client Library for Python
Targetted language identifier, based on FastText and Hunspell.
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
Multi-Langauge Identification
Add a description, image, and links to the language-identification topic page so that developers can more easily learn about it.
To associate your repository with the language-identification topic, visit your repo's landing page and select "manage topics."