The most accurate natural language detection library for Python, suitable for short text and mixed-language text
-
Updated
Nov 21, 2025 - Python
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
A TensorFlow-based spoken language identification
Faster, modernized fork of the language identification tool langid.py
End-to-end spoken language identification out of the box.
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
🎏🎌 language recognition script implemented using basic algorithms and spaghetti code
Suite of Python modules to recognise the language of a file
A single-layer neural network written from scratch that predicts the language of the text.
Modify Markdown fenced code blocks to contain the language name by detecting it from the block contents.
The European Parliament Proceedings Parallel Corpus (1996-2011) (https://www.statmt.org/europarl/) is a well-known dataset in Natural Language Processing tasks, it contains proceedings of the European Parliament in 21 European languages. In this project we will only extract data from 6 languages (German, French, Spanish, Italian, Polish and Eng…
End to End Dialect Identification using Convolutional Neural Network
too Simple Language Recognition
Add a description, image, and links to the language-recognition topic page so that developers can more easily learn about it.
To associate your repository with the language-recognition topic, visit your repo's landing page and select "manage topics."