🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
-
Updated
May 15, 2024 - Python
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
aim to use JapaneseTokenizer as easy as possible
Korean NLP Python Library for Economic Analysis
Mozc for Python: Kana-Kanji converter
A Python script for adding furigana to Japanese epub books using Mecab and Unidic.
Korean text data preprocess toolkit for NLP
Converting Mozc dictionary to MeCab dictionary for Kana-Kanji conversion (KKC)
BERT models with tokenization for Japanese texts.
natural language processing for japanese
Example usage of the python wrappers for MeCab Japanese parser in MacOSX.
Generates plain or tokenized text files from the Aozora Bunko
Add a description, image, and links to the mecab topic page so that developers can more easily learn about it.
To associate your repository with the mecab topic, visit your repo's landing page and select "manage topics."