Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 154 Bytes

File metadata and controls

2 lines (2 loc) · 154 Bytes

Chinese-charectar-tokenizer-with-nltk-regexp-tokenizer

a nltk tool for tokenizing chinese sentence. the sentence is mixed with english words, numbers.