You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- See more contributions here https://github.com/wannaphongcom/pythainlp/graphs/contributors
64
+
65
+
66
+
# References
67
+
68
+
-**[Maximum Matching]** -- Manabu Sassano. Deterministic Word Segmentation Using Maximum Matching with Fully Lexicalized Rules. Retrieved from http://www.aclweb.org/anthology/E14-4016
69
+
-**[MetaSound]** -- Snae & Brückner. (2009). Novel Phonetic Name Matching Algorithm with a Statistical Ontology for Analysing Names Given in Accordance with Thai Astrology. Retrieved from https://pdfs.semanticscholar.org/3983/963e87ddc6dfdbb291099aa3927a0e3e4ea6.pdf
70
+
-**[Thai Character Cluster]** -- T. Teeramunkong, V. Sornlertlamvanich, T. Tanhermhong and W. Chinnan, “Character cluster based Thai information retrieval,” in IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages, 2000.
71
+
- เพ็ญศิริ ลี้ตระกูล. การเลือกประโยคสำคัญในการสรุปความภาษาไทย โดยใช้แบบจำลองแบบลำดับชั้น (Selection of Important Sentences in Thai Text Summarization Using a Hierarchical Model). Retrieved from http://digi.library.tu.ac.th/thesis/st/0192/
PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.
14
12
15
-
## What's new in PyThaiNLP 1.7 ?
13
+
PyThaiNLP features include Thai word and subword segmentations, soundex, romanization, part-of-speech taggers, and spelling corrections.
14
+
15
+
## What's new in version 1.7 ?
16
16
17
17
- Deprecate Python 2 support
18
18
- Refactor pythainlp.tokenize.pyicu for readability
@@ -22,22 +22,20 @@ PyThaiNLP is a Python library for natural language processing (NLP) of Thai lang
22
22
- Add ULMFit utility to PyThaiNLP
23
23
- Add Thai romanization model thai2rom
24
24
- Retrain POS-tagging model
25
-
-Improve word tokenize (newmm,mm) and dict_word_tokenize
25
+
-Improved word_tokenize (newmm,mm) and dict_word_tokenize
26
26
- Documentation added
27
27
28
28
## Install
29
29
30
-
```
30
+
```sh
31
31
pip install pythainlp
32
32
```
33
33
34
-
**Note for Windows**: marisa-trie wheels can be obtained from https://www.lfd.uci.edu/~gohlke/pythonlibs/#marisa-trie , then install it with pip, for example: `pip install marisa_trie‑0.7.5‑cp36‑cp36m‑win32.whl`
35
-
36
-
37
-
38
-
Docs : https://thainlp.org/pythainlp/docs/1.7/
39
-
40
-
GitHub : https://github.com/PyThaiNLP/pythainlp
34
+
**Note for Windows**: `marisa-trie` wheels can be obtained from https://www.lfd.uci.edu/~gohlke/pythonlibs/#marisa-trie
35
+
Install it with pip, for example: `pip install marisa_trie‑0.7.5‑cp36‑cp36m‑win32.whl`
0 commit comments