Fix the usage of nltk bug #1515

joey12300 · 2021-12-27T09:42:04Z

PR types

Bug fixes

PR changes

APIs

Description

Specify the correct version of nltk in README.md
Fix the nltk punkt download

ZeyuChen · 2021-12-27T09:43:49Z

paddlenlp/transformers/tokenizer_utils.py

@@ -1641,6 +1641,8 @@ def _tokenize(self, text, is_sentencepiece=True):
        text = convert_to_unicode(text)
        text = " ".join(text.split())  # remove duplicate whitespace
        nltk = try_import('nltk')


怎么可以再关键函数上反复try import呢，这些都得在初始化阶段去做

已将nltk imort放到__init__函数中

ZeyuChen · 2021-12-29T15:41:35Z

nltk 在下载模型的时候会很慢很卡，这个地方是否评估过了？@joey12300

joey12300 · 2021-12-30T02:43:30Z

nltk 在下载模型的时候会很慢很卡，这个地方是否评估过了？@joey12300

这里打开代理后下载就几秒，但是关了代理就要五六分钟，也没有输出进度条像是hang住一样。我把这条命令单独拿出来在README上说明一下

ZeyuChen · 2021-12-30T08:26:11Z

好的，争取今天内合入。

wawltor

LGTM

avoid import nltk==3.6.6 bug

a1d27c3

ZeyuChen reviewed Dec 27, 2021

View reviewed changes

move nltk initialization to __init__

c848db5

refine doc

05e80c0

joey12300 mentioned this pull request Dec 30, 2021

PaddleNLP 2.2.3 Release Note Candidate #1509

Closed

wawltor approved these changes Dec 30, 2021

View reviewed changes

wawltor merged commit 128a7e5 into PaddlePaddle:develop Dec 30, 2021

joey12300 mentioned this pull request Jan 20, 2022

PaddleNLP 2.2.4 Release Note Candidate #1614

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the usage of nltk bug #1515

Fix the usage of nltk bug #1515

joey12300 commented Dec 27, 2021

ZeyuChen Dec 27, 2021

joey12300 Dec 27, 2021

ZeyuChen commented Dec 29, 2021

joey12300 commented Dec 30, 2021

ZeyuChen commented Dec 30, 2021

wawltor left a comment

Fix the usage of nltk bug #1515

Fix the usage of nltk bug #1515

Conversation

joey12300 commented Dec 27, 2021

PR types

PR changes

Description

ZeyuChen Dec 27, 2021

Choose a reason for hiding this comment

joey12300 Dec 27, 2021

Choose a reason for hiding this comment

ZeyuChen commented Dec 29, 2021

joey12300 commented Dec 30, 2021

ZeyuChen commented Dec 30, 2021

wawltor left a comment

Choose a reason for hiding this comment