Fix tokenizer bug (#893)

* fix unified transformer dtype problem * fix win dtype bug * Fix plato-2 and plato-mini dtype bug * Fix plato-2 tokenization * Refine some doc * Add general k support for topk sampling * fix seed * minor fix * Fix unitransformer readme * topk kernel optimization * add unimo model and fix generate api * add 3 datasets for unimo-text * fix tokenizer bug Co-authored-by: Jiaqi Liu <liujiaqi06@baidu.com> Co-authored-by: liu zhengxi <380185688@qq.com>
PaddlePaddle · Aug 17, 2021 · 7e098f1 · 7e098f1
1 parent 740f5e2
commit 7e098f1
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/paddlenlp/transformers/unimo/tokenizer.py b/paddlenlp/transformers/unimo/tokenizer.py
@@ -75,7 +75,7 @@ class UNIMOTokenizer(PretrainedTokenizer):
             "unimo-text-1.0":
             "https://paddlenlp.bj.bcebos.com/models/transformers/unimo/unimo-text-1.0-vocab.txt",
             "unimo-text-1.0-large":
-            "https://paddlenlp.bj.bcebos.com/models/transformers/unimo/unimo-text-1.0-vocab-large.txt",
+            "https://paddlenlp.bj.bcebos.com/models/transformers/unimo/unimo-text-1.0-large-vocab.txt",
         }
     }
     pretrained_init_configuration = {