Skip to content

Commit 2c82c66

Browse files
committed
first commit
0 parents  commit 2c82c66

24 files changed

+266466
-0
lines changed

README.md

+24
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
利用bert预训练的中文语言模型进行文本匹配
2+
3+
训练脚本 train.sh
4+
5+
批量测试脚本 predict.sh
6+
7+
利用模型进行单条语句测试 intent.py
8+
9+
chinese_L-12_H-768_A-12为预训练的相关模型和词典
10+
11+
data文件夹中为训练语料,验证语料,测试语料 数据集为LCQMC官方数据
12+
13+
14+
参数说明:
15+
max_seq_length sentence的最大长度(字)
16+
train_batch_size batch_size的大小
17+
18+
max_seq_length = 50
19+
eval_accuracy = 0.87207
20+
test_accuracy = 0.86272
21+
22+
max_seq_length = 40
23+
eval_accuracy = 0.88093615
24+
test_accuracy = 0.86256

__init__.py

+15
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
# coding=utf-8
2+
# Copyright 2018 The Google AI Language Team Authors.
3+
#
4+
# Licensed under the Apache License, Version 2.0 (the "License");
5+
# you may not use this file except in compliance with the License.
6+
# You may obtain a copy of the License at
7+
#
8+
# http://www.apache.org/licenses/LICENSE-2.0
9+
#
10+
# Unless required by applicable law or agreed to in writing, software
11+
# distributed under the License is distributed on an "AS IS" BASIS,
12+
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
13+
# See the License for the specific language governing permissions and
14+
# limitations under the License.
15+

__pycache__/modeling.cpython-36.pyc

25.2 KB
Binary file not shown.
3.63 KB
Binary file not shown.

__pycache__/predict.cpython-36.pyc

18.3 KB
Binary file not shown.
9.52 KB
Binary file not shown.

0 commit comments

Comments
 (0)