funnel transformer #1419

zzz2010 · 2021-12-09T12:59:49Z

PR types

new transformer model

PR changes

主要是添加了funnel 模型在paddlenlp的transformer 下面，和修改了run_squad.py 在examples 下，可以测试funnel 模型

Description

复现论文比赛第四期， funnel 的复现，团队：小木桌

请在google drive 下载所有pretrain 模型放到 https://bj.bcebos.com/paddlenlp/models：
https://drive.google.com/drive/folders/1eo2Jq0xDd7qO_9-N5lf42WsqQSi7i9aS?usp=sharing

joey12300 · 2021-12-12T15:59:45Z

paddlenlp/transformers/funnel/modeling.py

+          different token than `bos`, the id of that token.
+        - **sep_token_id** (:obj:`int`, `optional`)) -- The id of the `separation` token.
+
+    PyTorch specific parameters


这里是不需要提供PyTorch和TF的特定的参数，可以去掉

是的，已经去掉

joey12300 · 2021-12-12T16:06:55Z

paddlenlp/transformers/funnel/modeling.py

+    return x
+
+def constant_(x, val):
+    temp_value=paddle.zeros_like(x)+val


可以使用paddle.full替代

是的，已经改为 paddle.full_like(x,fill_value=val)

joey12300 · 2021-12-12T16:07:50Z

paddlenlp/transformers/funnel/modeling.py

+    return x * paddle.sigmoid(x)
+
+
+if version.parse(paddle.__version__) < version.parse("1.7"):


请问这里检验版本的作用是？

没有作用，之前torch的版本检验，已经remove。保留 silu = nn.functional.silu

joey12300 · 2021-12-12T16:09:08Z

paddlenlp/transformers/funnel/modeling.py

+
+
+
+def expand(self, *sizes):


与上面重复了

已经remove

joey12300 · 2021-12-12T16:09:27Z

paddlenlp/transformers/funnel/modeling.py

+    return x
+
+
+def repeat_interleave(x, repeats, dim=None):


与上面重复了

已经remove

joey12300 · 2021-12-12T16:09:48Z

paddlenlp/transformers/funnel/modeling.py

+        return paddle.reshape(x, orig_shape)
+
+
+def gather(x, dim, index):


与上面重复，下同

已经remove

joey12300 · 2021-12-15T02:51:36Z

paddlenlp/transformers/funnel/modeling.py

+
+def is_tensor(x):
+    """
+    Tests if ``x`` is a :obj:`paddle.Tensor`, :obj:`tf.Tensor`, obj:`jaxlib.xla_extension.DeviceArray` or


应该不需要判断tf的Tensor吧？

是的，已经删除无关注释

joey12300 · 2021-12-15T02:53:52Z

paddlenlp/transformers/funnel/modeling.py

+        return_dict=None,
+    ):
+        r"""
+        labels (:obj:`paddle.LongTensor` of shape :obj:`(batch_size,)`, `optional`):


这里以及下面几个任务类都可以稍微整理一下注释，可以参考BertForSequenceClassification。还有就是paddle只有Tensor没有LongTensor类型，注意修改~

谢谢，已经修改

joey12300 · 2021-12-15T02:54:52Z

paddlenlp/transformers/funnel/modeling.py

+        return_dict=None,
+    ):
+        r"""
+        labels (``paddle.LongTensor`` of shape ``(batch_size, sequence_length)``, `optional`):


同下，注意修改

已经修改

joey12300 · 2021-12-15T02:56:03Z

paddlenlp/transformers/funnel/modeling.py

+        return self.linear_out(hidden)
+
+from collections import OrderedDict
+class FunnelForPreTrainingOutput(OrderedDict):


注释里涉及到FloatTenor,LongTensor类型的统一修改为Tensor

已经修改

joey12300 · 2021-12-15T03:00:21Z

paddlenlp/transformers/__init__.py

@@ -86,3 +86,5 @@
 from .reformer.tokenizer import *
 from .mobilebert.modeling import *
 from .mobilebert.tokenizer import *


这个地方冲突了，注意解决冲突~

已经修改

joey12300 · 2021-12-15T03:04:50Z

paddlenlp/transformers/funnel/modeling.py

+        return ACT2FN[activation_string]
+    else:
+        raise KeyError(f"function {activation_string} not found in ACT2FN mapping {list(ACT2FN.keys())}")
+


这里空行较多，是否没有安装代码检查工具呢？建议安装git hook,并重新提交代码

pre-commit install

不太会用，我尝试过window，和ubuntu安装， commit的时候都出现先配置错误。可否帮忙看看现在的版本是否还有空行问题？

joey12300 · 2021-12-15T03:07:36Z

paddlenlp/transformers/funnel/modeling.py

+
+    PyTorch specific parameters
+
+        - **torchscript** (:obj:`bool`, `optional`, defaults to :obj:`False`) -- Whether or not the model should be


貌似很多注释都出现torch、TensorFlow的字眼，目前PaddleNLP暂时不用支持torch和tensorflow的输入，可以把相关的信息删除掉

已经删除

joey12300

LGTM

zzz2010 and others added 6 commits December 8, 2021 20:48

first version

b7d585d

add funnel to init.py and run_squad.py

6bb8bdc

add attention_mask

8992aa4

Merge branch 'PaddlePaddle:develop' into develop

d40292b

rename FunnelTokenizerFast to FunnelTokenizer

2083803

Merge remote-tracking branch 'origin/develop' into develop

1d24844

ZeyuChen requested a review from joey12300 December 10, 2021 16:30

ZeyuChen added the contributions label Dec 10, 2021

joey12300 reviewed Dec 12, 2021

View reviewed changes

revised based on the PR comments

b36473a

joey12300 reviewed Dec 15, 2021

View reviewed changes

zzz2010 and others added 7 commits December 15, 2021 20:58

further clean up function description section

c6286bc

further clean up

896457d

pre-commit check

cddafbf

pre-commit check

8552d04

pre-commit check

0cd4b60

Merge branch 'develop' into develop

3ac832e

Merge branch 'develop' into develop

fba0b29

joey12300 approved these changes Dec 23, 2021

View reviewed changes

joey12300 merged commit 74f9fae into PaddlePaddle:develop Dec 23, 2021

joey12300 mentioned this pull request Dec 24, 2021

PaddleNLP 2.2.3 Release Note Candidate #1509

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

funnel transformer #1419

funnel transformer #1419

zzz2010 commented Dec 9, 2021

joey12300 Dec 12, 2021

zzz2010 Dec 15, 2021

joey12300 Dec 12, 2021

zzz2010 Dec 15, 2021

joey12300 Dec 12, 2021

zzz2010 Dec 15, 2021

joey12300 Dec 12, 2021

zzz2010 Dec 15, 2021

joey12300 Dec 12, 2021

zzz2010 Dec 15, 2021

joey12300 Dec 12, 2021

zzz2010 Dec 15, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 Dec 15, 2021

zzz2010 Dec 16, 2021

joey12300 left a comment

		return x * paddle.sigmoid(x)


		if version.parse(paddle.__version__) < version.parse("1.7"):

		return paddle.reshape(x, orig_shape)


		def gather(x, dim, index):


		PyTorch specific parameters

		- torchscript (:obj:`bool`, `optional`, defaults to :obj:`False`) -- Whether or not the model should be

funnel transformer #1419

funnel transformer #1419

Conversation

zzz2010 commented Dec 9, 2021

PR types

PR changes

Description

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joey12300 left a comment

Choose a reason for hiding this comment