Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
047587f
improve names and tests longformer
patrickvonplaten Jul 13, 2020
9cfa7c4
more and better tests for longformer
patrickvonplaten Jul 13, 2020
ef04828
add first tf test
patrickvonplaten Jul 13, 2020
8884721
finalize tf basic op functions
patrickvonplaten Jul 13, 2020
c9d41b2
fix merge
patrickvonplaten Aug 3, 2020
4b24aa0
tf shape test passes
patrickvonplaten Jul 15, 2020
dca129a
narrow down discrepancies
patrickvonplaten Jul 16, 2020
3d20741
make longformer local attn tf work
patrickvonplaten Aug 3, 2020
9ddd897
correct tf longformer
patrickvonplaten Aug 3, 2020
067bcf2
add first global attn function
patrickvonplaten Aug 3, 2020
df33a43
add more global longformer func
patrickvonplaten Aug 4, 2020
f18a6a3
advance tf longformer
patrickvonplaten Aug 4, 2020
e100dc1
finish global attn
patrickvonplaten Aug 5, 2020
7a46495
upload big model
patrickvonplaten Aug 5, 2020
63dc6d1
finish all tests
patrickvonplaten Aug 5, 2020
8080f45
correct false any statement
patrickvonplaten Aug 5, 2020
032a80c
fix common tests
patrickvonplaten Aug 6, 2020
dae8026
make all tests pass except keras save load
patrickvonplaten Aug 6, 2020
0c67792
fix some tests
patrickvonplaten Aug 6, 2020
0a01aab
fix torch test import
patrickvonplaten Aug 6, 2020
75b5f76
finish tests
patrickvonplaten Aug 6, 2020
03cc447
fix test
patrickvonplaten Aug 6, 2020
e047d9c
fix torch tf tests
patrickvonplaten Aug 6, 2020
6a285aa
add docs
patrickvonplaten Aug 7, 2020
eb00e13
finish docs
patrickvonplaten Aug 7, 2020
bb41816
Update src/transformers/modeling_longformer.py
patrickvonplaten Aug 7, 2020
42574df
Update src/transformers/modeling_tf_longformer.py
patrickvonplaten Aug 7, 2020
484ea03
apply Lysandres suggestions
patrickvonplaten Aug 7, 2020
dfd729e
reverse to assert statement because function will fail otherwise
patrickvonplaten Aug 7, 2020
dabe6f1
applying sylvains recommendations
patrickvonplaten Aug 8, 2020
19d9b32
Update src/transformers/modeling_longformer.py
patrickvonplaten Aug 10, 2020
41cb64f
Update src/transformers/modeling_tf_longformer.py
patrickvonplaten Aug 10, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
22 changes: 22 additions & 0 deletions docs/source/model_doc/longformer.rst
Original file line number Diff line number Diff line change
Expand Up @@ -102,3 +102,25 @@ LongformerForQuestionAnswering

.. autoclass:: transformers.LongformerForQuestionAnswering
:members:


TFLongformerModel
~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFLongformerModel
:members:


TFLongformerForMaskedLM
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFLongformerForMaskedLM
:members:


TFLongformerForQuestionAnswering
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFLongformerForQuestionAnswering
:members:

9 changes: 9 additions & 0 deletions src/transformers/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -399,6 +399,7 @@
LongformerForMultipleChoice,
LongformerForTokenClassification,
LongformerForQuestionAnswering,
LongformerSelfAttention,
LONGFORMER_PRETRAINED_MODEL_ARCHIVE_LIST,
)

Expand Down Expand Up @@ -568,6 +569,14 @@
TFGPT2PreTrainedModel,
)

from .modeling_tf_longformer import (
TF_LONGFORMER_PRETRAINED_MODEL_ARCHIVE_LIST,
TFLongformerModel,
TFLongformerForMaskedLM,
TFLongformerForQuestionAnswering,
TFLongformerSelfAttention,
)

from .modeling_tf_mobilebert import (
TF_MOBILEBERT_PRETRAINED_MODEL_ARCHIVE_LIST,
TFMobileBertModel,
Expand Down
392 changes: 207 additions & 185 deletions src/transformers/modeling_longformer.py

Large diffs are not rendered by default.

6 changes: 6 additions & 0 deletions src/transformers/modeling_tf_auto.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,6 +29,7 @@
ElectraConfig,
FlaubertConfig,
GPT2Config,
LongformerConfig,
MobileBertConfig,
OpenAIGPTConfig,
RobertaConfig,
Expand Down Expand Up @@ -93,6 +94,7 @@
TFFlaubertWithLMHeadModel,
)
from .modeling_tf_gpt2 import TFGPT2LMHeadModel, TFGPT2Model
from .modeling_tf_longformer import TFLongformerForMaskedLM, TFLongformerForQuestionAnswering, TFLongformerModel
from .modeling_tf_mobilebert import (
TFMobileBertForMaskedLM,
TFMobileBertForMultipleChoice,
Expand Down Expand Up @@ -149,6 +151,7 @@
(AlbertConfig, TFAlbertModel),
(CamembertConfig, TFCamembertModel),
(XLMRobertaConfig, TFXLMRobertaModel),
(LongformerConfig, TFLongformerModel),
(RobertaConfig, TFRobertaModel),
(BertConfig, TFBertModel),
(OpenAIGPTConfig, TFOpenAIGPTModel),
Expand Down Expand Up @@ -191,6 +194,7 @@
(AlbertConfig, TFAlbertForMaskedLM),
(CamembertConfig, TFCamembertForMaskedLM),
(XLMRobertaConfig, TFXLMRobertaForMaskedLM),
(LongformerConfig, TFLongformerForMaskedLM),
(RobertaConfig, TFRobertaForMaskedLM),
(BertConfig, TFBertForMaskedLM),
(OpenAIGPTConfig, TFOpenAIGPTLMHeadModel),
Expand Down Expand Up @@ -226,6 +230,7 @@
(AlbertConfig, TFAlbertForMaskedLM),
(CamembertConfig, TFCamembertForMaskedLM),
(XLMRobertaConfig, TFXLMRobertaForMaskedLM),
(LongformerConfig, TFLongformerForMaskedLM),
(RobertaConfig, TFRobertaForMaskedLM),
(BertConfig, TFBertForMaskedLM),
(MobileBertConfig, TFMobileBertForMaskedLM),
Expand Down Expand Up @@ -259,6 +264,7 @@
(AlbertConfig, TFAlbertForQuestionAnswering),
(CamembertConfig, TFCamembertForQuestionAnswering),
(XLMRobertaConfig, TFXLMRobertaForQuestionAnswering),
(LongformerConfig, TFLongformerForQuestionAnswering),
(RobertaConfig, TFRobertaForQuestionAnswering),
(BertConfig, TFBertForQuestionAnswering),
(XLNetConfig, TFXLNetForQuestionAnsweringSimple),
Expand Down
Loading