01 Feb 20:38

ThilinaRajapakse

6f189e0

New Classification Models Latest

Latest

New Classification Models

Added

Added class weights support for Longformer classification
Added new classification models (multilabel classification is not supported yet):
- DeBERTa
- MPNet
- SqueezeBert (no sliding window support)

Changed

Updated ClassificationModel logic to make it easier to add new models

Assets 2

05 Dec 15:36

ThilinaRajapakse

v0.51.0

9688fcf

MT5, Adafactor optimizer, additional schedulers

Breaking change

T5Model now has a required model_type parameter ("t5" or "mt5")

Added

Added support for MT5
Added support for Adafactor optimizer
Added support for various schedulers:
- get_constant_schedule
- get_constant_schedule_with_warmup
- get_linear_schedule_with_warmup
- get_cosine_schedule_with_warmup
- get_cosine_with_hard_restarts_schedule_with_warmup
- get_polynomial_decay_schedule_with_warmup

Changed

T5Model now has a required model_type parameter ("t5" or "mt5")

Fixed

Fixed issue with class weights not working in ClassificationModel when using mult-GPU training

Assets 2

09 Nov 13:38

ThilinaRajapakse

v0.49.0

77da311

LayoutLM model for Classification. MBart. BERTWeet for NER. Longformer for Multilabel Classification.

Added

Added LayoutLM for Classification.
Added MBart. Zhylkaaa
Added BERTweet for NER. @manueltonneau
Added Longformer for Multilabel Classification. @manueltonneau

Fixed

Fixed issue with Seq2SeqModel when the model_name contained backslashes.
Fixed issue with saving args when a dataset_class is specified in Seq2SeqModel.

Changed

The Electra implementation used with ClassificationModel is now fully compatible with Hugging Face.

Assets 2

26 Sep 08:23

ThilinaRajapakse

v0.48.6

8409a1b

LayoutLM model for NER

Added

Added layoutlm model for NER (see docs)

Fixed

Potential fix for inconsistent eval_loss calculation

Assets 2

24 Sep 09:52

ThilinaRajapakse

v0.48.5

2786bf9

Evaluation/Prediction Performance enhancements and ONNX support

Mixed Precision Support for evaluation and prediction

Mixed precision (fp16) inference is now supported for evaluation and prediction in the following models:

ClassificationModel
ConvAI
MultiModalClassificationModel
NERModel
QuestionAnsweringModel
Seq2Seq
T5Model

You can disable fp16 by setting fp16 = False in the model_args.

Multi-GPU support for evaluation and prediction

Set the number of GPUs with n_gpu. in model_args Currently supported in the following models:

ClassificationModel
ConvAI
MultiModalClassificationModel
NERModel
QuestionAnsweringModel
Seq2Seq
T5Model

Native ONNX support for Classification and NER tasks (Beta)

Please note that ONNX support is still experimental.

See docs for details.

Assets 2

06 Sep 16:18

ThilinaRajapakse

v0.48.0

cdcaf1a

Dynamic Quantization Support

Added

Added dynamic quantization support for all models.
Added ConvAI docs to documentation website. @pablonm3

Assets 2

09 Aug 19:40

ThilinaRajapakse

v0.47.0

50525b9

Simple Viewer

Added

Added support for testing models through a Streamlit app. Use the command `simple-viewer". Currently supports:
- Classification (including multilabel)
- NER (design inspired by displaCy Named Entity Visualizer)
- QA

See docs for details.

Assets 2

24 Jul 19:46

ThilinaRajapakse

v0.45.2

3e27217

dataloader_num_workers option for specifying the number of processes to be used with a Pytorch dataloader

Added

Added dataloader_num_workers to ModelArgs for specifying the number of processes to be used with a Pytorch dataloader.

Changed

Bumped required transformers version to 3.0.2

Assets 2

18 Jul 19:43

ThilinaRajapakse

v0.45.0

349dcd2

Text Representation Generation

Added

Added Text Representation Generation (RepresentationModel). @pablonm3

Assets 2

18 Jul 19:41

ThilinaRajapakse

v0.44.0

fad190f

Lazy Loading QA Datasets

Added

Lazy loading support added for QuestionAnsweringModel.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Classification Models

Added

Changed

MT5, Adafactor optimizer, additional schedulers

Breaking change

Added

Changed

Fixed

Added

Fixed

Changed

Added

Fixed

Mixed Precision Support for evaluation and prediction

Multi-GPU support for evaluation and prediction

Native ONNX support for Classification and NER tasks (Beta)

Added

Added

Added

Changed

Added

Added

Releases: ThilinaRajapakse/simpletransformers