sywangyi / intel-extension-for-transformers Public

forked from intel/intel-extension-for-transformers

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.

Apache-2.0 license

0 stars 211 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
docs		docs
examples		examples
nlp_toolkit		nlp_toolkit
tests		tests
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

Nlp-toolkit：Optimization for Natural Language Processing Models

Nlp-toolkit is a powerful toolkit for automatically applying model optimizations on Natural Language Processing Models. It leverages Intel® Neural Compressor to provide a variety of optimization methods: quantization, pruning, distillation and so on. This toolkit is equipped with the capability to enable various Deep Learning framework like PyTorch/TensorFlow. For PyTorch models, it also support NNCF provider to optimization.