This release brings significant scalability improvements, new models, bug fixes, and usability enhancements to EasyDeL.
Highlights:
- Multi-host GPU Training: EasyDeL now scales seamlessly across multiple GPUs and hosts for demanding training workloads.
- New Models: Expand your NLP arsenal with the addition of Gemma2, OLMo, and Aya models.
- Improved KV Cache Quantization: Enjoy a substantial accuracy boost with enhanced KV cache quantization, achieving +21% accuracy compared to the previous version.
- Simplified Model Management: Load and save pretrained models effortlessly using the new
model.from_pretrained
andmodel.save_pretrained
methods. - Enhanced Generation Pipeline: The
GenerationPipeLine
now supports streaming token generation, ideal for real-time applications. - Introducing the ApiEngine: Leverage the power of the new
ApiEngine
andengine_client
for seamless integration with your applications.
Other Changes:
- Fixed GPU Flash Attention bugs for increased stability.
- Updated required
jax
version to>=0.4.28
for optimal performance. Versions0.4.29
or higher are recommended if available. - Streamlined the
structure
import process and resolved multi-host training issues.
Upgrade:
To upgrade to EasyDeL v0.0.69, use the following command:
pip install --upgrade easydel==0.0.69