Skip to content

efficient-edge/Cheetah

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 

Repository files navigation

Cheetah

Cheetah is an optimization zoo for vision transformer (ViT) that consists of all popular ViT optimization techniques (sparse, pruning and multi-exit). Prior to our implementation, I list all influencing papers about ViT acceleration and develop them as optimization modules for any ViT models. Unlike FasterTransformer and DeepSpeed, Cheetah pays more attention to vision transformer instead of BERT, and provids more developer-friendly code through modular design (users can use their favorite optimizations to accelerate ViT). In the end, I will introduce how to profile your new ViT models and deploy them to NVIDIA Trition server (including how to accelerate them with TensorRT).

ViT Models

-->

Contributors

Releases

No releases published

Packages

No packages published