Skip to content

[Tracker] TorchAO activation sparsity acceleration 🚀 #2095

Open
@jcaip

Description

@jcaip

This is a tracker issue for all the different ways we can accelerate training / inference with activation sparsity in TorchAO.

Inference

  • Accelerate memory-bound bs=1 decode use cases with a selective weight loading kernel, like that described in TEAL / CATS.
Image

Training

  • Activation compression to accelerate 2:4 sparse training (#1920 activation sparsity + compression #2076) has an implementation that I need to benchmark / review.
  • Implement custom sparse training kernels outlined in our ICLR paper. Lower priority for now.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions