Skip to content

[FT] Continuous batching for transformers #723

@NathanHB

Description

@NathanHB

Issue encountered

Evaluating models with transformers is very slow

Solution/Feature

Continuous batching for transformers models.

Metadata

Metadata

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions