Skip to content

[Bugfix] Fix KV head calculation for MPT models when using GQA #3594

[Bugfix] Fix KV head calculation for MPT models when using GQA

[Bugfix] Fix KV head calculation for MPT models when using GQA #3594

Annotations

2 warnings

This job succeeded