Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Inference] Qwen2 support fp8 inference (#8954)
* qwen2 fp8 * fp8 check * fp8 cutlass * int8 cachekv * a8w8c8_fp8
- Loading branch information