Skip to content

Commit aca4a33

Browse files
authored
Wenxh/fp8 on a100 v5 (#1)
Group Gemm Version
1 parent 03e3bda commit aca4a33

File tree

11 files changed

+997
-446
lines changed

11 files changed

+997
-446
lines changed

requirements-cuda.txt

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,4 +8,10 @@ vllm-nccl-cu12>=2.18,<2.19 # for downloading nccl library
88
torch == 2.2.1
99
xformers == 0.0.25 # Requires PyTorch 2.2.1
1010

11-
cupy-cuda12x
11+
# Dependencies for pycublas-moe-groupe-gemm
12+
gitpython
13+
pytest
14+
loguru
15+
# In case of invalide url, please install from this file:
16+
# pip install gitpython pytest loguru vllm/model_executor/layers/fused_moe/pycublas.zip
17+
git+https://github.com/wenxcs/pycublas.git@moe-group-gemm

0 commit comments

Comments
 (0)