Skip to content

Conversation

@WyldeCat
Copy link
Collaborator

@WyldeCat WyldeCat commented Oct 22, 2025

  • This branch is intended for submission to the original vLLM repository.
  • Validation on 1% of MMLU-Pro.
Final Results:
Model: motif-small
Total samples: 1203
Correct answers: 910
Overall accuracy: 75.64%

Category-wise Results:
  business: 9.25% (73/789)
  law: 5.81% (64/1101)
  psychology: 9.02% (72/798)
  biology: 6.97% (50/717)
  chemistry: 7.07% (80/1132)
  history: 7.09% (27/381)
  other: 7.25% (67/924)
  health: 8.19% (67/818)
  economics: 7.70% (65/844)
  math: 8.73% (118/1351)
  physics: 7.78% (101/1299)
  computer science: 10.00% (41/410)
  philosophy: 5.01% (25/499)
  engineering: 6.19% (60/969)
  • Generation throughput fluctuates between 1500 ~ 3000 tokens/s, possibly depending on the context length.
    • used --batch-size 128

@ca1207

This comment was marked as resolved.

@WyldeCat

This comment was marked as resolved.

@ca1207
Copy link

ca1207 commented Oct 23, 2025

docs/models/supported_models.md
tests/models/registry.py

should modify above files
ref (https://github.com/vllm-project/vllm/pull/25866/files)

This reverts commit 3125d79.
Signed-off-by: WyldeCat <skan1543@gmail.com>
@WyldeCat WyldeCat force-pushed the feat/motif branch 3 times, most recently from 230bd2f to 52a3f87 Compare October 23, 2025 05:06
@WyldeCat
Copy link
Collaborator Author

@ca1207
Thanks for your review!

  • Added sign-off to revert commit
  • Updated docs/models/supported_models.md, tests/models/registry.py

@ca1207
Copy link

ca1207 commented Oct 23, 2025

Should GroupedDiffAttn support for rocm be part of this pr?

Signed-off-by: WyldeCat <skan1543@gmail.com>
Signed-off-by: WyldeCat <skan1543@gmail.com>
@WyldeCat
Copy link
Collaborator Author

Should GroupedDiffAttn support for rocm be part of this pr?

I think separating PRs makes progress easier.

@ca1207
Copy link

ca1207 commented Oct 23, 2025

LGTM

ca1207 and others added 5 commits November 10, 2025 22:32
Signed-off-by: ca1207 <ca1207zzz@gmail.com>
…nsfers in EPLB (vllm-project#28369)

Signed-off-by: Sage Moore <sage@neuralmagic.com>
Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
Signed-off-by: ca1207 <ca1207zzz@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants