-
Notifications
You must be signed in to change notification settings - Fork 281
Add DS/QWEN Examples #2333
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Add DS/QWEN Examples #2333
Conversation
Signed-off-by: yiliu30 <yi4.liu@intel.com>
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
PR Code Suggestions ✨Explore these optional code suggestions:
|
for more information, see https://pre-commit.ci
User description
Signed-off-by: yiliu30 yi4.liu@intel.com
PR Type
Enhancement
Description
Added DS/QWEN quantization examples
Included quantization scripts for different schemes
Added generation script using vLLM
Diagram Walkthrough
File Walkthrough
2 files
Added quantization script for DS/QWENAdded generation script using vLLM5 files