-
Notifications
You must be signed in to change notification settings - Fork 12
Issues: foundation-model-stack/fms-acceleration
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Slow down observed for BigCode Santa Coder
bug
Something isn't working
help wanted
Extra attention is needed
question
Further information is requested
#110
opened Nov 13, 2024 by
fabianlim
ScatterMoE to support LoRA Adapters
help wanted
Extra attention is needed
question
Further information is requested
#103
opened Nov 6, 2024 by
fabianlim
ScatterMoE to support Quantized PEFT
help wanted
Extra attention is needed
question
Further information is requested
#101
opened Nov 6, 2024 by
fabianlim
Numba JIT TypingErrors Thrown on Multipack Functions
bug
Something isn't working
question
Further information is requested
#100
opened Nov 6, 2024 by
fabianlim
Slowdown and Higher Memory Consumption for GPTQ-LoRA with Bfloat16
question
Further information is requested
#84
opened Sep 12, 2024 by
achew010
Ensure Model is Correctly Loaded for Augmentation Purposes
question
Further information is requested
#77
opened Aug 29, 2024 by
fabianlim
Inconsistency in Padding-Free Benchmarks with Different Transformers Versions
question
Further information is requested
#70
opened Aug 19, 2024 by
achew010
Quantized Peft Benchmark Experiments Run Out of Memory with Non-Zero Lora Dropout
question
Further information is requested
#50
opened Jul 12, 2024 by
achew010
ProTip!
Updated in the last three days: updated:>2024-12-29.