-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[feat] support activation cpu offload in fsdp and fsdp2
#7201
opened Dec 24, 2025 by
meichangsu1
Loading…
1 of 4 tasks
support cce、tiledmlp、activation cpu offload
#7169
opened Dec 23, 2025 by
meichangsu1
Loading…
1 of 4 tasks
Improve vLLM examples regarding vllm_engine_kwargs use
#7133
opened Dec 19, 2025 by
3manifold
Loading…
1 task done
[feat] support TiledMLP in Deepspeed and FSDP2
#7090
opened Dec 17, 2025 by
kevssim
Loading…
2 of 4 tasks
[bugfix] fix missing generate method for InternVL-2.5
#7019
opened Dec 12, 2025 by
xwy-bit
Loading…
1 of 4 tasks
feat: Add support for enabling and configuring msprobe via command-line and config.json
#6834
opened Dec 1, 2025 by
Vectorwh
Loading…
2 of 4 tasks
Add conditional distillation support for GKD trainer
#6542
opened Nov 11, 2025 by
woshixiaobai2019
Loading…
3 tasks
Add Tensor Input Support: Enable .pt file processing with <tensor> tags for latent representations
#6504
opened Nov 9, 2025 by
Marshall-mk
Loading…
1 of 4 tasks
[Fix Bug] Enhance
ProgressCallbackNew to initialize training bar with current step
#6415
opened Nov 3, 2025 by
YushunXiang
Loading…
1 of 4 tasks
feat: Enable for exporting unmerged HF Lora Adapter
#6225
opened Oct 20, 2025 by
jason9693
Loading…
1 of 4 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.