-
Notifications
You must be signed in to change notification settings - Fork 454
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documents Updates #2574
Documents Updates #2574
Conversation
# Conflicts: # docs/source_en/Instruction/Commend-line-parameters.md # docs/source_en/Instruction/Supported-models-datasets.md
docs/source/Instruction/命令行参数.md
Outdated
@@ -239,7 +239,7 @@ Vera使用`target_modules`, `target_regex`, `modules_to_save`三个参数. | |||
|
|||
- use_liger: 使用liger-kernel进行训练. | |||
|
|||
### TorchAcc参数 | |||
#### TorchAcc |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
请还原
README.md
Outdated
@@ -155,7 +155,7 @@ You can contact us and communicate with us by adding our group: | |||
- 🔥2024.04.11: Support Model Evaluation with MMLU/ARC/CEval datasets(also user custom eval datasets) with one command! Check [this documentation](docs/source_en/Instruction/LLM-eval.md) for details. Meanwhile, we support a trick way to do multiple ablation experiments, check [this documentation](docs/source_en/Instruction/LLM-exp.md) to use. | |||
- 🔥2024.04.11: Support **c4ai-command-r** series: c4ai-command-r-plus, c4ai-command-r-v01, use [this script](https://github.com/modelscope/swift/blob/main/examples/pytorch/llm/scripts/c4ai_command_r_plus/lora_mp/sft.sh) to train. | |||
- 2024.04.10: Use SWIFT to fine-tune the qwen-7b-chat model to enhance its function call capabilities, and combine it with [Modelscope-Agent](https://github.com/modelscope/modelscope-agent) for best practices, which can be found [here](https://github.com/modelscope/swift/tree/main/docs/source_en/LLM/Agent-best-practice.md#Usage-with-Modelscope_Agent). | |||
- 🔥2024.04.09: Support ruozhiba dataset. Search `ruozhiba` in [this documentation](docs/source_en/Instruction/Supported-models-datasets.md) to begin training! | |||
- 🔥2024.04.09: Support ruozhiba dataset. Search `ruozhiba` in [this documentation](docs/source_en/Instruction/Supported-models-and-datasets) to begin training! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
检查链接是否正确
--model Qwen/Qwen2-7B-Instruct \ | ||
--train_type lora \ | ||
--lorap_lr_ratio 16 \ | ||
--dataset swift/self-cognition#1000 \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
请同步中文版代码
- Requires: Additional requirements for using this model | ||
The table below introduces the models integrated with ms-swift: | ||
|
||
- Model ID: Model ID for the Magic Tower Model |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
清检查这行
- Tags: Tags of the dataset. | ||
|
||
The table below introduces information about the datasets integrated with ms-swift: | ||
- Dataset ID: Magic Tower dataset ID |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
请检查这行
This reverts commit 6cd3f46.
PR type
PR information
Write the detail information belongs to this PR.
Experiment results
Paste your experiment result here(if needed).