Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support swift deploy stats #1593

Merged
merged 12 commits into from
Aug 5, 2024

Conversation

Jintao-Huang
Copy link
Collaborator

@Jintao-Huang Jintao-Huang commented Aug 4, 2024

PR type

  • Bug Fix
  • New Feature
  • Document Updates
  • More Models or Datasets Support
  1. 文档更新:推理时如何自定义val_dataset; megatron如何转换qwen2-72b-instruct
  2. 支持部署时显示统计量. 部署新增参数verbose, log_interval. 控制多久打印一次统计量.
  3. megatron中安装对应版本的megatron-lm (使用commit-hash控制),保证稳定性
  4. 修复lmdeploy部署时的bug. session_id导致的并发低的问题.
  5. 部署时,支持base模型使用chat/completions 接口.
  6. megatron部分重构函数注册部分, 方便新增模型
  7. 新增get_model_list_client_async.

English:

  1. Document update: How to customize val_dataset during inference; How to convert qwen2-72b-instruct in Megatron.
  2. Support displaying metrics during deployment. Added parameters verbose and log_interval to control how often metrics are printed.
  3. Ensure stability by installing the corresponding version of megatron-lm in Megatron (controlled using commit hash).
  4. Fixed bug during lmdeploy deployment. Resolved the concurrency issue caused by session_id.
  5. During deployment, support the base model using chat/completions interface.
  6. Partially refactored function registration in Megatron to facilitate adding new models.
  7. Added get_model_list_client_async.

@Jintao-Huang Jintao-Huang changed the title support lmdeploy stats support swift deploy stats Aug 4, 2024
@Jintao-Huang Jintao-Huang merged commit 5fc9a83 into modelscope:main Aug 5, 2024
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants