feat: support docker container mem_limit/cpu_shares configable#5
Open
qxo wants to merge 1 commit intogpustack:mainfrom
Open
feat: support docker container mem_limit/cpu_shares configable#5qxo wants to merge 1 commit intogpustack:mainfrom
qxo wants to merge 1 commit intogpustack:mainfrom
Conversation
* 模型容器的MEM/CPU限制可通过实例节点环境变量GPUSTACK_RUNTIME_DEPLOY_DEFAULT_MEM/GPUSTACK_RUNTIME_DEPLOY_DEFAULT_CPU 或模型编辑界面环境变量ENV_MEN/ENV_CPU来配置 * pause 容器默认限制MEM/CPU, 同时也通过worker环境变量来配置GPUSTACK_RUNTIME_DEPLOY_PAUSE_MEM/GPUSTACK_RUNTIME_DEPLOY_PAUSE_CPU_SHARES * 为防止内存过度分配,指mem_limit时默认不指mem_reservation=mem_limit,需要可单独设置mem_reservation * TODO 以上常量命名怎么更有意义有商榷
Contributor
|
please make above changes on gpustack/gpustack, runtime already provided a configuration entrence here: runtime/gpustack_runtime/deployer/__types__.py Lines 232 to 244 in c1c6ac1 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
模型容器的MEM/CPU限制可通过实例节点环境变量GPUSTACK_RUNTIME_DEPLOY_DEFAULT_MEM/GPUSTACK_RUNTIME_DEPLOY_DEFAULT_CPU 或模型编辑界面环境变量ENV_MEN/ENV_CPU来配置
pause 容器默认限制MEM/CPU, 同时也通过worker环境变量来配置GPUSTACK_RUNTIME_DEPLOY_PAUSE_MEM/GPUSTACK_RUNTIME_DEPLOY_PAUSE_CPU_SHARES
为防止内存过度分配,指mem_limit时默认不指mem_reservation=mem_limit,需要可单独设置mem_reservation
TODO 以上常量命名怎么更有意义有商榷