Skip to content

Commit

Permalink
perf: always keep one midnight rose (OpenRouterTeam#83)
Browse files Browse the repository at this point in the history
  • Loading branch information
sambarnes authored Mar 23, 2024
1 parent 936f731 commit af26dd5
Showing 1 changed file with 1 addition and 2 deletions.
3 changes: 1 addition & 2 deletions modal/runner/containers/vllm_unified.py
Original file line number Diff line number Diff line change
Expand Up @@ -174,8 +174,7 @@ def __init__(self):
model_name=_midnight_rose,
gpu=modal.gpu.H100(count=1),
concurrent_inputs=4,
# TODO: Keep it warm once launched
# keep_warm=1,
keep_warm=1,
max_containers=1,
quantization="GPTQ",
)
Expand Down

0 comments on commit af26dd5

Please sign in to comment.