-
Notifications
You must be signed in to change notification settings - Fork 426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] [RayService] Cannot move from single app to multi-app without downtime if using GCS FT #1297
Comments
This might be related to setting |
Thank @smit-kiri for reporting this issue! This seems to have no relationship with GCS FT for me. I can reproduce the issue by:
Ray Serve seems not to allow in-place upgrades between API V1 (single app) and API V2 (multi app). A workaround involves not only updating |
Thanks @kevin85421 ! |
Cool. I am still a bit confused. Do you only update serveConfig / serveConfigV2, or do you also update other fields? In the former case, it will only update the serve configurations in-place, while the latter case will trigger a zero-downtime upgrade. In my understanding, the former case will always report the exception |
We triggered a zero-downtime upgrade by updating the docker image |
Update the doc: ray-project/ray@ec19d15. |
ray-project/ray#38647 is merged. Close this issue. |
Search before asking
KubeRay Component
ray-operator
What happened + What you expected to happen
I'm trying to move all our workloads from single application, to multi-application RayService with the release of KubeRay
v0.6.0
, and it does not seem possible to do it without downtime if we're using GCS FT. I see the following error:Reproduction script
Single application:
demo.py
Dockerfile
rayservice_config.yaml
Multi-application
demo1.py
demo2.py
Dockerfile
rayservice_config.yaml
Deploy the single application code first. Then try to deploy the multi-application code. You should see an error.
Anything else
A workaround here is:
If you don't reboot the Redis instance, you run into the same error again when trying to add GCS FT back in.
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: