You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I following this instruction kuberay-gpu-training-example to install ray cluster in my desktop, which has one GPU. I create a new fresh cluster locally instead of remote GCP. The installation succeeds, and I can access the dashboard http://localhost:8265/ , however, the job submission on the console fails with the message:
Meanwhile, http://localhost:8265/api/version can result a json message: {"version": "4", "ray_version": "2.2.0", "ray_commit": "b6af0887ee5f2e460202133791ad941a41f15beb"}
So any suggestion for this?
The steps I did are:
helm repo add kuberay https://ray-project.github.io/kuberay-helm/
helm repo update
helm install kuberay-operator kuberay/kuberay-operator --version 1.0.0
# Create a Ray cluster
kubectl apply -f https://raw.githubusercontent.com/ray-project/ray/master/doc/source/cluster/kubernetes/configs/ray-cluster.gpu.yaml
# port forwarding
kubectl port-forward --address 0.0.0.0 services/raycluster-head-svc 8265:8265
# Test cluster (optional)
ray job submit --address http://localhost:8265 -- python -c "import ray; ray.init(); print(ray.cluster_resources())"
The text was updated successfully, but these errors were encountered:
Issue
I following this instruction kuberay-gpu-training-example to install ray cluster in my desktop, which has one GPU. I create a new fresh cluster locally instead of remote GCP. The installation succeeds, and I can access the dashboard http://localhost:8265/ , however, the job submission on the console fails with the message:
Meanwhile,
http://localhost:8265/api/version
can result a json message:{"version": "4", "ray_version": "2.2.0", "ray_commit": "b6af0887ee5f2e460202133791ad941a41f15beb"}
So any suggestion for this?
The steps I did are:
The text was updated successfully, but these errors were encountered: