[Doc][Train] Add accelerator_type to Ray Train user guide (ray-proj…

…ect#44882) Document the new `ScalingConfig(accelerator_type)` configuration. Signed-off-by: Hongpeng Guo <hpguo@anyscale.com>
hartikainen · Apr 24, 2024 · 0da794c · 0da794c
1 parent 2890cd0
commit 0da794c
Showing 1 changed file with 24 additions and 0 deletions.
diff --git a/doc/source/train/user-guides/using-gpus.rst b/doc/source/train/user-guides/using-gpus.rst
@@ -104,6 +104,30 @@ You can get a list of associated devices with :meth:`ray.train.torch.get_devices
     trainer.fit()
 
 
+Setting the GPU type
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+Ray Train allows you to specify the accelerator type for each worker.
+This is useful if you want to use a specific accelerator type for model training.
+In a heterogeneous Ray cluster, this means that your training workers will be forced to run on the specified GPU type, 
+rather than on any arbitrary GPU node. You can get a list of supported `accelerator_type` from 
+:ref:`the available accelerator types <accelerator_types>`.
+
+For example, you can specify `accelerator_type="A100"` in the :class:`~ray.train.ScalingConfig` if you want to 
+assign each worker a NVIDIA A100 GPU. 
+
+.. tip::
+    Ensure that your cluster has instances with the specified accelerator type 
+    or is able to autoscale to fulfill the request.
+
+.. testcode::
+
+    ScalingConfig(
+        num_workers=1,
+        use_gpu=True,
+        accelerator_type="A100"
+    )
+
+
 (PyTorch) Setting the communication backend 
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~