-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HPUAccelerator: remove support in set_visible_devices_envs #5929
Conversation
Hi @nelyahu in the link https://docs.habana.ai/en/latest/PyTorch/Reference/PT_Multiple_Tenants_on_HPU/Multiple_Workloads_Single_Docker.html specificed in the code. It mentioned |
@delock on Intel Gaudi accelerator (HPU), the visible module ids are determines by the |
Hi @nelyahu From the title and description this PR intends to avoid set_visible_devices_envs on this line: DeepSpeed/deepspeed/launcher/launch.py Line 166 in ffe0af2
This PR also nullify this line which does not set_visible_devices_envs, is it also intended? DeepSpeed/deepspeed/launcher/runner.py Line 408 in ffe0af2
|
@delock i want to avoid DeepSpeed accessing to this env var at all. |
1a3c304
to
5b9f50e
Compare
Hi @nelyahu, the latest push added cosmetic change from other 8 files which is unrelated. |
The way deepspeed sets it is not correct with all HPU instances and may lead to incorrect behavior.
5b9f50e
to
fc382db
Compare
Yes, formatter version issue. fixed |
The way deepspeed sets it is not correct with all HPU instances and may lead to incorrect behavior.