[XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) #22029

sergey-kozub · 2025-01-29T17:16:28Z

In addition to SM120a, also add SM101a mentioned in the PTX 8.7 spec (https://docs.nvidia.com/cuda/parallel-thread-execution/#release-notes), which is a slight variation of SM100a.

Bumping the max supported PTX version to 8.7, as the LLVM PR (llvm/llvm-project#124155) adding the support is now integrated to OpenXLA.

…(Blackwell) Imported from GitHub PR #22029 In addition to SM120a, also add SM101a mentioned in the PTX 8.7 spec (https://docs.nvidia.com/cuda/parallel-thread-execution/#release-notes), which is a slight variation of SM100a. Bumping the max supported PTX version to 8.7, as the LLVM PR (llvm/llvm-project#124155) adding the support is now integrated to OpenXLA. Copybara import of the project: -- be59b7a by Sergey Kozub <skozub@nvidia.com>: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) Merging this change closes #22029 FUTURE_COPYBARA_INTEGRATE_REVIEW=#22029 from openxla:devel/sm120a be59b7a PiperOrigin-RevId: 721049239

…(Blackwell) Imported from GitHub PR openxla/xla#22029 In addition to SM120a, also add SM101a mentioned in the PTX 8.7 spec (https://docs.nvidia.com/cuda/parallel-thread-execution/#release-notes), which is a slight variation of SM100a. Bumping the max supported PTX version to 8.7, as the LLVM PR (llvm/llvm-project#124155) adding the support is now integrated to OpenXLA. Copybara import of the project: -- be59b7a51721637d880207e7adb69a18c3a92bea by Sergey Kozub <skozub@nvidia.com>: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) Merging this change closes #22029 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#22029 from openxla:devel/sm120a be59b7a51721637d880207e7adb69a18c3a92bea PiperOrigin-RevId: 721049239

…(Blackwell) Imported from GitHub PR #22029 In addition to SM120a, also add SM101a mentioned in the PTX 8.7 spec (https://docs.nvidia.com/cuda/parallel-thread-execution/#release-notes), which is a slight variation of SM100a. Bumping the max supported PTX version to 8.7, as the LLVM PR (llvm/llvm-project#124155) adding the support is now integrated to OpenXLA. Copybara import of the project: -- be59b7a by Sergey Kozub <skozub@nvidia.com>: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) Merging this change closes #22029 FUTURE_COPYBARA_INTEGRATE_REVIEW=#22029 from openxla:devel/sm120a be59b7a PiperOrigin-RevId: 721049239

…(Blackwell) Imported from GitHub PR openxla/xla#22029 In addition to SM120a, also add SM101a mentioned in the PTX 8.7 spec (https://docs.nvidia.com/cuda/parallel-thread-execution/#release-notes), which is a slight variation of SM100a. Bumping the max supported PTX version to 8.7, as the LLVM PR (llvm/llvm-project#124155) adding the support is now integrated to OpenXLA. Copybara import of the project: -- be59b7a51721637d880207e7adb69a18c3a92bea by Sergey Kozub <skozub@nvidia.com>: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) Merging this change closes #22029 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#22029 from openxla:devel/sm120a be59b7a51721637d880207e7adb69a18c3a92bea PiperOrigin-RevId: 721049239

…(Blackwell) Imported from GitHub PR openxla/xla#22029 In addition to SM120a, also add SM101a mentioned in the PTX 8.7 spec (https://docs.nvidia.com/cuda/parallel-thread-execution/#release-notes), which is a slight variation of SM100a. Bumping the max supported PTX version to 8.7, as the LLVM PR (llvm/llvm-project#124155) adding the support is now integrated to OpenXLA. Copybara import of the project: -- be59b7a51721637d880207e7adb69a18c3a92bea by Sergey Kozub <skozub@nvidia.com>: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) Merging this change closes #22029 PiperOrigin-RevId: 721088886

[XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell)

be59b7a

sergey-kozub requested a review from reedwm January 29, 2025 17:16

sergey-kozub self-assigned this Jan 29, 2025

reedwm approved these changes Jan 29, 2025

View reviewed changes

copybara-service bot mentioned this pull request Jan 29, 2025

PR #22029: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) #22049

Merged

copybara-service bot mentioned this pull request Jan 29, 2025

PR #22029: [XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) tensorflow/tensorflow#86134

Merged

copybara-service bot closed this in 08d91b4 Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) #22029

[XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) #22029

Uh oh!

sergey-kozub commented Jan 29, 2025

Uh oh!

Uh oh!

[XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) #22029

[XLA:GPU] Add support for SM101a and SM120a architectures (Blackwell) #22029

Uh oh!

Conversation

sergey-kozub commented Jan 29, 2025

Uh oh!

Uh oh!