Add support for a CUDA 12.9.1 image with PyTorch 2.8.0 #7

lfu77 · 2025-10-23T18:58:42Z

This PR adds a Makefile recipe to build a Determined base image with CUDA 12.9.1 and PyTorch 2.8.0

I encountered some issues trying to build the image with PyTorch 2.9.0, I believe that we should be overriding this anyways when we build the actual augment images though

I was also unable to add 10.0 to the TORCH_CUDA_ARCH_LIST. I think this should be the Blackwell version.

TESTED:

Ran make build-gpt-neox-deepspeed-gpu-torch-280 and then docker run -it 77824367d1e6 /bin/bash to check the nvcc version is 12.9.1

lfu77 added 4 commits October 22, 2025 21:57

update the Makefile with new recipes

b7a3910

fix the pytorch version to 2.8.0

c1c93c4

builds up until deepspeed install

825bb6d

build completes

e65f18f

lfu77 requested a review from mmonaco October 23, 2025 18:58

upgrade to python 3.11.7

dc3a1da

lfu77 closed this Oct 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for a CUDA 12.9.1 image with PyTorch 2.8.0 #7

Add support for a CUDA 12.9.1 image with PyTorch 2.8.0 #7

Uh oh!

lfu77 commented Oct 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add support for a CUDA 12.9.1 image with PyTorch 2.8.0 #7

Add support for a CUDA 12.9.1 image with PyTorch 2.8.0 #7

Uh oh!

Conversation

lfu77 commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lfu77 commented Oct 23, 2025 •

edited

Loading