Fix Windows build and add Windows CPU tests #806

NicolasHug · 2025-08-05T08:18:40Z

This PR fixes the Windows wheel build, and adds tests jobs that mimic the setup we have for other platforms: we build a wheel, and then install and run the tests from that wheel on the separate machine.

The vast majority of tests are passing as-is, without any modification needed. Worth nothing that for comparing frames, we are going through the same path as on OSX, with higher tolerances:

torchcodec/test/utils.py

Lines 82 to 83 in 05a6ff5

    
           else: 
        
               torch.testing.assert_close(*args, **kwargs, atol=3, rtol=0)

I had a few problems with the audio encoder though. in increasing order of importance:

We can't always validate parameters, so we (correctly) error later in avcodec_open2(). This is described in this comment: Fix Windows build and add Windows CPU tests #806 (comment)
Encoding mp3 files seems to be buggy. We can encode, but we're getting weird errors when decoding what we just encoded. I don't think it's a decoding issue though, because the decoder tests are using mp3 files just fine. I have opened Encoding mp3 on Windows is probably incorrect #837 to follow-up.
We are getting segfaults and memory access errors with Fffmpeg 5 (and only with ffmpeg 5!!). I opened FFmpeg 5 on Windows errors in AudioEncoder's avcodec_open2 when passing bad parameters #836 to follow-up. You'll see more details there but the segfault happens internally within avcodec_open2(), even after validating that the parameters are valid pointers. Because it only happens on Windows and on FFmpeg5, I want to declare that this is an ffmpeg5 bug.

Note: we're not errorring on warnings on Windows, and there are a lot of those. I'll re-enable this as a follow-up:

torchcodec/src/torchcodec/_core/CMakeLists.txt

Lines 14 to 17 in 05a6ff5

    
               if (WIN32) 
        
                   # TODO set warnings as errors on Windows as well. 
        
                   # set(TORCHCODEC_WERROR_OPTION "/WX") 
        
               else()

Massive thanks to @traversaro for unblocking us with the #806 (comment) fix.

…Set its RPATH to ORIGIN so it can find libtorchcodec_core

…as dependency of custom_ops target

This reverts commit d896211.

This reverts commit 2f2c91d.

This reverts commit 963590e.

NicolasHug · 2025-08-22T11:32:51Z

src/torchcodec/_core/Encoder.cpp

-      avio_close(avFormatContext_->pb);
+      if (avFormatContext_->pb->error == 0) {
+        avio_close(avFormatContext_->pb);
+      }


Above is a drive-by as I was trying to fix the problem with FFmpeg 5. I'm not sure it fixed anything, but it is probably still a good change to have? I can extract it out in another PR if it's preferred.

Does this code imply that if the AVIOContext stored in avFormatContext_ has an error, we do not need to flush / close it?

Based on my digging, it's also that it's risky to call flush when there's an error state. Seems reasonable to keep it in.

yeah, it because when error isn't 0 then flushing or calling avio_close() are potentially accessing invalid data.

NicolasHug · 2025-08-22T11:34:33Z

test/test_encoders.py

+        with pytest.raises(
+            RuntimeError,
+            match=avcodec_open2_failed_msg if IS_WINDOWS else "invalid sample rate=10",
+        ):


above and in other tests below: on Windows, we're hitting this early return:

torchcodec/src/torchcodec/_core/Encoder.cpp

Lines 35 to 38 in 05a6ff5

void validateSampleRate(const AVCodec& avCodec, int sampleRate) {

if (avCodec.supported_samplerates == nullptr) {

return;

}

So we're unable to validate some parameters early, and we just fail in the call to avcodec_open2() later:

torchcodec/src/torchcodec/_core/Encoder.cpp

Lines 213 to 217 in 05a6ff5

int status = avcodec_open2(avCodecContext_.get(), avCodec, nullptr);

TORCH_CHECK(

status == AVSUCCESS,

"avcodec_open2 failed: ",

getFFMPEGErrorStringFromErrorCode(status));

NicolasHug · 2025-08-22T11:36:18Z

src/torchcodec/_core/CMakeLists.txt

-            ARCHIVE_OUTPUT_DIRECTORY_RELEASE ${CMAKE_CURRENT_BINARY_DIR}
-        )
-    endif()
-


@traversaro correctly pointed out that this can now be removed, as a consequence of the other cmake_build_type fix below #806 (comment)

NicolasHug · 2025-08-22T11:41:32Z

setup.py

+        subprocess.check_call(
+            ["cmake", "--install", ".", "--config", cmake_build_type],
+            cwd=self.build_temp,
+        )


Above is the main fix that was needed for the extensions to load properly, something I had been blocked on for a few weeks. The fix is from @traversaro :

NicolasHug#1

IIUC, on windows and with our current setup, cmake is generating multiple build files. And without this fix, the different build files would be inheriting different build configuration. Typically we'd be building some parts with the Release build type, while other would be inheriting a different build type, causing problems at runtime when loading the libraries.

@traversaro thank you so much again for unblocking us!

NicolasHug · 2025-08-22T11:45:23Z

.github/workflows/windows_wheel.yaml

+    strategy:
+      fail-fast: false
+      matrix:
+        python-version: ['3.9', '3.10', '3.11', '3.12']


Will revert to just 3.9 before merging, like in the other jobs. This is just to show that the jobs are green across multiple versions

Also, you probably don't need to review the rest of this file too closely, because it's mostly copy/pasted from our existing test job (for e.g. Linux)

NicolasHug · 2025-08-22T11:45:57Z

.github/workflows/windows_wheel.yaml

@@ -36,6 +36,9 @@ jobs:
      with-rocm: disable
      with-cuda: disable
      build-python-only: "disable"
+      # Explicitly avoid 3.13 because 3.13t builds don't work.
+      # TODO remove eventually.
+      python-versions: '["3.9", "3.10", "3.11", "3.12"]'


Will remove this before merging. This is just to show that the jobs are green across versions.

Note that we may still try to address the 3.13t issue, but that's for a follow-up.

Dan-Flores · 2025-08-22T13:59:50Z

test/test_encoders.py

@@ -295,8 +315,15 @@ def test_against_cli(
            rtol, atol = 0, 1e-3
        else:
            rtol, atol = None, None
+
+        if IS_WINDOWS and format == "mp3":


Will this condition be hit? It might be duplicate with the check and pytest.skip on line 260

The check above only skips on FFmpeg <= 5, but FFmpeg 6 and 7 will still reach this line :)

Dan-Flores · 2025-08-22T13:59:54Z

test/test_encoders.py

-        torch.testing.assert_close(
-            self.decode(encoded_file).data, self.decode(encoded_output).data
-        )
+        if not (IS_WINDOWS and format == "mp3"):


Same question here, is this a duplicate check with line 319? On a similar note, it might be better to explicitly call pytest.skip whenever we are not running a test.

It's not a duplicate because this will be reached by FFmpeg 6 and 7. You're right that we should generally prefer using pytest.skip to just return, or to doing what we're doing here: it's better to know that a test was skipped, rather than seeing it be green when in fact it never ran.

In this specific case I think it's best not to call pytest.skip, because the test is still doing something meaningful just above. Specifically it calls the encoding methods, without error. We are only skipping the assert_close() check.

Dan-Flores · 2025-08-22T14:00:53Z

test/test_encoders.py

@@ -414,7 +447,7 @@ def test_num_channels(

        sample_rate = 16_000
        source_samples = torch.rand(num_channels_input, 1_000)
-        format = "mp3"
+        format = "flac"


Does this change enable the test to pass on Windows? If so, we could parameterize the format and add an IS_WINDOWS check to skip if IS_WINDOWS and format == "mp3", as in tests above.

Correct, I changed from mp3 to flac due to the issues we had on Windows.

I agree that we could parameterize, but I'm not sure it's necessary for the purpose of this test, as it wouldn't make it more robust. Mostly this test is just about checking the number of output channels is respected, which is agnostic to the format.

NicolasHug added 2 commits August 5, 2025 08:50

Add test wheel job

d81c3e3

Make pybind extension a pyd instead of dll

78ea520

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 5, 2025

NicolasHug marked this pull request as draft August 5, 2025 08:18

NicolasHug added 26 commits August 5, 2025 09:22

empty

a0dd832

libtorchcodec_custom_ops may not be able to find libtorchcodec_core. …

1b5f926

…Set its RPATH to ORIGIN so it can find libtorchcodec_core

Try to add more debugging output on Windows

218ca2d

Print more debug

4cce232

More debug stuff

ab53ca9

Use MSVC toolchain when building FFmpeg on Windows

8450297

Use msvc-built FFmpeg binaries when building the wheel

3b79ff5

Merge branch 'buildffmpegwindowsmsvc' into windows-tests

e4f00a4

Try init_dll_path like in audio. But they only do it on 3.8

ac8116d

revert previous. Now add Python3_FIND_ABI and explicitly set Python3 …

9573fc2

…as dependency of custom_ops target

Set core lib dependencies as PRIVATE???

963590e

Same logic but try exposing the include directories as public

2f2c91d

OK at this point I have no idea

d896211

Add test job on Windows.

01f9dc0

Revert "OK at this point I have no idea"

60df908

This reverts commit d896211.

Revert "Same logic but try exposing the include directories as public"

09d077e

This reverts commit 2f2c91d.

Revert "Set core lib dependencies as PRIVATE???"

61ef07b

This reverts commit 963590e.

debug

1ae09aa

oops

6921d4c

Continuing

dab2d8e

oops

ff91aca

Try to build from source

bc919cc

DEBUG stuff

d988393

fix

9da6539

empty

99d0308

Make it a MODULE?????

7247adc

NicolasHug and others added 9 commits August 21, 2025 13:29

Use flac in test_num_channels

f25a0a7

Merge commit '8a30f92' into windows-tests

89e213d

Fix json list?

281cc38

Test on more ffmpeg versions

f2c3414

skip mp3 tests on windows when ffmpeg <= 5

12df6c7

better close??

9048f6c

use 5.0.1 instead of 5.1.2

c02ffc5

Use 5.1.1

4e4f739

cleanup

e40dfc2

NicolasHug commented Aug 22, 2025

View reviewed changes

traversaro mentioned this pull request Aug 22, 2025

Add lerobot python package conda-forge/staged-recipes#30714

Open

10 tasks

NicolasHug commented Aug 22, 2025

View reviewed changes

NicolasHug changed the title ~~Test Windows CPU wheel~~ Fix Windows build and add Windows CPU tests Aug 22, 2025

NicolasHug marked this pull request as ready for review August 22, 2025 12:47

Dan-Flores reviewed Aug 22, 2025

View reviewed changes

scotts approved these changes Aug 26, 2025

View reviewed changes

NicolasHug added 5 commits August 26, 2025 14:26

Rely on defaults for python versions

ff6d5d1

Set 3.10 as minimum supported version

f83e9da

Merge branch 'main' of github.com:pytorch/torchcodec into windows-tests

23c20de

Merge branch 'threeten' into windows-tests

7657857

Set 3.10 for windows as well

b531d4b

Dan-Flores approved these changes Aug 26, 2025

View reviewed changes

NicolasHug removed the ciflow/binaries/all label Aug 26, 2025

Merge branch 'main' into windows-tests

73e4680

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Windows build and add Windows CPU tests #806

Fix Windows build and add Windows CPU tests #806

NicolasHug commented Aug 5, 2025 •

edited

Loading

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

Dan-Flores Aug 22, 2025 •

edited

Loading

Uh oh!

scotts Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025 •

edited

Loading

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025 •

edited

Loading

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

Dan-Flores Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

Dan-Flores Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

Dan-Flores Aug 22, 2025

Uh oh!

NicolasHug Aug 22, 2025

Uh oh!

Uh oh!

	else:
	torch.testing.assert_close(args, *kwargs, atol=3, rtol=0)

	if (WIN32)
	# TODO set warnings as errors on Windows as well.
	# set(TORCHCODEC_WERROR_OPTION "/WX")
	else()

	void validateSampleRate(const AVCodec& avCodec, int sampleRate) {
	if (avCodec.supported_samplerates == nullptr) {
	return;
	}

	int status = avcodec_open2(avCodecContext_.get(), avCodec, nullptr);
	TORCH_CHECK(
	status == AVSUCCESS,
	"avcodec_open2 failed: ",
	getFFMPEGErrorStringFromErrorCode(status));

Fix Windows build and add Windows CPU tests #806

Are you sure you want to change the base?

Fix Windows build and add Windows CPU tests #806

Conversation

NicolasHug commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug commented Aug 5, 2025 •

edited

Loading

Dan-Flores Aug 22, 2025 •

edited

Loading

NicolasHug Aug 22, 2025 •

edited

Loading

NicolasHug Aug 22, 2025 •

edited

Loading