Temporality unbreak some LTXAV (LTX 2.0) workflows to give people time to migrate. by comfyanonymous · Pull Request #12605 · Comfy-Org/ComfyUI

comfyanonymous · 2026-02-24T05:43:03Z

This will eventually be removed again which will break many workflows that don't use the official LTXAV (LTX 2.0) files.

If you use the official LTXV files you are good. If you use non official files please migrate.

coderabbitai · 2026-02-24T05:48:25Z

📝 Walkthrough

Walkthrough

This change introduces a compatibility mode pathway to the LTXAVTEModel class. A new optional feature is implemented via a compat_mode flag initialized to False, an enable_compat_mode method that configures audio and video embedding connectors, and modifications to the token encoding and state dictionary loading logic. When compatibility mode is enabled, the projection output is routed through the configured connectors and their results concatenated. The state dictionary loader detects specific conditions to automatically enable this mode and populate the connector states.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title directly describes the main change: adding temporary compatibility mode to unbreak LTXAV workflows during migration.
Description check	✅ Passed	The description is related to the changeset, explaining the temporary nature of the compatibility mode and migration guidance.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@comfy/text_encoders/lt.py`:
- Around line 154-160: The code forces out to float32 then feeds it to
compat-mode connectors, causing dtype mismatches with connector weights (which
may be bf16/fp16); instead of out = out.float(), cast out to the connectors'
dtype before calling self.video_embeddings_connector and
self.audio_embeddings_connector (e.g., determine the connector dtype from a
connector parameter or weight:
next(self.video_embeddings_connector.parameters()).dtype or
self.video_embeddings_connector.weight.dtype) and do out =
out.to(connector_dtype) (or ensure both connectors use the same dtype) so the
forward passes use the connector's dtype and avoid scalar type errors.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9b1c63e and 04cec1f.

📒 Files selected for processing (1)

comfy/text_encoders/lt.py

coderabbitai · 2026-02-24T05:48:28Z

comfy/text_encoders/lt.py

        out = out.float()
+
+        if self.compat_mode:
+            out_vid = self.video_embeddings_connector(out)[0]
+            out_audio = self.audio_embeddings_connector(out)[0]
+            out = torch.concat((out_vid, out_audio), dim=-1)
+


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

cat -n comfy/text_encoders/lt.py | head -180 | tail -80

Repository: Comfy-Org/ComfyUI

Length of output: 4318

🏁 Script executed:

# Search for where the connectors are initialized rg -n "video_embeddings_connector|audio_embeddings_connector" comfy/text_encoders/lt.py -A 2 -B 2

Repository: Comfy-Org/ComfyUI

Length of output: 1895

🏁 Script executed:

# Check the projection's dtype handling rg -n "projection|dtype" comfy/text_encoders/lt.py | head -20

Repository: Comfy-Org/ComfyUI

Length of output: 1793

🏁 Script executed:

# Check git history for line 154 (the float() call) git log -p --follow comfy/text_encoders/lt.py | grep -A 15 -B 15 "out = out.float()" | head -80

Repository: Comfy-Org/ComfyUI

Length of output: 43

🌐 Web query:

PyTorch float32 input bf16 fp16 module weights dtype mismatch behavior without autocast

💡 Result:

Without autocast, PyTorch generally does not automatically cast your activations to match a module’s parameter dtype for “typed” ops like GEMM/linear, matmul, conv, etc. If your input is float32 but the module weights are float16 or bfloat16, you’ll typically get a runtime error like:

RuntimeError: expected scalar type Half but found Float

RuntimeError: expected scalar type Float but found BFloat16

RuntimeError: mat1 and mat2 must have the same dtype (common for addmm/matmul/linear) [1], [2]

This differs from many elementwise arithmetic ops (add/sub/mul/div), which follow PyTorch type promotion rules and may upcast instead of erroring [3].

What autocast changes

Inside torch.autocast(...), PyTorch will automatically cast inputs for certain eligible ops (e.g., mm) so you can feed float32 tensors and still run the op in float16/bfloat16, producing lower-precision outputs without manual casts [4]. PyTorch also warns that tensors produced inside autocast regions (often float16) can later cause dtype mismatch errors when used outside the region with tensors of different dtype—so you may need to cast outputs back to float32 when leaving autocast [5].

Practical fixes (no autocast)

Make dtypes match explicitly: x = x.to(weight.dtype) (or keep the module in float32).

Or use torch.autocast(device_type="cuda", dtype=torch.float16|torch.bfloat16) around the forward where you want mixed precision [4], [5].

Sources: [1] [2] [3] [4] [5]

Fix dtype mismatch between output and compat-mode connectors.

At line 154, out is forced to float32, but the connectors (initialized at lines 117, 125) use the projection's dtype, which is often bf16 or fp16. Passing float32 tensors to modules with bf16/fp16 weights will raise a runtime error like expected scalar type Half but found Float.

Cast to connector dtype before the forward pass, then the result stays in the connector's dtype:

Proposed fix

- out = out.float() - - if self.compat_mode: - out_vid = self.video_embeddings_connector(out)[0] - out_audio = self.audio_embeddings_connector(out)[0] - out = torch.concat((out_vid, out_audio), dim=-1) + if self.compat_mode: + compat_dtype = next(self.video_embeddings_connector.parameters()).dtype + out_compat = out.to(dtype=compat_dtype) + out_vid = self.video_embeddings_connector(out_compat)[0] + out_audio = self.audio_embeddings_connector(out_compat)[0] + out = torch.concat((out_vid, out_audio), dim=-1) + else: + out = out.float()

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

out = out.float()

if self.compat_mode:

out_vid = self.video_embeddings_connector(out)[0]

out_audio = self.audio_embeddings_connector(out)[0]

out = torch.concat((out_vid, out_audio), dim=-1)

if self.compat_mode:

compat_dtype = next(self.video_embeddings_connector.parameters()).dtype

out_compat = out.to(dtype=compat_dtype)

out_vid = self.video_embeddings_connector(out_compat)[0]

out_audio = self.audio_embeddings_connector(out_compat)[0]

out = torch.concat((out_vid, out_audio), dim=-1)

else:

out = out.float()

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@comfy/text_encoders/lt.py` around lines 154 - 160, The code forces out to float32 then feeds it to compat-mode connectors, causing dtype mismatches with connector weights (which may be bf16/fp16); instead of out = out.float(), cast out to the connectors' dtype before calling self.video_embeddings_connector and self.audio_embeddings_connector (e.g., determine the connector dtype from a connector parameter or weight: next(self.video_embeddings_connector.parameters()).dtype or self.video_embeddings_connector.weight.dtype) and do out = out.to(connector_dtype) (or ensure both connectors use the same dtype) so the forward passes use the connector's dtype and avoid scalar type errors.

Temporality unbreak some LTXAV workflows to give people time to migrate.

04cec1f

comfyanonymous requested review from Kosinkadink and guill as code owners February 24, 2026 05:43

coderabbitai bot reviewed Feb 24, 2026

View reviewed changes

comfyanonymous merged commit 84aba95 into master Feb 24, 2026
15 checks passed

comfyanonymous deleted the temp_pr branch February 24, 2026 05:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporality unbreak some LTXAV (LTX 2.0) workflows to give people time to migrate.#12605

Temporality unbreak some LTXAV (LTX 2.0) workflows to give people time to migrate.#12605
comfyanonymous merged 1 commit intomasterfrom
temp_pr

comfyanonymous commented Feb 24, 2026

Uh oh!

coderabbitai bot commented Feb 24, 2026

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Feb 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

comfyanonymous commented Feb 24, 2026

Uh oh!

coderabbitai bot commented Feb 24, 2026

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 24, 2026

Choose a reason for hiding this comment

What autocast changes

Practical fixes (no autocast)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

What `autocast` changes