Fix: UnboundLocalError for variable 'dim' about issue #7449

weeknan · 2025-07-24T12:28:19Z

Fix `UnboundLocalError` in `ZeroLinear.backward()` when training only bias parameters, as mentioned in #7435

This PR addresses an issue in the ZeroLinear.backward() method, where the local variable dim could be referenced before assignment. This happens specifically when:

Only the bias parameters are set to requires_grad=True, and
The training setup uses ZeRO Stage 3, AMP, and gradient checkpointing.

Problem

When only the bias requires gradients, the condition for setting dim = grad_output.dim() is skipped, but the value of dim is still used later in the computation, leading to:

Fix

Move the assignment dim = grad_output.dim() to occur unconditionally, so that dim is always defined before being used in any branch of the gradient computation logic.

Impact

This makes the backward pass more robust across different training setups.

Signed-off-by: weeknan <zhounan0431@163.com>

## Fix `UnboundLocalError` in `ZeroLinear.backward()` when training only bias parameters, as mentioned in deepspeedai#7435 This PR addresses an issue in the `ZeroLinear.backward()` method, where the local variable `dim` could be referenced before assignment. This happens specifically when: - Only the bias parameters are set to `requires_grad=True`, and - The training setup uses **ZeRO Stage 3**, **AMP**, and **gradient checkpointing**. ### Problem When only the bias requires gradients, the condition for setting `dim = grad_output.dim()` is skipped, but the value of `dim` is still used later in the computation, leading to: ### Fix Move the assignment `dim = grad_output.dim()` to occur unconditionally, so that `dim` is always defined before being used in any branch of the gradient computation logic. ### Impact This makes the backward pass more robust across different training setups. Signed-off-by: weeknan <zhounan0431@163.com> Co-authored-by: Olatunji Ruwase <tjruwase@gmail.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>

## Fix `UnboundLocalError` in `ZeroLinear.backward()` when training only bias parameters, as mentioned in deepspeedai#7435 This PR addresses an issue in the `ZeroLinear.backward()` method, where the local variable `dim` could be referenced before assignment. This happens specifically when: - Only the bias parameters are set to `requires_grad=True`, and - The training setup uses **ZeRO Stage 3**, **AMP**, and **gradient checkpointing**. ### Problem When only the bias requires gradients, the condition for setting `dim = grad_output.dim()` is skipped, but the value of `dim` is still used later in the computation, leading to: ### Fix Move the assignment `dim = grad_output.dim()` to occur unconditionally, so that `dim` is always defined before being used in any branch of the gradient computation logic. ### Impact This makes the backward pass more robust across different training setups. Signed-off-by: weeknan <zhounan0431@163.com> Co-authored-by: Olatunji Ruwase <tjruwase@gmail.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Signed-off-by: qimcis <chixie.mcisaac@gmail.com>

## Fix `UnboundLocalError` in `ZeroLinear.backward()` when training only bias parameters, as mentioned in deepspeedai#7435 This PR addresses an issue in the `ZeroLinear.backward()` method, where the local variable `dim` could be referenced before assignment. This happens specifically when: - Only the bias parameters are set to `requires_grad=True`, and - The training setup uses **ZeRO Stage 3**, **AMP**, and **gradient checkpointing**. ### Problem When only the bias requires gradients, the condition for setting `dim = grad_output.dim()` is skipped, but the value of `dim` is still used later in the computation, leading to: ### Fix Move the assignment `dim = grad_output.dim()` to occur unconditionally, so that `dim` is always defined before being used in any branch of the gradient computation logic. ### Impact This makes the backward pass more robust across different training setups. Signed-off-by: weeknan <zhounan0431@163.com> Co-authored-by: Olatunji Ruwase <tjruwase@gmail.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Signed-off-by: lym <letusgo126@126.com>

weeknan requested review from tjruwase and tohtana as code owners July 24, 2025 12:28

weeknan changed the title ~~Fix: UnboundLocalError for variable 'dim' about issue #7435~~ Fix: UnboundLocalError for variable 'dim' about issue Jul 24, 2025

Fix: UnboundLocalError for variable 'dim'

c18caf0

Signed-off-by: weeknan <zhounan0431@163.com>

weeknan force-pushed the fix-UnboundLocalError-bug branch from 0fbabd3 to c18caf0 Compare July 24, 2025 12:33

hwchen2017 approved these changes Jul 24, 2025

View reviewed changes

tjruwase and others added 2 commits July 25, 2025 14:19

Merge branch 'master' into fix-UnboundLocalError-bug

909a646

Merge branch 'master' into fix-UnboundLocalError-bug

19d5cf2

loadams approved these changes Jul 28, 2025

View reviewed changes

loadams merged commit 56fed13 into deepspeedai:master Jul 28, 2025
9 checks passed

weeknan deleted the fix-UnboundLocalError-bug branch July 29, 2025 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: UnboundLocalError for variable 'dim' about issue #7449

Fix: UnboundLocalError for variable 'dim' about issue #7449

Uh oh!

weeknan commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

Fix: UnboundLocalError for variable 'dim' about issue #7449

Fix: UnboundLocalError for variable 'dim' about issue #7449

Uh oh!

Conversation

weeknan commented Jul 24, 2025

Fix UnboundLocalError in ZeroLinear.backward() when training only bias parameters, as mentioned in #7435

Problem

Fix

Impact

Uh oh!

Uh oh!

Uh oh!

Fix `UnboundLocalError` in `ZeroLinear.backward()` when training only bias parameters, as mentioned in #7435