fix bug in Generic GEMM with no bias #126

marchioa · 2025-10-21T15:21:01Z

In generic platform, GEMM was not correctly hoisting the bias tensor when required.
To solve the issue, bias hoisting has been moved from MatMulParser.parseNodeCtxt to GEMMParser.parseNode.
Moreover, the default value of noBiasHoisting flag in GenericGEMMParser has been changed from True to False to be compliant with the template.

Added

testFloatGEMMnobias

Changed

Generic\Parser.py file (MatMulParser, GEMMParser, and GenericGEMMParser)

Fixed

fix bias hoisting in GEMM with no bias

PR Merge Checklist

The PR is rebased on the latest devel commit and pointing to devel.
Your PR reviewed and approved.
All checks are passing.
The CHANGELOG.md file has been updated.
If the docker was modified, change back its link after review.

coderabbitai · 2025-10-21T15:25:14Z

📝 Walkthrough

Summary by CodeRabbit

Release Notes

Bug Fixes
- Fixed bias hoisting behavior in generic GEMM operations when no bias tensor is present.
Changes
- Updated default bias hoisting configuration for generic GEMM operations.

Walkthrough

GEMM parser initialization now delegates bias-hoisting to the base class. Code that synthesized fake C/bias tensors when bias hoisting was disabled was removed. parseNode gained explicit handling for the 2-input case when hoisting is disabled. GenericGEMMParser default noBiasHoisting changed to False.

Changes

Cohort / File(s)	Summary
GEMM Parser Refactor `Deeploy/Targets/Generic/Parsers.py`	- `GEMMParser.__init__` now calls `super().__init__(noBiasHoisting)` and no longer assigns `self.noBiasHoisting` directly. - Removed creation/injection of synthetic C (bias) tensors and mock bias matrices in `parseNodeCtxt` when bias hoisting was disabled. - `parseNode` now explicitly creates a bias C tensor when there are exactly 2 inputs and bias hoisting is disabled. - When 3 inputs are present, existing C handling remains; the prior fallback that synthesized a mock bias for the no-hoist path was removed. - `GenericGEMMParser.__init__` default `noBiasHoisting` changed from `True` to `False`.
Changelog `CHANGELOG.md`	- Added entry documenting a fix for bias hoisting in generic GEMM with no bias.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Caller
  participant GEMMParser
  participant BaseParser
  Note over GEMMParser,BaseParser: Initialization
  Caller->>GEMMParser: instantiate(noBiasHoisting)
  GEMMParser->>BaseParser: super().__init__(noBiasHoisting)
  Note right of BaseParser: Base handles bias-hoisting flag

  Note over Caller,GEMMParser: parseNode (high-level)
  Caller->>GEMMParser: parseNode(node)
  alt node.inputs == 3
    GEMMParser->>GEMMParser: use provided C (bias) tensor
  else node.inputs == 2
    alt noBiasHoisting == False
      GEMMParser->>BaseParser: rely on base hoisting behavior (no synthetic C)
    else noBiasHoisting == True
      GEMMParser->>GEMMParser: create explicit C bias tensor for 2-input node
    end
  end
  GEMMParser-->>Caller: parsed representation (with/without C)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Potential review focus:

Consistency of C tensor shape and C_batched flags across parseNode and parseNodeCtxt.
Contract between GEMMParser and base parser regarding noBiasHoisting.
Call sites or tests that expected automatic synthetic C insertion when hoisting was disabled.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The PR title "fix bug in Generic GEMM with no bias" directly and accurately reflects the main changes in the changeset. The raw summary confirms that the changes center on fixing bias hoisting issues in generic GEMM when no bias is present, with modifications to the initialization flow and removal of synthetic bias tensor creation paths. The title is concise, specific, and clearly conveys the primary change from the developer's perspective without being vague or misleading.
Description Check	✅ Passed	The pull request description provided by the author clearly relates to the changeset and describes the specific changes made. The description accurately identifies the issue (GEMM not correctly hoisting the bias tensor in the generic platform), explains the solution (moving bias hoisting from MatMulParser.parseNodeCtxt to GEMMParser.parseNode), and mentions the adjustment to GenericGEMMParser's noBiasHoisting default from True to False. These elements directly correspond to the changes documented in the raw summary of the Parsers.py file and the CHANGELOG.md update. The description is neither vague nor off-topic—it provides meaningful context about the purpose and scope of the changes.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 15c4a23 and 0641fc9.

📒 Files selected for processing (1)

Deeploy/Targets/Generic/Parsers.py (3 hunks)

🔇 Additional comments (2)

Deeploy/Targets/Generic/Parsers.py (2)

1738-1739: LGTM: Initialization now delegates to base class.

The initialization correctly passes noBiasHoisting to MatMulParser.__init__, which stores it as self.noBiasHoisting. This is cleaner than the previous pattern.

2269-2269: I need to search the codebase more thoroughly to understand the impact of this change.Based on the evidence gathered, I can now generate the rewritten review comment. The key finding is that the test testFloatGEMMnobias mentioned in the PR description does NOT exist in the codebase, and the default parameter change is a BREAKING CHANGE affecting two platforms.

The PR claims to add testFloatGEMMnobias, but this test does not exist in the codebase. Verify test coverage before merging.

The change to noBiasHoisting default from True to False is a breaking behavioral change affecting GEMM operations on two platforms:

Deeploy/Targets/Generic/Platform.py:41 (Generic platform)

Deeploy/Targets/MemPool/Platform.py:74 (MemPool platform)

Both instantiations rely on the default parameter, so they will now enable bias hoisting by default instead of disabling it.

Required actions:

Confirm whether the test mentioned in the PR description (testFloatGEMMnobias) was intended to be added as part of this PR

Ensure existing GEMM test suite passes with the new default behavior

Document the breaking change or update code to explicitly pass noBiasHoisting=True to maintain backward compatibility

Deeploy/Targets/Generic/Parsers.py

lukamac

amazing work, beautiful!

Before it gets my stamp of approval, can you please make sure to:

Add your change to the changelog
Make sure your branch is rebased on devel, i.e., that your commits sit directly on top of the devel branch. I pulled locally your branch and you have been doing some merging. Try the command git rebase -i and just pick your commits.

lukamac

lgtm

In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. ## Added - testFloatGEMMnobias ## Changed - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) ## Fixed - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@chip.it>

In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. ## Added - testFloatGEMMnobias ## Changed - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) ## Fixed - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@outlook.it> Co-authored-by: Alex Marchioni <alex.marchioni@chip.it>

…ws (#1) * CMAKE FIX: include pulp-open config file * Adding pulp open target for gvsoc emulation + Testrunner_tiled_PULPOpen.py * First commit for working iDMA after refactoring * Further changes for iDMA integration * Fix for event unit error when using iDMA + updated testrunners for pulp_open * First changes for pulp-open rtl simulation inside Deeploy * Updated CMake macro for pulp-open simulation on modelsim * fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. ## Added - testFloatGEMMnobias ## Changed - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) ## Fixed - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> * Added new platform: PULP OPEN + MCHAN * Updated changelog --------- Signed-off-by: RiccardoGandolfi <riccardogandi95@gmail.com> Co-authored-by: RiccardoGandolfi <riccardo.gandolfi@chips.it> Co-authored-by: RiccardoGandolfi <riccardo.gandolfiu@chips.it> Co-authored-by: Alex Marchioni <alex.marchioni@outlook.it> Co-authored-by: Alex Marchioni <alex.marchioni@chip.it>

…ws (#1) * CMAKE FIX: include pulp-open config file * Adding pulp open target for gvsoc emulation + Testrunner_tiled_PULPOpen.py * First commit for working iDMA after refactoring * Further changes for iDMA integration * Fix for event unit error when using iDMA + updated testrunners for pulp_open * First changes for pulp-open rtl simulation inside Deeploy * Updated CMake macro for pulp-open simulation on modelsim * fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. - testFloatGEMMnobias - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> * Added new platform: PULP OPEN + MCHAN * Updated changelog ---------

…ws (#1) * CMAKE FIX: include pulp-open config file * Adding pulp open target for gvsoc emulation + Testrunner_tiled_PULPOpen.py * First commit for working iDMA after refactoring * Further changes for iDMA integration * Fix for event unit error when using iDMA + updated testrunners for pulp_open * First changes for pulp-open rtl simulation inside Deeploy * Updated CMake macro for pulp-open simulation on modelsim * fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. - testFloatGEMMnobias - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) - fix bias hoisting in GEMM with no bias * Added new platform: PULP OPEN + MCHAN * Updated changelog ---------

…ws (#1) (#3) * CMAKE FIX: include pulp-open config file * Adding pulp open target for gvsoc emulation + Testrunner_tiled_PULPOpen.py * First commit for working iDMA after refactoring * Further changes for iDMA integration * Fix for event unit error when using iDMA + updated testrunners for pulp_open * First changes for pulp-open rtl simulation inside Deeploy * Updated CMake macro for pulp-open simulation on modelsim * fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. - testFloatGEMMnobias - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) - fix bias hoisting in GEMM with no bias * Added new platform: PULP OPEN + MCHAN * Updated changelog ---------

…ws (#1) * CMAKE FIX: include pulp-open config file * Adding pulp open target for gvsoc emulation + Testrunner_tiled_PULPOpen.py * First commit for working iDMA after refactoring * Further changes for iDMA integration * Fix for event unit error when using iDMA + updated testrunners for pulp_open * First changes for pulp-open rtl simulation inside Deeploy * Updated CMake macro for pulp-open simulation on modelsim * fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. - testFloatGEMMnobias - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> * Added new platform: PULP OPEN + MCHAN * Updated changelog --------- Signed-off-by: RiccardoGandolfi <riccardogandi95@gmail.com> Co-authored-by: RiccardoGandolfi <riccardo.gandolfi@chips.it> Co-authored-by: RiccardoGandolfi <riccardo.gandolfiu@chips.it> Co-authored-by: Alex Marchioni <alex.marchioni@outlook.it> Co-authored-by: Alex Marchioni <alex.marchioni@chip.it>

* fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. ## Added - testFloatGEMMnobias ## Changed - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) ## Fixed - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> * Support Fully Asynchronous DMAs (pulp-platform#114) This pull request introduces improvements to the DMA code generation for several backends (`SnitchDma` and `Mchan`), to enable proper double-buffering by overlapping DMA transfers with kernel calls. Additionally, it refactors the profiling infrastructure for Snitch tiling and improves the readability of the generated code by adding some helpful comments. ### Added - Profiling-aware tiling mixins: `ProfilingDoubleBufferingTilingMixIn` and `ProfilingSingleBufferingTilingMixIn` integrated into the Snitch and PULP tiling generators. - Optional comments injected into generated code (DMA templates `_initTemplate`, `_allocTemplate`, `_waitTemplate`) for improved readability and traceability. - Profiling instrumentation for tile-level DMA and kernel execution integrated into the tiling passes for Snitch backends. ### Changed - Refactored DMA code-generation in the backends (`SnitchDma`, `Mchan`) to enable full overlap of DMA and compute for double-buffering, replacing the earlier (incorrect) synchronization scheme. - Simplified tiling generator logic by leveraging the profiling mix-ins and consolidating redundant template assignments, improving maintainability and code generation clarity. - Improved the waiting-strategy architecture: introduced `PerTensorWaitingStrategy` alongside existing `TensorGroupWaitingStrategy`, enabling finer-grained control of DMA futures in DB mode. ### Fixed - Corrected DMA synchronization bug that previously prevented effective overlapping of transfer and compute in DB mode, especially noticeable for memory-bound kernels. * iDMA Integration into Deeploy + Fixes for RTL and GVSOC pulp-open flows (#1) * CMAKE FIX: include pulp-open config file * Adding pulp open target for gvsoc emulation + Testrunner_tiled_PULPOpen.py * First commit for working iDMA after refactoring * Further changes for iDMA integration * Fix for event unit error when using iDMA + updated testrunners for pulp_open * First changes for pulp-open rtl simulation inside Deeploy * Updated CMake macro for pulp-open simulation on modelsim * fix bug in Generic GEMM with no bias (pulp-platform#126) In generic platform, GEMM was not correctly hoisting the bias tensor when required. To solve the issue, bias hoisting has been moved from `MatMulParser.parseNodeCtxt` to `GEMMParser.parseNode`. Moreover, the default value of `noBiasHoisting` flag in `GenericGEMMParser` has been changed from True to False to be compliant with the template. - testFloatGEMMnobias - Generic\Parser.py file (`MatMulParser`, `GEMMParser`, and `GenericGEMMParser`) - fix bias hoisting in GEMM with no bias Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> * Added new platform: PULP OPEN + MCHAN * Updated changelog --------- Signed-off-by: RiccardoGandolfi <riccardogandi95@gmail.com> Co-authored-by: RiccardoGandolfi <riccardo.gandolfi@chips.it> Co-authored-by: RiccardoGandolfi <riccardo.gandolfiu@chips.it> Co-authored-by: Alex Marchioni <alex.marchioni@outlook.it> Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> * Re-alignment fixes after commit 06a2b46 --------- Signed-off-by: RiccardoGandolfi <riccardogandi95@gmail.com> Co-authored-by: Alex Marchioni <alex.marchioni@outlook.it> Co-authored-by: Alex Marchioni <alex.marchioni@chip.it> Co-authored-by: Philip Wiese <wiesep@iis.ee.ethz.ch> Co-authored-by: RiccardoGandolfi <riccardo.gandolfi@chips.it> Co-authored-by: RiccardoGandolfi <riccardo.gandolfiu@chips.it>

This release includes improvements to the tiling and DMA code generation, new networks and operators, improved CI workflows, migration to PyTest, and support for PyPi package releases. Note: Since the release tag references the Docker container tagged with the release tag (ghcr.io/pulp-platform/deeploy:v0.2.1), the CI will initially fail. The Deeploy Docker image must be built after the release PR is merged and the CI restarted. ### List of Pull Requests - PyPi Package Deployment + Remove Banshee Dept [#154](#154) - PyTest Migration [#144](#144) - Update submodule `pulp-nn-mixed` [#145](#145) - Improve Profiling [#138](#138) - FP32 ReduceMean operator improvement [#137](#137) - Support for RMSNorm (Pow and Sqrt operators) [#136](#136) - Demo TinyViT compatibility with tiled Siracusa [#124](#124) - TinyViT on non-tiled Siracusa [#117](#117) - Support Fully Asynchronous DMAs [#114](#114) - Disallow shape inference [#128](#128) - Remove memory-aware node bindings [#123](#123) - Fix missing const's layout transformation and refactor NCHWtoNHWC passes [#122](#122) - Fix aliasing [#125](#125) - Support for 1D Autoencoder [#98](#98) - Refactor Logging for Improved Debugging [#115](#115) - Add reuse-tool as an SPDX license header linter [#113](#113) - Bug fixes, API Cleanup and Reduce Compiler Warning on PULP [#112](#112) - Fix PULP GEMM `batch` serialization [#109](#109) - Split CI Workflows by Platform and Task, Improve Formatting and Linting Reliability [#108](#108) - Refactor tiling code generation [#105](#105) - Change order of typeMatching entries [#68](#68) - Node Mangling to avoid duplication [#93](#93) - Prepare Post v0.2.0 Release [#104](#104) - Use Docker digests instead of arch-specific tags [#106](#106) - Fix `Unsqueeze` Op. when using ONNX opset 13 or higher (from attribute to input) [#119](#119) - Fix bias hoisting in generic GEMM with no bias [#126](#126)

marchioa requested review from Victor-Jung, Xeratec and lukamac as code owners October 21, 2025 15:21

coderabbitai bot reviewed Oct 21, 2025

View reviewed changes

Deeploy/Targets/Generic/Parsers.py Show resolved Hide resolved

Xeratec added the Bug Something isn't working label Oct 23, 2025

Xeratec added this to Deeploy Oct 23, 2025

Xeratec moved this to Need Reviewer in Deeploy Oct 23, 2025

Xeratec added this to the Release 0.2.1 milestone Oct 23, 2025

Xeratec assigned marchioa Oct 23, 2025

Xeratec moved this from Need Reviewer to In review in Deeploy Oct 28, 2025

lukamac suggested changes Oct 30, 2025

View reviewed changes

fix bug in Generic GEMM with no bias, add testFloatGEMMnobias

df3b648

lukamac self-requested a review October 30, 2025 16:31

marchioa force-pushed the fix-gemm-nobias branch from 7185e32 to df3b648 Compare October 31, 2025 09:08

lukamac approved these changes Oct 31, 2025

View reviewed changes

lukamac merged commit 23e9f02 into pulp-platform:devel Oct 31, 2025
134 checks passed

github-project-automation bot moved this from In review to Done in Deeploy Oct 31, 2025

marchioa deleted the fix-gemm-nobias branch November 6, 2025 08:03

This was referenced Feb 5, 2026

Prepare for Release v0.2.1 #158

Merged

Release v0.2.1 #161

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bug in Generic GEMM with no bias #126

fix bug in Generic GEMM with no bias #126

Uh oh!

marchioa commented Oct 21, 2025 •

edited by Xeratec

Loading

Uh oh!

coderabbitai bot commented Oct 21, 2025 •

edited

Loading

Summary by CodeRabbit

Release Notes

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

lukamac left a comment

Uh oh!

lukamac left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix bug in Generic GEMM with no bias #126

fix bug in Generic GEMM with no bias #126

Uh oh!

Conversation

marchioa commented Oct 21, 2025 • edited by Xeratec Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Added

Changed

Fixed

PR Merge Checklist

Uh oh!

coderabbitai bot commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lukamac left a comment

Choose a reason for hiding this comment

Uh oh!

lukamac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

marchioa commented Oct 21, 2025 •

edited by Xeratec

Loading

coderabbitai bot commented Oct 21, 2025 •

edited

Loading