2D Convolution #1937

JuanPedroGHM · 2025-08-12T07:59:13Z

Copy of #1007 due to branching conflicts

Due Diligence

General:
- title of the PR is suitable to appear in the Release Notes
Implementation:
- unit tests: all split configurations tested
- unit tests: multiple dtypes tested
- NEW unit tests: MPS tested (1 MPI process, 1 GPU)
- benchmarks: created for new functionality
- benchmarks: performance improved or maintained
- documentation updated where needed

Description

Issue/s resolved: #

Changes proposed:

Type of change

Memory requirements

Performance

Does this change modify the behaviour of other functions? If so, which?

yes / no

for more information, see https://pre-commit.ci

… had to replace ht.int by ht.float32

for more information, see https://pre-commit.ci

- instead use str(ht_array.device) - Use signal.device to get correct rank

- put more functionality into input_check - add batch processing check - add stride to convolution 2D - fix convgenpad regarding circular condition - prepare convolution2D for batch-processing (not yet implemented) - Improve indexing in convolution2D for arbitrary dimensions test_signal.py - Start adding second dimension to test_only_balanced_kernels

Split up 1D tests into subtests - test conv_input_check - test conv_batchprocessing_check - test batch_cnvolutions 1D with and without stride

- test 1D batch processing with and without stride - remove batchprocessing test from test_convolve into own function - 2D batch processing tests, test NotImplementedError

- Test odd and even kernels separately - With and without stride

- test large signal and kernel for different modes with/without stride - test kernel size 1 with/without stride - remove test_convolve

- Tests for different modes - Tests for different kernels - Tests for different strides - Different formats - Edge cases

- Feature still missing: Batch convolutions for 2D

Open question: How to handle distributed arrays in conv_pad if the rank is empty? Or rather not pad, if empty However, in this case: the last non-empty rank is the reference for padding

side note: Maybe add a RuntimeError or other if boundary == "replicate" and split dimension is 1 and any rank is zero! Will throw an error from torch

- Still broken: Local-index computation for even kernels Fix: - correct array shapes for DND result - 2D convolutions, with stride 1 for distributed kernels

convolution2d_large_signal_and_kernel still throws error for distributed kernels

Issue: Flip of kernel did not occur along split dimension Solution: Use positive indexing - Added scripts/flip_text.py to demonstrate problem - Adjusted signal.py to positive indexing Open problems: convolve2d with distributed kernel still fails, likely due to broadcasting issue

- kernel chunks were not distributed by v.comm.bcast - Switch to v.comm.Bcast Additional fix: line 915 - check a.isdistributed() before calling stride[a.split] All tests pass but one: - test_convolve2d_local_chunks_error

Problem: chunk error test after padding, but now also after halo computation Solution: - Adjust test to comm sizes > 3, because otherwise no problem arises - To my knowledge, the chunk test is also valid after halo compute, so no additional fix needed

- communication in _allgather fails - only appeared for mpirun -n >3 and convolve2d with stride and v.is_distributed() - problem likely relates to issues within convolve2d where ranks loos the information despite axis=None

krajsek and others added 30 commits August 11, 2025 15:54

added gepad to heat/core/signal.py

386423a

added convolve2d to heat/core/signal.py

4ccd4a7

refactored

1df67e8

refactored

d61694d

replaced unsqueeze wirh reshape

288a6ad

refactorized

7b8c11c

refactorized

888b090

refactorized

a5628d3

refactorized

0367d2e

added balance step for mode=full/valid

07706e2

added balance for full and valid mode

363f476

init commit

663ddfb

implemented 2d convolution with distributed kernel

f6da7b9

swap a, v when v is larger and check for different split axis

8af9d59

[pre-commit.ci] auto fixes from pre-commit.com hooks

3338c45

for more information, see https://pre-commit.ci

comparison to none

844a0be

supported for all modes

89f8141

added tests for convolve2d

f0a6357

supported non-square matrix

78646ec

used scipy to compute example for tests

4a73de9

reformatted

af2b2a7

manual merge convolve() from main

2286498

found where the CI failed: in the second-last test of test_signal one…

34f8c85

… had to replace ht.int by ht.float32

added inputcheck to heat/core/signal.py

742702e

[pre-commit.ci] auto fixes from pre-commit.com hooks

8d75a1f

for more information, see https://pre-commit.ci

added comments

8c87f49

refactoring

f7e4266

added comments to inputcheck

9050295

convolve2d docstring revised

467eb2e

set dtype to float on GPU

48b461a

lolacaro added 8 commits August 11, 2025 16:04

Fix: Missed .device to previous commit

f49e09c

Fix: torch device not accessible

6e46dc4

- instead use str(ht_array.device) - Use signal.device to get correct rank

Add: Tests for check functions and 1D batch convolutions

dd40685

Split up 1D tests into subtests - test conv_input_check - test conv_batchprocessing_check - test batch_cnvolutions 1D with and without stride

Add: test batch processing implementation

92e55aa

- test 1D batch processing with and without stride - remove batchprocessing test from test_convolve into own function - 2D batch processing tests, test NotImplementedError

Fix: Split 1d tests kernel and mode

1f9cd07

- Test odd and even kernels separately - With and without stride

Fix: Split test_convolve edge cases

0a5b0dc

- test large signal and kernel for different modes with/without stride - test kernel size 1 with/without stride - remove test_convolve

Split up tests, intermediate commit

dd0f899

JuanPedroGHM added this to the 2.0 milestone Aug 12, 2025

JuanPedroGHM self-assigned this Aug 12, 2025

JuanPedroGHM added the PR talk label Aug 12, 2025

JuanPedroGHM added this to Roadmap Aug 12, 2025

github-project-automation bot moved this to Todo in Roadmap Aug 12, 2025

lolacaro force-pushed the feature920/distributed-2D-convolution branch from d8359df to dd0f899 Compare August 12, 2025 08:10

Fix: .pre-commit.yaml

ee5230d

JuanPedroGHM mentioned this pull request Aug 12, 2025

Feature/920: Implemented 2D convolution #1007

Closed

JuanPedroGHM linked an issue Aug 12, 2025 that may be closed by this pull request

Implement convolve2d() #920

Open

lolacaro added 6 commits August 16, 2025 08:46

Test: Added tests on conv_pad

cedb5a8

Add: Tests for conv2d

c86bd5c

- Tests for different modes - Tests for different kernels - Tests for different strides - Different formats - Edge cases

Fix: Remove debugging prints

c215065

- Feature still missing: Batch convolutions for 2D

Add: 2D batch convolutions and tests

e486516

Intermediate commit

369424e

Open question: How to handle distributed arrays in conv_pad if the rank is empty? Or rather not pad, if empty However, in this case: the last non-empty rank is the reference for padding

Intermediate commit: Tested until test_conolve2d_batch_convolutions

6ca7e40

side note: Maybe add a RuntimeError or other if boundary == "replicate" and split dimension is 1 and any rank is zero! Will throw an error from torch

JuanPedroGHM removed the PR talk label Aug 25, 2025

lolacaro added 6 commits September 26, 2025 11:51

Fix: Tests and convolutions

1408856

- Still broken: Local-index computation for even kernels Fix: - correct array shapes for DND result - 2D convolutions, with stride 1 for distributed kernels

Fix: local_index computation for distributed signal and stride > 1

680855e

convolution2d_large_signal_and_kernel still throws error for distributed kernels

Fix: 2d convolution with distributed kernel

09900e5

- kernel chunks were not distributed by v.comm.bcast - Switch to v.comm.Bcast Additional fix: line 915 - check a.isdistributed() before calling stride[a.split] All tests pass but one: - test_convolve2d_local_chunks_error

Intermediate: Search for problem

81837df

- communication in _allgather fails - only appeared for mpirun -n >3 and convolve2d with stride and v.is_distributed() - problem likely relates to issues within convolve2d where ranks loos the information despite axis=None

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2D Convolution #1937

2D Convolution #1937

Uh oh!

JuanPedroGHM commented Aug 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

2D Convolution #1937

Are you sure you want to change the base?

2D Convolution #1937

Uh oh!

Conversation

JuanPedroGHM commented Aug 12, 2025

Due Diligence

Description

Changes proposed:

Type of change

Memory requirements

Performance

Does this change modify the behaviour of other functions? If so, which?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants