Fix UNet implementation with arbitrary channel sizes (#243) #276

vinayakjeet · 2024-03-22T14:46:23Z

Bug Description:
The current UNet implementation in the Metalhead package has a limitation where it only works with input tensors of channel size 3. This restriction causes compatibility issues when users try to use UNet with input tensors of different channel sizes.

Patch Description:
To address this limitation, I've modified the UNet implementation to support input tensors with arbitrary channel sizes. The UNet model can now handle input with varying dimensions

Test Case:
using Metalhead
UNet((128,128),1,3,Metalhead.backbone(DenseNet(121)))

This UNet model can process without any errors

Fix UNet implementation to support input with channel sizes other than 3

Fixes FluxML#243

theabhirath · 2024-03-22T16:59:19Z

Hi Vinayakjeet, thanks for the PR! Unfortunately, I don't think this does what we want yet. The problem is that inchannels isn't being passed to the model backbone. What you've done is try and change the input being passed in to the Flux.outputsize function, which actually causes an error when I try to initialise the model:

julia> using Metalhead

julia> model = UNet((128,128),1,3,Metalhead.backbone(DenseNet(121)))
ERROR: DimensionMismatch: layer Conv((7, 7), 3 => 64, pad=3, stride=2, bias=false) expects size(input, 3) == 3, but got 128×128×1×1 Array{Flux.NilNumber.Nil, 4}
Stacktrace:
  [1] _size_check(layer::Flux.Conv{2, 2, typeof(identity), Array{…}, Bool}, x::Array{Flux.NilNumber.Nil, 4}, ::Pair{Int64, Int64})
    @ Flux ~/.julia/packages/Flux/jgpVj/src/layers/basic.jl:195
  [2] (::Flux.Conv{2, 2, typeof(identity), Array{Float32, 4}, Bool})(x::Array{Flux.NilNumber.Nil, 4})
    @ Flux ~/.julia/packages/Flux/jgpVj/src/layers/conv.jl:198
  [3] #outputsize#340
    @ ~/.julia/packages/Flux/jgpVj/src/outputsize.jl:93 [inlined]
  [4] outputsize(m::Flux.Conv{2, 2, typeof(identity), Array{Float32, 4}, Bool}, inputsizes::NTuple{4, Int64})
    @ Flux ~/.julia/packages/Flux/jgpVj/src/outputsize.jl:91
  [5] unetlayers(layers::Vector{…}, sz::NTuple{…}; outplanes::Nothing, skip_upscale::Int64, m_middle::typeof(Metalhead.unet_middle_block))
    @ Metalhead ~/Code/Metalhead.jl/src/convnets/unet.jl:34
  [6] unet(encoder_backbone::Flux.Chain{…}, imgdims::Tuple{…}, inchannels::Int64, outplanes::Int64, final::typeof(Metalhead.unet_final_block), fdownscale::Int64)
    @ Metalhead ~/Code/Metalhead.jl/src/convnets/unet.jl:81
  [7] unet
    @ ~/Code/Metalhead.jl/src/convnets/unet.jl:76 [inlined]
  [8] #UNet#175
    @ ~/Code/Metalhead.jl/src/convnets/unet.jl:120 [inlined]
  [9] UNet(imsize::Tuple{Int64, Int64}, inchannels::Int64, outplanes::Int64, encoder_backbone::Flux.Chain{Tuple{…}})
    @ Metalhead ~/Code/Metalhead.jl/src/convnets/unet.jl:118
 [10] top-level scope
    @ REPL[3]:1
Some type information was truncated. Use `show(err)` to see complete types.

I would suggest that you try and rewrite the function in such a way that inchannels is passed along to the encoder backbone.

theabhirath · 2024-03-22T17:01:50Z

src/convnets/unet.jl

              encoder_backbone = Metalhead.backbone(DenseNet(121)); pretrain::Bool = false)
-    layers = unet(encoder_backbone, (imsize..., inchannels, 1), outplanes)
+    layers = unet(encoder_backbone, imsize, inchannels, outplanes)


inchannels should somehow be passed in to the encoder backbone here. Of course, we will have to decide how to deal with this in case the user passes in a model with this initialised and also separately inchannels

Modified the first convolutional layer of the encoder backbone to ensure compatibility with the input's channel size and dimension mismatch error is thus prevented #1

CarloLucibello · 2024-03-23T12:18:47Z

src/convnets/unet.jl

-                        skip_upscale = fdownscale)
+function unet(encoder_backbone, imgdims, inchannels::Integer, outplanes::Integer,
+    final::Any = unet_final_block, fdownscale::Integer = 0)
+backbonelayers = collect(flatten_chains(encoder_backbone))


please pay attention to the formatting, you lost the indentation here

Indentation issue resolved

2nd try

3rd try

4th try

5th try

6th try

vinayakjeet · 2024-03-24T04:30:28Z

A beginner contributor to the codebase, can you review the logic I have implemented, additionally I have encountered an error MethodError indicating a mismatch in method signatures for the unet function. It appears that there might be an issue with how the encoder_backbone is instantiated or utilized within the unet function. Could you please review the instantiation and usage of the encoder_backbone

vinayakjeet added 2 commits March 22, 2024 16:04

Fixes FluxML#243

4bf74f5

Fix UNet implementation to support input with channel sizes other than 3

Merge pull request #1 from vinayakjeet/vinayakjeet-patch-1

accc442

Fixes FluxML#243

theabhirath reviewed Mar 22, 2024

View reviewed changes

Update unet.jl

3c92c88

Modified the first convolutional layer of the encoder backbone to ensure compatibility with the input's channel size and dimension mismatch error is thus prevented #1

CarloLucibello reviewed Mar 23, 2024

View reviewed changes

vinayakjeet added 7 commits March 24, 2024 05:58

Update unet.jl

4f01145

Indentation issue resolved

Update unet.jl

6c7cfaa

2nd try

Update unet.jl

689c3b1

3rd try

Update unet.jl

238bfa9

4th try

Update unet.jl

c98640b

5th try

Update unet.jl

1e892a4

6th try

Update unet.jl

18478fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix UNet implementation with arbitrary channel sizes (#243) #276

Fix UNet implementation with arbitrary channel sizes (#243) #276

vinayakjeet commented Mar 22, 2024

theabhirath commented Mar 22, 2024

theabhirath Mar 22, 2024

CarloLucibello Mar 23, 2024

vinayakjeet commented Mar 24, 2024

Fix UNet implementation with arbitrary channel sizes (#243) #276

Are you sure you want to change the base?

Fix UNet implementation with arbitrary channel sizes (#243) #276

Conversation

vinayakjeet commented Mar 22, 2024

theabhirath commented Mar 22, 2024

theabhirath Mar 22, 2024

Choose a reason for hiding this comment

CarloLucibello Mar 23, 2024

Choose a reason for hiding this comment

vinayakjeet commented Mar 24, 2024