Many examples use `HWCN` order instead of `WHCN` recommended by document #235

yiyuezhuo · 2020-06-07T03:10:39Z

According to doc

Data should be stored in WHCN order (width, height, # channels, batch size). In other words, a 100×100 RGB image would be a 100×100×3×1 array, and a batch of 50 would be a 100×100×3×50 array.

But many examples use HWCN, for example:

# Function to convert the RGB image to Float64 Arrays
function getarray(X)
    Float32.(permutedims(channelview(X), (2, 3, 1)))
end

The correct transform should be Float32.(permutedims(channelview(X), (3, 2, 1))), because channelview(X) returns a "CHW" array. Likewise, MNIST example doesn't use any permutedims so it just keeps its wrong "HW" order. Fortunately, some are correct, for example, this one from MLDatasets. The three cases are shown in this gist.

So basically, many examples in fact run models on a "transposed" image datasets. The good (bad?) part is that CNN is robust enough to deal with this distortion, so we can't detect it from statistics such as acc and even eye evaluation. But those examples suggest a misleading preprocessing pipeline and should be fixed. (To be honest, I post this issue since I have been misleaded by this...)

The text was updated successfully, but these errors were encountered:

aditkumar72 · 2021-06-18T20:46:06Z

Fixed in #306

MariusDrulea · 2023-02-18T15:04:15Z

I think FluxML shall also document why we use the WHCN order. The only explanation I have is that CUDA.jl is using cuDNN and cuDNN supports the NCHW order (row-major). If we look at the NCHW memory representation in cuDNN, we can notice this is exactly the same as the memory representation of WHCN in julia.

cuDNN:
https://docs.nvidia.com/deeplearning/cudnn/developer-guide/index.html#nchw-layout-x32

ToucheSir mentioned this issue Sep 20, 2020

Refresh Model Zoo FluxML/FluxML-Community-Call-Minutes#9

Open

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Many examples use `HWCN` order instead of `WHCN` recommended by document #235

Many examples use `HWCN` order instead of `WHCN` recommended by document #235

yiyuezhuo commented Jun 7, 2020

aditkumar72 commented Jun 18, 2021

MariusDrulea commented Feb 18, 2023

Many examples use HWCN order instead of WHCN recommended by document #235

Many examples use HWCN order instead of WHCN recommended by document #235

Comments

yiyuezhuo commented Jun 7, 2020

aditkumar72 commented Jun 18, 2021

MariusDrulea commented Feb 18, 2023

Many examples use `HWCN` order instead of `WHCN` recommended by document #235

Many examples use `HWCN` order instead of `WHCN` recommended by document #235