the code question in semantic_seg #20

Ianresearch · 2021-12-09T03:21:19Z

Hi, I have a questation about the logit_scale and logit_bias in semantic_seg. The shape of the above parameter is (1, num_classes, 1, 1), why not is (1, num_classes, 512, 512) which is matched the input image size for semantic segmenation.

tonysy · 2021-12-09T03:26:30Z

Hi, logit_scale and logit_bias are class-wise magnitudes and margin(scalar for each class), respectively, as described in the original paper.

Ianresearch · 2021-12-09T03:48:21Z

Thanky you for reply. If I use the method in semantic segmentation, and the input image size is (512,512), the parameter shape should be the shape(1, num_classes, 512, 512) or not? I think the shape(1, num_classes, 1, 1) in your code is used for Image Classification.Is that correct?

tonysy · 2021-12-09T03:57:19Z

The code should work for this scenario as the broadcast will be conducted automatically. You can try this example

>>> import torch
>>> data = torch.randn(1,10, 512,512)
>>> a = torch.ones(1,10,1,1)
>>> b = torch.zeros(1,10,1,1)
>>> out = a * data + b
>>> out.shape
torch.Size([1, 10, 512, 512])
>>>

Ianresearch · 2021-12-13T02:42:46Z

Hi tonysy, I use this paper method in my u-net semantic segmentation, but the result is not improved. Is there something wrong in my implementation process? The u-net reuslt is 4 classes and image size is 512*512, and the last layer is: reuslt = conv_bn_relu(inputchannel, outputchannel=4).In the first stage, I train this u-net with lr=0.02. In the second stage, I fix the parameter in the all net except the last layer, and add the layer code:
#add the long-tail disalign
confidence = self.confidence_layer(result) #self.confidence_layer=conv_bn_relu(inchannel=4,outchannel=1)
confidence = torch.sigmoid(confidence)
#only adjust the foreground classification scores
scores_tmp = confidence * (result * self.logit_scale + self.logit_bias)
result = scores_tmp + (1 - confidence) * result
At the same time in the second stage, I re-weight the celoss weight using the method in Sec.3.2.2(ρ=0.3) .Train the above net again with lr=0.02. Looking forward your answer.

tonysy · 2021-12-13T03:18:54Z

First, in DisAlign, all layers learned in stage-1 are fixed during the second stage.
Second, the proposed method mainly focuses on long-tail, which typically consists of many tail classes.

Thus, you can fix all layers of stage-1, and remove the confidence layer, only using the GRW for you case.(confidence estimation has minor improvement on segmentation task in our recent experiment)

Such as:

result = result * self.logit_scale + self.logit_bias

Then only learn the logit_scale and logit_bias with the GRWCrossEntropyLoss

DisAlign/semantic_seg/disalign/models/losses/grw_cross_entropy_loss.py

Line 42 in a2fc350

class GRWCrossEntropyLoss(nn.Module):

Ianresearch · 2021-12-13T03:36:53Z

I will try again.The dataset is long-tail distribution, the biggest class ration is 74% and The smallest proportion is only 0.2%.(74%,11%,14.8%,0.2%). Thank you very much.

tonysy · 2021-12-13T03:55:21Z

Hi, the case described is imbalanced classification, not long-tail. long-tail means there exist many tail classes.(typically hundreds or thousands of classes in total)

Fly-dream12 · 2022-03-03T07:53:21Z

In this project, where is the code about imbalanced image classification, which script should be used？ @tonysy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the code question in semantic_seg #20

the code question in semantic_seg #20

Ianresearch commented Dec 9, 2021

tonysy commented Dec 9, 2021

Ianresearch commented Dec 9, 2021

tonysy commented Dec 9, 2021

Ianresearch commented Dec 13, 2021 •

edited

Loading

tonysy commented Dec 13, 2021

Ianresearch commented Dec 13, 2021

tonysy commented Dec 13, 2021

Fly-dream12 commented Mar 3, 2022

the code question in semantic_seg #20

the code question in semantic_seg #20

Comments

Ianresearch commented Dec 9, 2021

tonysy commented Dec 9, 2021

Ianresearch commented Dec 9, 2021

tonysy commented Dec 9, 2021

Ianresearch commented Dec 13, 2021 • edited Loading

tonysy commented Dec 13, 2021

Ianresearch commented Dec 13, 2021

tonysy commented Dec 13, 2021

Fly-dream12 commented Mar 3, 2022

Ianresearch commented Dec 13, 2021 •

edited

Loading