feat: replace bce by focal loss in linknet loss #824

charlesmindee · 2022-02-16T14:54:45Z

Following a suggestion by @fg-mindee and @SiddhantBahuguna, this PR replaces the BCE loss by the Focal loss in the linknet loss to increase the recall (imbalanced classes)

Any feedback is welcome!

codecov · 2022-02-16T15:08:54Z

Codecov Report

Merging #824 (972a2b6) into main (fae4923) will decrease coverage by 0.02%.
The diff coverage is 92.00%.

@@            Coverage Diff             @@
##             main     #824      +/-   ##
==========================================
- Coverage   96.01%   95.98%   -0.03%     
==========================================
  Files         131      131              
  Lines        5019     5033      +14     
==========================================
+ Hits         4819     4831      +12     
- Misses        200      202       +2

Flag	Coverage Δ
unittests	`95.98% <92.00%> (-0.03%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/models/detection/linknet/tensorflow.py	`97.67% <91.66%> (-1.06%)`	⬇️
doctr/models/detection/linknet/pytorch.py	`97.97% <92.30%> (-0.94%)`	⬇️
doctr/transforms/modules/base.py	`94.59% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fae4923...972a2b6. Read the comment docs.

fg-mindee

Thanks, I added a comment on the implementation! Have you tried to check if it improves training perf?

doctr/models/detection/linknet/pytorch.py

fg-mindee · 2022-02-16T17:39:27Z

doctr/models/detection/linknet/pytorch.py

+        p_t = (seg_target[seg_mask] * pred_prob) + ((1 - seg_target[seg_mask]) * (1 - pred_prob))
+        # Compute alpha factor
+        alpha_factor = seg_target[seg_mask] * alpha + (1 - seg_target[seg_mask]) * (1 - alpha)
+        # compute the final loss
+        focal_loss = (alpha_factor * (1. - p_t) ** gamma * bce_loss[seg_mask]).mean()


I think we need to address the masking + reduction problem of the dice loss first: once masked, this reduces the tensor to something in 1D. So if one class has 10 times more masked region than another, this will be a problem

I'd suggest doing the following: changing

seg_target[mask].mean()

to

mask = mask.to(dtype=torch.foat32) # Average on N, H, W class_loss = (seg_target * mask).sum((0, 2, 3)) / mask.sum((0, 2, 3)) loss = class_loss.mean()

or average it on H, W only before the final mean

I am not sure I understand well here: do you want to remove each ...[mask] occurrence ?

So the difference is the following:

my_tensor[mask] is 1D tensor with a number of elements = number of True in the mask

my_tensor * mask.to(dtype=torch.float32) has the shape shape as my_tensor, it only puts zero on elements that are masked out

Now if you perform a reduction operation like mean:

in the first case, you divide by the number of elements in my_tensor[mask] = mask.sum()

in the second one, you dive by the number of elements in my_tensor

And this extends to dimension-specific operations, so if we mask it, we lose the separation of the dimensions to get a contiguous tensor in the end. To properly scale the loss, in the first case, this widely increases the contribution of classes with the highest amount of positive mask (makes no difference if there is only a single class). And since we specifically want to help balance the less-frequent classes here, I suggest leveraging that second option with my suggestion above 👍

(and we need to do this for both the dice loss and the focal loss)

it will only make a difference in cases of multi-class and where the mask isn't only True but that might be safer!

Either way, I think we should run a training with the configuration to make sure this yields a positive change 👍

fg-mindee

A few corrections in the loss computation and we'll be good to go!

doctr/models/detection/linknet/pytorch.py

fg-mindee

Final cosmetic adjustments and we're good to go 👍

doctr/models/detection/linknet/pytorch.py

doctr/models/detection/linknet/tensorflow.py

fg-mindee

My bad, a few fixes to do on my previous suggestion

doctr/models/detection/linknet/pytorch.py

doctr/models/detection/linknet/tensorflow.py

fg-mindee

Thanks!

* feat: replace bce by focal loss in linknet loss * fix: requested changes * fix: mask reduction * fix: mask reduction * fix: loss reduction * fix: final adjustements * fix: final changes

This reverts commit 6511183.

* backup * onnx classification * fix: Fixed some ResNet architecture imprecisions (#828) * feat: Added new resnets * feat: Added ResNet101 * fix: Fixed ResNet31 & ResNet34 wide * feat: Added new pretrained resnets * style: Fixed isort * fix: Fixed ResNet architectures * refactor: Refactored LinkNet * feat: Added more LinkNets * fix: Fixed MAGResNet * docs: Updated documentation * refactor: Removed ResNet101 * fix: Fixed warning * fix: Fixed a few bugs * test: Updated unittests * docs: Fixed docstrings * update with new models * feat: replace bce by focal loss in linknet loss (#824) * feat: replace bce by focal loss in linknet loss * fix: requested changes * fix: mask reduction * fix: mask reduction * fix: loss reduction * fix: final adjustements * fix: final changes * Revert "feat: replace bce by focal loss in linknet loss (#824)" This reverts commit 6511183. * Revert "fix: Fixed some ResNet architecture imprecisions (#828)" This reverts commit 72e5e0d. * happy codacy * sapply suggestions * fix-setup * remove onnx from test req * move onnx deps ftm to torch * up * up * revert requirements * fix * update docstring * up Co-authored-by: F-G Fernandez <76527547+fg-mindee@users.noreply.github.com> Co-authored-by: Charles Gaillard <charles@mindee.co>

* backup * onnx classification * fix: Fixed some ResNet architecture imprecisions (mindee#828) * feat: Added new resnets * feat: Added ResNet101 * fix: Fixed ResNet31 & ResNet34 wide * feat: Added new pretrained resnets * style: Fixed isort * fix: Fixed ResNet architectures * refactor: Refactored LinkNet * feat: Added more LinkNets * fix: Fixed MAGResNet * docs: Updated documentation * refactor: Removed ResNet101 * fix: Fixed warning * fix: Fixed a few bugs * test: Updated unittests * docs: Fixed docstrings * update with new models * feat: replace bce by focal loss in linknet loss (mindee#824) * feat: replace bce by focal loss in linknet loss * fix: requested changes * fix: mask reduction * fix: mask reduction * fix: loss reduction * fix: final adjustements * fix: final changes * Revert "feat: replace bce by focal loss in linknet loss (mindee#824)" This reverts commit 6511183. * Revert "fix: Fixed some ResNet architecture imprecisions (mindee#828)" This reverts commit 72e5e0d. * happy codacy * sapply suggestions * fix-setup * remove onnx from test req * move onnx deps ftm to torch * up * up * revert requirements * fix * update docstring * up Co-authored-by: F-G Fernandez <76527547+fg-mindee@users.noreply.github.com> Co-authored-by: Charles Gaillard <charles@mindee.co>

feat: replace bce by focal loss in linknet loss

c68b2d0

charlesmindee added type: enhancement Improvement module: models Related to doctr.models framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend topic: text detection Related to the task of text detection labels Feb 16, 2022

charlesmindee added this to the 0.5.1 milestone Feb 16, 2022

charlesmindee self-assigned this Feb 16, 2022

charlesmindee requested a review from fg-mindee February 16, 2022 14:57

fg-mindee reviewed Feb 16, 2022

View reviewed changes

charlesmindee added 4 commits February 17, 2022 12:23

fix: requested changes

a3d1983

Merge branch 'main' into focal

b89f579

fix: mask reduction

660a7fb

fix: mask reduction

fb7809d

charlesmindee requested a review from fg-mindee February 22, 2022 10:17

fg-mindee suggested changes Feb 22, 2022

View reviewed changes

fix: loss reduction

e182b10

charlesmindee requested a review from fg-mindee February 22, 2022 16:26

fg-mindee reviewed Feb 23, 2022

View reviewed changes

doctr/models/detection/linknet/pytorch.py Outdated Show resolved Hide resolved

doctr/models/detection/linknet/tensorflow.py Outdated Show resolved Hide resolved

charlesmindee added 2 commits February 23, 2022 14:01

fix: final adjustements

395c935

fix: conflicts

5df162e

charlesmindee requested a review from fg-mindee February 23, 2022 13:03

fg-mindee suggested changes Feb 23, 2022

View reviewed changes

doctr/models/detection/linknet/pytorch.py Outdated Show resolved Hide resolved

doctr/models/detection/linknet/pytorch.py Outdated Show resolved Hide resolved

doctr/models/detection/linknet/tensorflow.py Outdated Show resolved Hide resolved

fix: final changes

972a2b6

charlesmindee requested a review from fg-mindee February 23, 2022 14:53

fg-mindee approved these changes Feb 23, 2022

View reviewed changes

charlesmindee merged commit b06a27f into main Feb 23, 2022

charlesmindee deleted the focal branch February 23, 2022 16:21

felixdittrich92 added a commit to felixdittrich92/doctr that referenced this pull request Feb 24, 2022

Revert "feat: replace bce by focal loss in linknet loss (mindee#824)"

5e77167

This reverts commit 6511183.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: replace bce by focal loss in linknet loss #824

feat: replace bce by focal loss in linknet loss #824

charlesmindee commented Feb 16, 2022

codecov bot commented Feb 16, 2022 •

edited

Loading

fg-mindee left a comment

fg-mindee Feb 16, 2022

charlesmindee Feb 17, 2022 •

edited

Loading

fg-mindee Feb 18, 2022

fg-mindee Feb 18, 2022

fg-mindee left a comment

fg-mindee left a comment

fg-mindee left a comment

fg-mindee left a comment

feat: replace bce by focal loss in linknet loss #824

feat: replace bce by focal loss in linknet loss #824

Conversation

charlesmindee commented Feb 16, 2022

codecov bot commented Feb 16, 2022 • edited Loading

Codecov Report

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee Feb 16, 2022

Choose a reason for hiding this comment

charlesmindee Feb 17, 2022 • edited Loading

Choose a reason for hiding this comment

fg-mindee Feb 18, 2022

Choose a reason for hiding this comment

fg-mindee Feb 18, 2022

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

codecov bot commented Feb 16, 2022 •

edited

Loading

charlesmindee Feb 17, 2022 •

edited

Loading