DropPath Implementation

https://github.com/huggingface/pytorch-image-models/blob/a6e8598aaf90261402f3e9e9a3f12eac81356e9d/timm/models/layers/drop.py#L140

Hi, 
I had two questions about the implementation of the dropPath:
1. Why do we do it per sample, as far as I understand from https://arxiv.org/pdf/1603.09382.pdf, you either take the whole batch or drop it all together with probability p_l, why is it done per sample here?
2. What is the _div(keep_prob) used for, I can't see that in the equation of the paper as well, can you please clarify the reason behind that?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DropPath Implementation #2118

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

DropPath Implementation #2118

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions