I dont see any where, the padded mask applied on inputs/loss during training. Given this, a label is masked as '0', that implies a class?