Skip to content

suggested improvement to normalize on base #8

Closed
@tensorfoo

Description

Hi great work so far. Just one thing, i've noticed that even though the original wav2vec2 base model did not have normalization (but did have it enabled for large), the author has suggested future models even small ones to use normalization.

So i would suggest if a future training run for CLSRIL-23 be done with normalization as suggested. I tried using CLSRIL-23 as a base for pretraining and it seems to work fine with normalization=True but if you also add in model.extractor_mode='layer_norm' as suggested I got an error.

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions