Skip to content

Adding test presets for mms + support for any encoder in reference model #1249

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Aug 20, 2025

Conversation

gilkeren1
Copy link

What does this PR do? Please describe:
As discussed in the meeting, adding two test presets for mms that can be quickly run (10 mins) to verify PRs are not breaking training. One preset for llm-asr and another for ctc.

those two tests are passing right now:
test_ctc_seed2_test_presets
test_llm_asr_seed2_test_presets

Fixes #{issue number}
n/a

Does your PR introduce any breaking changes? If yes, please list them:
No

Check list:

  • Was the content of this PR discussed and approved via a GitHub issue? (no need for typos or documentation improvements)
  • Did you read the contributor guideline?
  • Did you make sure that your PR does only one thing instead of bundling different changes together?
  • Did you make sure to update the documentation with your changes? (if necessary)
  • Did you write any new necessary tests?
  • Did you verify new and existing tests pass locally with your changes?
  • Did you update the CHANGELOG? (no need for typos, documentation, or minor internal changes)

All ok

@gilkeren1 gilkeren1 requested a review from cbalioglu as a code owner August 19, 2025 17:42
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 19, 2025
@gilkeren1 gilkeren1 requested review from artemru and mattsetz and removed request for cbalioglu and artemru August 19, 2025 17:42
@wav2vec2_asr_arch("300m_bib1143_3292")
def bib1143_300m_3292() -> Wav2Vec2AsrConfig:
config = bib1143_300m()
config.vocab_info.size = 3292
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why this vocab size ? can you leave a comment ?

@gilkeren1 gilkeren1 force-pushed the gilkeren/test_presets branch from 4fb03b0 to e91f6f8 Compare August 20, 2025 15:58
@artemru artemru changed the title Adding test presets for mms Adding test presets for mms + support for any encoder in reference model Aug 20, 2025
@gilkeren1 gilkeren1 force-pushed the gilkeren/test_presets branch from e91f6f8 to 1ef0bf9 Compare August 20, 2025 20:59
@gilkeren1 gilkeren1 merged commit a5c0410 into main_w2v2_pretraining Aug 20, 2025
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants