ASR Eval

Code to do yearly evaluation of Norwegian speech recognition models

Install

Use uv or pdm to install dependencies from pyproject.toml

pdm install

Predict

The placeholder arguments in the prediction command below must be filled in. The model name can be one of "usm", "chirp", "gcloud", "azure" or any huggingface model, e.g. "NbAiLab/nb-whisper-large".

pdm run python -m asr_eval.predict -m <modelname> -i <input_file> -o <output_file> -A <audio_path>

Evaluate speech recognition predictions

The main evaluation script expects a csv-file where the ground truth is standardized (without capital letters or punctuation) in a column called "standardized_text" and predicted text is in a column called "predictions". It also expects a language code for the written standard ("nob" or "nno").

pdm run python -m asr_eval -l nob path/to/your/input_file.csv

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
.github/workflows		.github/workflows
data/output/2024		data/output/2024
notebooks		notebooks
scripts		scripts
src/asr_eval		src/asr_eval
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ASR Eval

Install

Predict

Evaluate speech recognition predictions

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Sprakbanken/asr_eval

Folders and files

Latest commit

History

Repository files navigation

ASR Eval

Install

Predict

Evaluate speech recognition predictions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages