Code to do yearly evaluation of Norwegian speech recognition models
Use uv or pdm to install dependencies from pyproject.toml
pdm install
The placeholder arguments in the prediction command below must be filled in. The model name can be one of "usm", "chirp", "gcloud", "azure" or any huggingface model, e.g. "NbAiLab/nb-whisper-large".
pdm run python -m asr_eval.predict -m <modelname> -i <input_file> -o <output_file> -A <audio_path>
The main evaluation script expects a csv-file where the ground truth is standardized (without capital letters or punctuation) in a column called "standardized_text" and predicted text is in a column called "predictions". It also expects a language code for the written standard ("nob" or "nno").
pdm run python -m asr_eval -l nob path/to/your/input_file.csv