Name	Name	Last commit message	Last commit date
parent directory ..
fsts	fsts
logits	logits
LICENSE	LICENSE
README.md	README.md
logits2fsts.py	logits2fsts.py
symbols.txt	symbols.txt

Name

Last commit message

Last commit date

Argentinian Spanish CTC phoneme lattices

This directory contains CTC posterior labeling distributions for a Spanish phoneme recognition task. These were generated using the Persephone toolkit trained on a subset of a crowdsourced high-quality Argentinian Spanish speech data set and evaluated on a small held-out subset.

The labeling distributions are presented in two equivalent formats:

In the directory logits as 2-dimensional NumPy arrays of shape (F, L) where F is the number of frames and L is the size of the labeling alphabet. Each row holds the per-frame logits of the L-dimensional categorical distribution over labels.
In the directory fsts as serialized finite automata in OpenFst format.

Provide feedback