This repository was archived by the owner on Jun 27, 2023. It is now read-only.

Description
I'm in need of getting the time coordinates of every word sphinx thinks it identified.
I found out about segments() and realized that it's exactly what I need:
for phrase in AudioFile(audio_file="output.wav"):
print phrase.segments(detailed=True)
However it is not clear to me what the frame # represents. It was immediately apparent that it is neither seconds nor sample #s.
I require a method to convert these frame #s into seconds.
Edit: Also, some words have a (#) following them, eg: 'and(2)', What does it represent?