Skip to content

updated tokenization rule in proc.py in timesync data #125

Merged
keighrim merged 1 commit into
mainfrom
123-fix-timesync-proc.py
Nov 14, 2025
Merged

updated tokenization rule in proc.py in timesync data #125
keighrim merged 1 commit into
mainfrom
123-fix-timesync-proc.py

Conversation

@keighrim
Copy link
Copy Markdown
Member

more fix for #123

so that it no longer splits punctuations from words, as forced-alignment
tools usually don't have sophisticated tokenizations and often just
splits on whitespaces.
@clams-bot clams-bot added this to infra Nov 14, 2025
@github-project-automation github-project-automation Bot moved this to Todo in infra Nov 14, 2025
@keighrim
Copy link
Copy Markdown
Member Author

merging and closing the underlying the issue as the FA eval.py is fully updated and functional.

@keighrim keighrim merged commit d51eb64 into main Nov 14, 2025
2 checks passed
@github-project-automation github-project-automation Bot moved this from Todo to Done in infra Nov 14, 2025
@keighrim keighrim deleted the 123-fix-timesync-proc.py branch February 2, 2026 14:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants