In this project, I undertook the task of fine-tuning a whisper speech to text base model to enhance pronunciation learning, particularly focusing on broken words or fragmented speech segments. The primary objective was to develop a robust system capable of accurately transcribing whispered speech, especially in scenarios where words are partially uttered or fragmented. Leveraged advanced transfer learning techniques and deep learning architectures to achieve an impressive accuracy rate of nearly 95%. Collaborated with educators to integrate the model into language learning applications, demonstrating a commitment to leveraging technology for educational enhancement.
-
Notifications
You must be signed in to change notification settings - Fork 0
bilalhameed248/Whisper-Fine-Tuning-For-Pronunciation-Learning
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published