Skip to content

Conversation

@liam-mceneaney
Copy link

@liam-mceneaney liam-mceneaney commented Mar 5, 2024

Demo for Android capable of streaming audio in X second sizes.

If anyone has any better ideas for handling the audio buffer on android let me know please! Currently if there is too much audio data to be transcribed I just drop the buffer. This usually happens on silence or if you speak in nonsense. I plan on implementing a VAD library to mitigate silence being processed.

In assets/models I converted distil-small to ggml and it appears to process audio 2x faster than real time. Currently running on an S24 Ultra with just the CPU.

Screenshot_20240304_195942_WhisperCppDemo

Digipom and others added 7 commits November 29, 2023 17:02
Warning about %lu on 32-bit targets. Updated to %zu.
Emulated the objc example of storing a buffer of audio data and
processing in chunks.  Current implementation has issues with handling
situations in which there is too much audio data, current solution is to
dump the buffer which means lost text.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants