Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore completely hands free operations instead of push to talk / image #33

Open
MrCsabaToth opened this issue Aug 16, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@MrCsabaToth
Copy link
Member

Currently the app operates in a push to talk manner (also push to signal the end of talk, the native STT often times cuts the session short while I'm still speaking).
It'd be the best to have completely hand free operations somehow. The activation can be by keyword. We could also utilize gesture detection in case of multi modal operation.
Even in the demo video the multi modal scenes were hard to record.

@MrCsabaToth MrCsabaToth added the enhancement New feature or request label Aug 16, 2024
@MrCsabaToth
Copy link
Member Author

Roman Jaquez at https://gdg.community.dev/events/details/google-gdg-surrey-presents-beyond-chatbots-unlocking-geminis-potential-through-flutter/ told me that the https://pub.dev/packages/speech_to_text plugin support voice activation. So we could offer that when the user enables Android native Speech Services and this would not be usable in translation mode.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant