Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integration of Text-to-Speech (TTS) Feature for Enhanced Immersion #2

Open
SverreNystad opened this issue Sep 9, 2023 · 3 comments
Assignees
Labels

Comments

@SverreNystad
Copy link
Owner

SverreNystad commented Sep 9, 2023

While the GPT Dungeon Master provides textual responses, having an auditory component can significantly enhance the immersion for players, especially during intense or dramatic moments in the game.

I propose the integration of a Text-to-Speech (TTS) feature that can convert the GPT DM's textual responses into spoken words. This would allow players to hear the DM's narration, making the gameplay experience more engaging and lifelike.

Proposed Features:

  • Voice Selection: Provide options for different voice tones (e.g., deep, mystical, cheerful) to match the mood of the game.
  • Volume Control: Allow users to adjust the TTS volume independently of other game sounds.
  • Speed Adjustment: Give users the ability to speed up or slow down the TTS to their preference.
  • Pause/Resume: Implement a feature to pause and resume the TTS, especially useful during longer narrations.
  • Emotions: Make the model
  • Skip: Many players find long dialog irritating, we should have a method to skip current Speech.
@SverreNystad
Copy link
Owner Author

SverreNystad commented Sep 9, 2023

@SverreNystad SverreNystad self-assigned this Sep 9, 2023
@SverreNystad
Copy link
Owner Author

I started out using watson but lovo seems better as it allows us to change the speed and emotion of the voice. We can use classifiers or the LLM with function calling to get it to use correct emotion for voice and speed of it
https://api-doc.lovo.ai/

@SverreNystad
Copy link
Owner Author

We need to discuss what is the best text to speech bot

@SverreNystad SverreNystad changed the title Text to speach Integration of Text-to-Speech (TTS) Feature for Enhanced Immersion Sep 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant