Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]: Voice Input and Audio Response #1877

Closed
1 task done
paresh2806 opened this issue Aug 8, 2024 · 1 comment
Closed
1 task done

[Feature Request]: Voice Input and Audio Response #1877

paresh2806 opened this issue Aug 8, 2024 · 1 comment
Labels

Comments

@paresh2806
Copy link
Contributor

Is there an existing issue for the same feature request?

  • I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

Summary:
Add voice input and audio response capabilities to the chat application.

Description:

  1. Voice Input:

    • Allow users to speak instead of typing their messages.
    • Implement a speech-to-text functionality that transcribes the spoken words.
    • The transcribed text will be passed to the chat completion API call, similar to the current text input method.

    Voice Input Example

  2. Audio Response:

    • Provide users with the option to listen to the responses.
    • Implement a text-to-speech functionality that converts the text responses into audio.
    • Users can choose to play the audio response instead of reading the text.

    Audio Response Example

Benefits:

  • Improves accessibility and convenience.
  • Enhances user experience with multiple interaction methods.

Implementation:

  • Integrate speech-to-text and text-to-speech APIs.
  • Add UI elements for recording and playing audio.

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

@KevinHuSh KevinHuSh mentioned this issue Sep 12, 2024
46 tasks
cike8899 added a commit to cike8899/ragflow that referenced this issue Sep 14, 2024
KevinHuSh pushed a commit that referenced this issue Sep 14, 2024
### What problem does this PR solve?

feat: Supports text output and sound output #1877

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
cike8899 added a commit to cike8899/ragflow that referenced this issue Sep 14, 2024
KevinHuSh pushed a commit that referenced this issue Sep 14, 2024
…to the tab of the conversation #1877 (#2440)

### What problem does this PR solve?

feat: After the voice in the new conversation window is played, jump to
the tab of the conversation #1877

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
cike8899 added a commit to cike8899/ragflow that referenced this issue Sep 14, 2024
KevinHuSh pushed a commit that referenced this issue Sep 14, 2024
…llowed to be turned on #1877 (#2446)

### What problem does this PR solve?

feat: If the tts model is not set, the Text to Speech switch is not
allowed to be turned on #1877

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
cike8899 added a commit to cike8899/ragflow that referenced this issue Sep 14, 2024
KevinHuSh pushed a commit that referenced this issue Sep 14, 2024
…ly message when the answer is empty #1877 (#2447)

### What problem does this PR solve?

feat: When voice is turned on, the page will not display an empty reply
message when the answer is empty #1877

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
@peizimo
Copy link

peizimo commented Oct 22, 2024

Right now we can't implement voice input, so if I want to implement this function, how should I do it, thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants