Remove redundant steps & improve README.md #26

ethanzrd · 2023-08-10T19:53:49Z

The installation script also installs portaudio as part of the Conda environment to avoid having the user install it on their device, eliminating the first step.
The README file addresses troubleshooting and initial configuration.

…ed "whisper-playground," integrated configuration options for the transcription device and compute type, improved clarity of README instructions, and streamlined package selection by removing redundancies.

…izationPlayground

saharmor · 2023-08-10T20:26:14Z

README.md

-If you want minimal latency, use the real-time mode. If you don't mind growing latency and prioritize accuracy, use the sequential mode.
+## Troubleshooting
+
+- If you're unable to connect from the client to the server, use an ngrok tunnel to expose port 8000.


Why would this happen?

Honestly, no idea. Worked fine on MacOS, but didn't work on Windows. If I had to take a guess, I'd say the connection is blocked.

Honestly, no idea. Worked fine on MacOS, but didn't work on Windows. If I had to take a guess, I'd say the connection is blocked.

After changing http://0.0.0.0:8000/ to http://localhost:8000/ in App.js, it is able to run.

@Epresin Good find! Appreciate it :)
Seems to work just fine on MacOS as well, so it's probably the safer bet :)

saharmor · 2023-08-10T20:26:43Z

README.md

+1. On MacOS, there's a clash between av files preventing transcription (works well on Google Colab with Python 3.8).
+2. In the sequential mode, there may be uncontrolled speaker swapping, which can be fixed by using pyannote's building blocks and handling speakers manually.
+3. In real-time mode, audio data not meeting the transcription timeout won't be transcribed.
+4. Speechless batches will cause errors.


Add a link to the issue you've opened

saharmor · 2023-08-10T20:27:40Z

backend/config.py

@@ -1,6 +1,9 @@
 from diart import PipelineConfig
 from enum import Enum

+TRANSCRIPTION_DEVICE = "cuda"  # use 'cpu' if it doesn't work
+COMPUTE_TYPE = "int8_float16"  # use float32 with cpu


Maybe I'm missing something, but the comment seems off. Float 32 though using int8_float16

If one is using "cpu" as their transcription device, it should be float32. float16 wouldn't make that much of a difference even if supported.
int8_float16 works well with cuda.

saharmor · 2023-08-10T20:28:23Z

@ethanzrd I approved and merged but please reply to my comments or just go ahead and fix them if relevant

ethanzrd and others added 6 commits August 10, 2023 22:46

Revised installation script to generate a fresh Conda environment nam…

7681c13

…ed "whisper-playground," integrated configuration options for the transcription device and compute type, improved clarity of README instructions, and streamlined package selection by removing redundancies.

Merge remote-tracking branch 'origin/diarizationPlayground' into diar…

61627e8

…izationPlayground

Merge branch 'saharmor:main' into diarizationPlayground

61c2bf9

Improved README.md, removed unnecessary steps

160e893

Merge remote-tracking branch 'origin/diarizationPlayground' into diar…

4530809

…izationPlayground

Update README.md

e850738

saharmor reviewed Aug 10, 2023

View reviewed changes

saharmor approved these changes Aug 10, 2023

View reviewed changes

saharmor merged commit f840fd1 into saharmor:main Aug 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove redundant steps & improve README.md #26

Remove redundant steps & improve README.md #26

ethanzrd commented Aug 10, 2023

saharmor Aug 10, 2023

ethanzrd Aug 10, 2023

Epresin Aug 11, 2023

ethanzrd Aug 11, 2023

saharmor Aug 10, 2023

saharmor Aug 10, 2023

ethanzrd Aug 10, 2023 •

edited

Loading

saharmor commented Aug 10, 2023

Remove redundant steps & improve README.md #26

Remove redundant steps & improve README.md #26

Conversation

ethanzrd commented Aug 10, 2023

saharmor Aug 10, 2023

Choose a reason for hiding this comment

ethanzrd Aug 10, 2023

Choose a reason for hiding this comment

Epresin Aug 11, 2023

Choose a reason for hiding this comment

ethanzrd Aug 11, 2023

Choose a reason for hiding this comment

saharmor Aug 10, 2023

Choose a reason for hiding this comment

saharmor Aug 10, 2023

Choose a reason for hiding this comment

ethanzrd Aug 10, 2023 • edited Loading

Choose a reason for hiding this comment

saharmor commented Aug 10, 2023

ethanzrd Aug 10, 2023 •

edited

Loading