Skip to content

Conversation

@neooriginal
Copy link
Collaborator

@neooriginal neooriginal commented Oct 23, 2025

I did also implement backend/scripts/stt/k_compare_transcripts_performance.py but i could not test it yet.

This pull request introduces ElevenLabs Scribe as a new speech-to-text (STT) integration, making it available as a selectable transcription service alongside existing providers. The changes span environment configuration, requirements, core streaming logic, database storage, performance testing, and include a new integration test script. The main updates focus on supporting ElevenLabs Scribe throughout the backend pipeline, enabling its use for real-time and batch transcription.

ElevenLabs Scribe Integration

  • Added the elevenlabs Python package to requirements.txt and the ELEVENLABS_API_KEY environment variable to .env.template for authentication and configuration. [1] [2]
  • Implemented ElevenLabs Scribe support in the streaming pipeline (process_audio_elevenlabs), updated service selection logic to prioritize ElevenLabs when configured, and mapped supported languages. [1] [2] [3]
  • Integrated ElevenLabs Scribe into the main transcription router, including socket management, audio streaming, and resource cleanup. [1] [2] [3] [4] [5] [6] [7] [8]

Database and Performance Testing

  • Updated the database logic to store and retrieve ElevenLabs Scribe transcripts, ensuring results are available alongside other providers.
  • Enhanced the transcript performance comparison script to include ElevenLabs, allowing for benchmarking and validation against other STT services. [1] [2] [3] [4] [5]

Testing and Validation

  • Added a standalone integration test script (test_elevenlabs_stt.py) to verify ElevenLabs Scribe connectivity, service selection, and basic audio streaming functionality.

@beastoin
Copy link
Collaborator

beastoin commented Nov 5, 2025

I cannot merge if the pr was not tested. pls get back to me with the test results.

this pr is clean. nice job. thanks man.

@neooriginal
Copy link
Collaborator Author

I cannot merge if the pr was not tested. pls get back to me with the test results.

this pr is clean. nice job. thanks man.

It was fully tested, don't worry. I've been running it on my custom backend for a few weeks now.

@neooriginal neooriginal changed the title [proof-of-concept] add elevenlabs scribe add elevenlabs scribe Nov 5, 2025
@neooriginal
Copy link
Collaborator Author

I can't sadly test backend/scripts/stt/k_compare_transcripts_perfor mance.py as I don't have the funds to test all of then. If you want me to test it, please get back to me on discord with the elevenlabs API key.

@Benson665
Copy link

Thank you all on your support

@neooriginal
Copy link
Collaborator Author

@beastoin

@mdmohsin7
Copy link
Member

Scribe v2 is real-time but does not support diarization, scribe v1 supports diarization but is not real-time

@neooriginal
Copy link
Collaborator Author

Scribe v2 is real-time but does not support diarization, scribe v1 supports diarization but is not real-time

Thats a good point, i did not check before migrating to v2. Sorry about that.
Want me to revert back to v1? Deepgram is terrible

@mdmohsin7
Copy link
Member

Also there are a few issues in code, elevenlabs_socket, elevenlabs_socket1, elevenlabs_socket2, you'd probably want to replace elevenlabs_socket and its instances with elevenlabs_socket1

@neooriginal
Copy link
Collaborator Author

Also there are a few issues in code, elevenlabs_socket, elevenlabs_socket1, elevenlabs_socket2, you'd probably want to replace elevenlabs_socket and its instances with elevenlabs_socket1

Got it, will it be merged if fixed?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants