-
Notifications
You must be signed in to change notification settings - Fork 658
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Containerized servings.py #17
Conversation
Thanks for this, we'll have a look in a bit... noticed that you were trying to add references... quick note on them: |
The references were just for testing, what comes out, if I through in a German reference. Outcome: Funny 😉 Looking forward to actual fine-tuning capabilities. Are those LoRAs? |
lol expected
LoRAs - not sure yet... we're open to folks adding that, and I can provide some guidance! |
IF it was transformers I would be able to help quickly with LoRA PEFT. But as I see you have your very custom GPT implementation. |
I am happy to lead through those rough edges if it helps, otherwise we'll have to wait till we are able to do this! :( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
really appreciate you putting this together. a few changes requested
Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>
Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>
Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>
Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>
Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>
I do not understand why this API is using a very non-standard "X-Payload" header to contain the request body, and despite the link to For anyone else trying to figure this out, here is a curl -X POST http://localhost:8869/tts \
-H "X-Payload: {\"text\": \"Hello, this is a test!\", \"guidance\": 3.0, \"top_p\": 0.95, \"speaker_ref_path\": \"assets/av
a.flac\"}" \
-o output.mp3 EDIT: I have also discovered this:
so... that's fun. It's currently hardcoded to only support up to 220 characters of input. I can't see where the 220 character limit is actually being enforced, but I can't think of any use case where 220 characters is reliably large enough to use. I am excited to see more options in the open source TTS space, so hopefully that limitation is lifted soon. |
Yeah was wondering about X-Payload as well and thought of changing it to but just kept it as is in the end. For the length, I think the current implementation expects you to basically feed the API a text in chunks, with a max char length of 220... |
Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>
@coder543, we will lift this limit soon. Synthesising arbitrary lengths of text is on our roadmap after we ship optimisations to reduce inference latency & fine-tuning support. What are you building with TTS today? |
Will fix this shortly |
@sidroopdaska With a good enough synthesized voice, I would enjoy being able to paste in an article and have it read it to me sometimes. So, I was just playing around with that kind of thing. |
Added Dockerfile & docker-compose.yml for containerized deployment.
Had to make a slight adjustment to
servings.py
changing host ip from127.0.0.1
to0.0.0.0
.Hope this helps others with quick and convenient deployment and testing.
Addresses #48