Containerized servings.py #17

l4b4r4b4b4 · 2024-02-08T08:50:45Z

Added Dockerfile & docker-compose.yml for containerized deployment.

Had to make a slight adjustment to servings.py changing host ip from 127.0.0.1 to 0.0.0.0.

Hope this helps others with quick and convenient deployment and testing.

Addresses #48

vatsalaggarwal · 2024-02-08T14:15:13Z

Thanks for this, we'll have a look in a bit...

noticed that you were trying to add references... quick note on them:
i) we don't support cross-lingual cloning yet, so I think the german reference is unlikely to work
ii) we don't support references shorter than 30 seconds, and stuff that is unclean/has background noise etc is very unlikely to work!

l4b4r4b4b4 · 2024-02-08T14:24:26Z

Thanks for this, we'll have a look in a bit...

noticed that you were trying to add references... quick note on them: i) we don't support cross-lingual cloning yet, so I think the german reference is unlikely to work ii) we don't support references shorter than 30 seconds, and stuff is unclean/has background noise etc is very unlikely to work!

The references were just for testing, what comes out, if I through in a German reference. Outcome: Funny 😉

Looking forward to actual fine-tuning capabilities. Are those LoRAs?

vatsalaggarwal · 2024-02-08T16:06:28Z

The references were just for testing, what comes out, if I through in a German reference. Outcome: Funny 😉

lol expected

Looking forward to actual fine-tuning capabilities. Are those LoRAs?
I am not sure if finetuning will be able to make that 20s sample with background noise work :P ... have you tried others that were longer than 30s and were clean that you're hoping to finetune on but couldn't get zero-shot to work?

LoRAs - not sure yet... we're open to folks adding that, and I can provide some guidance!

l4b4r4b4b4 · 2024-02-09T10:37:48Z

IF it was transformers I would be able to help quickly with LoRA PEFT. But as I see you have your very custom GPT implementation.

vatsalaggarwal · 2024-02-09T10:38:44Z

I am happy to lead through those rough edges if it helps, otherwise we'll have to wait till we are able to do this! :(

sidroopdaska

really appreciate you putting this together. a few changes requested

Dockerfile

README.md

.gitignore

fam/llm/serving.py

docker-compose.yml

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

coder543 · 2024-02-15T01:27:02Z

I do not understand why this API is using a very non-standard "X-Payload" header to contain the request body, and despite the link to /docs, there is no real documentation there. The FastAPI implementation does not explicitly say that TTSRequest is the expected request type, likely because pulling it out of a header isn't a common approach, so the generated docs assume there are no parameters at all.

For anyone else trying to figure this out, here is a curl command that serves as a basic example with the way the API is currently structured:

curl -X POST http://localhost:8869/tts \
     -H "X-Payload: {\"text\": \"Hello, this is a test!\", \"guidance\": 3.0, \"top_p\": 0.95, \"speaker_ref_path\": \"assets/av
a.flac\"}" \
     -o output.mp3

EDIT:

I have also discovered this:

    # NOTE: supports max. 220 characters atm.
    # Long form synthesis coming soon...
    MAX_CHARS = 220

so... that's fun. It's currently hardcoded to only support up to 220 characters of input. I can't see where the 220 character limit is actually being enforced, but I can't think of any use case where 220 characters is reliably large enough to use.

I am excited to see more options in the open source TTS space, so hopefully that limitation is lifted soon.

l4b4r4b4b4 · 2024-02-15T09:58:15Z

I do not understand why this API is using a very non-standard "X-Payload" header to contain the request body, and despite the link to /docs, there is no real documentation there. The FastAPI implementation does not explicitly say that TTSRequest is the expected request type, likely because pulling it out of a header isn't a common approach, so the generated docs assume there are no parameters at all.

For anyone else trying to figure this out, here is a curl command that serves as a basic example with the way the API is currently structured:
curl -X POST http://localhost:8869/tts \
     -H "X-Payload: {\"text\": \"Hello, this is a test!\", \"guidance\": 3.0, \"top_p\": 0.95, \"speaker_ref_path\": \"assets/av
a.flac\"}" \
     -o output.mp3
EDIT:

I have also discovered this:
    # NOTE: supports max. 220 characters atm.
    # Long form synthesis coming soon...
    MAX_CHARS = 220
so... that's fun. It's currently hardcoded to only support up to 220 characters of input. I can't see where the 220 character limit is actually being enforced, but I can't think of any use case where 220 characters is reliably large enough to use.

I am excited to see more options in the open source TTS space, so hopefully that limitation is lifted soon.

Yeah was wondering about X-Payload as well and thought of changing it to but just kept it as is in the end.

For the length, I think the current implementation expects you to basically feed the API a text in chunks, with a max char length of 220...

Dockerfile

docker-compose.yml

Dockerfile

docker-compose.yml

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

sidroopdaska · 2024-02-21T17:22:55Z

I am excited to see more options in the open source TTS space, so hopefully that limitation is lifted soo

@coder543, we will lift this limit soon. Synthesising arbitrary lengths of text is on our roadmap after we ship optimisations to reduce inference latency & fine-tuning support. What are you building with TTS today?

sidroopdaska · 2024-02-21T17:23:39Z

I do not understand why this API is using a very non-standard "X-Payload" header to contain the request body

Will fix this shortly

coder543 · 2024-02-21T17:33:46Z

@sidroopdaska With a good enough synthesized voice, I would enjoy being able to paste in an article and have it read it to me sometimes. So, I was just playing around with that kind of thing.

docker-compose.yml

l4b4r4b4b4 and others added 4 commits February 7, 2024 19:10

Containerized

4edb7ed

containerized

a5bec38

containerized 0.1

cfdccd1

assets

91a0686

sidroopdaska requested changes Feb 9, 2024

View reviewed changes

l4b4r4b4b4 added 5 commits February 10, 2024 04:49

Update README.md

fdb126a

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

Update serving.py

8992274

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

Update .gitignore

0e3e05f

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

Delete assets/GER_F_SylviaF.flac

8f4fd53

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

Delete assets/barackobamafederalplaza.flac

d502179

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

djmaze mentioned this pull request Feb 14, 2024

I created a working Dockerfile to build it consistently (and on my system)... #48

Closed

djmaze reviewed Feb 15, 2024

View reviewed changes

Dockerfile Show resolved Hide resolved

djmaze reviewed Feb 15, 2024

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

djmaze reviewed Feb 15, 2024

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

docker-compose.yml Show resolved Hide resolved

djmaze reviewed Feb 16, 2024

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

l4b4r4b4b4 added 2 commits February 16, 2024 11:27

requested changes

3cfa47c

add flash-attn

0943734

djmaze reviewed Feb 17, 2024

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

docker-compose.yml Outdated Show resolved Hide resolved

Merge branch 'main' into main

8226a9d

Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

Merge remote-tracking branch 'upstream/main'

6137d25

sid and others added 6 commits February 21, 2024 22:45

Merge remote-tracking branch 'upstream/main'

1d1e0e5

update: docker compose with common configs

13ce337

feat: add health check endpoint

87a609c

feat: make services naming terse

50b78cf

feat: reduce health check durations

0d946e3

update: README.md

b66d740

sidroopdaska approved these changes Feb 22, 2024

View reviewed changes

docker-compose.yml Outdated Show resolved Hide resolved

sidroopdaska merged commit 33cd288 into metavoiceio:main Feb 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Containerized servings.py #17

Containerized servings.py #17

l4b4r4b4b4 commented Feb 8, 2024 •

edited by sidroopdaska

Loading

vatsalaggarwal commented Feb 8, 2024 •

edited

Loading

l4b4r4b4b4 commented Feb 8, 2024

vatsalaggarwal commented Feb 8, 2024 •

edited

Loading

l4b4r4b4b4 commented Feb 9, 2024

vatsalaggarwal commented Feb 9, 2024 •

edited

Loading

sidroopdaska left a comment

coder543 commented Feb 15, 2024 •

edited

Loading

l4b4r4b4b4 commented Feb 15, 2024

sidroopdaska commented Feb 21, 2024 •

edited

Loading

sidroopdaska commented Feb 21, 2024

coder543 commented Feb 21, 2024

Containerized servings.py #17

Containerized servings.py #17

Conversation

l4b4r4b4b4 commented Feb 8, 2024 • edited by sidroopdaska Loading

vatsalaggarwal commented Feb 8, 2024 • edited Loading

l4b4r4b4b4 commented Feb 8, 2024

vatsalaggarwal commented Feb 8, 2024 • edited Loading

l4b4r4b4b4 commented Feb 9, 2024

vatsalaggarwal commented Feb 9, 2024 • edited Loading

sidroopdaska left a comment

Choose a reason for hiding this comment

coder543 commented Feb 15, 2024 • edited Loading

l4b4r4b4b4 commented Feb 15, 2024

sidroopdaska commented Feb 21, 2024 • edited Loading

sidroopdaska commented Feb 21, 2024

coder543 commented Feb 21, 2024

l4b4r4b4b4 commented Feb 8, 2024 •

edited by sidroopdaska

Loading

vatsalaggarwal commented Feb 8, 2024 •

edited

Loading

vatsalaggarwal commented Feb 8, 2024 •

edited

Loading

vatsalaggarwal commented Feb 9, 2024 •

edited

Loading

coder543 commented Feb 15, 2024 •

edited

Loading

sidroopdaska commented Feb 21, 2024 •

edited

Loading