Machine Learning

1 readers

1 users here now

Community Rules:

Be nice. No offensive behavior, insults or attacks: we encourage a diverse community in which members feel safe and have a voice.
Make your post clear and comprehensive: posts that lack insight or effort will be removed. (ex: questions which are easily googled)
Beginner or career related questions go elsewhere. This community is focused in discussion of research and new projects that advance the state-of-the-art.
Limit self-promotion. Comments and posts should be first and foremost about topics of interest to ML observers and practitioners. Limited self-promotion is tolerated, but the sub is not here as merely a source for free advertisement. Such posts will be removed at the discretion of the mods.

founded 2 years ago

MODERATORS

communick@academy.garden

[P]Coqui released XTTSv2 (alien.top)

submitted 2 years ago by coinfelix@alien.top to c/machinelearning@academy.garden

4 comments fedilink hide all child comments

XTTSv2 is released. I’d say it’s a big jump in quality.

Better voice cloning
Better audio
Impressive prosody and expressiveness
Added more languages, I guess total 16 languages.
Non-EN languages sounds way better
Streaming under 200ms ( I have 3090)
Finetuning code

Here you can try https://huggingface.co/spaces/coqui/xtts

top 4 comments

sorted by: hot top controversial new old

[–] satireplusplus@alien.top 1 points 2 years ago

incredible

[–] m-pana@alien.top 1 points 2 years ago

Does anyone know if there is a detailed model description somewhere? They don't seem to have a full technical report anywhere and the documentation just describes the model API.

[–] tonyabracadabra@alien.top 1 points 2 years ago

What is the best infra to deploy the API?

[–] tonyabracadabra@alien.top 1 points 2 years ago

What is the best way to deploy it as an API?