this post was submitted on 10 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

XTTSv2 is released. I’d say it’s a big jump in quality.

  • Better voice cloning
  • Better audio
  • Impressive prosody and expressiveness
  • Added more languages, I guess total 16 languages.
  • Non-EN languages sounds way better
  • Streaming under 200ms ( I have 3090)
  • Finetuning code

Here you can try https://huggingface.co/spaces/coqui/xtts

top 4 comments
sorted by: hot top controversial new old
[–] satireplusplus@alien.top 1 points 1 year ago
[–] m-pana@alien.top 1 points 1 year ago

Does anyone know if there is a detailed model description somewhere? They don't seem to have a full technical report anywhere and the documentation just describes the model API.

[–] tonyabracadabra@alien.top 1 points 11 months ago

What is the best infra to deploy the API?

[–] tonyabracadabra@alien.top 1 points 11 months ago

What is the best way to deploy it as an API?