this post was submitted on 30 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

I just recently started playing with Coqui XTTS and I have to say my results have been horrid. I am familiar with 11labs, and have always had great results. My background is originally in audio/video production, so I am very capable of giving it whatever exact formats it needs, however my results so far sound NOTHING like the source material. Very robotic, very distorted. I am assume from all the gushing I have seen regarding this tool that it must be user error. Currently I am just using it as a extension on Oobabooga as that was the easiest way to get it up with a UI. Please let me know any tips and tricks you guys have learned! Thank you!

Current workflow:
Record in Adobe Audition
24bit, sample rate 22050
WAV Format

you are viewing a single comment's thread
view the rest of the comments
[โ€“] ----Val----@alien.top 1 points 11 months ago (1 children)

Check which model you are using. The latest 2.0.3 XTTSv2 is really wonky. Manually revert it to 2.0.2.

[โ€“] aallsbury@alien.top 1 points 11 months ago

Do you know an easy way to revert using the Oobabooga extension? Thanks!