this post was submitted on 29 Oct 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 1 year ago
MODERATORS
 

This post is to ask for help regarding a personal project of mine.

So as a heads up, I'm very new to Machine Learning. I mostly a engaged in development stuff. But recently I took on a project where I have to convert text to lip-synced video file.

I need to first generate a WAV file from text. For that, Im looking for a TTS software. I just want a somewhat human-like voice for my project so I am not looking for a very high-quality voice.

I tried to use Tortoise TTS but I failed during the installation process and I can't find a good enough tutorial I can follow. Also, it seems Tortoise and many other AI tools work with a NVIDIA GPU which I don't have (I got a system with AMD integrated graphics). So does anyone have a tutorial or suggestion how to install tortoise?

Or do you have any suggestion for any other TTS to use?

you are viewing a single comment's thread
view the rest of the comments
[–] mr_birrd@alien.top 1 points 1 year ago

You can use the VITS TTS model