LocalLLaMA

3 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago

MODERATORS

communick@poweruser.forum

Voice Models? I have no idea where to start with that. (alien.top)

submitted 11 months ago by Lance_lake@alien.top to c/localllama@poweruser.forum

2 comments fedilink hide all child comments

So I saw the post, "Hugging Face Removes Singing AI Models of Xi Jinping But Not of Biden" and I was curious..

How does one set up a singing model (or a speaking model that can copy other people)?

Is it just TTS and fine tuning the settings of pitch, tone, etc or is there a program that takes a description of the voice and uses a model to make it?

How does one dive into this kind of AI stuff on a home system?

you are viewing a single comment's thread
view the rest of the comments

[–] a_beautiful_rhind@alien.top 1 points 11 months ago

so-vits-svc has an interface you can do inference on static files or live voice. I only trained RVC and not a so-vits yet, but it's very good with decent audio. I have tried it with other peoples models.

To do a song and put it on youtube, you will still need to know some audio engineering.