so-vits-svc has an interface you can do inference on static files or live voice. I only trained RVC and not a so-vits yet, but it's very good with decent audio. I have tried it with other peoples models.
To do a song and put it on youtube, you will still need to know some audio engineering.