So I saw the post, "Hugging Face Removes Singing AI Models of Xi Jinping But Not of Biden" and I was curious..
How does one set up a singing model (or a speaking model that can copy other people)?
Is it just TTS and fine tuning the settings of pitch, tone, etc or is there a program that takes a description of the voice and uses a model to make it?
How does one dive into this kind of AI stuff on a home system?
So.. Some censoring?