this post was submitted on 28 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

weird question I suppose, I've tried Bark (works but it hasn't been updated in a while), Tortoise and Tortoise-Fast (both don't even install properly, and haven't been updated in a while too)...

Is there any Ai text to speech model still being updated?

top 4 comments
sorted by: hot top controversial new old
[–] ludflu@alien.top 1 points 9 months ago

I just started using silero for a side project and it works pretty nicely. Repo says it was updated last month.

[–] cyborgsnowflake@alien.top 1 points 9 months ago

There is a fork of tortoise by mrq which works well and could use more participants.

[–] GinjaTurtles@alien.top 1 points 9 months ago

I’ve messed around with a bunch of open source AI TTS that I can self host. Here’s my 2 cents:

  • mrq has a repo where you can fine tune tortoise to audio samples you have using a GUI https://git.ecker.tech/mrq/ai-voice-cloning there’s some good YouTube videos by Jarrod’s Journey about this
  • if you want some of the best sounding local TTS, using finetuned tortoise + a finetuned RVC model is going to be very nice quality
  • recently the tortoise maintainer added HIFI GAN for even faster inference but i don’t think you can finetune this HIFI GAN model yet since it’s a custom implementation for tortoise
  • one of the models that I’m going to look into next that sounds incredible is google soundstorm. I believe a few people have implemented an open source pytorch soundstorm model on GitHub
  • I’m not sure how good a finetuned version of soundstorm would be but this what I’m going to try out next when I have time (work sucks)
[–] epic_awesome@alien.top 1 points 9 months ago

Coqui TTS is always updated.