Material1276

joined 10 months ago
[–] Material1276@alien.top 1 points 10 months ago

I don't know of any public ones, but they may be out there. I've only seen a few referenced , places like here:

https://www.youtube.com/watch?v=FEOAnDgCD5A

There's a couple of research papers and names mentioned in there. Maybe you can hunt those papers/names in google and see if there are any references to models.

[–] Material1276@alien.top 1 points 10 months ago

I was struggling at first and had that American twang coming through...

But I managed to get a very clear, short clip of an English actor from an interview. There was no background noises, it was very clear. I made sure to clip out any non speech from the start or end of the audio, then saved it as a 22050HZ mono 16bit wav.

That seems to have done it! I get a pretty good representation of the voice and it 99% seems to stay in character with the occasional slight slip.

I also occasionally get a little gibberish, which seems to be when my model is trying to say somehthing like " ' " (which occasionally slips through when its generating text and I look at the backend of whats being sent for audio processing). Im guessing its possible to filter this out with a regex or something.

[–] Material1276@alien.top 1 points 10 months ago

Another consideration is that I was told by someone with multiple cards, that if you split your layers across multiple cards, they don't all process the layers simultaneously.

So, if you are on 3x cards, you don't get a parallel performance benefit of all cards working at the same time. It processes layers on card 1, then card 2, then card 3.

The slowest card will obviously have the worst speed. Not sure what this will do for your load times of a model or your electricity bill, as well as the fact you need a system big enough to fit them all in.

[–] Material1276@alien.top 1 points 10 months ago

Heres a link to a up to date ranking of models for RP. Currently 400+ models ranked.

http://ayumi.m8geil.de/ayumi_bench_v3_results.html