farmingvillein

joined 1 year ago
[–] farmingvillein@alien.top 1 points 11 months ago

Sounds stupid and reductionist, but I'd start with doing speech-to-text and then run a small # of examples through 3.5-turbo & GPT-4, asking it to annotate where a laugh track should be added.

Good chance that it'll do a pretty decent job, with some careful prompting.

Then, based on cost requirements, you can try collecting some labels and fine-tuning a model like Mistral (which you could also just try upfront as well).

[–] farmingvillein@alien.top 1 points 11 months ago (1 children)

I think OP is responding to (without commenting on correctness...)

At some point, if coding is good enough, it might be able to write its own better LLM system.

[–] farmingvillein@alien.top 1 points 1 year ago

what is happening to this sub