TheBlackLounge

joined 11 months ago
[–] TheBlackLounge@lemm.ee 1 points 2 weeks ago

You need an editor for traditional transcription tools too :) and it's A LOT more work. They don't even do punctuation or names.

[–] TheBlackLounge@lemm.ee 2 points 2 weeks ago

I use it for generating subtitles. It figures out context, it ignores stuttering, it does punctuation etc. It's really is just better. With clean audio it transcribes like a human does.

It does better than other techniques with dirty audio, but when it fails it fails weird, which is the big issue here.

[–] TheBlackLounge@lemm.ee 22 points 2 weeks ago (5 children)

Whisper really is a lot better when it works, and it's free. The problem is that it refuses to produce gibberish or give up when it doesn't work. You'll always need an editor.

[–] TheBlackLounge@lemm.ee 1 points 2 weeks ago* (last edited 2 weeks ago)

The architecture changed, there is still progress to be made there. But LLMs will forever be stuck in 2021, all data afterwards is tainted. Not a lot has been added.

In fact, Whisper was developed to transcribe videos for more training data, because they ran out of text data. These bad transcriptions are in newer models.

[–] TheBlackLounge@lemm.ee 25 points 2 weeks ago (1 children)

It's actually extremely good at figuring out confusing text. It gets weird when the audio quality is bad.

I use it for generating subs for obscure movies.

[–] TheBlackLounge@lemm.ee 6 points 1 month ago

Something very basic and transparent like lemmy's 'scaled', sure

[–] TheBlackLounge@lemm.ee 167 points 1 month ago

Transphobe mad about people deadnaming his company

[–] TheBlackLounge@lemm.ee 15 points 3 months ago* (last edited 3 months ago)

Just this past year

  • Builtin offline translation
  • A pdf editor
  • Firefox View
  • And of course a whole bunch of privacy, security, performance, and developer features
[–] TheBlackLounge@lemm.ee 5 points 3 months ago

Artificial intelligence is intelligent like artificial grass is grass. That's how the word artificial works. It just means man-made, says nothing about quality.

[–] TheBlackLounge@lemm.ee 2 points 5 months ago

It used to mean all generated output though. Calling only mistakes hallucinations is new, definitely because of hype.

[–] TheBlackLounge@lemm.ee 7 points 5 months ago (2 children)

So is bullshitting. More so, only human minds can bullshit.

We anthropomorphize machines all the time, it's fine.

I'd prefer we'd start calling all genai output hallucinations again. It used to be like 10 years ago, but somewhere along the line marketing decided hallucinated truths aren't "hallucinations".

view more: next ›