this post was submitted on 03 May 2024
856 points (97.7% liked)
Technology
59427 readers
4177 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
1% correct is never "fairly high" wtf
Also if you want a computer that you don't have to double check, you literally are expecting software to embody the concept of God. This is fucking stupid.
It's all about context. Asking a bunch of 4 year olds questions about trigonometry, 1% of answers being correct would be fairly high. 'Fairly high' basically only means 'as high as expected' or 'higher than expected'.
Hence, it is useless. If I cannot expect it to be more or less always correct, I can skip using it and just look stuff up myself.
Obviously the only contexts that would apply here are ones where you expect a correct answer. Why would we be evaluating a software that claims to be helpful against 4 year old asked to do calculus? I have to question your ability to reason for insinuating this.
So confirmed. God or nothing. Why don't you go back to quills? Computers cannot read your mind and write this message automatically, hence they are useless
That's the whole point, I don't expect correct answers. Neither from a 4 year old nor from a probabilistic language model.
And you don't expect a correct answer because it isn't 100% of the time. Some lemmings are basically just clones of Sheldon Cooper
I don't expect a correct answer because I've used these models quite a lot last year. At least half the answers were hallucinated. And it's still a common complaint about this product as well if you look at actual reviews (e.g., pretty sure Marques Brownlee mentions it).
Like most people, I have no interest in engaging in conversation with someone who gives me zero reason to.
Not that it's any of your business, but quality matters to me more than anything else, which is why I like tools that help me deliver it
Not reading anything else you write