this post was submitted on 14 Jun 2026
907 points (99.6% liked)

People Twitter

10095 readers
215 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a pic of the tweet or similar. No direct links to the tweet.
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 3 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] errer@lemmy.world 13 points 1 week ago (3 children)

I mean the poster above you is wrong, they use math tools internally now when you ask math questions. Very obvious in Gemini. Yes the raw LLM trying to autocomplete the answer to a math problem is gonna be wrong but that’s not the way they are used to solve problems like that anymore.

[–] sbv@sh.itjust.works 8 points 1 week ago

The LLM has to choose to use the calculating tools. Gemini tried to do this one solo:

4 + 2 + 2 + 2 + 1+ 2 + 0 = 15

Tbf, it did four of these calculations, and 75% were correct.

[–] baines@lemmy.cafe 5 points 1 week ago

no way i’d want to drive on a bridge built on their supposed math

[–] wonderingwanderer@sopuli.xyz 2 points 1 week ago

That makes sense. I clearly don't keep up on the frontier models...