this post was submitted on 28 Mar 2025
-7 points (38.7% liked)

Technology

68187 readers
3846 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Google has finally arrived

Some observations on the model

  • Gemini 2.5 pro is absolutely a beast in coding, perhaps the best model right now
  • They spent all the computing resources on training it on coding data and forgot to give it a distinct personality
  • Doesn't do well on reasoning as well as Grok 3 (think) and Claude 3.7 Sonnet (thinking)
  • On par with 03-mini-high in general mathematics

If you're a coder, you'll absolutely love it, or else you will be fine with other frontier reasoning models (Deepseek r1, if you ask me)

top 4 comments
sorted by: hot top controversial new old
[–] heavydust@sh.itjust.works 1 points 4 days ago (1 children)

If you’re a coder, you’ll absolutely love it

Please show me some example of C++ refactoring on a real application, not yet another ReactJS template.

[–] sunilkumardash9@lemmy.world 1 points 4 days ago* (last edited 4 days ago) (1 children)

Have you tried doing the same?

[–] heavydust@sh.itjust.works 4 points 4 days ago

Every month. It's hallucinating APIs and language features.

[–] simple@lemm.ee 1 points 4 days ago

I've been using it and people are sleeping on it. It's easily the best LLM on the market right now, even if you're not using it for coding. Very good reasoning skills and it doesn't have the issues other reasoning models do where they overthink or keep saying "but wait" and confusing its outputs.