this post was submitted on 06 Jan 2026

388 points (97.1% liked)

Programming

24338 readers

328 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

founded 2 years ago

MODERATORS

snowe@programming.dev

Ategon@programming.dev

MaungaHikoi@lemmy.nz

UlrikHD@programming.dev

388

Experienced software developers assumed AI would save them a chunk of time. But in one experiment, their tasks took 20% longer (fortune.com)

submitted 4 days ago by codeinabox@programming.dev to c/programming@programming.dev

102 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] BradleyUffner@lemmy.world 90 points 4 days ago* (last edited 4 days ago) (3 children)

... That keeps making the same mistakes over and over again because it never actually learns from what you try to teach it.

[–] zaphod@sopuli.xyz 48 points 4 days ago (3 children)

Yep, the junior is capable of learning.

[–] InternetCitizen2@lemmy.world 18 points 4 days ago (1 children)

Wait till I get hired as junior

[–] Clent@lemmy.dbzer0.com 0 points 3 days ago

Yeah, not all people who enter the industry should be doing so.

Most of this was boomers being boomers and claiming anyone and everyone should code.

[–] aport@programming.dev 3 points 3 days ago

My job believes the solution to this is a 7,000 line agents.md file

[–] 30p87@feddit.org 4 points 4 days ago

Sometimes. And if they're not, they'll be replaced or replace themselves.

[+] VoterFrog@lemmy.world -9 points 4 days ago (4 children)

This is not really true.

The way you teach an LLM, outside of training your own, is with rules files and MCP tools. Record your architectural constraints, favored dependencies, and style guide information in your rule files and the output you get is going to be vastly improved. Give the agent access to more information with MCP tools and it will make more informed decisions. Update them whenever you run into issues and the vast majority of your repeated problems will be resolved.

[–] UnspecificGravity@piefed.social 18 points 4 days ago* (last edited 4 days ago) (2 children)

Well, that's what they say, but then it doesn't actually work, and even if it did it's not any easier or cheaper than teaching humans to do it.

More to the point, that is exactly what the people in this study were doing.

[–] Clent@lemmy.dbzer0.com 1 points 3 days ago

If it's doesn't work for you, it's because you're a failure!

Still not convinced these LLM bros aren't junior developers (at best) who someone gave a senior title to because everyone else left their shit hole company.

[–] VoterFrog@lemmy.world -1 points 4 days ago* (last edited 4 days ago) (1 children)

More to the point, that is exactly what the people in this study were doing.

They don't really do into a lot of detail about what they were doing. But they have a table on limitations of the study that would indicate it is not.

We do not provide evidence that: There are not ways of using existing AI systems more effectively to achieve positive speedup in our exact setting. Cursor does not sample many tokens from LLMs, it may not use optimal prompting/scaffolding, and domain/repository-specific training/finetuning/few-shot learning could yield positive speedup.

Back to this:

even if it did it’s not any easier or cheaper than teaching humans to do it.

In my experience, the kinds of information that an AI needs to do its job effectively has a significant overlap with the info humans need when just starting on a project. The biggest problem for onboarding is typically poor or outdated internal documentation. Fix that for your humans and you have it for your LLMs at no extra cost. Use an LLM to convert your docs into rules files and to keep them up to date.

[–] UnspecificGravity@piefed.social 11 points 4 days ago (1 children)

Your argument depends entirely on the assumption that you know more about using AI to support coding than the experienced devs that participated in this study. You want to support that claim with more than a "trust me, bro"?

[–] VoterFrog@lemmy.world 0 points 4 days ago (1 children)

Do you think that like nobody has access to AI or something? These guys are the ultimate authorities on AI usage? I won't claim to be but I am a 15 YOE dev working with AI right now and I've found the quality is a lot better with better rules and context.

And, ultimately, I don't really care if you believe me or not. I'm not here to sell you anything. Don't use it the tools, doesn't matter to me. Anybody else who does use them, give my advice a try an see if it helps you.

[–] UnspecificGravity@piefed.social 2 points 4 days ago (1 children)

These guys all said the same thing before they participated in a study that proved that they were less efficient than their peers.

[–] VoterFrog@lemmy.world 0 points 4 days ago

Again, read and understand the limitations of the study. Just the portion I quoted you alone is enough to show you that you're leaning way too heavily on conclusions that they don't even claim to provide evidence for.

[–] aport@programming.dev 2 points 3 days ago (1 children)

Codex literally lies about being connected to configured MCP servers.

[–] VoterFrog@lemmy.world -2 points 3 days ago (1 children)

Are you trying to make a point that agents can't use MCP based off of a picture of a tweet you saw or something?

[–] aport@programming.dev 2 points 3 days ago

I'm talking from my personal, daily experience using codex.

[–] raspberriesareyummy@lemmy.world 2 points 4 days ago

That is a moronic take. You would be better off learning to structure your approach to SW development than trying to learn how to use a glorified slop machine to plagiarize other people's works.

[–] criss_cross@lemmy.world 0 points 3 days ago

In theory yes.

In practice I find the more stuff like this you throw at it the more rope it has to hang itself with. And you spend so much time prompt adjusting so it doesn’t do the wrong things that you were better off just doing half of the tasks yourself.

[+] plantfanatic@sh.itjust.works -28 points 4 days ago (2 children)

This is why you use a downloaded llm and customize it, there’s ways to fix these issues.

[–] BradleyUffner@lemmy.world 25 points 4 days ago* (last edited 4 days ago) (2 children)

Unless you are retraining the model locally at your 23 acre data center in your garage after every interaction, it's still not learning anything. You are just dumping more data in to its temporary context.

[+] plantfanatic@sh.itjust.works -9 points 4 days ago* (last edited 4 days ago) (1 children)

What part of customize did you not understand?

And lots fit on personal computers dude, do you even know what different llms there are…?

One for programming doesn’t need all the fluff of books and art, so now it’s a manageable size. Llms are customizable to any degree, use your own data library for the context data even!

[–] BradleyUffner@lemmy.world 11 points 4 days ago* (last edited 4 days ago) (2 children)

What part about how LLMs actually work do you not understand?

"Customizing" is just dumping more data in to it's context. You can't actually change the root behavior of an LLM without rebuilding it's model.

[–] plantfanatic@sh.itjust.works -5 points 4 days ago* (last edited 4 days ago) (2 children)

"Customizing" is just dumping more data in to it's context.

Yes, which would fix the incorrect coding issues. It’s not an llm issue, it’s too much data. Or remove the context causing that issue. These require a little legwork and knowledge to make useful. Like anything else.

You really don’t know how these work do you?

[–] BradleyUffner@lemmy.world 8 points 4 days ago* (last edited 4 days ago) (1 children)

You do understand that the model weights and the context are not the same thing right? They operate completely differently and have different purposes.

Trying to change the model's behavior using instructions in the context is going to fail. That's like trying to change how a word processor works by typing in to the document. Sure, you can kind of get the formatting you want if you manhandle the data, but you haven't changed how the application works.

[–] SchmidtGenetics@lemmy.world -2 points 4 days ago (1 children)

Why are you so focused on just the training? The data is ALSO the issue.

Of course if you ignore one fix, that works, of course you can only cry it’s not fixable.

But it is.

[–] BradleyUffner@lemmy.world 7 points 4 days ago* (last edited 4 days ago) (1 children)

Why are you so focused on just the training?

Because I work with LLMs daily. I understand how they work. No matter how much I type at an LLM, its behavior will never fundamentally change without regenerating the model. It never learns anything from the content of the context.

The model is the LLM. The context is the document of a word processor.

A Jr developer will actually learn and grow in to a Sr developer and will retain that knowledge as they move from job to job. That is fundamentally different from how an LLM works.

I'm not anti-AI. I'm not "crying" about their issues. I'm just discussing the from a practical standpoint.

LLMs do not learn.

[–] SchmidtGenetics@lemmy.world -5 points 4 days ago* (last edited 4 days ago) (1 children)

Because I work with LLMs daily. I understand how they work.

Clearly you don’t, because context data modifies how the training data extrapolates.

You can use something, while not being educated on how to use it. And just using something does not mean you understand how they work. Your comments have made it QUITE clear that you have no idea.

People who just whing about AI and pretend they know how they work are the worst kind of people right now.

[–] BradleyUffner@lemmy.world 7 points 4 days ago* (last edited 4 days ago)

Your comments have made it QUITE clear that you have no idea.

Odd, I can say the exact same thing about your comments on the subject.

We are clearly at an impasse that won't be solved through this discussion.

[–] tja@sh.itjust.works 7 points 4 days ago (1 children)

But

All the fluff from books and art

Is not inside the context, that comes from training. So you know how an llm works?

[–] moomoomoo309@programming.dev 9 points 4 days ago* (last edited 4 days ago)

Yeah, but LLMs still consistently don't follow all rules they're given, they randomly will not follow one or more with no indication they did so, so you can't really fix these issues consistently, just most of the time.

Edit: to put this a little more clearly after a bit more thought: It's not even necessarily a problem that it doesn't always follow rules, it's more so a problem that when it doesn't follow the rules, there's no indication it did so. If it had that, it would actually be fine!