this post was submitted on 07 Jun 2026

121 points (83.8% liked)

Technology

86580 readers

4584 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

121

AI Agent Uncovers 21 Zero-Days in FFmpeg; Chrome Patches Record 429 Bugs (thehackernews.com)

submitted 1 month ago by themachinestops@lemmy.dbzer0.com to c/technology@lemmy.world

39 comments fedilink hide all child comments

top 39 comments

sorted by: hot top controversial new old

[–] Mondez@lemdro.id 111 points 1 month ago (4 children)

What these articles never say is how many hallucinated bugs the LLM found that either weren't real or were actually exploitable. The LLM didn't find these with any confidence it highlighted areas of interest that actual security researchers then needed to investigate and confirm or rule out.

[–] hard_zero1@discuss.tchncs.de 18 points 1 month ago* (last edited 1 month ago) (1 children)

In the article it says the ffmpeg vulns were found by an "autonomous" agent and that it produced a proof-of-concept for each. So what do you base your claims on? They seem quite contrary to that.

Even if there was still a lot of human work involved, it seems that the LLM-Agents can help a lot with security research, as the number of (real) zero-days that are beeing found recently (with the help of AI) seems to spike (telling from what I read, e.g. here on Lemmy, or the number of security updates for my distro).

[–] Mondez@lemdro.id 5 points 1 month ago (1 children)

It's states they were produced which I'm taking to mean found and it's ambigously worded so it's unclear if the article is actually claiming it generated PoC for all of them. It doesn't say how many if any hallucinated results were produced or how much effort was needed to fully confirm, basically down played the human involvement.

It's great that so many bugs are being found and squashed but it's implied LLMs are doing all the work when what they are actually doing is assisting as a tool.

[–] hard_zero1@discuss.tchncs.de 5 points 1 month ago* (last edited 1 month ago)

I agree that the wording is a bit ambiguous, I interpreted it the way it seems more natural to me. In the post by the researcher(s) themselves, it says in the tldr paragraph that the "agent produces concrete, reproducible PoC inputs to confirm its findings" but also that they (probably humans) "explored the exploitability of the issues and developed a PoC demonstrating a RCE exploit primitive". Apparently it finds the vulnerabilities very concretely but humans were involved for the full-blown exploit. It also doesn't say much about the number of false-positives.

I'm not in the business, so I can't tell how much of the work such agents are actually saving. Since the articles don't say much about the amount of human involvement, the imagination conveyed by them probably depends strongly on the (knowledge of the) reader. But in my opinion it is a bit of stretch to say this is downplaying it. It should be noted though, that the article probably sources its information from a post by the company selling that AI.

With that information, the "without any confidence" and "area of interest" parts of your previous post still seem misleading.

[–] Cocodapuf@lemmy.world 3 points 1 month ago (1 children)

What these articles never say is how many hallucinated bugs the LLM found that either weren't real or were actually exploitable.

It literally wouldn't matter if it did.

The fact that it found exploitable bugs means that these bugs need to be addressed. To be clear, I care much more about the security flaws and fixing them than how they were discovered.

[–] wholookshere@lemmy.blahaj.zone 10 points 1 month ago (1 children)

I feel like you missed the forest for the trees.

The question is how many were made up?

[–] Cocodapuf@lemmy.world -2 points 1 month ago (1 children)

I saw that, and you're right, I wasn't answering that question. What I was saying was that I thought the question was irrelevant and ignoring a bigger issue.

[–] wholookshere@lemmy.blahaj.zone 3 points 1 month ago

I disagree that its ignoring the bigger problem, which is that slop like this is overwhelming devs to get fixes out ASAP faster than they can fix.

So now we have AI big reports feeding AI big fixes in a lot of projects.

The assumption that what AI finds is correct in the first place is.... Probably wrong.

It makes stuff up all the bloody time, so how many of these bugs were made up, or not actually bugs?

[+] hard_zero1@discuss.tchncs.de -16 points 1 month ago* (last edited 1 month ago) (2 children)

I'm frankly quite annoyed by the amount and extend of anti-AI hate in this community. It almost seems like this is a pure anti-AI rage community. The capabilities and the utility of LLMs are basically always denied or downplayed as much as possible without running into obvious contradictions.

It would be so nice to have some differentiated and insightful discussions here, about what can or cannot be done with AI, positive and negative impacts it has, possible new use cases, how AI should or should not be used, how the overall benefits of AI can be maximized and the overall negative effects minimized, what the world with AI could be like in the future, ...

[–] Mondez@lemdro.id 11 points 1 month ago (1 children)

I was trying to have some insightful discussion on the actual capability of LLM which is difficult when the involvement of the human element is played down amd the role of the LLM is played up to feed the hype machine. It's hard to acknowledge the real capabilities and weaknesses when the capabilities are always over reported and the weaknesses down played or denied.

It's great that so many bugs are getting discovered but as I say there is no reporting on what effort was needed to sift and review the LLM output or how functional or understandable any PoC were... The article doesn't directly even state the PoC were directly produced by the LLM and reads very ambigously.

[–] hard_zero1@discuss.tchncs.de 0 points 1 month ago

I think some of that is because the reporting is focused on the new stuff, that was previously not possible. That human work is involved and some of the weaknesses are not really new. But also because the information in this case comes from a company that wants to sell their AI. I agree that the reporting is probably biased and not really sharp and therefore limited in usefulness.

Also, my (second) comment was not specifically about your comment but generally about the "vibe" of this community

[–] GnuLinuxDude@lemmy.ml 5 points 1 month ago (1 children)

what the world with AI could be like im the future

Imagine this trend line increasing a bit more rapidly https://en.wikipedia.org/wiki/Carbon_dioxide_in_the_atmosphere_of_Earth

[–] hard_zero1@discuss.tchncs.de 0 points 1 month ago* (last edited 1 month ago) (1 children)

CO2 emissions are a huge problem, of course. But it is not specific to AI. Data centers are starting to become a significant factor of energy consumption but I think it will stay very manageable compared to other consumers and given the utility it provides. And since data centers luckily natively require electricity, it is much easier, compared to e.g. transportation, to switch them to renewables. And renewables are very often already the cheapest source of energy anyways. So I think AI is just another thing that humans do that requires energy, and it comes with the same tradeoffs (the utility vs. the cost of sourcing that energy). So in my opinion we should mainly focus on accelerating the transition to green energy.

Here's a good overview about AI carbon emissions I just found: https://www.carbonbrief.org/ai-five-charts-that-put-data-centre-energy-use-and-emissions-into-context/

[–] Dyskolos@lemmy.zip 1 points 1 month ago

I expected it to be worse. And the org seems trustworthy. Though that's a tough topic to trust any source, considering the amount of money involved.

[–] greyscale@lemmy.grey.ooo 40 points 1 month ago (9 children)

What the fuck is a zero day in the context of ffmpeg?

Its not like its a system service that you can get ingress through..

"AI found 21 bugs in massive video project" sounds like junior developer shit hungry to get some shit on their resume.

Even if it wasn't AI slop, this wouldn't be impressive.

[–] VibeSurgeon@piefed.social 38 points 1 month ago (1 children)

Its not like its a system service that you can get ingress through..

With a competently crafted payload, you could perhaps get in via someone's transcoding pipeline.

[–] greyscale@lemmy.grey.ooo 4 points 1 month ago (2 children)

Does nobody isolate ffmpeg and friends from their application?

I can't imagine you'd have much fun breaking into a container that terminates the moment the original ffmpeg stops, or over-runs its max execution time..

[–] panda_abyss@lemmy.ca 21 points 1 month ago (1 children)

Container escapes do exist, and they have shared kernel with the host

[–] Passerby6497@lemmy.world 5 points 1 month ago

If you're running rootless containers, it's less of a concern. I'm trying to move all of my public containers to podman for this reason

[–] VibeSurgeon@piefed.social 1 points 1 month ago

Sure, you'd need a second exploit to escalate from there.

ffmpeg is expected to run for extended periods of time, given its use in transcoding.

[–] karlhungus@lemmy.ca 28 points 1 month ago (1 children)

My understanding is that ffmpeg is the bedrock that all video streaming services use. I'm suspicious it's a bigger deal than you think

[–] filcuk@lemmy.zip 4 points 1 month ago

Don't forget various conversion services, which includes photo cloud backups.

[–] Zarxrax@lemmy.world 22 points 1 month ago

From the article

Most are heap or stack overflows in parsers and demuxers, spanning components from the TS demuxer to the VP9 decoder. depthfirst says some already carry CVE identifiers; its writeup lists nine, CVE-2026-39210 through CVE-2026-39218, and notes the rest are fixed but not yet numbered. It also published a PoC.

[–] tomalley8342@lemmy.world 17 points 1 month ago

I imagine there are many web services around the world which use ffmpeg to handle user submitted content.

[–] fonix232@fedia.io 14 points 1 month ago

Tell me you know nothing about the intricacies of media playback (especially hardware accelerated), without telling me...

[–] gandalf_der_12te@feddit.org 11 points 1 month ago (1 children)

if you upload any image on lemmy, it goes through ffmpeg to be converted into a webp file.

now, you can feed arbitrary input to that ffmpeg process

[–] null@lemmy.zip 1 points 1 month ago (2 children)

I never thought of doing that with ffmpeg. Why ffmpeg, instead of imagmagick?

[–] gandalf_der_12te@feddit.org 11 points 1 month ago

ffmpeg is on any system, has a consistent user interface for many different conversions

[–] 8uurg@lemmy.world 5 points 1 month ago* (last edited 1 month ago)

It just so happens that many video codecs are based on image formats, so ffmpeg already has a lot of the complex machinery to do so available to also implement these image formats - internally it can just handle it as a single frame of video with specialized formats for that.

Imagemagick (and other tools) also work, but why use multiple pieces of software if what you already have is adequate? ImageMagick is also software, and can also have vurnabilities.

[–] nitroemdash@lemmy.wtf 2 points 1 month ago (1 children)

FFMPEG in the command line generally has permission to access the entire non-sudo filesystem and delete files.

[–] greyscale@lemmy.grey.ooo 1 points 1 month ago (1 children)

Yes but why are we allowing user input to be fed to an executable in that environment?

[–] LodeMike@lemmy.today 4 points 1 month ago

This is the environment that almost all user software is executed.

[–] Jason2357@lemmy.ca 0 points 1 month ago

Indeed. AI agents cannot "discover" zero days, they can only discover vulnerabilities. The humans can create zero days by reporting the vulnerability in the media without disclosing it properly.

[–] bcnelson@lemmy.world 11 points 1 month ago

The term 0-day vulnerability in security has a specific meaning. It means that it was not found by security researchers and is instead being actively exploited on the day that it's discovered. I'm pretty sure none of these by definition or zero days

[–] mecen@lemmy.ca 1 points 1 month ago

"Chrome's record landed after Google overhauled its bounty program to cope with a flood of AI-generated reports."

How did they change it?