this post was submitted on 17 Aug 2024
103 points (99.0% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

53939 readers
296 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder


💰 Please help cover server costs.

Ko-FiLiberapay


founded 1 year ago
MODERATORS
top 18 comments
sorted by: hot top controversial new old
[–] hernanca@beehaw.org 45 points 1 month ago (2 children)

Why not go full data nihilist and say that every file is just a natural number expressed in binary.

[–] rickyrigatoni@lemm.ee 4 points 1 month ago

Yeah but I legally own this particular number >:E

[–] Daxtron2@startrek.website 3 points 1 month ago

I'm good with that

[–] Steve 38 points 1 month ago (2 children)

What about copyrighted code?
Like for instance, GPU drivers?

[–] FaceDeer@fedia.io 21 points 1 month ago

Yes, that would also be statistical correlations to an AI model. The specific kind of information they're being trained on doesn't affect the underlying mechanism of model training.

[–] prex@aussie.zone 8 points 1 month ago

I mentioned it before:

If they use any GPL code for their model then any output would be a derived work and a violation of the GPL.

[–] todd_bonzalez@lemm.ee 35 points 1 month ago (1 children)

Torrents are just statistical correlations to our torrent clients.

[–] Thann@lemmy.ml 18 points 1 month ago

Siri create a random dataset in the likeness of peter jacksons lord of the rings: the return of the king directors cut on dvd

[–] kibiz0r@midwest.social 34 points 1 month ago (2 children)

Aren’t MP3s just a statistical correlation?

Besides, you really don’t need to zoom in on “but muh license agreement” to roast these AI turds.

They’re very clear: We’re gonna put creatives out of work, we’re gonna sell a unified product to replace them, and we’re gonna use their own labor to build their replacements.

That’s anticompetitive.

Nail em on that instead of trying to thread the needle on reining in the tech lords without damaging e.g. linguistic analysis researchers.

[–] sxan@midwest.social 4 points 1 month ago

We’re gonna put creatives out of work, we’re gonna sell a unified product to replace them, and we’re gonna use their own labor to build their replacements.

Yes, but: it's short sighted, and wrong. Until we have a sea change in the LLM/AGI space, "creatives" will be needed for seed data. LLMs that are recursively trained on their own output degrade and produce worse output over time.

The "yes" part is that companies looking to replace paying people for their work, but still hoping that Creative Commons types are still posting online for free harvesting.

[–] Daxtron2@startrek.website -1 points 1 month ago

The tools exist for creatives to use.

[–] gbin@lemmy.ca 24 points 1 month ago (1 children)

Copies are just very strong statistical correlations.

[–] CrabAndBroom@lemmy.ml 3 points 1 month ago (1 children)

Yeah I'm just downloading random data for fun in little tiny bits. If that data happens to arrange itself in the form of the latest episode of Doctor Who then that's not my problem.

[–] B0rax@feddit.org 2 points 1 month ago

Reminder for everyone who does not know: doctor who is available on the bbc iplayer, which can be downloaded with get-iplayer.

If you are outside the UK, all you need is a DNS or VPN.

[–] CAPSpirou@lemmy.dbzer0.com 15 points 1 month ago

These files are just correlated bits and bytes, nothing more.

[–] Even_Adder@lemmy.dbzer0.com 0 points 1 month ago (1 children)

Damn, this article is so biased.

[–] FaceDeer@fedia.io 5 points 1 month ago (1 children)

Seemed pretty fair and fact-based to me. What bias are you seeing?

[–] Even_Adder@lemmy.dbzer0.com 1 points 1 month ago

I think it's really disingenuous to mention the DeviantArt/Midjourney/Runway AI/Stability AI lawsuit without talking about how most of the infringement claims were dismissed by the judge.