Technology

86426 readers

3933 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

435

Report: Potential NYT lawsuit could force OpenAI to wipe ChatGPT and start over (arstechnica.com)

submitted 2 years ago by wanderingmagus@lemm.ee to c/technology@lemmy.world

114 comments fedilink hide all child comments

cross-posted from: https://nom.mom/post/121481

OpenAI could be fined up to $150,000 for each piece of infringing content.https://arstechnica.com/tech-policy/2023/08/report-potential-nyt-lawsuit-could-force-openai-to-wipe-chatgpt-and-start-over/#comments

you are viewing a single comment's thread
view the rest of the comments

[–] ArmokGoB@lemmy.dbzer0.com 14 points 2 years ago (2 children)

I disagree. I think that there should be zero regulation of the datasets as long as the produced content is noticeably derivative, in the same way that humans can produce derivative works using other tools.

[–] adrian783@lemmy.world 1 points 2 years ago (1 children)

LLM are not human, the process to train LLM is not human-like, LLM don't have human needs or desires, or rights for that matter.

comparing it to humans has been a flawed analogy since day 1.

[–] synceDD@lemmy.world 2 points 2 years ago

Llm no desires = no derivative works? Let llm handle your comments they will make more sense

[–] HelloHotel@lemmy.world 1 points 2 years ago* (last edited 2 years ago) (1 children)

Good in theory, Problem is if your bot is given too mutch exposure to a specific piece of media and when the "creativity" value that adds random noise (and for some setups forces it to improvise) is too low, you get whatever impression the content made on the AI, like an imperfect photocopy (non expert, explained "memorization"). Too high and you get random noise.

[–] ArmokGoB@lemmy.dbzer0.com 2 points 2 years ago

if your bot is given too mutch exposure to a specific piece of media and when the “creativity” value that adds random noise (and for some setups forces it to improvise) is too low, you get whatever impression the content made on the AI, like an imperfect photocopy

Then it's a cheap copy, not noticeably derivative, and whoever is hosting the trained bot should probably take it down.

Too high and you get random noise.

Then the bot is trash. Legal and non-infringing, but trash.

There is a happy medium where SD, MJ, and many other text-to-image generators currently exist. You can prompt in such a way (or exploit other vulnerabilities) to create "imperfect photocopies," but you can also create cheap, infringing works with any number of digital and physical tools.