this post was submitted on 29 Sep 2023
398 points (93.8% liked)

Technology

72320 readers
4424 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Authors using a new tool to search a list of 183,000 books used to train AI are furious to find their works on the list.

you are viewing a single comment's thread
view the rest of the comments
[–] Gibdos@feddit.de 24 points 2 years ago (42 children)

I certainly hope that none of these authors have ever read a book before or have been inspired by something written by another author.

[–] adriaan@sh.itjust.works 33 points 2 years ago (2 children)

That would be a much better comparison if it was artificial intelligence, but these are just reinforcement learning models. They do not get inspired.

[–] Hackerman_uwu@lemmy.world 9 points 2 years ago (1 children)

More to the point: they replicate patterns of words.

[–] lloram239@feddit.de 7 points 2 years ago (1 children)
[–] FontMasterFlex@lemmy.world 1 points 2 years ago

That's a Bingo!

[–] Shurimal@kbin.social 8 points 2 years ago (2 children)

just reinforcement learning models

...like the naturally occuring neural networks are.

[–] Khalic@kbin.social 26 points 2 years ago (2 children)

The brain does not work the way you think… (I work in the field, bio-informatics). What you call “neural networks” come from an early misunderstanding of how the brain stores information. It’s a LOT more complicated and frankly, barely understood.

[–] canihasaccount@lemmy.world 11 points 2 years ago (1 children)

Yeah, accurately simulating a single pyramidal neuron requires an eight-layer deep neural network:

https://www.cell.com/neuron/pdf/S0896-6273(21)00501-8.pdf

[–] demonsword@lemmy.world 3 points 2 years ago

that was an interesting read, thank you

so its barely understood, but this definitely is not it. got it.

[–] lemmyvore@feddit.nl 10 points 2 years ago (1 children)

Tell you what, you get a landmark legal decision classifying LLM as people and then we'll talk.

Until then it's software being fed content in a way not permitted by its license i.e. the makers of that software committing copyright infringement.

[–] Touching_Grass@lemmy.world 6 points 2 years ago* (last edited 2 years ago) (1 children)

What exactly was not permitted by the license? Reading?

[–] sab@lemmy.world 14 points 2 years ago (3 children)

Using it to (create a tool to) create derivatives of the work on a massive scale.

[–] SirGolan@lemmy.sdf.org 8 points 2 years ago

Wikipedia: In copyright law, a derivative work is an expressive creation that includes major copyrightable elements of a first, previously created original work.

I think you may be off a bit on what a derivative work is. I don't see LLMs spouting out major copyrightable elements of books. They can give a summary sure, but Cliff Notes would like to have a word if you think that's copyright infringement.

[–] lloram239@feddit.de 5 points 2 years ago (1 children)

Better tell that Google and their search index, book scanning project and knowledge graph.

[–] sab@lemmy.world -1 points 2 years ago* (last edited 2 years ago)

I didn't know those were LLMs, TIL.

[–] Touching_Grass@lemmy.world -1 points 2 years ago (1 children)

Well when that happens we have laws. So no problems

[–] sab@lemmy.world 2 points 2 years ago (1 children)

Would you be okay with applying that argument for any crime?

[–] Touching_Grass@lemmy.world -1 points 2 years ago (1 children)

Ever heard of the early 21st century classic Minority Report

[–] sab@lemmy.world 3 points 2 years ago

You're missing the point. I'll make your example more specific.

Well when fraud/rape/murder happens we have laws. So no problems.

Those things happen. Creating a LLM based on copyrighted material without permission happens - it's not a hypothetical. But even then, giving a punishment after the fact does not make the initial crime "no problem", as you put it.

load more comments (39 replies)