this post was submitted on 22 Aug 2023
691 points (95.9% liked)

Technology

59402 readers
2525 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

you are viewing a single comment's thread
view the rest of the comments
[–] Even_Adder@lemmy.dbzer0.com 5 points 1 year ago (7 children)

I mean, this is the exact way the U.S. Copyright Office's guidance says they think you should use it.

[–] habanhero@lemmy.ca 2 points 1 year ago (6 children)

Sure, but even under this guidance copyright owners of the training data are still shafted, based on how the data is scraped pretty much freely. Can an opportunist generate an unofficial sequel to Harry Potter, do the minimum to ensure they get copyright and reap the reward from it?

[–] Even_Adder@lemmy.dbzer0.com 3 points 1 year ago (5 children)

That's how copyright has always worked. Fair use is complex, but as long as you're not straight up copying someone's work you're fine. 50 Shades of Grey started out as Twilight fanfiction. So yeah, you could.

[–] habanhero@lemmy.ca 2 points 1 year ago* (last edited 1 year ago) (1 children)

Yes fair use has its stipulations but AI is breaking new grounds here. We are no longer dealing with the reaction videos but in an era where literally dozen of pages of content can be generated in a matter of minutes, with relatively little human involvement. Perhaps it's time to revisit if the law still holds in light of these new technology and tools.

[–] Even_Adder@lemmy.dbzer0.com 1 points 1 year ago (1 children)

You should read this article by Kit Walsh, who’s a senior staff attorney at the EFF.

[–] habanhero@lemmy.ca 1 points 1 year ago

Interesting read! Definitely a useful breakdown and I see the reasoning. Thanks for sharing.

load more comments (3 replies)
load more comments (3 replies)
load more comments (3 replies)