487
‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says
(www.theguardian.com)
This is a most excellent place for technology news and articles.
Every work is protected by copyright, unless stated otherwise by the author.
If you want to create a capable system, you want real data and you want a wide range of it, including data that is rarely considered to be a protected work, despite being one.
I can guarantee you that you're going to have a pretty hard time finding a dataset with diverse data containing things like napkin doodles or bathroom stall writing that's compiled with permission of every copyright holder involved.
How hard it is doesn't matter. If you can't compensate people for using their work, or excluding work people don't want users, you just don’t get that data.
There's plenty of stuff in the public domain.
And artists are being compensated now fairly?
Previous wrongs don't make this instance right.