this post was submitted on 29 Aug 2023
60 points (95.5% liked)
Technology
75329 readers
2220 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I see what you mean yes, but of course such large resources are required to train the model - not run it. So reasonably as long as a bunch of users can pool resources to compete with big tech, there will always be an 'un-watermark-able' crowd out there, making all the watermakrs essentially useless because they only got half the picture.
And how training these models works is insanely parallel, so reasonably - if (ideally a FOSS) project pops up allowing users to donate cpu time to train the model as a whole - users could actually have more computational power than the big tech companies
The resources to train these models are such that even Google/Amazon/MS are doing it selectively. Google.../AWS/Azure are some of the biggest resources on the planet and these companies get it for as close to "free" as it is ever going to get, and even they ration those resources.
A bunch of kids on 4chan aren't going to even make a drop in the bucket. And, in the act of organizing, the ringleaders will likely get caught for violating copyright law. Because, unless the open source model really is the best (and it won't be), they are using proprietary code that they modified to remove said watermarks.
As for "We'll folding@home it!": Those efforts are largely dead because people realized power matters (and you are almost always better off just donating some cash so they can buy some AWS time). But, again, you aren't getting a massive movement out of people who just need to make untraceable deepfakes.
Also, this also ignores all of the shady ass crap being done to get that training data. Like... there is a reason MS bought Github.
I think youre mixing together a couple angles here to try n make a point.
'Unless the open source model is the best...theyre using proprietary code' youre talking about a hypothetical program hypothetically being stolen and referencing it as a definite?
As per the companies, of course they only use certain resoures, theyre companies they need returns to exist. A couple million down the drain could be some CEO's next bonus, so they won't do anything theyre into sure they'll get something from (even if only short term)
As per the 4chan, was that a coincidence or are you referencing unstable diffusion? Cause they did do almost exactly that (before of course it got mismanaged cause the nsfw industry is always been a bit ghetto)
And like sure fold it at home or donate for aws, same end result really doesn't matter what the user's are comfortable with
And whew finally sure ms bought github but like you think stable diffusion bought the internet? Courts have proven webscraping is legal...
Ik this is a wall of text but like I said these arguments all feel like a bunch of thoughts tangentially related
I am covering "a couple angles" because this falls apart under almost the most cursory of examination.
But sure. If we are at a point where there is sufficient publicly available training data, a FOSS product performs comparable to the flagship products of 100 billion dollar companies, and training costs have been reduced to the point that a couple kids on 4chan can train up a new model over the course of a few days: Sure.
Until we reach that point? Actually, even after we reach that point, it would still be unlikely. Because if training is that cheap then you can bet said companies have funded the development of new technologies that allow them to take advantage of their.. advantages.