this post was submitted on 17 Feb 2024
298 points (97.8% liked)
Technology
59402 readers
3246 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
For anyone looking for a gibberish generator to replace their Reddit content with, here's one. This shit is like poison for those large models.
For automatic edition I'm not sure on what people can use nowadays; back then just before the APIcalypse I've used power delete suite, I'm not sure if it still works and I'm not creating a Reddit account just to test it out.
Not that I’m against telling Reddit to fuck off in no uncertain terms, but won’t providing this kind of poisoning to AI training just make it more resilient to exactly this kind of thing?
I don't think so. It's really hard to sort the poison out of the data, unless you actually have enough reading comprehension to know that it's gibberish - humans do, bots don't. And even if they discard 80% of the poison, the 20% there are already screwing with the model.
They could prevent you from editing your posts/comments, but that would cause an uproar.