Well know if they start using redactions as much as em dashes
Showerthoughts
A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The most popular seem to be lighthearted clever little truths, hidden in daily life.
Here are some examples to inspire your own showerthoughts:
- Both “200” and “160” are 2 minutes in microwave math
- When you’re a kid, you don’t realize you’re also watching your mom and dad grow up.
- More dreams have been destroyed by alarm clocks than anything else
Rules
- All posts must be showerthoughts
- The entire showerthought must be in the title
- No politics
- If your topic is in a grey area, please phrase it to emphasize the fascinating aspects, not the dramatic aspects. You can do this by avoiding overly politicized terms such as "capitalism" and "communism". If you must make comparisons, you can say something is different without saying something is better/worse.
- A good place for politics is c/politicaldiscussion
- Posts must be original/unique
- Adhere to Lemmy's Code of Conduct and the TOS
If you made it this far, showerthoughts is accepting new mods. This community is generally tame so its not a lot of work, but having a few more mods would help reports get addressed a little sooner.
Whats it like to be a mod? Reports just show up as messages in your Lemmy inbox, and if a different mod has already addressed the report, the message goes away and you never worry about it.
And also everyone's vague notes about, like, the Sword of Truth mass isekai hatefic they wanted to write back in 2024 but then gave up on because they mentioned the Battle of Cable Street and then had to stare at a wall for a bit and walk away in shame
highly doubt it, they're trained on publicly available data mostly
If by “publicly” you mean “any data source that it managed to connect to, public or private”, then yes….
Not entirely though.
Like we know grok was trained on the fbi's cp stash, right?
Do you have any source on that?
Those files are kinda a nightmare to navigate in their bare state. And the datasets are huge. I doubt anyone training AI would allow them to go through knowingly, less it was specifically a police invesigation and case law focused AI that was designed to process and categorize that kind of data.
Most AI are designed for functional discussion and factual data processing. It's not a great idea to just feed in random trash.
They scrape data indiscriminately; I'm sure any Epstein files publicly accessible on the internet have been added to their databases. Perhaps they'd be filtered out before being used to train models but I'm skeptical they take that level of care with the data.
I had to use cloudflare to stop AI crawlers from using like 60% of my 16 core server that runs this instance. They were spending that much time pulling fediverse content, multiple bots without and wait time between requests. You really think they'd reject epstein files but seek out our combined output?
not a good idea
Which would stop them.