this post was submitted on 28 Feb 2024
92 points (96.0% liked)

Technology

58143 readers
5159 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] terminhell@lemmy.dbzer0.com 22 points 6 months ago (1 children)

Can we get a list of companies NOT doing this? I'd assume it's going to be much shorter.

[–] IllNess@infosec.pub 9 points 6 months ago (2 children)

All these AI and machine learning companies are taking content directly from websites and ignoring robot.txt files.

If your content is able to be crawled, even without being listed on search engines, I don't think it really matters.

load more comments (2 replies)
[–] Please_Do_Not@lemm.ee 12 points 6 months ago (7 children)

I work in marketing, and every client I work with who has a WordPress website is using AI to write a lot of their content. This is going to lead to circularly trained AI for sure.

[–] ininewcrow@lemmy.ca 2 points 6 months ago (2 children)

Are you sure your clients aren't AI also

[–] pavnilschanda@lemmy.world 5 points 6 months ago (1 children)

Dead internet theory in action

[–] ininewcrow@lemmy.ca 1 points 6 months ago

It was half dead and suicidal even before AI

[–] NewAgeOldPerson@lemmy.world 1 points 6 months ago

No way for me to know. My programming doesn't allow it.

load more comments (6 replies)
[–] donuts@kbin.social 11 points 6 months ago (2 children)

Funny how all of these social media platforms that were so happy to describe themselves as "the public town square of the internet" or whatever are now claiming that they own everything that everyone ever posted. So, which is it? Because it obviously cannot be both.

[–] monkeyslikebananas2@lemmy.world 2 points 6 months ago

Depends on the day it is more convenient in.

[–] Squire1039@lemm.ee 2 points 6 months ago

both

Town-square when they lure you in, they own everything when they sell you ass off.

[–] EdibleFriend@lemmy.world 7 points 6 months ago (2 children)

Bro...tumblr is full of some WEIRD FUCKIN SHIT YO

[–] nickhammes@lemmy.world 3 points 6 months ago (1 children)

I, for one, am looking forward to the rise of generative AI trained on 2014 tumblr, hallucinating Superwholock jokes where they don't belong, cosplayers dying themselves grey in a bathtub, and DashCon references where nobody expects them

[–] EdibleFriend@lemmy.world 2 points 6 months ago

Bro this shit is gonna make AI UwU

[–] RadicalCandour@startrek.website 2 points 6 months ago (1 children)

Hey now, Don’t kink shame the weirdos

[–] EdibleFriend@lemmy.world 2 points 6 months ago

I know because I was one of those weirdos lol

[–] gofsckyourself@lemmy.world 7 points 6 months ago* (last edited 6 months ago) (1 children)

I always thought it was scummy as fuck that WordPress.org, a 501c3 nonprofit, is allowed to funnel business to WordPress.com which is a completely separate for-profit entity.

They are even allowed to trick people into thinking they are the same by using the name and trademarks, which they explicitly state you cannot do. But wp.com gets a free pass for some reason? Scummy as fuck.

[–] TORFdot0@lemmy.world 1 points 6 months ago

Yeah I’ve never liked Wordpress. But it’s pretty much the defacto CMS for noobs. I always have used my own self-built CMS’s on frameworks like Laravel but it’s not really practical for non-tech people or even businesses to self develop their own CMS unless they have really specific needs.

I’m going to be honest, I didn’t even realize that Wordpress.org existed and was a non-profit; I just thought making the source available was something they did because you can’t really not do that as PHP framework.

[–] herrcaptain@lemmy.ca 6 points 6 months ago (4 children)

I'm assuming this just relates to WordPress.com rather than the open-source WordPress.org but it's still a bummer. I've worked with the open source platform for over a dozen years and have started to kinda loathe what it's turned into but I'm not sure I'm yet at the point where I'm ready to migrate a bunch of sites to something else. This could be that push if they keep going down this road.

God, am I getting too old for this shit? I'm a pretty technical person but this AI nonsense is just relentless. I'm not philosophically against the idea of AI as like any tool it has the potential to better the world, but every tech company and their dog are going all in on using it for commercial bullshit that seems to provide very little value to society. Even fucking Mozilla is going in that direction.

[–] fruitycoder@sh.itjust.works 2 points 6 months ago (3 children)

Mozilla seems more towards local and privacy preserving AI Dev, no? Both are really lacking in the space IMHO

Like I'm not interested in what the collective of digital knowledge looks like behind several corporate filters and giant rent seeking moat.

load more comments (3 replies)
[–] Traister101@lemmy.today 2 points 6 months ago* (last edited 6 months ago)

It's the new NFTs and Crypto but it's not blatantly a scam so the companies that skipped out on those sure as shit will be hoping onto AI

[–] KingThrillgore@lemmy.ml 2 points 6 months ago

There's already several WordPress plugins to block out Generative AI. I expect the community to have a less than chipper attitude about this over Automattic.

[–] CosmoNova@lemmy.world 1 points 6 months ago* (last edited 6 months ago)

I don‘t really know what to say to cheer you up. Industrial revolutions are as important and exciting as they are painful, even dreadful to many. I’ve seen no signs of this one being different. There will be a lot of losers before we can expect wide spread benefits for society from it. The current working class will suffer great losses and will have to fight so another can reap the benefits later.

[–] SuperSynthia@lemmy.world 5 points 6 months ago

Not only am I really glad to not be on tumblr, but this further shows I shouldn't use wordpress for my website even though there is an opensource version

[–] autotldr@lemmings.world 4 points 6 months ago

This is the best summary I could come up with:


To complicate matters even further, advertising content that isn’t even owned by Automattic, including ads from an old Apple Music campaign, has also reportedly made its way into the training data set.

The plans at Automattic have been so controversial internally, that a product manager has even started pulling his own photos off Tumblr to make sure they’re not used to train AI, according to 404.

Generative AI has become a big business ever since OpenAI first launched ChatGPT in late 2022 and text-prompt image creators soon followed from a number of companies.

But major publishers have complained, with some even filing lawsuits, alleging that much of the data used to train these systems was either pirated or doesn’t constitute “fair use” under existing copyright regimes.

In response to emailed questions on Tuesday, Automattic directed Gizmodo to a new post that more or less confirmed 404 Media’s reporting, while trying to sell the move to consumers as an opportunity to “give you more control over the content you’ve created.”

We also plan to take that a step further and regularly update any partners about people who newly opt-out and ask that their content be removed from past sources and future training.”


The original article contains 536 words, the summary contains 201 words. Saved 62%. I'm a bot and I'm open source!

[–] phoneymouse@lemmy.world 4 points 6 months ago

All of this is predicated on having some company that can afford to pay and wants this data. Or, the next tech bubble will just be VCs throwing money at AI companies training their models on the old internet.

[–] harsh3466@lemmy.ml 4 points 6 months ago (1 children)

Shit like this should be opt in by default. But no. Instead of respecting the users they count on ignorance, forgetfulness, and obfuscation for this kind of fuckery.

[–] agent_flounder@lemmy.world 2 points 6 months ago

Anything to make a buck.

[–] AlmightySnoo@lemmy.world 4 points 6 months ago

In the 2000s we had AdSense. So now we're getting... AISense?

[–] LunaCtld@lemmy.world 3 points 6 months ago (2 children)

I welcome this change actually. Now users can clearly see what others have been saying forever: If you don't pay for the product, you ARE the product.

[–] TheImpressiveX@lemmy.ml 1 points 6 months ago

If you don't pay for the product, you ARE the product.

Well, that's not always true. I don't pay for Wikipedia, am I the product?

[–] MossBear@lemmy.world 0 points 6 months ago (2 children)

Explain how I'm the product relative to Linux.

[–] Crack0n7uesday@lemmy.world 2 points 6 months ago* (last edited 6 months ago) (1 children)

With Linux you pay for support if you ever need it. Most end users will never need support, but businesses running Linux servers pay Red Hat a shit load to support them in case shit ever hits the fan. Like giving away a free car, but only certain people know how to do maintenance on it, and they all work for the manufacturer.

[–] MossBear@lemmy.world 1 points 6 months ago

I'm not a business, so it doesn't apply to me.

[–] RizzRustbolt@lemmy.world 1 points 6 months ago

Have you told anyone to switch to Linux?

[–] RizzRustbolt@lemmy.world 3 points 6 months ago

Matt's selling it.

The teams at Wordpress and Tumblr have made it known that they absolutely don't want this shit.

[–] Kid_Thunder@kbin.social 3 points 6 months ago

It's crazy that it sounds like paying customers might also have to opt-out.

[–] CosmoNova@lemmy.world 2 points 6 months ago

Remember when Xitter started selling the checkmark and now every platform is rolling out something identical? What about Netflix cracking down on sharing and adding ads to their lowest tier? Yeah this is that.

[–] FrostKing@lemmy.world 2 points 6 months ago (1 children)

Can someone please outline the main reasons people are upset with these sites for choosing to do this?

[–] Ultraviolet@lemmy.world 4 points 6 months ago (1 children)

There are 3 very important things that have to be respected when using someone's work. Consent, credit, and compensation. The data is being taken without the consent of users, they're not being credited for anything, and they don't receive so much as a cent in exchange.

[–] FrostKing@lemmy.world 1 points 6 months ago

Makes sense

[–] echodot@feddit.uk 1 points 6 months ago

Good maybe now everyone will stop using bloody shopify.

load more comments
view more: next ›