this post was submitted on 08 Sep 2023
305 points (94.5% liked)

Technology

58173 readers
5769 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

But what if you do? Will you get caught?

top 43 comments
sorted by: hot top controversial new old
[–] teft@startrek.website 113 points 1 year ago (2 children)

If it's publicly accessible it's scrape-able. He already tried to make tweets not publicly accessible and look how long that lasted.

[–] olympicyes@lemmy.world 17 points 1 year ago (2 children)

It’s not even clear that it’s illegal to scrape publicly available data, so I don’t know what the TOS would be enforcing.

[–] coffee_poops@sh.itjust.works 9 points 1 year ago (1 children)

It didn't have to be illegal for them to sue you for violating its policies. It would be a civil suit for damages.

[–] Eranziel@lemmy.world 16 points 1 year ago (2 children)

Lol, then they would have to demonstrate that there were damages. The worst a TOS violation will get you is a ban.

[–] kometes@lemmy.world 4 points 1 year ago (1 children)
[–] MonkderZweite@feddit.ch 4 points 1 year ago (1 children)
[–] kometes@lemmy.world 2 points 1 year ago (1 children)

It's still a billionaire suing you for those peanuts.

[–] MonkderZweite@feddit.ch 1 points 1 year ago

Suing for 50 dollar or so? Fine.

[–] coffee_poops@sh.itjust.works 3 points 1 year ago

Unfortunately, they have more money to blow on legal fees. The threat of a suit is enough to keep most perks from fucking around and finding out.

It's not publicly available anymore. If you're not logged in you don't see anything anymore except tweets you have a direct link to. Even then you don't see any replies and the amount of tweets per day you can see is limited.

[–] Kuro@lemm.ee 15 points 1 year ago* (last edited 1 year ago)

Yep, would never remember on the odd occasions I would look at Twitter, then just leave the site after being prompted to login

[–] Darkard@lemmy.world 45 points 1 year ago (2 children)

I hope someone makes some manic bot that scrapes every last tweet and posts it on a duplicate site call Y

[–] SinningStromgald@lemmy.world 12 points 1 year ago* (last edited 1 year ago)

Might as well go whole hog and do the entire alphabet. Then do one for every iteration of every letter combination.

[–] veloxization@yiffit.net 39 points 1 year ago

He's still mad at those researchers for scraping the data that shows that ever since he took over, the antisemitism, racism and general bigotry has gone up on the platform.

[–] Sanctus@lemmy.world 30 points 1 year ago (1 children)
[–] cheese_greater@lemmy.world 0 points 1 year ago (1 children)

Let the Supreme Court enforce it ;)

[–] Sanctus@lemmy.world -1 points 1 year ago

It should, but it won't.

[–] tonytins@pawb.social 24 points 1 year ago (1 children)

How on earth will do they plan on enforcing that? xD

[–] xavier666@lemm.ee 12 points 1 year ago (1 children)

They don't have to enforce it. If someone says bad things about Twitter by analysing their content, Twitter can sue them scraping.

[–] IphtashuFitz@lemmy.world 16 points 1 year ago

“Our interns spent 500 hours collecting the raw data”.

[–] underisk@lemmy.ml 23 points 1 year ago (1 children)

I’m pretty sure both parties must agree to the terms before they legally bind anyone so wouldn’t this just apply to logged in users?

[–] thepianistfroggollum@lemmynsfw.com 7 points 1 year ago* (last edited 1 year ago) (3 children)

Accessing the website is often viewed as accepting the terms, so that wouldn't hold up. Not that they'd have a legal standpoint on the issue.

[–] NeoNachtwaechter@lemmy.world 15 points 1 year ago (1 children)

Accessing the website is often viewed as accepting the terms

The scraping bot can't read the terms

But even if it could, it wouldn't give a damn :-)

[–] mojo@lemm.ee 12 points 1 year ago (1 children)

By reading this message you agree to my terms that I'm really cool

[–] eskimofry@lemmy.one 3 points 1 year ago

Lol and you username

[–] TheEntity@kbin.social 8 points 1 year ago

How do you read the terms without accessing their website?

[–] ChunkMcHorkle@lemmy.world 22 points 1 year ago* (last edited 4 months ago)

deleted by creator

[–] 7fb2adfb45bafcc01c80@lemmy.world 15 points 1 year ago* (last edited 1 year ago) (2 children)

I thought this was an article about the X Windows system based on the preview for the article. Boy are those two similar-looking.

[–] MJBrune@lemmy.world 2 points 1 year ago

Realistically, very little people know about x windows system even less care about it.

You could always join wayland.social

[–] dingleberry@discuss.tchncs.de 14 points 1 year ago (1 children)

Just update robot.txt coward!

[–] Iwasondigg@lemmy.one 6 points 1 year ago

Took a look at their robots.txt, it appears to block all bots except Google.

[–] pseudorandom@kbin.social 14 points 1 year ago

Or just stop using X all together.

[–] cheese_greater@lemmy.world 14 points 1 year ago

Crawling for me, not thee!

[–] BlinkerFluid@lemmy.one 9 points 1 year ago

don't!

You heard him, scrape more.

[–] YurkshireLad@lemmy.ca 6 points 1 year ago

So he’s going to sue google then?