this post was submitted on 23 Jun 2024
343 points (97.8% liked)
Technology
59472 readers
3747 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
They can't even be punished.
robots.txt
is just a convention, not a regulation. It's totally not enforceable.The only legal framework we have is copyright law. Those who oppose this behavior will have to demonstrate copyright violation, and that may be difficult to do since the law hasn't caught up.
It's true robots is not regulation but if it's proven they ignore it on purpose it will be a major point in future lawsuits. And those are the next step.
It won't have any relevance at all.
Either scraping to transform the information in the page is fair use, and consent isn't necessary, or it is not fair use, and the absence of a robots.txt doesn't constitute consent. There's no middle ground where a robots.txt can mean anything.
Yeah I know. But I wanted to point out that the comment in the article wasn't so much a real consideration as business risk analysis 101. Along with a healthy dose of corporate spin.