this post was submitted on 16 Apr 2026
9 points (80.0% liked)

Programming

26563 readers
268 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev



founded 2 years ago
MODERATORS
 

I'm trying to get to a reason on this, but my point reach to a limit.

I've the feels that scraping the internet for public accessible data, like for example open and public music on Spotify wouldn't be a crime, but the distribution would be. At the same token, this is seem as a crime, while Google does the same and nothing happens, even worse, if this get regulated, Google would have a huge advantage on anyone else.

So, my deeper question is: "Is copyright dead?"

you are viewing a single comment's thread
view the rest of the comments
[โ€“] Shin@piefed.social 1 points 2 days ago (1 children)

That's the point, my focus is on the "Europe" as a general place, since they need to sync the "law" to some degree, there is different levels, but the base line are the same.

Most public data, like all the music in Spotify don't require a cookie. So I could in theory scrape all the Spotify music to "listem later". This wouldn't be "illigal", but if that's the case Annas Archive should be "fine"... (I know that they are distributing, and this is the fight)

But, if they scrapped the music, and I scrape we would have the same "dataset", so if I download the Annas "dataset", would it be different from mine? So if I prefer to download the Anna's dataset instead of scrape myself, would this be illigal? They aren't selling (on the contrary of Google).

There is way to many questions in my head :(

[โ€“] TehPers@beehaw.org 2 points 2 days ago

This wouldn't be "illigal", but if that's the case Annas Archive should be "fine"... (I know that they are distributing, and this is the fight)

I don't know much about European law, but redistribution changes things a lot here in the US. At least here, it then gets into copyright law, and you'd be reproducing copyrighted works without authorization (the Internet Archive attempted to get around this with books by getting legitimate copies of the books, digitizing them, then "lending" the digital copies of those books).

So if I prefer to download the Anna's dataset instead of scrape myself, would this be illigal?

No idea in Europe. In the US, it might be, depending on what the contents of the work are. I believe Anna's Archive would count as piracy in this case, though scraping directly from Spotify might not be because they are redistributing the music with authorization from the copyright holder. It gets pretty confusing, honestly.

Regardless, if you aren't doing things at large scale, even if you are breaking a law by downloading pirated content, it's unlikely anyone will care. People usually only really start caring if you start redistributing stuff, so as long as you aren't hosting what you're scraping, you're unlikely to run into any trouble.