this post was submitted on 08 Jun 2023
4 points (100.0% liked)

Lemmy

12514 readers
42 users here now

Everything about Lemmy; bugs, gripes, praises, and advocacy.

For discussion about the lemmy.ml instance, go to !meta@lemmy.ml.

founded 4 years ago
MODERATORS
 

(attempt to cross-post from /c/programming )

Idea: Scrape all the posts from a subreddit as they're being made, and "archive" them on a lemmy instance, making it very clear it's being rehosted, and linking back to the original. It would probably have to be a "closed" lemmy instance specifically for this purpose. The tool would run for multiple subreddits, allowing Lemmy users to still be updated about and discuss any potential content that gets left behind.

Thoughts? It's probably iffy copyright-wise, but I think I can square my conscience with it.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] usernotfound@lemmy.ml 0 points 1 year ago* (last edited 1 year ago) (1 children)

Yeah, exactly.

Also to reduce the chances of it colliding with an existing community. It would be an entire Lemmy instance dedicated to reddit mirroring, Lemmit ;)

But to be fair, I wasn't particularly looking forward to hosting and maintaining my own instance, but coding the tool part should be easy.

[โ€“] Barbarian@sh.itjust.works 0 points 1 year ago (1 children)

Just be aware that it might not work. Reddit implemented rate limits on page loads to combat the inevitable web scraping as they turn off the API. Test out how fast you can pull pages before putting in any real coding time.

load more comments (1 replies)