The archive warriors are downloading Reddit for a while already. 15.6 billion items and counting. You can help too:
Linux
From Wikipedia, the free encyclopedia
Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).
Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.
Rules
- Posts must be relevant to operating systems running the Linux kernel. GNU/Linux or otherwise.
- No misinformation
- No NSFW content
- No hate speech, bigotry, etc
Related Communities
Community icon by Alpár-Etele Méder, licensed under CC BY 3.0
It just lists name of people archiving reddit. where can I get the archived data. do I have to ask one of those people to send me a zip file?
The data is integrated into the Internet archive and available e.g. via the way back machine. Not sure if you can get the whole reddit dataset.
It's not an archive but RedLib provides an alternative frontend which deals with most of the hostile design
There is lemmit.online which does this purely.
It is pretty busy but may already do some of it. You could request a nieche and very useful community like the sway one. Or use your own server to fetch these results and post to the open Web.
Unfortunately they don't take requests for new subreddits anymore. In addition, they don't mirror comments so in terms of answers to questions it's probably not that helpful.
True. But that may be due to how the bot works?
As a sidenote, you can get around the VPN block with Redlib by just adding safe-
to the start of most reddit URLs. So like instead of reddit.com/r/linux
or whatever you can do safereddit.com/r/linux
and it should work without needing a login.
I mean, if people here don't like how Reddit took advantage of user comment data, why should we archive the same without consent from the people who wrote them? Legally speaking Reddit holds the copyright also.
so just use chatgpt or gemini - pretty sure they sucked in all of reddit to form their KB
Even if that's so, I have had many occasions where I thought that for something simple, ChatGPT could do the job. I ended up having a back and forth for hours (last case of that being yesterday) until I got it fixed. For most cases (but not yesterday's) I found it much faster by looking it up online.
I mostly use Mistral personally. You also can use llava for image analysis
using llm ai for tech support is monumentally stupid lmao
How is it worse than taking advise off of the Internet? At the end of the day you need to be aware of what you are doing.
Mistral has helped me with a variety of tasks such as finding tools and choosing ZFS geometry
BTW - thanks for Mistral. Another tool in the box!
Quite right!
You need to take it all (AI or internet searches) with a huge pinch of salt. Even ye olde text books were not infallible and often out of date, so sodium chloride was also required even then.
The code either works or it doesn't - it's all in the testing. If you deploy AI suggestions without thought you deserve the consequences.
I think the reliability of the response also depends on the prompt. Certain prompts decrease the reliability issues.