this post was submitted on 31 Jan 2025
37 points (93.0% liked)

Privacy

33164 readers
663 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
 

I will be stuck in low or no internet areas and having a way to save a whole website (such as a small community wiki or something) to browse while bored would be very nice. It'd be nice if its features like search could be kept working. Any suggestions for a Foss app that can do this?

all 16 comments
sorted by: hot top controversial new old
[–] MonkderVierte@lemmy.ml 3 points 22 hours ago

There's "Save as web archive", might fit your need. Careful, has Admob and Firebase Analytics trackers.

[–] kekmacska@lemmy.zip 10 points 1 day ago

you can't. How do you imagine saving the sql databases, that you need for logging in and viewing user profiles and so on? at most, you can save a snapshot

[–] user224@lemmy.sdf.org 10 points 1 day ago

I used wget to download static sites, or at least ones with simpler JavaScript, but it won't download any required files that are only linked in JS code, so it probably won't work for many sites.

You also need to be careful when spanning hosts so that you don't accidentally (attempt to) download the entire internet. And rate-limiting, useragent, robots file, filename limitations (so that it doesn't save files with filename characters that have other uses in URLs like # and ?), filename extensions (to serve them back with correct mimetype), getting filenames from server rather than URL when appropriate, converting links (works in HTML files only), and I am probably forgetting something else.

Oh, and it's a single process doing one request at a time, so even just a page with too many images will take ages. E.g.: http://cyber.dabamos.de/88x31/ (currently offline).

You can then easily serve them using NGINX, or just browse as files, though the latter may not work well on something like a phone. Oh, one more thing, image.jpg and Image.jpg would conflict on Android, and some websites have differences like that. It can only be stored within Termux (and served using NGINX in Termux).

[–] asudox@lemmy.asudox.dev 6 points 1 day ago (1 children)

Would you be ok with reading wikipedia?

There's this app that is for viewing wikipedia pages both online and offline: https://f-droid.org/packages/org.nsh07.wikireader/

[–] marcie@lemmy.ml 4 points 1 day ago* (last edited 1 day ago) (2 children)

This will help but the wiki I'm thinking of using is for a video game and for a ttrpg 🤔

[–] asudox@lemmy.asudox.dev 4 points 1 day ago* (last edited 1 day ago)

You can download the website's static files then (html, css, images, etc.) but features such as search won't function if it works by querying some database.

Iirc most browsers have a way to make website's available offline. I know chromium has it, but firefox does not. You'd probably need an extension for that. Or you can download the static files, store them in a directory manually and then open the index.html with firefox. That should work.

[–] f@ieji.de 1 points 1 day ago (1 children)

@marcie @asudox On Firefox, can you try CTRL+S and choosing complete web page save? It usually is enough. Though if it calls an API for searching, that's not gonna work.

[–] asudox@lemmy.asudox.dev 3 points 1 day ago

"functions on android"

[–] Boomkop3@reddthat.com 3 points 1 day ago (1 children)

a chrome extension called WebScrapBook does the trick. Install it on a chromium based browser, such as kiwi or one that isn't getting discontinued

[–] ReversalHatchery@beehaw.org 2 points 18 hours ago (1 children)

firefox has that too, they don't need to use chrome for that

[–] Boomkop3@reddthat.com 2 points 11 hours ago

Oh nice! I only knew about the chrome one. I might just switch as my current browser is not getting maintained for long anymore

[–] flipflop@sh.itjust.works 3 points 1 day ago (1 children)
[–] marcie@lemmy.ml 2 points 1 day ago (2 children)

While it seems chromium based browsers are able to download pages to view later, it doesnt seem it saves a whole website.

[–] Trent@lemmy.ml 4 points 1 day ago

Kiwix isn't a web browser exactly and doesn't download web pages the way your browser saves them. It uses a specialized file format, and it can be used to back up an entire site. For instance the kiwix library has an offline copy of wikipedia (no images), but it weighs in at more than 100GB last I looked.