this post was submitted on 13 Nov 2023
1 points (100.0% liked)

Emacs

310 readers
1 users here now

A community for the timeless and infinitely powerful editor. Want to see what Emacs is capable of?!

Get Emacs

Rules

  1. Posts should be emacs related
  2. Be kind please
  3. Yes, we already know: Google results for "emacs" and "vi" link to each other. We good.

Emacs Resources

Emacs Tutorials

Useful Emacs configuration files and distributions

Quick pain-saver tip

founded 1 year ago
MODERATORS
 

Are there any existing plaintext file format to for storing discussion forums posts like this one or ubuntuforums. I want to archive the discussion I like locally. i have been using singlefilez for download the whole page into my machine, but i prefer plaintext formats. When I tried Org-web-tools, it does not seem to properly extract reddit discussion pages for example.

I suppose I can write a scraper and dump content in json format. I'd prefer a plaintext format like org-mode and was designed with some thought put into this, instead of me cobbling something together.

you are viewing a single comment's thread
view the rest of the comments
[–] rafulafu@alien.top 1 points 10 months ago

I suppose I can write a scraper and dump content in json format.

No need, reddit already provides their data in JSON form. Generally just append .json at the end of the URL and you get your JSON, for example

https://www.reddit.com/r/emacs/comments/17u00j0/extracting_forums_posts_like_reddit_discussions/

->

https://www.reddit.com/r/emacs/comments/17u00j0/extracting_forums_posts_like_reddit_discussions.json