this post was submitted on 30 Jan 2026
75 points (91.2% liked)

Selfhosted

55462 readers
1233 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

  7. No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Armillarian@pawb.social 1 points 2 days ago (1 children)

I think its better if their github mention the minimum token count requirement to selfhost this. I don't think it will ever reach something usable for normal selfhost user.

Based on your statement i think most of your experience come from corporate AI usage... Which deploy multiple agent system in their AI and hosted in large data center.

I do selfhost my own, and even tried my hand at building something like this myself. It runs pretty well, I'm able to have it integrate with HomeAssistant and kubectl. It can be done with consumer GPUs, I have a 4000 and it runs fine. You don't get as much context, but it's about minimizing what the LLM needs to know while calling agents. You have one LLM context that's running a todo list, you start a new one that is charge of step 1, which spins off more contexts for each subtask, etc. It's not that each agent needs it's own GPU, it's that each agent needs it's own context.