this post was submitted on 20 Jun 2023
9 points (100.0% liked)

Selfhosted

40246 readers
845 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS
 

Are there any Discord servers or somewhere in the Matrix to chat about hosting a Lemmy instance? I've got Lemmy running, but I think there are several of us in the same boat struggling with federation performance issues and it might be good to have some place to chat real time.

you are viewing a single comment's thread
view the rest of the comments
[–] chiisana@lemmy.chiisana.net 1 points 1 year ago (1 children)

If you look here: https://lemmy.world/comment/65982

At least specs and capacity wise, it doesn't suggest it is hitting a wall.

The more I dug into things, the more I think the limitation comes from an age old issue in that if your service is expected to connect to a lot of flakey destinations, you're not going to be in for a good time. I think the big instance backend is trying to send federation event messages, and a bunch of smaller federated destinations have shuttered (because they're not getting all the messages, so they just go and sign up on the big instances to see everything), which results in the big instances' out going connection have to wait for timeout and/or discover the recipient is no longer available, which results in a backed up queue of messages to send out.

When I posted a reply to myself on lemmy.world, it took 17 seconds to reach my instance (hosted in a data centre w/ sub 200ms ping to lemmy.world itself, so not a network latency issue here), which exceeds the 10 seconds limit per defined by Lemmy. Increasing it on the application protocol level won't help, because as more small instances come up, they too would also like to subscribe to big hubs, which will just further exacerbate the lag.

I think the current implementation is very naive and can scale a bit, but will likely be insufficient as the fediverse grows, not as the individual instance's user grows. That is, the bottle neck will not so much be "this can support instance up to 100K users" but rather "now that there's 100K users, we'd also have 50K servers trying to federate with us". And to work around that, you're going to need a lot more than Postgres horizontal scaling... you'd need message buses and workers that can ensure jobs (i.e.: outward federation) can be sent effectively.

[–] xtremeownage@lemmyonline.com 0 points 1 year ago* (last edited 1 year ago) (1 children)

That is a VERY small server....

MY server, has 32 cores, 64 threads, 256G of ram, and 130T of storage (4T of which is NVMe)

Sheesh, that is prob why that instance is dragging!!

https://lemmy.world/post/56228

[–] chiisana@lemmy.chiisana.net 0 points 1 year ago (1 children)

They've bumped the server much more than the original posted VM. I was pointing to the zabbix charts and actual usage. Notice CPU is sub 20%, and the network usage being sub 200Mbits. There's plenty of headroom.

[–] xtremeownage@lemmyonline.com 0 points 1 year ago (1 children)

I found the newest link- https://lemmy.world/comment/379405

Ok, that is a pretty sizable chunk of hardware.

[–] chiisana@lemmy.chiisana.net 0 points 1 year ago (1 children)

I care less about what it is running on, but what is consumed. At sub 20% usage, it really doesn't matter what the hardware is, because the overall spec is not the bottle neck.

[–] xtremeownage@lemmyonline.com 1 points 1 year ago* (last edited 1 year ago)

Your original link is from 9 days ago, before the massive surge hit.

https://lemmy.world/post/56228 Came 8 days ago, with reports of it being pretty well saturated.

Remember- the big surge, is in the last 3-4 days.

Fediverse stats: https://fediverse.observer/dailystats

In the last 4 days, they have went up over 400% in size.