overview for farcaller

Building my own log aggregation and search server in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 2 points 3 days ago (1 children)

That's a good point. Mind that in most production environments you'd be firewalled rather hard (especailly when it comes to logs processing which oftentimes ends up having PII). I wouldn’t trust any service that tries to use DoT or DoH in there that I couldn’t snoop on. Many deployments nowadays allow you to "punch" firewall holes based on the outgoing dns requests to an allowlisted domain, so chances are you actually want to use the glibc resolver and not try to be fancy.

That said, smaller images are always good in my book!

Building my own log aggregation and search server in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 7 points 3 days ago (3 children)

You’re nailing your goal then!

I would still steer you slightly towards documenting your architectural decisions more. It's a good skill to have and will help you in a long run.

You have dozens of crate dependencies and only you know why they are in there. A high-level document on how your system interconnects and how the algorithms under the hood work will be a huge help to anyone who comes looking through your source code. We become better programmers not by reading the source code, but by understanding what it actually does.

Here's a random trivia: your server depends on trust-dns-resolver. Why? Why wasn’t the stock resolver enough? Is that a design choice or you just wanted to have fun? There is no wrong answer but without the design notes it's hard to figure your intent.

Building my own log aggregation and search server in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 24 points 3 days ago (5 children)

This looks nice, but there's plenty free alternatives in this space which warrants a section in the readme with the comparison to other products.

You mention ram usage, but it’s oftentimes a product of event size. Based on your numbers, your average event size is about 800 bytes. Let’s call it 1kb. That’s one million events per day. It’s surely sounds more promising than Elastic, but not reaching Loki numbers, or, if you focus on efficiency, is way behind Victoriametrics Logs (based on peeking at their benches).

I think the important bits you need to add is how you store the logs (i.e. which indices you build) and what are your trade-offs. Grep is an efficient logs processor which barely uses any ram but incurs dramatic I/O costs, after all.

Enterprises will be looking at different numbers and they have lots of SaaS products to choose from. Homelab users are absolutely your target audience and you can have it by making a better UI than the alternative (victoriametrics logs aren’t that comfortable to work with) or making resource usage lower (people run k8s clusters on RPis, they sure wonder about every megabyte of ram lost) or making the deployment easier (fire and forget, and when you come to it, it works).

It sounds like lots of things and I don’t want to be discouraging. What you started there is really nice-looking. Good job!

Is there a way to guarantee a mobile device or tablet can only access my own services and block all other traffic? in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 6 points 4 days ago (3 children)

You can enforce an always-on VPN (for at least ipsec) via an MDM profile. This kind of features isn’t found in the casual user setup options, but there's plenty of knobs to tune in the enterprise profile configurator.

And yes, you can easily install that profile on your phone after.

Question: self-hosted instant messanger. in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 1 points 4 days ago

It is pretty bad. After this thread I tried using Element X again only to learn that its "favorites" aren’t the same as Element's "favorites" and more so you can’t set someone a favorite in E-X, at least not of your server is Conduit. It's just silently ignored.

What are your AI use cases? in c/technology@lemmy.world

[–] farcaller@fstab.sh 1 points 4 days ago

Local models are really good at tokenizing the text and figuring the intent in the user input. Not perfect, but much better than any possible regexps you can think of. And it's a trivial operation you can run even on a CPU model.

RAID array filesystem recommendations in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 3 points 5 days ago

I would absolutely recommend a file system with snapshot capabilities for a home server. One of btrfs mirror, dm-raid (raid5) with btrfs, or zfs would work. The practical differences would be negligible at this scale and you can just pick whatever you fancy.

Question: self-hosted instant messanger. in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 2 points 6 days ago (3 children)

I’ve been having sync issues with conduit lately, takes minutes for the mobile app to catch up. No way to purge old media, or to use something S3-compatible for its storage either.

Also, element x doesn’t support spaces, so if you want to bridge other chats into matrix they all are going to be messed up together.

I like matrix as a concept, but both servers and clients are in a bit of a shitshow state (same as xmpp was years ago).

Dropbox is laying off 20% of its staff in c/technology@lemmy.world

[–] farcaller@fstab.sh 1 points 2 weeks ago

The windows client does, yes. But I’ve found that to be fragile on occasions.

Dropbox is laying off 20% of its staff in c/technology@lemmy.world

[–] farcaller@fstab.sh 14 points 2 weeks ago (6 children)

Technically, it does have a windows client. It's just in various states of being broken.

How annoying is it to connect to VPN/use Tailscale instead of being able to access the service directly? in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 6 points 2 weeks ago

For the last 10 days tailscale clocked 1% battery on my phone. I honestly didn’t even consider turning it off for battery savings.

*Permanently Deleted* in c/selfhosted@lemmy.world

[–] farcaller@fstab.sh 5 points 3 weeks ago

If tailscale inside a container allows you to talk to it via “direct” connection and not a derp proxy, then it will offer you better service isolation (can set the tailscale ACLs for this specific service) without sacrificing performance.

Tailscale pushes for it because it just ties you in more. It allows to to utilize the ACLs better, to see your thing in their service mesh, and every service will count against the free node limit.

In practice, I often do both. E.g. I’ll have my http ingress exposed to tailscale and route a bunch of different services through it at a single tailscale node, where the access control is done by services individually. But I’ll also run a pod-to-pod tailscale between two k8s clusters because tailscale ACL is just convenient.

7

How to split short term and long term VictoriaMetrics storage (farcaller.net)

submitted 5 months ago by farcaller@fstab.sh to c/selfhosted@lemmy.world

1 comments fedilink

I finally got to cleaning up the metrics in my homelab and researched the means to separate my long-term and short-term data. This way you can scrape all kinds of noisy sources (e.g. kubernetes) while having a separate store for things you want to observe on longer time windows (months and years). The best thing? It's transparent for grafana and the like, so you can keep all your dashboards intact.

24

Self-hosted alternative to synology drive? (fstab.sh)

submitted 5 months ago by farcaller@fstab.sh to c/selfhosted@lemmy.world

16 comments fedilink

I moved off a Synology NAS to a self-managed machine and one thing I still struggle to replace is something like a synology drive. Here are my requirements:

server side store data in a plain FS (I want transparency)
client side (windows), it must support VFS (download files when needed, support offloading of large files)
having snapshots of data is a must

I have a 40gbit uplink to my desktop, so if everything else fails I’ll just use samba with zfs snapshots exposed to VSS, but we’re talking some large files still (think several hundreds of MBs) and I’m not sure Blender will be happy working off a network disk.

I’ve been pointed to next/own-cloud previously, but they don’t seem to cover my use case, I think. Should I actually try one of those? I browsed around owncloud's storage bit (which is written in go), and it seems mostly fitting, but I’ve been told I should steer away from ownCloud towards nextCloud.

36

Why fediverse clients reinvent the C2S APIs and don't use ActivityPub? (fstab.sh)

submitted 1 year ago by farcaller@fstab.sh to c/fediverse@lemmy.world

8 comments fedilink

I’m reading the ActivityPub spec here and it seems pretty fit for client-to-server communications. Yeah, it might be somewhat bulkier than your typical rest api, but it's more universal, which begs the question: why do mastodon and lemmy both decided to implement custom (and incompatible) APIs for their clients to talk to the servers? Wouldn’t it be more straightforward if e.g. my voyager app talked ActivityPub to lemmy.world which then talked ActivityPub to lemmy.ml or something.

What am I missing?

1

I made a lemmy community directory (fstab.sh)

submitted 1 year ago* (last edited 1 year ago) by farcaller@fstab.sh to c/fediverse@lemmy.world

0 comments fedilink

I wasn't sure how to find the communities I'm interested in, so I quickly hacked together a scraper that makes a list of all the communities(1) of all the servers mine is federating to(2).

You can find it (with a very trivial UI) at directory.fstab.sh. Hover over the link to see the description. Use the search bar to search by text.

Is this something useful or there was a better way to do the same?

(1) it does its best to scrape them all but incidents might happen
(2) updated nightly