this post was submitted on 22 Oct 2023

3 points (100.0% liked)

Self-Hosted Main

589 readers

1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

For Example

Service: Dropbox - Alternative: Nextcloud
Service: Google Reader - Alternative: Tiny Tiny RSS
Service: Blogger - Alternative: WordPress

We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.

Useful Lists

Awesome-Selfhosted List of Software
Awesome-Sysadmin List of Software

founded 2 years ago

MODERATORS

communick@selfhosted.forum

How do you all monitor your server performance? (alien.top)

submitted 2 years ago by Michaelscarn69-@alien.top to c/main@selfhosted.forum

92 comments fedilink hide all child comments

As in, when I watched YouTube tutorials, I often see YouTubers have a small widget on their desktop giving them an overview of their ram usage, security level, etc. What apps do you all use to track this?

(page 2) 42 comments

sorted by: hot top controversial new old

[–] Large_Yams@alien.top 1 points 2 years ago

I don't track their performance, I just track if they're up or down.

I use uptimekuma running on a free tier of fly.io so I can tell if my cluster had a catastrophic failure. There's no point in the alerting system running on the same system.

[–] LumePart@alien.top 1 points 2 years ago

Zabbix for hardware, certificate monitoring

Prometheus for service monitoring (e.g how many are actually using my Jellyfin server, so i know if I need to scale etc.)

[–] maximus459@alien.top 1 points 2 years ago

Observium..

If it's just one server, Netdata is a better option..

[–] roh4@alien.top 1 points 2 years ago

First for PRTG.

[–] xupetas@alien.top 1 points 2 years ago

Nagios for service/QOS, Grafana for dashboarding for some items more specific. Planning on eventually switching to zabbix but nagios is so simple that i feel having a hard time justifying moving over 400 monitored services to it

[–] BouncyPancake@alien.top 1 points 2 years ago

If its down, I assume performance is bad

[–] thibmaek@alien.top 1 points 2 years ago

Quick checks: Proxmox dashboard, htop or glances, Portainer

Extensive monitoring: Prometheus (node-exporter), Rsyslog server, Loki, Grafana, Uptime Kuma, Alertmanager (via Gotify)

[–] __aa__aa@alien.top 1 points 2 years ago

I literally tried all. Nagios is the best one

[–] weilah_@alien.top 1 points 2 years ago

Uptime Kuma for my services Netdata + Prometheus + Grafana for server health (alerts and visualization)

[–] 2000nesman@alien.top 1 points 2 years ago

Prometheus and grafana

[–] Nasach@alien.top 1 points 2 years ago

I use net data for both dashboards and alerts. Works great and easy to setup.

[–] lunakoa@alien.top 1 points 2 years ago

Its not well liked but I use nagios core for alerts and jump to grafana which has data in prometheus, influxdb, and mysql backend for trends like cpu usage hard drive Temps etc.

[–] weller_rocks@alien.top 1 points 2 years ago

easiest by far to set up, plenty of metrics

https://www.netdata.cloud/

[–] Dogeek@alien.top 1 points 2 years ago

Oh lord, I have so much info to give ! For the setup, it's running on kubernetes 1.28.2, so YMMV. My monitoring stack is :

Grafana -- Dashboards
Alertmanager -- Alerting
Prometheus -- Time series Database
Loki -- Logs database
Promtail -- Log collector
Mimir -- Long term metrics&logs storage
Tempo -- Datadog APM, but with Grafana, allows you to track requests through a network of services, invaluable to link your reverse proxy, to your apps, to your SSO to your database...
SMTP Relay -- A homemade SMTP relay that eases setting up mail alerts, allows me to push mail through mailjet using my domain
Node-exporter -- exports metrics for the server
Exportarr -- exports metrics for sonarr/radarr etc
pihole-exporter -- exports pihole metrics for prometheus scraping
smart-exporter -- exports S.M.A.R.T metrics (for HDD health)
ntfy -- for notifications to my phone (other than mail)

The rest is pretty much the same, if the service exports prometheus metrics by default, I use that, and write a ServiceMonitor and a Service manifest for that, it usually looks like that

apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
  name: traefik
  labels:
    app.kubernetes.io/component: traefik
    app.kubernetes.io/instance: traefik
    app.kubernetes.io/managed-by: kustomize
    app.kubernetes.io/name: traefik
    app.kubernetes.io/part-of: traefik
spec:
  selector:
    matchLabels:
      app.kubernetes.io/name: traefik-metrics
  endpoints:
  - port: metrics
    interval: 30s
    path: /metrics
    scheme: http
    tlsConfig:
      insecureSkipVerify: true
  namespaceSelector:
    matchNames:
    - traefik

apiVersion: v1
kind: Service
metadata:
  name: traefik-metrics
  namespace: traefik
  labels:
    app.kubernetes.io/name: traefik-metrics
spec:
  type: ClusterIP
  ports:
    - protocol: TCP
      name: metrics
      port: 8082
  selector:
    app.kubernetes.io/name: traefik

If the app doesn't include a prometheus endpoint, I just find an existing exporter for that app, most popular ones have that, and ready made grafana dashboards.

For alerting, I create PrometheusRule object with the prometheus query and the message to alert me (depending on the severity, it's either a mail for med-low severity incidents, phone notification for high sev). I try to keep mails / notifications to a minimum, just alerts on load, CPU, RAM, and potential SMART errors as well give me alerts.

[–] Olleye@alien.top 1 points 2 years ago

Use PRTG, up until 100 sensors it’s free.

Best Monitoring tool ever ☝🏻🙂

[–] chuchodavids@alien.top 1 points 2 years ago

None. There is no need for a performance monitor for my home lab. I just have an alert if one of my main three services is down. That is all i need.

[–] servergeek82@alien.top 1 points 2 years ago

Glances, uptime-kuma, and back end script that reboots service if down. If it doesn't work I get a notification via gotify. Simple and sweet

[–] ElevenNotes@alien.top 1 points 2 years ago

Netdata, monitoring a few thousand servers (virtual) that way.

[–] Savancik@alien.top 1 points 2 years ago

Girlfriend first Alert Manager second. Girlfriend is usually faster.

[–] bobbarker4444@alien.top 1 points 2 years ago

I just check the proxmox dashboard every now and then. Honestly if everything is working I'm not too worried about exact ram levels at any given moment

[–] Pesfreak92@alien.top 1 points 2 years ago (2 children)

Uptime Kuma and Grafana. Uptime Kuna to monitor if a service is up and running and Grafana to monitor the host like CPU, RAM, SSD usage etc.

[–] Michaelscarn69-@alien.top 1 points 2 years ago

Thank you for this. I appreciate the support.

[–] Reasonable-Ladder300@alien.top 1 points 2 years ago

Same here, also have some autoscaling mechanisms set up in docker swarm to scale certain services in case the load is high

[–] jln_brtn@alien.top 1 points 2 years ago (3 children)

Nobody mentioned htop 🤔

[–] thekrautboy@alien.top 1 points 2 years ago

htop is a selfhosted service?

load more comments (2 replies)

[–] thekrautboy@alien.top 1 points 2 years ago

Just to make sure: You are aware that a search option here exists, yes? And you keep refusing to use it for whatever reason?

[–] The_Axelander@alien.top 1 points 2 years ago

I use checkmk with notifications to a telegram bot

load more comments