Prometheus: Infrastructure monitoring ๐
Warning system about problems before they occur.
What controls:
- ๐ฅ๏ธ Server metrics: CPU, RAM, disk, network via node_exporter
- ๐ Service availability: blackbox-HTTP/TCP/ICMP checks
- ๐ Collecting metrics from applications: Nextcloud, Home Assistant, and others
- ๐จ Alertmanager: notifications in Telegram/Discord when thresholds are exceeded
- ๐ Powerful queries via PromQL for deep analysis
How it works:
- Prometheus collects metrics on a schedule (scrap)
- Do you see graphs in Grafana or set up your dashboards
- In case of an anomaly, an alert is triggered and the administrator receives a notification.
For administrators: Flexible alert rules, recording rules, federation of metrics, long-term storage via Thanos.
Access: via Grafana (grafana.potatoenergy.ru) โข according to Potato Energy credentials (management is only based on the rights of the admin group)