← All projects

SecureBytes NOC Stack

Active March 23, 2026 · 5 min read

Internal observability stack — Grafana dashboards over Prometheus with Node Exporter on every host. Sixty-second scrape interval, push notifications via ntfy.

NOC stack observability pipeline
60s
scrape interval
12
dashboard panels
1
consolidated view

What it covers

Host layer
Uptime, CPU, memory, disk, and temperature for every node and container. Status rows turn green when a host responds to scrape, red when it does not.
Resource layer
Memory used and percentage, disk at root, stacked side-by-side. Comparing lightweight LXCs against heavy compute boxes is a glance, not a calculation.
Network layer
Inbound and outbound throughput per host, plus separate panels for LAN traffic and WAN egress. Useful for catching a runaway service before it shows up as a complaint.
Host status, uptime, CPU, temperature panels Memory, disk, traffic panels Disk trends, per-host traffic

Node Exporter on everything, no exceptions

Cost is ~10MB of RAM per host. The value is that nothing is invisible when something behaves weirdly. Curating what's worth monitoring up-front is premature optimization for a homelab.

60-second scrape interval

Faster intervals sound better but quintuple storage cost without changing how the data is actually used. 60s is fine for everything except acute incidents, and during those you're on the host with top.

One consolidated dashboard

Separating by category broke the muscle memory — every glance became a navigation problem. One dashboard, scroll to find what you need. Boring, but it works.

Roadmap

Loki for log aggregation — traces, metrics, and logs on one query surface
Alertmanager rules tuned by severity — currently every alert fires at the same level
Sanitized dashboard JSON pushed to a clean public repo

Stack

GrafanaPrometheusNode ExporterntfyDocker Compose
← Previous
Tailscale Zero-Trust Access
Next →
Network Automation Toolkit