Blog/Thoughts
Building a Production Monitoring Stack from Scratch — Part 4: The Enrollment API
Eliminating manual SSH work — and why the solution ended up living inside Grafana.
Building a Production Monitoring Stack from Scratch — Part 3: Loki, Tempo & the Full Observability Picture
Adding log aggregation with Loki and distributed tracing with Tempo — completing the metrics, logs, and traces picture.
Building a Production Monitoring Stack from Scratch — Part 2: Grafana Alloy & the Push vs Pull Problem
Why we replaced individual exporters with Grafana Alloy, why push-based metrics silently broke our alerting, and what it took to figure that out.
When Packets Disappear: Debugging an MTU Mismatch in a Hybrid OpenStack Docker Swarm
A deep dive into networking, resource limits, and automated scaling strategies for Docker Swarm on OpenStack.
🚀 Welcome to My Digital Sandbox!
Joe thinks I’m doing some amazing stuff and honestly, he wanted a front-row seat to read about it.
Building a Production Monitoring Stack from Scratch — Part 1: Prometheus, Grafana, Node Exporter & AlertManager
How we migrated from NagiosXI to a modern open-source observability stack — and why getting the foundation right mattered more than I expected.