Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Implement Node.js application monitoring with Prometheus metrics and Grafana dashboards

Set up comprehensive Node.js application monitoring using Prometheus metrics collection with the prom-client library and create custom Grafana dashboards for performance insights and alerting.

45 min 4 distros 21 views
monitoring Intermediate

Configure NTP monitoring with Grafana dashboards and Prometheus alerting

Set up comprehensive time synchronization monitoring using chrony, Prometheus node exporter, and custom Grafana dashboards with alerting for time drift and NTP service failures.

25 min 4 distros 76 views
monitoring Intermediate

Configure Jaeger alerting with Prometheus and Grafana for distributed tracing observability

Set up comprehensive monitoring and alerting for Jaeger distributed tracing using Prometheus metrics collection and Grafana dashboards with automated alerts for trace performance issues.

45 min 4 distros 63 views
monitoring Intermediate

Configure MariaDB 11.6 performance monitoring with Prometheus and Grafana dashboards

Set up comprehensive MariaDB monitoring with mysqld_exporter, Prometheus metrics collection, and custom Grafana dashboards for database performance insights and alerting.

35 min 4 distros 90 views
monitoring Intermediate

Configure PM2 log rotation and advanced monitoring with Prometheus integration

Set up comprehensive PM2 monitoring with automated log rotation, Prometheus metrics, cluster mode management, and production-ready health checks for Node.js applications.

45 min 4 distros 69 views
monitoring Intermediate

Monitor Caddy and Consul integration with Prometheus and Grafana dashboards

Set up comprehensive monitoring for Caddy reverse proxy and Consul service discovery with Prometheus metrics collection and Grafana dashboards for performance insights and alerting.

45 min 4 distros 54 views
security Advanced

Configure CockroachDB SSL encryption and security hardening

Set up SSL/TLS encryption, authentication, and advanced security policies for CockroachDB 24.3 in production environments with certificate management and cluster monitoring.

45 min 4 distros 87 views
monitoring Intermediate

Configure MySQL backup monitoring with Prometheus alerts and Grafana dashboards

Set up automated monitoring for MySQL backups with Prometheus metrics collection, alerting rules for backup failures, and comprehensive Grafana dashboards for backup status visualization.

45 min 4 distros 78 views
monitoring Intermediate

Configure Prometheus Alertmanager with custom webhook integrations for Slack, Microsoft Teams, and PagerDuty notifications

Set up Prometheus Alertmanager with webhook receivers for Slack, Microsoft Teams, and PagerDuty notifications. Configure routing rules, test notifications, and implement alert escalation workflows for comprehensive monitoring coverage.

45 min 4 distros 202 views
monitoring Advanced

Implement Alertmanager high availability clustering with automatic failover and load balancing

Set up a production-grade Alertmanager cluster with gossip protocol for high availability, automatic failover, and load balancing. Ensure your monitoring alerts remain operational even when individual nodes fail.

45 min 4 distros 162 views
databases Advanced

Set up CockroachDB backup and disaster recovery automation with systemd timers and monitoring

Configure automated backup strategies for CockroachDB with systemd timers, implement comprehensive disaster recovery procedures, and set up monitoring with Prometheus and Grafana for production-grade database infrastructure.

90 min 4 distros 95 views
monitoring Intermediate

Configure Prometheus Alertmanager with Slack integration for team notifications

Set up Prometheus Alertmanager to route alerts to Slack channels with custom notification rules. Configure alerting rules, webhook integration, and team-based routing for production monitoring workflows.

25 min 4 distros 130 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer