Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Implement Node.js application monitoring with Prometheus metrics and Grafana dashboards

Set up comprehensive Node.js application monitoring using Prometheus metrics collection with the prom-client library and create custom Grafana dashboards for performance insights and alerting.

45 min 4 distros 37 views
monitoring Intermediate

Configure NTP monitoring with Grafana dashboards and Prometheus alerting

Set up comprehensive time synchronization monitoring using chrony, Prometheus node exporter, and custom Grafana dashboards with alerting for time drift and NTP service failures.

25 min 4 distros 87 views
monitoring Intermediate

Configure Jaeger alerting with Prometheus and Grafana for distributed tracing observability

Set up comprehensive monitoring and alerting for Jaeger distributed tracing using Prometheus metrics collection and Grafana dashboards with automated alerts for trace performance issues.

45 min 4 distros 73 views
monitoring Intermediate

Configure MariaDB 11.6 performance monitoring with Prometheus and Grafana dashboards

Set up comprehensive MariaDB monitoring with mysqld_exporter, Prometheus metrics collection, and custom Grafana dashboards for database performance insights and alerting.

35 min 4 distros 103 views
monitoring Intermediate

Configure PM2 log rotation and advanced monitoring with Prometheus integration

Set up comprehensive PM2 monitoring with automated log rotation, Prometheus metrics, cluster mode management, and production-ready health checks for Node.js applications.

45 min 4 distros 79 views
monitoring Intermediate

Monitor Caddy and Consul integration with Prometheus and Grafana dashboards

Set up comprehensive monitoring for Caddy reverse proxy and Consul service discovery with Prometheus metrics collection and Grafana dashboards for performance insights and alerting.

45 min 4 distros 64 views
security Advanced

Configure CockroachDB SSL encryption and security hardening

Set up SSL/TLS encryption, authentication, and advanced security policies for CockroachDB 24.3 in production environments with certificate management and cluster monitoring.

45 min 4 distros 97 views
monitoring Intermediate

Configure MySQL backup monitoring with Prometheus alerts and Grafana dashboards

Set up automated monitoring for MySQL backups with Prometheus metrics collection, alerting rules for backup failures, and comprehensive Grafana dashboards for backup status visualization.

45 min 4 distros 88 views
monitoring Intermediate

Configure Prometheus Alertmanager with custom webhook integrations for Slack, Microsoft Teams, and PagerDuty notifications

Set up Prometheus Alertmanager with webhook receivers for Slack, Microsoft Teams, and PagerDuty notifications. Configure routing rules, test notifications, and implement alert escalation workflows for comprehensive monitoring coverage.

45 min 4 distros 217 views
monitoring Advanced

Implement Alertmanager high availability clustering with automatic failover and load balancing

Set up a production-grade Alertmanager cluster with gossip protocol for high availability, automatic failover, and load balancing. Ensure your monitoring alerts remain operational even when individual nodes fail.

45 min 4 distros 174 views
databases Advanced

Set up CockroachDB backup and disaster recovery automation with systemd timers and monitoring

Configure automated backup strategies for CockroachDB with systemd timers, implement comprehensive disaster recovery procedures, and set up monitoring with Prometheus and Grafana for production-grade database infrastructure.

90 min 4 distros 105 views
monitoring Intermediate

Configure Prometheus Alertmanager with Slack integration for team notifications

Set up Prometheus Alertmanager to route alerts to Slack channels with custom notification rules. Configure alerting rules, webhook integration, and team-based routing for production monitoring workflows.

25 min 4 distros 141 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer