Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Implement Node.js application monitoring with Prometheus metrics and Grafana dashboards

Set up comprehensive Node.js application monitoring using Prometheus metrics collection with the prom-client library and create custom Grafana dashboards for performance insights and alerting.

45 min 4 distros 21 views
security Advanced

Set up container runtime security with Falco and Sysdig for threat detection

Configure Falco for runtime security monitoring and Sysdig Agent for container visibility to detect threats in Kubernetes environments. Implement behavioral analysis, custom security rules, and threat detection policies for production container workloads.

45 min 4 distros 57 views
monitoring Intermediate

Configure NTP monitoring with Grafana dashboards and Prometheus alerting

Set up comprehensive time synchronization monitoring using chrony, Prometheus node exporter, and custom Grafana dashboards with alerting for time drift and NTP service failures.

25 min 4 distros 76 views
monitoring Intermediate

Configure Prometheus Alertmanager with custom webhook integrations for Slack, Microsoft Teams, and PagerDuty notifications

Set up Prometheus Alertmanager with webhook receivers for Slack, Microsoft Teams, and PagerDuty notifications. Configure routing rules, test notifications, and implement alert escalation workflows for comprehensive monitoring coverage.

45 min 4 distros 202 views
monitoring Advanced

Implement Alertmanager high availability clustering with automatic failover and load balancing

Set up a production-grade Alertmanager cluster with gossip protocol for high availability, automatic failover, and load balancing. Ensure your monitoring alerts remain operational even when individual nodes fail.

45 min 4 distros 153 views
monitoring Intermediate

Configure Prometheus Alertmanager with Slack integration for team notifications

Set up Prometheus Alertmanager to route alerts to Slack channels with custom notification rules. Configure alerting rules, webhook integration, and team-based routing for production monitoring workflows.

25 min 4 distros 120 views
monitoring Advanced

Configure advanced Grafana dashboards and alerting with Prometheus integration

Build production-ready Grafana dashboards with dynamic variables, custom panels, and sophisticated alert rules. Integrate Prometheus metrics for comprehensive monitoring with multi-condition alerting and notification channels.

45 min 4 distros 466 views
monitoring Intermediate

Configure advanced Uptime Kuma monitoring with API integrations and automated alerting

Set up Uptime Kuma with advanced API endpoints, custom notification channels, and automated monitoring workflows. Configure health check automation, status page customization, and enterprise alerting rules for production environments.

45 min 4 distros 156 views
monitoring Intermediate

Monitor system time drift with Prometheus and Grafana alerts

Set up comprehensive time synchronization monitoring with Prometheus node exporter metrics, Grafana dashboards, and automated alerting to prevent system clock drift issues in production environments.

45 min 4 distros 168 views
monitoring Intermediate

Configure Prometheus alerting rules for cgroup metrics monitoring and container resource alerts

Set up comprehensive Prometheus alerting rules for monitoring cgroup resource usage with automated threshold alerts for CPU, memory, and I/O limits. Configure Alertmanager notifications for container resource exhaustion and system health monitoring.

45 min 4 distros 227 views
monitoring Intermediate

Implement Apache log analysis with GoAccess and ELK stack for real-time monitoring

Set up comprehensive Apache log monitoring with GoAccess for real-time analysis and ELK stack for centralized log processing, creating powerful Kibana dashboards for web server insights and automated alerting.

45 min 4 distros 189 views
monitoring Intermediate

Set up Apache Airflow performance monitoring with DataDog agent integration and custom dashboards

Configure comprehensive Apache Airflow monitoring using DataDog agent to track DAG performance, task execution metrics, and resource utilization with custom dashboards and automated alerting for production workflow management.

45 min 4 distros 211 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer