Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

devops Intermediate

Setup advanced Airflow alerting with Slack integration and custom notification rules

Configure Apache Airflow to send intelligent alerts to Slack channels when DAGs fail, with custom notification rules based on task importance and execution context. Includes automated retry logic and escalation workflows.

45 min 4 distros 198 views
security Advanced

Configure audit compliance automation with Ansible for security monitoring and reporting

Build comprehensive security audit compliance automation using Ansible playbooks to enforce security policies, monitor system configurations, and generate compliance reports across your Linux infrastructure fleet.

45 min 4 distros 108 views
monitoring Intermediate

Set up Kubernetes monitoring with Prometheus Operator and custom metrics

Deploy a production-grade monitoring stack with Prometheus Operator, configure ServiceMonitor resources for automatic scraping, and create custom alerting rules with Grafana dashboards for comprehensive Kubernetes cluster observability.

45 min 4 distros 119 views
monitoring Advanced

Configure advanced Grafana dashboards and alerting with Prometheus integration

Build production-ready Grafana dashboards with dynamic variables, custom panels, and sophisticated alert rules. Integrate Prometheus metrics for comprehensive monitoring with multi-condition alerting and notification channels.

45 min 4 distros 476 views
security Intermediate

Set up audit log analysis dashboard with Grafana and Prometheus for security monitoring

Build a comprehensive security monitoring dashboard that collects Linux audit logs through auditd, exports metrics to Prometheus, and visualizes security events in Grafana with automated alerting for suspicious activities.

45 min 4 distros 102 views
linux Intermediate

Implement backup rotation policies with automated cleanup using systemd timers and shell scripts

Set up automated backup rotation with systemd timers to manage storage space, implement retention policies for different backup types, and create monitoring alerts for backup health and cleanup processes.

45 min 4 distros 159 views
security Advanced

Configure Linux audit system with SIEM integration for Splunk security monitoring

Set up auditd with custom security rules, configure Splunk Universal Forwarder for log shipping, and implement real-time monitoring workflows for comprehensive security event tracking and compliance reporting.

45 min 4 distros 181 views
devops Advanced

Set up Kubernetes persistent volume snapshots and backup automation

Configure CSI snapshot controllers, create persistent volume snapshots, and implement automated backup strategies with Velero for production Kubernetes environments.

45 min 4 distros 140 views
databases Intermediate

Implement ClickHouse backup automation with compression and S3 integration

Set up automated ClickHouse backups with compression, S3 storage, and systemd timers. Includes monitoring, encryption, and recovery procedures for production environments.

45 min 4 distros 131 views
monitoring Intermediate

Install and configure Filebeat 8.15 for efficient log shipping to ELK stack

Set up Filebeat 8.15 to collect and ship logs from multiple sources to Elasticsearch. Configure SSL/TLS security, performance optimization, and monitoring for production-grade log aggregation.

45 min 4 distros 125 views
monitoring Intermediate

Set up Alertmanager with email and Slack notifications for monitoring alerts

Configure Prometheus Alertmanager with email SMTP and Slack webhook integrations for comprehensive monitoring notifications. Set up alert routing rules and test notification delivery.

25 min 4 distros 125 views
hosting Intermediate

Configure OpenResty load balancing with upstream health checks and automatic failover

Set up OpenResty with intelligent load balancing across multiple backend servers, health monitoring, and automatic failover to maintain high availability for web applications.

25 min 4 distros 106 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer