Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Set up Prometheus and Grafana monitoring stack with Docker Compose

Deploy a complete Prometheus and Grafana monitoring solution using Docker Compose with persistent storage, custom dashboards, and Alertmanager integration for production-ready observability.

25 min 4 distros 38 views
monitoring Intermediate

Set up Telegraf custom plugins for application monitoring with Prometheus and InfluxDB integration

Learn to build custom Telegraf input plugins for application metrics collection, configure dual output to Prometheus and InfluxDB backends, and create comprehensive monitoring dashboards with Grafana for production observability.

45 min 4 distros 42 views
devops Intermediate

Setup advanced Airflow alerting with Slack integration and custom notification rules

Configure Apache Airflow to send intelligent alerts to Slack channels when DAGs fail, with custom notification rules based on task importance and execution context. Includes automated retry logic and escalation workflows.

45 min 4 distros 51 views
monitoring Intermediate

Set up Kubernetes monitoring with Prometheus Operator and custom metrics

Deploy a production-grade monitoring stack with Prometheus Operator, configure ServiceMonitor resources for automatic scraping, and create custom alerting rules with Grafana dashboards for comprehensive Kubernetes cluster observability.

45 min 4 distros 33 views
devops Advanced

Implement Istio observability with Jaeger tracing and Kiali dashboard for Kubernetes service mesh

Configure comprehensive observability for your Kubernetes service mesh with Jaeger distributed tracing, Kiali visualization, and Prometheus metrics integration. Get complete visibility into microservice communication patterns, performance bottlenecks, and service dependencies.

45 min 4 distros 56 views
linux Intermediate

Implement backup rotation policies with automated cleanup using systemd timers and shell scripts

Set up automated backup rotation with systemd timers to manage storage space, implement retention policies for different backup types, and create monitoring alerts for backup health and cleanup processes.

45 min 4 distros 48 views
performance Advanced

Optimize OpenResty performance with advanced caching strategies and Lua optimization

Configure high-performance caching with Redis integration, implement microcaching strategies, and optimize Lua scripts for memory management in production OpenResty deployments.

45 min 4 distros 51 views
databases Intermediate

Implement ClickHouse backup automation with compression and S3 integration

Set up automated ClickHouse backups with compression, S3 storage, and systemd timers. Includes monitoring, encryption, and recovery procedures for production environments.

45 min 4 distros 55 views
monitoring Intermediate

Install and configure Filebeat 8.15 for efficient log shipping to ELK stack

Set up Filebeat 8.15 to collect and ship logs from multiple sources to Elasticsearch. Configure SSL/TLS security, performance optimization, and monitoring for production-grade log aggregation.

45 min 4 distros 55 views
security Advanced

Implement OpenResty rate limiting and API protection with Lua middleware

Build production-grade API protection using OpenResty's Lua middleware with advanced rate limiting algorithms, request throttling, and comprehensive security policies for high-traffic web applications.

45 min 4 distros 48 views
devops Intermediate

Implement Django continuous deployment with Git hooks and automated testing

Set up automated Django deployment with Git hooks, testing pipelines, and production rollbacks. Configure continuous integration with database migrations, static file management, and zero-downtime deployments.

45 min 4 distros 74 views
devops Advanced

Configure Consul multi-datacenter WAN federation for geographic redundancy

Set up Consul WAN federation to connect multiple datacenters for global service discovery and failover. This tutorial covers primary and secondary datacenter configuration with ACL token replication and cross-datacenter networking.

45 min 4 distros 72 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer