Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Configure Prometheus alerting rules for cgroup metrics monitoring and container resource alerts

Set up comprehensive Prometheus alerting rules for monitoring cgroup resource usage with automated threshold alerts for CPU, memory, and I/O limits. Configure Alertmanager notifications for container resource exhaustion and system health monitoring.

45 min 4 distros 66 views
devops Advanced

Implement Kubernetes workload rightsizing with VPA recommendations and cost analysis

Set up Vertical Pod Autoscaler to automatically optimize resource requests and limits for your Kubernetes workloads. Create cost analysis dashboards to track resource utilization and identify opportunities for rightsizing containers in production clusters.

45 min 4 distros 111 views
monitoring Intermediate

Set up Apache Airflow performance monitoring with DataDog agent integration and custom dashboards

Configure comprehensive Apache Airflow monitoring using DataDog agent to track DAG performance, task execution metrics, and resource utilization with custom dashboards and automated alerting for production workflow management.

45 min 4 distros 96 views
devops Advanced

Setup Tekton Pipelines 0.62 for Kubernetes CI/CD with security scanning integration

Configure Tekton Pipelines for cloud-native CI/CD with automated security scanning, Git webhooks, and production-ready RBAC policies. Build secure container pipelines with integrated vulnerability detection and compliance checks.

45 min 4 distros 73 views
databases Advanced

Configure Elasticsearch 8 cross-cluster replication for disaster recovery

Set up cross-cluster replication between Elasticsearch 8 clusters to ensure data resilience and business continuity. This advanced configuration creates automatic data synchronization across geographically distributed clusters for disaster recovery scenarios.

45 min 4 distros 108 views
networking Advanced

Configure advanced gRPC load balancing with Envoy Proxy health checks and circuit breakers

Set up Envoy Proxy with intelligent gRPC load balancing, health checks, and circuit breakers for production microservices. Includes SSL termination, monitoring integration, and security hardening.

45 min 4 distros 60 views
monitoring Intermediate

Configure Prometheus monitoring for ArgoCD with metrics collection and Grafana dashboards

Set up comprehensive ArgoCD monitoring with Prometheus metrics collection, custom service discovery, and Grafana dashboards. Configure alerting rules for deployment failures and performance issues to maintain GitOps visibility.

45 min 4 distros 43 views
devops Intermediate

Configure Traefik with Consul service discovery for dynamic load balancing

Set up Traefik 3.1 reverse proxy with Consul service discovery to automatically route traffic to services without manual configuration. Learn dynamic routing, SSL automation, health checks, and monitoring setup for production environments.

25 min 4 distros 111 views
monitoring Intermediate

Configure Prometheus Alertmanager with email notifications for production monitoring

Set up Prometheus Alertmanager to send email notifications when your systems trigger alerts. This tutorial covers SMTP configuration, alert routing rules, and email template customization for production monitoring workflows.

35 min 4 distros 127 views
devops Intermediate

Configure Kubernetes External DNS for automatic DNS record management with cloud providers

Set up External DNS controller to automatically create and manage DNS records for your Kubernetes services and ingresses. This tutorial covers installation, cloud provider integration, and security configuration for production-ready DNS automation.

25 min 4 distros 175 views
monitoring Advanced

Configure Thanos Receiver clustering for high availability and load distribution

Set up Thanos Receiver clustering with hashring configuration to distribute Prometheus remote write traffic across multiple replicas for high availability and scalability.

45 min 4 distros 152 views
monitoring Advanced

Configure advanced Grafana dashboards and alerting with Prometheus integration

Learn to build production-grade Grafana dashboards with advanced templating, custom Prometheus queries, sophisticated alerting rules, and automated provisioning for scalable monitoring infrastructure.

45 min 4 distros 250 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer