Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Set up Apache Airflow performance monitoring with DataDog agent integration and custom dashboards

Configure comprehensive Apache Airflow monitoring using DataDog agent to track DAG performance, task execution metrics, and resource utilization with custom dashboards and automated alerting for production workflow management.

45 min 4 distros 211 views
databases Intermediate

Implement Elasticsearch 8 snapshot lifecycle management with S3 storage for automated backups

Set up automated Elasticsearch 8 backups using snapshot lifecycle management policies with S3 repository storage. Configure retention policies, scheduling, and monitoring for production backup strategies.

45 min 4 distros 238 views
monitoring Intermediate

Configure Loki and Promtail for centralized Docker log aggregation and analysis

Set up Grafana Loki and Promtail to collect, aggregate, and analyze logs from Docker containers. Configure retention policies, integrate with Grafana for visualization, and enable real-time log monitoring across your infrastructure.

25 min 4 distros 200 views
devops Intermediate

Set up GitLab container registry mirror and proxy cache for improved performance

Configure GitLab's built-in container registry as a mirror and proxy cache to reduce Docker Hub rate limits, speed up image pulls, and improve CI/CD pipeline performance across your organization.

45 min 4 distros 243 views
devops Intermediate

Implement Consul backup and disaster recovery with automated snapshots and restoration

Set up automated Consul snapshots with GPG encryption, systemd timers, and complete disaster recovery procedures. Includes monitoring integration with Prometheus and automated restoration workflows for production environments.

45 min 4 distros 253 views
devops Intermediate

Set up GitLab backup and disaster recovery with automated restoration

Configure automated GitLab backups with disaster recovery procedures and automated restoration scripts. Includes monitoring, alerting, and production-grade recovery workflows.

45 min 4 distros 241 views
devops Advanced

Configure Apache Airflow high availability with CeleryExecutor and Redis clustering for production deployments

Set up Apache Airflow with CeleryExecutor and Redis clustering for high availability production deployments. Configure multiple workers, load balancing, monitoring, and automated failover to handle enterprise-scale workflow orchestration with zero downtime.

45 min 4 distros 302 views
devops Advanced

Configure Spark Kubernetes Operator with MinIO for cloud-native analytics

Deploy Apache Spark on Kubernetes with the Spark Operator and MinIO object storage for scalable big data processing. Configure RBAC, SSL certificates, and persistent storage for production-ready analytics workloads.

45 min 4 distros 217 views
databases Intermediate

Configure TimescaleDB backup and recovery with pgBackRest for automated PostgreSQL protection

Set up pgBackRest with TimescaleDB for automated backups, point-in-time recovery, and database protection. Includes configuration for local and remote repositories, scheduled backups, and comprehensive restore procedures.

25 min 4 distros 226 views
security Advanced

Configure Apache Airflow DAG security and secrets management with RBAC policies and encryption

Implement comprehensive security for Apache Airflow DAGs using role-based access control, HashiCorp Vault integration, and encrypted secrets management. Configure granular permissions, audit logging, and isolation policies for production workflows.

45 min 4 distros 219 views
monitoring Intermediate

Configure Grafana dashboards for TimescaleDB analytics with real-time metrics and alerting

Build comprehensive Grafana dashboards for TimescaleDB analytics with time-series visualizations, custom queries, and automated alerting for performance monitoring and data insights.

45 min 4 distros 208 views
performance Advanced

Optimize ClickHouse performance for high-throughput workloads with advanced tuning and memory management

Learn how to optimize ClickHouse for high-throughput analytics workloads through advanced memory configuration, query performance tuning, storage engine optimization, and connection pooling strategies.

45 min 4 distros 219 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer