Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

security Advanced

Configure Airflow DAG security and isolation with RBAC policies

Set up comprehensive security for Apache Airflow with role-based access control, DAG-level permissions, and resource isolation. Configure user authentication, implement fine-grained security policies, and establish monitoring for production-grade workflow orchestration.

45 min 4 distros 443 views
security Intermediate

Set up OSSEC agent deployment and centralized management with automated configuration

Deploy and manage OSSEC Host-based Intrusion Detection System (HIDS) across multiple servers with centralized monitoring, automated agent configuration, and real-time security event processing for enterprise-scale infrastructure protection.

45 min 4 distros 389 views
monitoring Advanced

Integrate OpenTelemetry with ELK stack for unified observability and distributed tracing

Set up a comprehensive observability stack by integrating OpenTelemetry Collector with Elasticsearch, Logstash, and Kibana for distributed tracing, metrics collection, and unified monitoring across microservices and applications.

45 min 4 distros 318 views
monitoring Intermediate

Set up multi-location monitoring with distributed Uptime Kuma instances

Deploy multiple Uptime Kuma instances across different geographic locations to monitor your services from various vantage points. This setup provides comprehensive monitoring coverage, reduces false positives, and enables regional performance analysis with centralized alerting.

45 min 4 distros 330 views
performance Advanced

Configure Apache Airflow performance optimization with connection pooling and resource tuning

Optimize Apache Airflow performance through advanced connection pooling, resource tuning, and Celery executor configuration. Learn to scale workers, configure database pools, and implement comprehensive monitoring for production workflows.

45 min 4 distros 685 views
databases Advanced

Set up Elasticsearch 8 cross-cluster replication for disaster recovery and high availability

Configure Elasticsearch 8 cross-cluster replication (CCR) to replicate indices across multiple clusters for disaster recovery. This tutorial covers security setup, remote cluster connections, replication policies, and automated failover procedures.

45 min 4 distros 382 views
monitoring Advanced

Implement Thanos multi-cluster federation for global Prometheus metrics aggregation

Set up Thanos components across multiple Kubernetes clusters to enable global metrics federation, long-term storage, and unified querying of Prometheus data with high availability and unlimited retention.

45 min 4 distros 340 views
monitoring Intermediate

Set up OpenTelemetry metrics collection with Prometheus integration for distributed system monitoring

Configure OpenTelemetry Collector to gather metrics from distributed services and export them to Prometheus for comprehensive observability monitoring. This integration provides unified metrics collection across your microservices architecture.

25 min 4 distros 366 views
devops Intermediate

Integrate Redis 7 with microservices architecture for caching and session management

Set up Redis 7 as a centralized caching layer and session store for microservices, with service discovery integration and clustering for high availability. Configure distributed session management patterns and implement Redis clustering for horizontal scalability.

45 min 4 distros 410 views
databases Intermediate

Implement Redis backup automation with RDB and AOF persistence

Set up automated Redis backups using RDB snapshots and AOF logging with verification scripts and monitoring alerts. Configure persistent data storage, automated backup scheduling, and comprehensive backup validation for production Redis deployments.

45 min 4 distros 287 views
databases Advanced

Configure MinIO high availability clustering for production with multi-node setup and automatic failover

Set up a production-ready MinIO cluster with multiple nodes, erasure coding, and automatic failover. This tutorial covers distributed mode configuration, load balancing with SSL termination, and health monitoring for enterprise object storage.

45 min 4 distros 476 views
databases Advanced

Optimize CockroachDB 24.3 performance with advanced tuning and configuration

Master advanced CockroachDB 24.3 performance optimization through memory, CPU, storage, and query tuning. Configure cluster parameters, connection pooling, and indexing strategies for production-grade distributed SQL workloads.

45 min 4 distros 368 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer