Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Advanced

Implement Thanos multi-cluster federation for global Prometheus metrics aggregation

Set up Thanos components across multiple Kubernetes clusters to enable global metrics federation, long-term storage, and unified querying of Prometheus data with high availability and unlimited retention.

45 min 4 distros 313 views
monitoring Advanced

Implement Prometheus federation for multi-cluster monitoring with centralized metrics aggregation

Set up hierarchical Prometheus federation to monitor multiple Kubernetes clusters with a central aggregation layer. Configure global and local Prometheus instances with federated scrape jobs, service discovery, and unified dashboards for enterprise-scale observability.

45 min 4 distros 327 views
databases Advanced

Configure MinIO high availability clustering for production with multi-node setup and automatic failover

Set up a production-ready MinIO cluster with multiple nodes, erasure coding, and automatic failover. This tutorial covers distributed mode configuration, load balancing with SSL termination, and health monitoring for enterprise object storage.

45 min 4 distros 435 views
devops Advanced

Advanced Nomad job templates and deployment strategies with rolling updates and canary deployments

Master production-grade Nomad job templates with HCL syntax, implement rolling updates with health checks, and deploy advanced blue-green and canary deployment patterns for resilient containerized workloads.

45 min 4 distros 436 views
monitoring Intermediate

Set up log alerting with Fluentd and Prometheus Alertmanager for centralized monitoring

Configure Fluentd to collect and parse logs, integrate with Prometheus metrics collection, and set up Alertmanager for intelligent routing of log-based alerts to multiple notification channels.

45 min 4 distros 341 views
devops Intermediate

Integrate Consul with Kubernetes service discovery and automatic configuration

Set up Consul for dynamic service discovery in Kubernetes clusters with automatic service registration, health checks, and configuration management for microservices orchestration.

45 min 4 distros 261 views
devops Intermediate

Setup Kubernetes Ingress NGINX with cert-manager for automated SSL certificates

Learn to deploy NGINX Ingress Controller with cert-manager for automatic SSL certificate provisioning and renewal using Let's Encrypt in production Kubernetes clusters.

35 min 4 distros 397 views
monitoring Intermediate

Set up distributed tracing for Node.js and Python microservices with OpenTelemetry and Jaeger

Configure end-to-end distributed tracing across Node.js and Python microservices using OpenTelemetry instrumentation and Jaeger backend with Elasticsearch for centralized trace collection and analysis.

45 min 4 distros 280 views
devops Intermediate

Configure Kubernetes horizontal pod autoscaler for dynamic scaling based on resource metrics

Set up HPA with CPU and memory targets for automatic pod scaling. Configure metrics server and Prometheus adapter for custom metrics monitoring. Enable dynamic workload scaling based on resource utilization.

45 min 4 distros 373 views
monitoring Intermediate

Monitor Kubernetes clusters with Prometheus and Grafana for container orchestration insights

Set up comprehensive Kubernetes monitoring using the Prometheus Operator and Grafana with persistent storage, RBAC, ServiceMonitors, and custom dashboards for complete cluster observability.

45 min 4 distros 430 views
devops Advanced

Set up Apache Airflow high availability with CeleryExecutor and Redis clustering

Deploy Apache Airflow in high availability mode using CeleryExecutor with Redis clustering for task distribution, PostgreSQL connection pooling, and load-balanced webservers for production-grade workflow orchestration.

45 min 4 distros 658 views
devops Advanced

Setup Apache Airflow cluster with Kubernetes Executor for auto-scaling workflows

Deploy production-grade Apache Airflow with Kubernetes Executor for dynamic workflow scaling. Configure PostgreSQL backend, RBAC authentication, and auto-scaling policies with Prometheus monitoring integration.

45 min 4 distros 400 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer