Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

devops Advanced

Configure Spark on Kubernetes with cluster autoscaling for dynamic workloads

Deploy Apache Spark 3.5 on Kubernetes with automatic cluster scaling, dynamic resource allocation, and comprehensive monitoring for production data processing workloads.

45 min 4 distros 185 views
devops Advanced

Set up Spark Streaming with Kafka and Delta Lake for real-time analytics

Configure Apache Spark 3.5 with Kafka integration and Delta Lake support for building production-grade real-time analytics pipelines with ACID transactions and streaming capabilities.

45 min 4 distros 156 views
performance Advanced

Implement Spark SQL performance optimization with Catalyst optimizer and advanced tuning

Optimize Apache Spark 3.5 SQL performance using Catalyst optimizer with advanced query tuning, adaptive query execution, and production-grade configuration for high-throughput analytics workloads.

45 min 4 distros 87 views
devops Intermediate

Set up GitLab CI/CD with Kubernetes runners for scalable deployments

Configure GitLab Runner with Kubernetes executor to automatically scale CI/CD workloads. Set up RBAC permissions, deploy pipelines to Kubernetes clusters, and implement resource management policies for efficient container orchestration.

45 min 4 distros 105 views
devops Intermediate

Setup advanced Airflow alerting with Slack integration and custom notification rules

Configure Apache Airflow to send intelligent alerts to Slack channels when DAGs fail, with custom notification rules based on task importance and execution context. Includes automated retry logic and escalation workflows.

45 min 4 distros 198 views
devops Advanced

Implement Kafka Streams exactly-once processing semantics with Java applications

Configure Kafka cluster and Java applications for exactly-once processing semantics with transaction state management, idempotent producers, and EOS isolation levels for reliable stream processing.

45 min 4 distros 116 views
devops Advanced

Configure Kafka Streams state stores and RocksDB optimization for high-performance streaming applications

Configure Kafka Streams state stores with RocksDB optimization for high-performance streaming applications. Learn custom state store configurations, RocksDB tuning parameters, and monitoring techniques for production-grade stream processing.

45 min 4 distros 173 views
databases Intermediate

Setup DuckDB with Apache Airflow for automated data pipelines

Configure DuckDB as a high-performance analytical database backend for Apache Airflow workflows. Build automated data pipelines that process files, APIs, and databases using DuckDB's columnar engine.

45 min 4 distros 164 views
monitoring Intermediate

Set up Nagios Core 4.5 distributed monitoring with NRPE for remote host checks

Configure Nagios Core server with NRPE agents for distributed monitoring across multiple servers. Set up remote host checks, service monitoring, and centralized alerting for comprehensive infrastructure oversight.

45 min 4 distros 184 views
devops Advanced

Configure Apache Airflow DAG performance optimization best practices

Optimize Apache Airflow DAGs for production with parallelism tuning, resource allocation strategies, and performance monitoring. Learn executor configuration, task dependency optimization, and troubleshooting techniques for high-throughput workflows.

45 min 4 distros 241 views
monitoring Intermediate

Set up Apache Airflow performance monitoring with DataDog agent integration and custom dashboards

Configure comprehensive Apache Airflow monitoring using DataDog agent to track DAG performance, task execution metrics, and resource utilization with custom dashboards and automated alerting for production workflow management.

45 min 4 distros 211 views
devops Advanced

Implement Airflow load balancing with Celery executor for high availability workflows

Set up Apache Airflow with Celery executor, Redis cluster backend, HAProxy load balancing, and auto-scaling workers for production-grade workflow orchestration with high availability.

45 min 4 distros 293 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer