Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Configure Fluentd with Kubernetes DaemonSet and log routing for centralized collection

Deploy Fluentd as a DaemonSet on Kubernetes for centralized log collection with multi-format parsing, routing to multiple outputs, and RBAC security. Includes configuration for Elasticsearch, S3, and custom log sources.

45 min 4 distros 237 views
devops Intermediate

Implement Kafka Streams processing applications with Java and Scala for real-time data analytics

Build production-ready Kafka Streams applications using Java and Scala for real-time data processing, including stateless transformations, stateful aggregations, windowing, and stream joins with exactly-once semantics.

45 min 4 distros 242 views
devops Advanced

Setup Kafka Schema Registry with Avro serialization for data processing

Configure Confluent Schema Registry with Avro serialization for production Kafka deployments. Includes schema evolution, producer/consumer integration, and SSL security configuration.

45 min 4 distros 248 views
devops Intermediate

Configure Kafka Schema Registry with Avro serialization for data processing

Set up Confluent Schema Registry with Apache Kafka to manage Avro schemas for data serialization. Configure producers and consumers with schema evolution and compatibility rules for production data processing workflows.

25 min 4 distros 302 views
databases Advanced

Optimize ClickHouse performance for large datasets with advanced tuning and memory management

Optimize ClickHouse for high-performance analytics on multi-terabyte datasets with advanced memory tuning, storage engine configuration, query optimization strategies, and distributed clustering for production workloads.

45 min 4 distros 323 views
databases Intermediate

Implement MinIO data lifecycle management and automated archival with S3 retention policies

Configure MinIO lifecycle policies to automatically transition and expire objects based on age and storage class. Set up intelligent tiering and automated archival to optimize storage costs and compliance with data retention requirements.

35 min 4 distros 843 views
monitoring Intermediate

Set up ClickHouse monitoring with Prometheus and Grafana dashboards

Configure comprehensive ClickHouse monitoring using Prometheus for metrics collection and Grafana for visualization. Set up system metrics, query performance monitoring, and alerting rules for production ClickHouse deployments.

45 min 4 distros 561 views
security Advanced

Configure ClickHouse users and RBAC for production environments with authentication and access control

Secure your ClickHouse deployment with proper user authentication, role-based access control, and production-grade security policies. Learn to create users, manage roles, implement quota systems, and monitor access patterns for enterprise environments.

45 min 4 distros 773 views
databases Intermediate

Implement automated ClickHouse backups with S3 storage and monitoring

Set up automated backup solutions for ClickHouse databases with S3 storage, retention policies, and monitoring alerts. This tutorial covers backup tool installation, S3 configuration, scheduling with systemd timers, and health monitoring.

45 min 4 distros 454 views
devops Advanced

Implement Apache Spark 3.5 cluster with YARN and HDFS for distributed computing

Set up a production-grade Apache Spark 3.5 cluster with YARN resource management and HDFS distributed storage for scalable big data processing. This tutorial covers multi-node Hadoop cluster configuration, YARN integration, and monitoring setup.

45 min 4 distros 507 views
databases Advanced

Set up Spark 3.5 Delta Lake with MinIO for ACID transactions and big data analytics

Configure Apache Spark 3.5 with Delta Lake and MinIO object storage for ACID transactions, data versioning, and distributed analytics processing. Includes complete setup for production-grade data lake architecture.

45 min 4 distros 588 views
devops Advanced

Implement Kafka Streams applications for real-time data processing with Java and Scala

Build production-grade Kafka Streams applications for real-time data processing using Java and Scala. Configure stream processing topologies, implement windowing operations, and deploy scalable streaming applications with monitoring.

45 min 4 distros 630 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer