Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Advanced

Setup Jaeger sampling strategies for high-volume production tracing

Configure advanced Jaeger sampling strategies to efficiently capture traces in high-traffic production environments while controlling storage costs and maintaining observability.

45 min 4 distros 275 views
networking Advanced

Implement Consul multi-datacenter replication with WAN federation

Set up Consul WAN federation to replicate services and configuration across multiple datacenters with ACL token replication, health monitoring, and automatic failover capabilities.

45 min 4 distros 235 views
databases Advanced

Configure MariaDB Galera cluster for multi-master replication with automatic failover

Set up a highly available MariaDB Galera cluster with multi-master replication and automatic failover for production database workloads. This tutorial covers cluster initialization, node configuration, and monitoring setup across multiple servers.

45 min 4 distros 324 views
security Advanced

Configure ClamAV cluster with Kubernetes deployment for high availability antivirus scanning

Deploy a distributed ClamAV antivirus cluster on Kubernetes with persistent storage, load balancing, and monitoring for enterprise-grade threat detection and scanning.

45 min 4 distros 353 views
monitoring Intermediate

Configure Prometheus Alertmanager with custom webhook integrations for Slack, Microsoft Teams, and PagerDuty notifications

Set up Prometheus Alertmanager with webhook receivers for Slack, Microsoft Teams, and PagerDuty notifications. Configure routing rules, test notifications, and implement alert escalation workflows for comprehensive monitoring coverage.

45 min 4 distros 517 views
databases Advanced

Configure Elasticsearch 8 cross-cluster replication for disaster recovery

Set up cross-cluster replication between Elasticsearch 8 clusters to ensure data resilience and business continuity. This advanced configuration creates automatic data synchronization across geographically distributed clusters for disaster recovery scenarios.

45 min 4 distros 430 views
devops Advanced

Set up Consul multi-datacenter replication with ACL token replication

Configure Consul WAN federation across multiple datacenters with secure ACL token replication. Enable automatic token synchronization, gossip encryption, and cross-datacenter service discovery for distributed infrastructure.

45 min 4 distros 276 views
monitoring Intermediate

Configure Fluentd with Kubernetes DaemonSet and log routing for centralized collection

Deploy Fluentd as a DaemonSet on Kubernetes for centralized log collection with multi-format parsing, routing to multiple outputs, and RBAC security. Includes configuration for Elasticsearch, S3, and custom log sources.

45 min 4 distros 396 views
devops Advanced

Implement Istio circuit breaker and retry policies for microservices resilience and fault tolerance

Configure Istio destination rules with circuit breaker patterns, implement retry policies with exponential backoff, and set up comprehensive fault injection testing for microservices resilience in Kubernetes service mesh environments.

45 min 4 distros 409 views
security Advanced

Configure Keycloak high availability clustering for production with load balancing and failover

Set up a production-ready Keycloak cluster with PostgreSQL backend, HAProxy load balancing, and automatic failover to ensure identity services remain available during node failures and high traffic.

90 min 4 distros 618 views
monitoring Intermediate

Configure Prometheus Alertmanager for email and Slack notifications with webhook integration

Set up Prometheus Alertmanager to send critical alerts via email and Slack channels with custom webhook integration. This tutorial covers installation, SMTP configuration, routing rules, and alert notification testing.

25 min 4 distros 354 views
databases Intermediate

Monitor PostgreSQL performance with pg_stat_statements extension for query analysis and optimization

Set up pg_stat_statements extension to collect query statistics, analyze slow queries, and optimize PostgreSQL database performance with automated monitoring alerts.

25 min 4 distros 497 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer