Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Advanced

Configure Thanos Receiver clustering for high availability and load distribution

Set up Thanos Receiver clustering with hashring configuration to distribute Prometheus remote write traffic across multiple replicas for high availability and scalability.

45 min 4 distros 203 views
monitoring Advanced

Configure advanced Grafana dashboards and alerting with Prometheus integration

Learn to build production-grade Grafana dashboards with advanced templating, custom Prometheus queries, sophisticated alerting rules, and automated provisioning for scalable monitoring infrastructure.

45 min 4 distros 353 views
monitoring Intermediate

Set up multi-location monitoring with distributed Uptime Kuma instances

Deploy multiple Uptime Kuma instances across different geographic locations to monitor your services from various vantage points. This setup provides comprehensive monitoring coverage, reduces false positives, and enables regional performance analysis with centralized alerting.

45 min 4 distros 253 views
monitoring Advanced

Implement Thanos multi-cluster federation for global Prometheus metrics aggregation

Set up Thanos components across multiple Kubernetes clusters to enable global metrics federation, long-term storage, and unified querying of Prometheus data with high availability and unlimited retention.

45 min 4 distros 244 views
monitoring Intermediate

Monitor Elasticsearch cluster with Prometheus and Grafana dashboards

Set up comprehensive Elasticsearch cluster monitoring using Prometheus Elasticsearch Exporter and Grafana dashboards. Configure alerting rules for cluster health, performance metrics, and automated notifications.

45 min 4 distros 240 views
monitoring Advanced

Implement Prometheus federation for multi-cluster monitoring with centralized metrics aggregation

Set up hierarchical Prometheus federation to monitor multiple Kubernetes clusters with a central aggregation layer. Configure global and local Prometheus instances with federated scrape jobs, service discovery, and unified dashboards for enterprise-scale observability.

45 min 4 distros 266 views
devops Intermediate

Integrate Consul with Kubernetes service discovery and automatic configuration

Set up Consul for dynamic service discovery in Kubernetes clusters with automatic service registration, health checks, and configuration management for microservices orchestration.

45 min 4 distros 223 views
devops Advanced

Configure Istio security policies with mutual TLS and authorization for Kubernetes service mesh

Configure mutual TLS authentication and authorization policies in Istio service mesh for secure microservices communication. This tutorial covers PeerAuthentication, AuthorizationPolicy, and RBAC configuration for production Kubernetes environments.

45 min 4 distros 288 views
monitoring Intermediate

Implement custom Prometheus exporters for application metrics collection and monitoring

Build production-grade custom Prometheus exporters in Python and Go to collect application-specific metrics. Learn exporter architecture, metric types, systemd deployment, and Prometheus integration for comprehensive application monitoring.

45 min 4 distros 501 views
devops Advanced

Setup Apache Airflow cluster with Kubernetes Executor for auto-scaling workflows

Deploy production-grade Apache Airflow with Kubernetes Executor for dynamic workflow scaling. Configure PostgreSQL backend, RBAC authentication, and auto-scaling policies with Prometheus monitoring integration.

45 min 4 distros 344 views
devops Advanced

Set up Istio multi-cluster service mesh with cross-cluster communication

Deploy and configure Istio across multiple Kubernetes clusters with secure cross-cluster communication, shared service discovery, and unified traffic management for distributed microservices architecture.

45 min 4 distros 362 views
monitoring Advanced

Configure Prometheus long-term storage with Thanos for unlimited data retention

Deploy Thanos components with Prometheus to achieve unlimited data retention using object storage. This advanced setup enables querying years of historical metrics while maintaining high availability and reducing local storage costs.

45 min 4 distros 516 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer