Tutorials Monitoring

Monitoring

Logging, alerting, metrics, observability

monitoring Advanced

Implement Prometheus federation for multi-cluster monitoring with centralized metrics aggregation

Set up hierarchical Prometheus federation to monitor multiple Kubernetes clusters with a central aggregation layer. Configure global and local Prometheus instances with federated scrape jobs, service discovery, and unified dashboards for enterprise-scale observability.

45 min 4 distros 228 views
monitoring Advanced

Set up Jaeger multi-datacenter replication for disaster recovery and high availability

Configure Jaeger distributed tracing with multi-datacenter replication for high availability and disaster recovery. Learn to set up primary and secondary datacenters with automated failover and cross-region data synchronization.

90 min 4 distros 395 views
monitoring Advanced

Implement OpenTelemetry distributed context propagation across microservices with automatic instrumentation

Set up comprehensive distributed tracing across microservices using OpenTelemetry with automatic context propagation, trace correlation headers, and framework-specific auto-instrumentation for Python, Java, and Node.js applications.

45 min 4 distros 468 views
monitoring Advanced

Configure Thanos Ruler for distributed alerting across multiple Prometheus clusters

Set up Thanos Ruler to create a unified alerting layer across distributed Prometheus instances. This tutorial covers installation, global rule configuration, and cross-cluster alert federation for enterprise monitoring.

45 min 4 distros 475 views
monitoring Advanced

Set up Thanos Receiver for remote write scalability with Prometheus integration

Configure Thanos Receiver to handle high-volume remote write traffic from multiple Prometheus instances. This tutorial covers installation, multi-tenancy setup, and performance optimization for large-scale metrics ingestion.

45 min 4 distros 471 views
monitoring Advanced

Configure Prometheus long-term storage with Thanos for unlimited data retention

Deploy Thanos components with Prometheus to achieve unlimited data retention using object storage. This advanced setup enables querying years of historical metrics while maintaining high availability and reducing local storage costs.

45 min 4 distros 438 views
monitoring Advanced

Set up Jaeger high availability clustering with load balancing and failover

Deploy a production-grade Jaeger distributed tracing cluster with Elasticsearch backend, load-balanced collectors, and automatic failover for enterprise-scale microservices monitoring.

45 min 4 distros 453 views
monitoring Advanced

Set up Zabbix 7 high availability cluster with PostgreSQL replication and automated failover

Build a production-grade Zabbix 7 high availability cluster with PostgreSQL streaming replication, HAProxy load balancing, and automated failover capabilities for enterprise monitoring.

90 min 4 distros 612 views