Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Intermediate

Configure Prometheus Alertmanager with custom webhook integrations for Slack, Microsoft Teams, and PagerDuty notifications

Set up Prometheus Alertmanager with webhook receivers for Slack, Microsoft Teams, and PagerDuty notifications. Configure routing rules, test notifications, and implement alert escalation workflows for comprehensive monitoring coverage.

45 min 4 distros 202 views
databases Advanced

Configure MongoDB sharding with zone-based data distribution for geographic workloads

Set up MongoDB sharding with geographic zones to distribute data based on location, ensuring optimal performance for global applications and regulatory compliance.

45 min 4 distros 85 views
monitoring Advanced

Implement Alertmanager high availability clustering with automatic failover and load balancing

Set up a production-grade Alertmanager cluster with gossip protocol for high availability, automatic failover, and load balancing. Ensure your monitoring alerts remain operational even when individual nodes fail.

45 min 4 distros 162 views
databases Advanced

Set up CockroachDB backup and disaster recovery automation with systemd timers and monitoring

Configure automated backup strategies for CockroachDB with systemd timers, implement comprehensive disaster recovery procedures, and set up monitoring with Prometheus and Grafana for production-grade database infrastructure.

90 min 4 distros 105 views
monitoring Intermediate

Configure Prometheus Alertmanager with Slack integration for team notifications

Set up Prometheus Alertmanager to route alerts to Slack channels with custom notification rules. Configure alerting rules, webhook integration, and team-based routing for production monitoring workflows.

25 min 4 distros 130 views
hosting Intermediate

Setup Tomcat SSL certificates with Let's Encrypt automation

Configure Apache Tomcat 11 with automated SSL certificate management using Let's Encrypt and Certbot. Set up HTTPS connector, Java keystore integration, and automatic certificate renewal for production environments.

25 min 4 distros 117 views
monitoring Intermediate

Implement Tomcat application monitoring with JMX and Grafana dashboards

Set up comprehensive Apache Tomcat 10 monitoring using JMX metrics, Prometheus JMX Exporter, and Grafana dashboards to track application performance, memory usage, and request metrics in real-time.

45 min 4 distros 107 views
monitoring Intermediate

Set up centralized logging with Winston and Elasticsearch for Node.js applications

Configure Winston logging in Node.js with Elasticsearch transport for centralized log aggregation, monitoring, and analysis using Kibana dashboards.

45 min 4 distros 91 views
performance Advanced

Configure Nginx Redis cluster caching for high availability and performance optimization

Set up Nginx with Redis cluster caching to achieve high availability and optimized performance. This advanced configuration includes cluster setup, failover testing, and performance tuning for production environments.

45 min 4 distros 136 views
hosting Advanced

Implement NGINX Plus active health checks for enterprise environments

Configure NGINX Plus active health checks to automatically detect and remove unhealthy upstream servers, ensuring high availability and optimal load balancing for enterprise applications.

45 min 4 distros 121 views
databases Advanced

Configure Elasticsearch cross-cluster replication for disaster recovery

Set up Elasticsearch cross-cluster replication (CCR) to replicate indices between clusters for disaster recovery. Configure remote clusters, create follower indices, and monitor replication status for high availability.

45 min 4 distros 92 views
databases Advanced

Configure InfluxDB 2.7 clustering for high availability with data replication and automated failover

Set up a production-ready InfluxDB Enterprise cluster with automatic data replication, failover mechanisms, and comprehensive monitoring using Grafana dashboards for time-series workloads.

45 min 4 distros 134 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer