Infrastructure tutorials

Production-grade guides for Linux, servers, security and performance. Copy-paste commands, multi-distro support, written by engineers who run this in production.

monitoring Advanced

Implement Alertmanager high availability clustering with automatic failover and load balancing

Set up a production-grade Alertmanager cluster with gossip protocol for high availability, automatic failover, and load balancing. Ensure your monitoring alerts remain operational even when individual nodes fail.

45 min 4 distros 268 views
monitoring Advanced

Monitor Kubernetes network policies with Prometheus and Grafana for enhanced cluster security

Set up comprehensive monitoring for Kubernetes network policies using Prometheus and Grafana. Configure CNI metrics collection, create security dashboards, and implement alerting for policy violations and traffic anomalies.

45 min 4 distros 196 views
databases Advanced

Set up CockroachDB backup and disaster recovery automation with systemd timers and monitoring

Configure automated backup strategies for CockroachDB with systemd timers, implement comprehensive disaster recovery procedures, and set up monitoring with Prometheus and Grafana for production-grade database infrastructure.

90 min 4 distros 200 views
monitoring Intermediate

Configure Prometheus Alertmanager with Slack integration for team notifications

Set up Prometheus Alertmanager to route alerts to Slack channels with custom notification rules. Configure alerting rules, webhook integration, and team-based routing for production monitoring workflows.

25 min 4 distros 304 views
devops Intermediate

Set up GitLab CI/CD with Kubernetes runners for scalable deployments

Configure GitLab Runner with Kubernetes executor to automatically scale CI/CD workloads. Set up RBAC permissions, deploy pipelines to Kubernetes clusters, and implement resource management policies for efficient container orchestration.

45 min 4 distros 197 views
monitoring Advanced

Configure advanced SNMP alerting with Prometheus Alertmanager for network monitoring

Set up comprehensive SNMP monitoring with Prometheus exporters and create intelligent alerting rules in Alertmanager to proactively monitor network devices, interfaces, and performance metrics.

45 min 4 distros 198 views
monitoring Intermediate

Set up centralized logging with Winston and Elasticsearch for Node.js applications

Configure Winston logging in Node.js with Elasticsearch transport for centralized log aggregation, monitoring, and analysis using Kibana dashboards.

45 min 4 distros 201 views
devops Intermediate

Configure GitLab backup automation with GPG encryption

Set up automated GitLab backups with GPG encryption, remote storage, and systemd timers. This tutorial covers backup script creation, encryption setup, and monitoring for production GitLab instances.

45 min 4 distros 181 views
monitoring Intermediate

Implement Node.js application monitoring with Prometheus metrics collection and Grafana dashboards

Set up comprehensive Node.js application monitoring with Prometheus metrics collection, custom performance tracking, and Grafana dashboards for real-time observability and alerting.

45 min 4 distros 189 views
security Advanced

Configure advanced nftables logging and monitoring for network security

Set up comprehensive nftables logging with structured syslog integration, Prometheus metrics collection, and ELK stack analysis. Configure Grafana dashboards for real-time firewall monitoring and automated alerting for security events.

45 min 4 distros 226 views
monitoring Intermediate

Setup Prometheus Blackbox Exporter for endpoint monitoring with SSL and alerting

Configure Prometheus Blackbox Exporter to monitor HTTP endpoints, SSL certificates, and DNS resolution with automated alerting rules and Grafana dashboards.

25 min 4 distros 492 views
monitoring Intermediate

Configure Prometheus alerting with AlertManager notifications and webhook integration

Set up AlertManager to handle Prometheus alerts with email, Slack, and webhook notifications. Configure alerting rules, notification channels, and test the complete alerting pipeline.

25 min 4 distros 412 views

Need help?

Don't want to manage this yourself?

We handle infrastructure for businesses that depend on uptime. From initial setup to ongoing operations.

Talk to an engineer