HAProxy High Availability with keepalived VRRP

Configure HAProxy load balancer with keepalived VRRP clustering for automatic failover. Set up virtual IP failover, health checks, and monitor the cluster for production high availability.

Prerequisites

Two Linux servers with network connectivity
Root or sudo access on both servers
Backend servers to load balance
Basic understanding of networking concepts

What this solves

HAProxy provides load balancing, but a single HAProxy instance creates a single point of failure. This tutorial sets up two HAProxy nodes with keepalived for VRRP (Virtual Router Redundancy Protocol) clustering. When the primary HAProxy fails, keepalived automatically moves the virtual IP to the backup node, ensuring continuous service availability without manual intervention.

Architecture overview

You'll configure two HAProxy servers (primary and backup) with keepalived managing a shared virtual IP address. The primary node owns the virtual IP and handles all traffic. If the primary fails health checks, keepalived promotes the backup node to primary and moves the virtual IP automatically.

Component	Primary Node	Backup Node	Virtual IP
HAProxy	203.0.113.10	203.0.113.11	203.0.113.100
keepalived	Priority 110	Priority 100	Floats between nodes
Backend servers	203.0.113.20	203.0.113.21	Load balanced targets

Step-by-step configuration

Update system packages on both nodes

Start by updating both HAProxy nodes to ensure you get the latest packages and security updates.

sudo apt update && sudo apt upgrade -y

sudo dnf update -y

Install HAProxy and keepalived on both nodes

Install both HAProxy for load balancing and keepalived for VRRP clustering on each node.

sudo apt install -y haproxy keepalived

sudo dnf install -y haproxy keepalived

Enable IP forwarding

Enable kernel IP forwarding on both nodes to allow keepalived to manage the virtual IP address routing.

echo 'net.ipv4.ip_forward = 1' | sudo tee -a /etc/sysctl.conf
echo 'net.ipv4.ip_nonlocal_bind = 1' | sudo tee -a /etc/sysctl.conf
sudo sysctl -p

Configure HAProxy on both nodes

Create an identical HAProxy configuration on both nodes. This config includes health checks for backend servers and statistics monitoring.

global
    log stdout local0
    chroot /var/lib/haproxy
    stats socket /run/haproxy/admin.sock mode 660 level admin
    stats timeout 30s
    user haproxy
    group haproxy
    daemon

defaults
    mode http
    log global
    option httplog
    option dontlognull
    option redispatch
    retries 3
    timeout connect 5000
    timeout client 50000
    timeout server 50000
    errorfile 400 /etc/haproxy/errors/400.http
    errorfile 403 /etc/haproxy/errors/403.http
    errorfile 408 /etc/haproxy/errors/408.http
    errorfile 500 /etc/haproxy/errors/500.http
    errorfile 502 /etc/haproxy/errors/502.http
    errorfile 503 /etc/haproxy/errors/503.http
    errorfile 504 /etc/haproxy/errors/504.http

frontend web_frontend
    bind *:80
    bind *:443
    default_backend web_servers

backend web_servers
    balance roundrobin
    option httpchk GET /health
    http-check expect status 200
    server web1 203.0.113.20:80 check inter 3000 rise 2 fall 3
    server web2 203.0.113.21:80 check inter 3000 rise 2 fall 3

listen stats
    bind *:8404
    stats enable
    stats uri /stats
    stats refresh 30s
    stats admin if TRUE

Configure keepalived on the primary node

Configure keepalived on the primary node (203.0.113.10) with higher priority and health check script for HAProxy monitoring.

global_defs {
    router_id LB_DEVEL_PRIMARY
    vrrp_skip_check_adv_addr
    vrrp_strict
    vrrp_garp_interval 0
    vrrp_gna_interval 0
}

vrrp_script chk_haproxy {
    script "/usr/bin/killall -0 haproxy"
    interval 2
    weight 2
    fall 3
    rise 2
}

vrrp_instance VI_1 {
    state MASTER
    interface eth0
    virtual_router_id 51
    priority 110
    advert_int 1
    authentication {
        auth_type PASS
        auth_pass changeme123
    }
    virtual_ipaddress {
        203.0.113.100
    }
    track_script {
        chk_haproxy
    }
    notify_master "/etc/keepalived/notify_master.sh"
    notify_backup "/etc/keepalived/notify_backup.sh"
    notify_fault "/etc/keepalived/notify_fault.sh"
}

Configure keepalived on the backup node

Configure keepalived on the backup node (203.0.113.11) with lower priority. The configuration is identical except for state, priority, and router_id.

global_defs {
    router_id LB_DEVEL_BACKUP
    vrrp_skip_check_adv_addr
    vrrp_strict
    vrrp_garp_interval 0
    vrrp_gna_interval 0
}

vrrp_script chk_haproxy {
    script "/usr/bin/killall -0 haproxy"
    interval 2
    weight 2
    fall 3
    rise 2
}

vrrp_instance VI_1 {
    state BACKUP
    interface eth0
    virtual_router_id 51
    priority 100
    advert_int 1
    authentication {
        auth_type PASS
        auth_pass changeme123
    }
    virtual_ipaddress {
        203.0.113.100
    }
    track_script {
        chk_haproxy
    }
    notify_master "/etc/keepalived/notify_master.sh"
    notify_backup "/etc/keepalived/notify_backup.sh"
    notify_fault "/etc/keepalived/notify_fault.sh"
}

Create keepalived notification scripts

Create notification scripts on both nodes to log state changes and send alerts when failover occurs.

#!/bin/bash
echo "$(date): Became MASTER" | logger -t keepalived
Optional: Send email or webhook notification
curl -X POST https://your-webhook-url.com/alerts -d "HAProxy cluster: $(hostname) became MASTER"

#!/bin/bash
echo "$(date): Became BACKUP" | logger -t keepalived
Optional: Send email or webhook notification
curl -X POST https://your-webhook-url.com/alerts -d "HAProxy cluster: $(hostname) became BACKUP"

#!/bin/bash
echo "$(date): Entered FAULT state" | logger -t keepalived
Optional: Send critical alert
curl -X POST https://your-webhook-url.com/alerts -d "CRITICAL: HAProxy cluster: $(hostname) FAULT"

Make the scripts executable on both nodes:

sudo chmod +x /etc/keepalived/notify_*.sh

Configure firewall rules for VRRP

Open the required ports for VRRP communication, HAProxy, and monitoring. VRRP uses protocol 112 for communication between nodes.

sudo ufw allow 80/tcp
sudo ufw allow 443/tcp
sudo ufw allow 8404/tcp
sudo ufw allow from 203.0.113.10 to any port 112
sudo ufw allow from 203.0.113.11 to any port 112
sudo ufw --force enable

sudo firewall-cmd --permanent --add-port=80/tcp
sudo firewall-cmd --permanent --add-port=443/tcp
sudo firewall-cmd --permanent --add-port=8404/tcp
sudo firewall-cmd --permanent --add-rich-rule="rule family='ipv4' source address='203.0.113.10' protocol value='vrrp' accept"
sudo firewall-cmd --permanent --add-rich-rule="rule family='ipv4' source address='203.0.113.11' protocol value='vrrp' accept"
sudo firewall-cmd --reload

Start and enable services on both nodes

Enable and start HAProxy and keepalived services. Start HAProxy first, then keepalived to ensure proper health check initialization.

sudo systemctl enable --now haproxy
sudo systemctl enable --now keepalived

Configure health checks with monitoring script

Create a more sophisticated health check script that monitors both HAProxy process and port availability.

#!/bin/bash
Check if HAProxy process is running
if ! pgrep haproxy > /dev/null; then
    logger -t haproxy-check "HAProxy process not found"
    exit 1
fi

Check if HAProxy is listening on port 80
if ! netstat -tlnp | grep :80 | grep haproxy > /dev/null; then
    logger -t haproxy-check "HAProxy not listening on port 80"
    exit 1
fi

Check HAProxy stats page
if ! curl -s http://localhost:8404/stats > /dev/null; then
    logger -t haproxy-check "HAProxy stats page unreachable"
    exit 1
fi

logger -t haproxy-check "HAProxy health check passed"
exit 0

Make the script executable and update keepalived configuration:

sudo chmod +x /usr/local/bin/check_haproxy.sh

Update the vrrp_script section in /etc/keepalived/keepalived.conf on both nodes:

vrrp_script chk_haproxy {
    script "/usr/local/bin/check_haproxy.sh"
    interval 3
    weight 2
    fall 3
    rise 2
    timeout 2
}

Test automatic failover

Verify initial cluster state

Check which node currently owns the virtual IP and confirm both nodes are running properly.

# Check virtual IP assignment
ip addr show | grep 203.0.113.100

Check keepalived status
sudo systemctl status keepalived

Check HAProxy status
sudo systemctl status haproxy

View keepalived logs
sudo journalctl -u keepalived -f

Test HAProxy failover

Stop HAProxy on the primary node to trigger automatic failover to the backup node.

# Stop HAProxy to trigger failover
sudo systemctl stop haproxy

Watch keepalived logs for state change
sudo journalctl -u keepalived -f

On the backup node, verify it became the new master:

# Check if this node now has the virtual IP
ip addr show | grep 203.0.113.100

Check keepalived logs
sudo journalctl -u keepalived -f

Test service recovery

Restart HAProxy on the original primary node and verify it becomes backup (or master if configured with higher priority).

# Start HAProxy again
sudo systemctl start haproxy

Check current state
sudo journalctl -u keepalived --since "1 minute ago"

Monitor cluster health

Set up monitoring commands to track cluster status and performance. For comprehensive monitoring, consider integrating with Prometheus and Grafana.

#!/bin/bash
Save as /usr/local/bin/cluster-status.sh

echo "=== HAProxy Cluster Status ==="
echo "Date: $(date)"
echo

Check which node has VIP
echo "Virtual IP Status:"
ip addr show | grep -A 1 -B 1 203.0.113.100 || echo "Virtual IP not found on this node"
echo

Check keepalived state
echo "Keepalived State:"
sudo systemctl is-active keepalived
echo

Check HAProxy state
echo "HAProxy State:"
sudo systemctl is-active haproxy
echo

Show recent keepalived events
echo "Recent Keepalived Events:"
sudo journalctl -u keepalived --since "10 minutes ago" --no-pager
echo

Show HAProxy stats
echo "HAProxy Backend Status:"
curl -s http://localhost:8404/stats | grep -E "web1|web2" | awk -F, '{print $1 ": " $18}' 2>/dev/null || echo "Stats unavailable"

Verify your setup

# Test virtual IP responds
curl -I http://203.0.113.100

Check HAProxy stats
curl http://203.0.113.100:8404/stats

Verify both nodes are in cluster
sudo ip addr show | grep 203.0.113.100

Check service status on both nodes
sudo systemctl status haproxy keepalived

View cluster communication
sudo tcpdump -i any vrrp

Common issues

Symptom	Cause	Fix
Split-brain (both nodes master)	Firewall blocks VRRP traffic	Open protocol 112 between nodes: `sudo ufw allow from peer_ip`
Virtual IP not accessible	Interface binding issues	Check interface name in config matches: `ip link show`
Failover not happening	Health check script failing	Test script manually: `/usr/local/bin/check_haproxy.sh`
Authentication errors in logs	Mismatched VRRP passwords	Ensure auth_pass identical on both nodes
VRRP instance flapping	Network latency or packet loss	Increase advert_int to 3 seconds in config
Backend servers show as down	Health check URL not responding	Verify `/health` endpoint exists on backends

Security note: Change the default VRRP authentication password from 'changeme123' to a strong password in production. Use the same password on both nodes for proper cluster communication.

Next steps

Configure SSL termination with HAProxy for HTTPS traffic handling
Implement rate limiting and DDoS protection for security hardening
Set up comprehensive HAProxy monitoring with metrics and alerting
Configure advanced health checks and circuit breakers for resilient load balancing

Running this in production?

Want this handled for you? Running this at scale adds a second layer of work: capacity planning, failover drills, cost control, and on-call. See how we run infrastructure like this for European teams.

Automated install script

Run this to automate the entire setup

install.sh

#!/usr/bin/env bash

set -euo pipefail

# HAProxy High Availability with Keepalived Setup Script
# Usage: ./setup_haproxy_ha.sh <role> <primary_ip> <backup_ip> <vip> [backend1_ip] [backend2_ip]
# Role: primary or backup

# Color codes
RED='\033[0;31m'
GREEN='\033[0;32m'
YELLOW='\033[1;33m'
NC='\033[0m'

log() { echo -e "${GREEN}[INFO]${NC} $1"; }
warn() { echo -e "${YELLOW}[WARN]${NC} $1"; }
error() { echo -e "${RED}[ERROR]${NC} $1" >&2; }

usage() {
    cat << EOF
Usage: $0 <role> <primary_ip> <backup_ip> <vip> [backend1_ip] [backend2_ip]

Parameters:
  role         - Node role: 'primary' or 'backup'
  primary_ip   - IP address of primary HAProxy node
  backup_ip    - IP address of backup HAProxy node
  vip          - Virtual IP address for failover
  backend1_ip  - First backend server IP (default: 192.168.1.20)
  backend2_ip  - Second backend server IP (default: 192.168.1.21)

Example:
  $0 primary 203.0.113.10 203.0.113.11 203.0.113.100 203.0.113.20 203.0.113.21
EOF
    exit 1
}

cleanup() {
    error "Installation failed. Performing cleanup..."
    systemctl stop haproxy keepalived 2>/dev/null || true
    exit 1
}

trap cleanup ERR

# Parameter validation
[[ $# -lt 4 || $# -gt 6 ]] && usage
[[ ! "$1" =~ ^(primary|backup)$ ]] && { error "Role must be 'primary' or 'backup'"; usage; }

ROLE="$1"
PRIMARY_IP="$2"
BACKUP_IP="$3"
VIP="$4"
BACKEND1_IP="${5:-192.168.1.20}"
BACKEND2_IP="${6:-192.168.1.21}"

# Check if running as root
[[ $EUID -ne 0 ]] && { error "This script must be run as root"; exit 1; }

# Detect distribution
if [[ -f /etc/os-release ]]; then
    . /etc/os-release
    case "$ID" in
        ubuntu|debian)
            PKG_MGR="apt"
            PKG_UPDATE="apt update"
            PKG_INSTALL="apt install -y"
            PKG_UPGRADE="apt upgrade -y"
            HAPROXY_CONFIG="/etc/haproxy/haproxy.cfg"
            KEEPALIVED_CONFIG="/etc/keepalived/keepalived.conf"
            NETWORK_INTERFACE=$(ip route | grep default | awk '{print $5}' | head -1)
            ;;
        almalinux|rocky|centos|rhel|ol|fedora)
            PKG_MGR="dnf"
            PKG_UPDATE="dnf check-update || true"
            PKG_INSTALL="dnf install -y"
            PKG_UPGRADE="dnf upgrade -y"
            HAPROXY_CONFIG="/etc/haproxy/haproxy.cfg"
            KEEPALIVED_CONFIG="/etc/keepalived/keepalived.conf"
            NETWORK_INTERFACE=$(ip route | grep default | awk '{print $5}' | head -1)
            ;;
        amzn)
            PKG_MGR="yum"
            PKG_UPDATE="yum check-update || true"
            PKG_INSTALL="yum install -y"
            PKG_UPGRADE="yum upgrade -y"
            HAPROXY_CONFIG="/etc/haproxy/haproxy.cfg"
            KEEPALIVED_CONFIG="/etc/keepalived/keepalived.conf"
            NETWORK_INTERFACE=$(ip route | grep default | awk '{print $5}' | head -1)
            ;;
        *)
            error "Unsupported distribution: $ID"
            exit 1
            ;;
    esac
else
    error "Cannot detect distribution"
    exit 1
fi

log "[1/8] Updating system packages..."
$PKG_UPDATE
$PKG_UPGRADE

log "[2/8] Installing HAProxy and keepalived..."
$PKG_INSTALL haproxy keepalived

log "[3/8] Configuring kernel parameters..."
cat > /etc/sysctl.d/99-haproxy-keepalived.conf << 'EOF'
net.ipv4.ip_forward = 1
net.ipv4.ip_nonlocal_bind = 1
EOF
sysctl -p /etc/sysctl.d/99-haproxy-keepalived.conf

log "[4/8] Configuring HAProxy..."
cp "$HAPROXY_CONFIG" "${HAPROXY_CONFIG}.backup"
cat > "$HAPROXY_CONFIG" << EOF
global
    log stdout local0
    chroot /var/lib/haproxy
    stats socket /run/haproxy/admin.sock mode 660 level admin
    stats timeout 30s
    user haproxy
    group haproxy
    daemon

defaults
    mode http
    log global
    option httplog
    option dontlognull
    option redispatch
    retries 3
    timeout connect 5000
    timeout client 50000
    timeout server 50000

frontend web_frontend
    bind *:80
    bind *:443
    default_backend web_servers

backend web_servers
    balance roundrobin
    option httpchk GET /
    http-check expect status 200
    server web1 $BACKEND1_IP:80 check inter 3000 rise 2 fall 3
    server web2 $BACKEND2_IP:80 check inter 3000 rise 2 fall 3

listen stats
    bind *:8404
    stats enable
    stats uri /stats
    stats refresh 30s
    stats admin if TRUE
EOF
chown root:root "$HAPROXY_CONFIG"
chmod 644 "$HAPROXY_CONFIG"

log "[5/8] Creating keepalived notification scripts..."
mkdir -p /etc/keepalived/scripts
cat > /etc/keepalived/scripts/notify_master.sh << 'EOF'
#!/bin/bash
echo "$(date): Becoming MASTER" >> /var/log/keepalived-notifications.log
EOF

cat > /etc/keepalived/scripts/notify_backup.sh << 'EOF'
#!/bin/bash
echo "$(date): Becoming BACKUP" >> /var/log/keepalived-notifications.log
EOF

cat > /etc/keepalived/scripts/notify_fault.sh << 'EOF'
#!/bin/bash
echo "$(date): Entering FAULT state" >> /var/log/keepalived-notifications.log
EOF

chmod 755 /etc/keepalived/scripts/*.sh

log "[6/8] Configuring keepalived for $ROLE node..."
if [[ "$ROLE" == "primary" ]]; then
    PRIORITY=110
    STATE="MASTER"
else
    PRIORITY=100
    STATE="BACKUP"
fi

cat > "$KEEPALIVED_CONFIG" << EOF
global_defs {
    router_id LB_DEVEL_${ROLE^^}
    vrrp_skip_check_adv_addr
    vrrp_strict
    vrrp_garp_interval 0
    vrrp_gna_interval 0
}

vrrp_script chk_haproxy {
    script "/usr/bin/killall -0 haproxy"
    interval 2
    weight 2
    fall 3
    rise 2
}

vrrp_instance VI_1 {
    state $STATE
    interface $NETWORK_INTERFACE
    virtual_router_id 51
    priority $PRIORITY
    advert_int 1
    authentication {
        auth_type PASS
        auth_pass $(openssl rand -base64 12)
    }
    virtual_ipaddress {
        $VIP
    }
    track_script {
        chk_haproxy
    }
    notify_master "/etc/keepalived/scripts/notify_master.sh"
    notify_backup "/etc/keepalived/scripts/notify_backup.sh"
    notify_fault "/etc/keepalived/scripts/notify_fault.sh"
}
EOF
chown root:root "$KEEPALIVED_CONFIG"
chmod 644 "$KEEPALIVED_CONFIG"

log "[7/8] Enabling and starting services..."
systemctl enable haproxy keepalived
systemctl restart haproxy keepalived

log "[8/8] Verifying installation..."
sleep 5

# Check if services are running
if systemctl is-active --quiet haproxy; then
    log "✓ HAProxy is running"
else
    error "✗ HAProxy failed to start"
    exit 1
fi

if systemctl is-active --quiet keepalived; then
    log "✓ Keepalived is running"
else
    error "✗ Keepalived failed to start"
    exit 1
fi

# Check if ports are listening
if ss -tlnp | grep -q ":80 "; then
    log "✓ HAProxy listening on port 80"
else
    warn "⚠ HAProxy not listening on port 80"
fi

if ss -tlnp | grep -q ":8404 "; then
    log "✓ HAProxy stats available on port 8404"
else
    warn "⚠ HAProxy stats not available on port 8404"
fi

log "HAProxy High Availability setup completed successfully!"
log "Role: $ROLE"
log "Virtual IP: $VIP"
log "Backend servers: $BACKEND1_IP, $BACKEND2_IP"
log "Stats URL: http://$VIP:8404/stats"
warn "Make sure to configure the same keepalived authentication password on both nodes"
warn "Verify backend servers are accessible before putting into production"

Review the script before running. Execute with: bash install.sh

#haproxy #keepalived #high-availability #vrrp #load-balancer

Set up HAProxy high availability with keepalived clustering for automatic failover

Prerequisites

What this solves

Architecture overview

Step-by-step configuration

Update system packages on both nodes

Install HAProxy and keepalived on both nodes

Enable IP forwarding

Configure HAProxy on both nodes

Configure keepalived on the primary node

Configure keepalived on the backup node

Create keepalived notification scripts

Optional: Send email or webhook notification

curl -X POST https://your-webhook-url.com/alerts -d "HAProxy cluster: $(hostname) became MASTER"

Optional: Send email or webhook notification

curl -X POST https://your-webhook-url.com/alerts -d "HAProxy cluster: $(hostname) became BACKUP"

Optional: Send critical alert

curl -X POST https://your-webhook-url.com/alerts -d "CRITICAL: HAProxy cluster: $(hostname) FAULT"

Configure firewall rules for VRRP

Start and enable services on both nodes

Configure health checks with monitoring script

Check if HAProxy process is running

Check if HAProxy is listening on port 80

Check HAProxy stats page

Test automatic failover

Verify initial cluster state

Check keepalived status

Check HAProxy status

View keepalived logs

Test HAProxy failover

Watch keepalived logs for state change

Check keepalived logs

Test service recovery

Check current state

Monitor cluster health

Save as /usr/local/bin/cluster-status.sh

Check which node has VIP

Check keepalived state

Check HAProxy state

Show recent keepalived events

Show HAProxy stats

Verify your setup

Check HAProxy stats

Verify both nodes are in cluster

Check service status on both nodes

View cluster communication

Common issues

Next steps

Running this in production?

Related tutorials

Implement enterprise network QoS with Cisco integration using FRRouting and traffic shaping

Configure advanced iptables QoS with fwmark and multiple interfaces

Implement Consul multi-datacenter replication with WAN federation

Don't want to manage this yourself?

`curl -X POST https://your-webhook-url.com/alerts -d "HAProxy cluster: $(hostname) became MASTER"`

`curl -X POST https://your-webhook-url.com/alerts -d "HAProxy cluster: $(hostname) became BACKUP"`

`curl -X POST https://your-webhook-url.com/alerts -d "CRITICAL: HAProxy cluster: $(hostname) FAULT"`