BGP Route Update Delay During HA Pair When Multiple BGP Peers Configured With Graceful restart Enabled

Symptom

Environment

Topology

Topology

Cause

Explained in examples below. 

Scenario 1

HA Link-monitor on FW1 detects both ports 1/9 and 1/10 down. It triggers fail-over.

BGP peering on FW2 on take-over initiates/receives BGP open from 1/10 will wait for link 1/9 up before it sends BGP update messages to all its peers( in this case 1/10 will wait unit 1/9 comes up). If port 1/9 dose not come up unit graceful restart time expire it will BGP update message to 1/10.

This is expected behavior. BGP waits for all its configured peers to be in open state before send update messages. Though this appears to be delay in route updates due to HA fail-over. This is a overlap of BGP convergence on a PA unit with HA fail-over and not a BGP update delay due to HA fail-over.

Scenario 2

User-added image


BGP waits for all its peers to be reachable before it sends update message. Once grace timer reaches. It ignores the unreachable peer and send updates to rest of the peers. This is to avoid excess update messages which consume more resource.

Resolution

  1. This is expected behavior. BGP waits for all its configured peers to be in open state before send update messages.

  2.  Though this appears to be delay in route updates due to HA fail-over, This is a overlap of BGP convergence on a PA unit with HA fail-over and not a BGP update delay due to HA fail-over.

  3. The Graceful restart timers can be adjusted (lowered) to reduce the time it takes for an update to be sent.

  4. While lowering the graceful restart timers can speed up convergence in this scenario, how quickly the update can be sent is still reliant on the time it takes for BGP peering and RIB update to complete.