infoblox_nios:high_availability
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| infoblox_nios:high_availability [2023/12/01 15:08] – [High Availability] bstafford | infoblox_nios:high_availability [2025/12/15 15:30] (current) – bstafford | ||
|---|---|---|---|
| Line 3: | Line 3: | ||
| Documentation [[https:// | Documentation [[https:// | ||
| [[https:// | [[https:// | ||
| - | ===== HA failover on DNS Nameservers ===== | ||
| + | General blog article [[https:// | ||
| + | ===== Changing HA Pair Types ===== | ||
| + | Cutting over from HA physical to HA virtual. When I cut the passive to vNIOS, it did not change the Member Type to Virtual NIOS. After I cut the second member of the HA pair, the Member Type changed to Virtual NIOS without intervention. | ||
| + | ===== DFP ===== | ||
| + | When using DFP, NIOS uses the LAN1 port to establish DoT on TCP-443 to Infoblox Anycast. This is true EVEN IF THE NIOS is HA. NIOS will not use the HA VIP for TCP-443. However, any plaintext queries will come from the HA VIP. | ||
| + | ===== LACP ===== | ||
| + | NIOS does not support LACP. In addition, for the bonding of LAN1/LAN2, NIOS only supports mode 1 (active-backup) bonding. Only one NIC will be " | ||
| + | ===== Cloud Support ===== | ||
| + | As of NIOS 9.0.4, HA in AWS, Azure and GCP are supported. | ||
| + | |||
| + | * HA is not supported on only Azure TE-926 appliance because the underlying Azure VM doesn' | ||
| + | * HA is not supported on Azure/GCP TE-825 appliance because the underlying Azure/GCP VM doesn' | ||
| + | |||
| + | Documentation on [[https:// | ||
| + | ===== Change IP Settings ===== | ||
| + | If you edit the subnet mask or default gateway of the VIP or either of the HA ports or either of the LAN ports of a HA pair, both members will do a product restart (not full reboot) at the same time when you save your changes. | ||
| + | |||
| + | You can edit the MGMT interface of one none in a HA pair. It will reboot that node but not the other node of the HA pair. | ||
| + | |||
| + | ===== Make Standalone ===== | ||
| + | If you take a HA member and make it standalone, the active appliance will make the LAN1 interface IP be set to the current HA VIP address. If MGMT is used, that will stay the same. The device will then reboot. | ||
| + | |||
| + | The other device will keep its LAN1 and MGMT IP address and also its DNS name and also its local admin accounts but will be made into a standalone device. | ||
| + | |||
| + | ===== Proximity ===== | ||
| + | NIOS HA pairs are designed to be deployed next to each other in adjacent racks. Deploying a HA pair over two separate sites (i.e. between two DC/data centers) connected with dark fibre is not supported. It may well work but it is bad practice because of the risk of split-brain should anything happen to the fibre. | ||
| + | |||
| + | e.g. examples of fibre cuts. | ||
| + | * 2025-09-21 [[https:// | ||
| + | |||
| + | As per [[https:// | ||
| + | ===== HA Failover on DNS Nameservers ===== | ||
| + | |||
| + | From the [[https:// | ||
| + | # | ||
| When an HA failover occurs on NIOS, there is an approximate 4-5 second time interval in which the network is adjusted for the new active node and the new passive node. During this failover period, the active node becomes unresponsive. After the new active node comes up on the network, | When an HA failover occurs on NIOS, there is an approximate 4-5 second time interval in which the network is adjusted for the new active node and the new passive node. During this failover period, the active node becomes unresponsive. After the new active node comes up on the network, | ||
| Line 15: | Line 49: | ||
| e.g. If LAN1 is for production and LAN2 is for OOB network, if LAN2 on the active node fails, there is no failover and the OOB network looses access to services on LAN2. | e.g. If LAN1 is for production and LAN2 is for OOB network, if LAN2 on the active node fails, there is no failover and the OOB network looses access to services on LAN2. | ||
| + | ===== NSX ===== | ||
| + | The only time I saw a customer deploy NIOS HA on NSX, they had to bypass NSX and expose the VM to ESXi directly because they couldn' | ||
| + | * port group is on NSX " | ||
| + | * "MAC address changes" | ||
| + | * " | ||
| + | |||
| + | Without " | ||
| + | |||
| + | ===== KB Article ===== | ||
| + | * When does an HA failover occur? [[https:// | ||
| + | * High Availability (HA) and network usage [[https:// | ||
| + | |||
| + | ===== HA Priority ===== | ||
| + | "The priority is based on system status and events" | ||
| + | |||
| + | Please refer to the following different start-up behaviors for more information on VRRP priorities. | ||
| + | * Both starting at same time and both previously non-active. Both nodes would wait 3 to 12 seconds (depending on arping) of listening and if nothing received, they would go active with priority penalty of 1. | ||
| + | If both go active, they will both sending VRRP advertisements on their HA ports using the VRRP shared mac address and their priorities will keep increase or decrease depending on whether they are receiving VRRP packets on their LAN port. If one of the two nodes has been active for any longer than the other, its priorities will stay bigger than their partner' | ||
| + | * Both starting at same time and both previously active. Similar to A, except they wait 3 to 9 seconds and would have a priority penalty of 171. If both nodes end of restarting due to dual active conditions, they would both have been previously passive, which would be case A. | ||
| + | * Both starting at same time - one previously active, other previously non-active. Previously active node would go active after 3 to 9 seconds with a priority penalty of 171. The passive node would have 3 seconds (after the initial 0-9) to get an advertisement. If it doesn' | ||
| + | * Node starting when other node is active and the starting node was previously active. Assuming that the running node is up and healthy, it will be active with priority 180 - so the joining node will listen for 3 to 9 seconds - and if it receives a VRRP advertisement within this time - it will go passive. If not, it will go active and its priority set to 171 - so if it gets an advertisement from the other node within the next 10 seconds - it will go passive - and we will have case E. If not, we will get to dual active. | ||
| + | * Node starting when other node is active and the starting node was previously passive. Assuming that the running node is up and healthy, it will be active with priority 180 - so the joining node will listen for 3 to 12 seconds - and if it receives a VRRP advertisement within this time - it will go passive. If not, it will go active and its priority will be 1 - so if it gets an advertisement from the other node within the next 3 minutes - it will go passive - and we will have case E (again). If not, we will get to dual active. | ||
infoblox_nios/high_availability.1701443316.txt.gz · Last modified: by bstafford
