VMware Host Isolation Response in a Nutanix Environment #NoSAN

I was recently discussing the Nutanix solution with a friend of mine and fellow VCDX, Michael Webster (@vcdxnz001) and he asked what the recommended Host Isolation Response is for Nutanix.

At this stage I must advise there is no formal recommendation, but an Official vSphere on Nutanix Best Practice guide is in the works and will be released asap.

Back to my conversation with Michael, Being that Nutanix is an IP Storage solution, my initial feeling is that Host isolation Response should be set to “Shutdown”, but I didn’t go into any more detail with Michael, so I thought it best to post a quick explanation.

This post also assumes basic knowledge of vSphere as well as the Nutanix platform, for those of you who are not familiar with Nutanix please review the following links prior to reading the remainder of this post.

About Nutanix | How Nutanix Works | 8 Strategies for a Modern Datacenter

So back on topic, in other posts I have written for IP Storage, such as (Example Architectural Decision – Host Isolation Response for IP Storage) I have concluded that “Shutdown” was the most suitable setting and recommended specifying isolation addresses of the NAS controllers.

But as Nutanix changes the game and has one virtual storage controller per ESXi host, so does this change the recommendation?

In short, No, but for those who are interested, here is why.

If we leave the default isolation address, (being the default gateway for ESXi Management), in the event the gateway is down, it will trigger an isolation response even if the rest of the network is operating fine, thus an unnecessary outage would occur.

If we configure das.isolationaddress1 & 2 with the Management IP address of any two Nutanix Controller VMs (192.168.1.x , 192.168.1.y in my below diagram) then an isolation response will only be triggered if both Nutanix Controller VMs (CVMs) are not responding, in which case, the VMs should be Shutdown as the Nutanix cluster may not be function properly with two Controllers offline concurrently as its configured by default for N+1 (or replication factor of “2” in Nutanix speak).

The below is a high level example of the above configuration.

NutanixHostIsolation

Related Articles

1. Example Architectural Decision – Host Isolation Response for a Nutanix Environment

2. Storage DRS and Nutanix – To use, or not to use, that is the question?

3. VMware HA and IP Storage

4 thoughts on “VMware Host Isolation Response in a Nutanix Environment #NoSAN

    • Hi Luca,

      Your correct, if local access to the CVM is lost, Autopath will redirect to another CVM. This is part of the reason configuring the das.isolationaddress1 and 2 to point to other CVMs and setting isolation response to “Shutdown” is important.

      In the event the isolation addresses both cannot be reached, (ie: The ESXi host cant connect to two other CVMs) the CVM will likely not be able to function correctly, in this scenario, with isolation response set to “Shutdown” the host will shutdown the VMs which is important to allow them to be restarted on hosts which are not isolated (if the isolation only impacts one host) and to prevent “split brain” once the isolated host connectivity is restored.

      If the isolation addresses can be reached, then we dont have an issue and autopath shouldnt kick in unless we’re performing maintenance on the local CVM or there is an outage/failure of the CVM, but this isn’t strictly related to isolation addresses/response which is what this post is discussing, although I should write a post explaining the resiliency of the Nutanix solution as its complimentary to this post.

      Thanks for the comment.

      • Ok, got it.
        So to check if I understood correctly, in a 3-nodes Nutanix block, ESXi-1 will have isolation addresses of CVM-2 and CVM-3 for example, on ESXi-2 CVM-1 and CVM-3, and on ESXi-3 the ip of CVM-1 and CVM-2, right?

        Thanks again, Luca.

  1. Pingback: Nutanix Platform Link-O-Rama | vcdx133.com

Leave a Reply to Josh OdgersCancel reply