ESXi host takes more than 60 seconds to failover the path when storage process reboots on an EMC unity storage array
search cancel

ESXi host takes more than 60 seconds to failover the path when storage process reboots on an EMC unity storage array

book

Article ID: 332682

calendar_today

Updated On:

Products

VMware vSphere ESXi

Issue/Introduction

During a Storage Process (SP) reboot on an EMC Unity storage array, ESXi host can take over 60 seconds to failover the path to the peer SP.

Cause

ESXi host probes and iterate through all the paths before finding the active path and doing a path failover. ESXi host internally uses mode sense and RTPG requests to probe the paths and each of these request uses 10 seconds of time out. In case of SP reboot, ESXi host does not get response while probing the path from the target and waits for the commands to time out. For each path the time out is around 20 seconds (10 seconds time out for mode sense and RTPG respectively).
 
The overall delay is directly proportional to the number of paths per SP. For example: If you have 4 paths per SP, then overall fail over time will be double of 2 paths per SP. Hitting a long time out is unlikely because the SP will force its link down, which triggers Registered State Change Notifications (RSCN) across the fabric and ESXi host fails over immediately.
 
Note: The impact is seen only if there are active IO in progress during SP reboot.

Resolution

This is a known issue affecting ESXi and EMC Unity storage array.
 
Currently, there is no resolution. However, EMC and VMware engineering teams are diligently working together on a solution.
 
To work around this issue, use these options:

Notes: