Symptoms:
When a vSAN witness node is managed by a VLCM cluster image and there is a HMS appliance configured in the VC, then we could see the below errors due to a race condition where vLCM and HMS both try to perform some lifecycle operations on the witness hosts simultaneously.
VLCM ESXi upgrade fails with "A general system error occurred: After host 'xxxxxxxxxx' remediation was completed, the compliance check reported the host as 'non-compliant'. The image on the host does not match the image set for the cluster."
When further checking esxupdate.log and lifecycle.log, two types of error messages can be seen:
VMware vCenter Server 8.0.x
VMware vCenter Server 8.0.1
The root cause of this issue is a race condition between the vLCM upgrade and the HMS appliance performing a VIB push (in order to install the hbr-agent VIB) at the same time. HMS appliance is trying to make this push and install hbr-agent VIB by itself since HMS is not yet integrated with vLCM.
Ideally, this should not happen, as the HMS appliance should not be automatically pushing VIBs onto the vSAN witness host that is managed by vLCM. For hosts that are managed by vLCM, the expectation is that the user would need to manually install the hbr-agent VIB (as opposed to the HMS appliance automatically doing this install).
Workaround:
Follow the below steps to resolve the issue: