Attempting to place a host in maintenance mode on clusters where hosts has GPUs configured.
Checking VPXD logs, you would see the below error.
[Originator@6876 sub=VmCheck item=DetectAndFixPolicyViolations opID=WorkQueue-185c9f38] CompatCheck results: (vim.vm.check.Result) [--> (vim.vm.check.Result) {--> vm = 'vim.VirtualMachine:',--> host = 'vim.HostSystem:8',--> warning = (vmodl.MethodFault) [--> (vim.fault.MigrationFault) {--> faultMessage = (vmodl.LocalizableMessage) [--> (vmodl.LocalizableMessage) {--> key = "com.vmware.vim.vpxd.vmcheck.vgpuRelocateOrCloneWarning",--> }--> ],--> msg = ""--> },--> (vim.fault.MigrationFault) {--> faultMessage = (vmodl.LocalizableMessage) [--> (vmodl.LocalizableMessage) {--> key = "com.vmware.vim.vpxd.vmcheck.vgpuRelocateOrCloneWarning",--> }--> ],--> msg = ""--> }--> ],--> error = (vmodl.MethodFault) [--> (vim.fault.MigrationFeatureNotSupported) {--> faultMessage = (vmodl.LocalizableMessage) [--> (vmodl.LocalizableMessage) {--> key = "com.vmware.vim.vpxd.vmcheck.vgpuMigrateNotSupported",--> }
vSphere Kubernetes Service
VMware Private AI Foundation with NVIDIA
This is by design. NVIDIA disables vMotion support when UVM is enabled.
There are two options, depending on workload requirements:
1. Create a Custom VM Class for NVIDIA vGPU Devices
Steps on this found in create a custom vm class for nvidia vgpu devices
2. Continue using passthrough (DirectPath I/O), but disable vMotion dependency
SDDC KB for similar issue. 409562
NVIDIA documentation on UVM
Reference: NVIDIA AI Enterprise User Guide