VMware vSphere
In a vSphere environment, the hypervisor's role is to present the GPU hardware or vGPU profile to the virtual machine. Once the Guest OS identifies the PCI device, the vSphere layer has fulfilled its requirement. The actual allocation, scheduling, and utilization of the GPU are controlled entirely by the Guest OS and the internal NVIDIA drivers.
Because the vSphere layer is functioning as designed by presenting the hardware, further troubleshooting must be performed within the guest software stack.
For instructions on verifying or configuring the host-level setup, refer to: Installing and configuring the NVIDIA VIB on ESXi (367541).