Unable to install, activate or deploy Private AI Foundation (PAIF) through PAIF QuickStart.
While connected to the vCenter Appliance shell, the following symptoms are observed:
dcli> com vmware vcenter namespacemanagement supervisors summary list
items:
- supervisor: <supervisor-ID>
dcli> com vmware vcenter namespacemanagement supervisors zone bindings list --supervisor <supervisor ID>
- zone: <cluster-ID>
marked_for_removal: False
resource_allocation:
vm_reservations:
- reserved_vm_class: <custom vmclass>
count: #
messages:
- severity: INFO
details:
error reconciling Zone reservation; the failed operation will be retried: the cpu reservation value specified in the config spec )('0') is invalid
type: MANAGEMENT
status: ERROR
error reconciling Zone reservation; the failed operation will be retried: the memory reservation value specified in the config spec )('0') is invalid
VMware Private AI Foundation (PAIF)
vCenter 9.0.2
This is caused by product limitation and guardrails to prevent the creation of a PAIF cluster associated with a reserved vmclass that does not have any reservations defined for CPU or MEM.
PAIF does not allow for updating the configuration of a reserved vmclass that is being used by a Supervisor cluster.
Configure a reserved vmclass appropriately for use with PAIF.
dcli> com vmware vcenter namespacemanagement supervisors summary list
dcli> com vmware vcenter namespacemanagement supervisors zones bindings list --supervisor <supervisor-ID>dcli> com vmware vcenter namespacemanagement supervisors zones bindings update --supervisor <supervisor-id> --resource-allocation-vm-reservations '[{"reserved_vm_class": "<reserved vmclass name>", "count": <count #>}]' --zone <zone-id>This step is written with the intentions to replace the incorrectly configured vmclass with the newly created vmclass.
For multiple reserved vmclass entries, see the below example:
'[{"reserved_vm_class": "<vmclass A>", "count": <count #>}, {"reserved_vm_class": "<vmclass B>", "count": <count #>}]'
dcli> com vmware vcenter namespacemanagement supervisors zone bindings list --supervisor <supervisor ID>
error reconciling zone reservation; the failed operation will be retried: no user-created directpath profile exists for the accelerator device in the VM configSpecSee Step 1 of this KB.