RabbitMQ tile upgrade from version 2.4.2 to 2.4.3 hangs after one node is upgraded and remains in progress for many hours.Cancelling the task and re-running the upgrade results in the same stuck state.
During the upgrade, the following errors are observed in the RabbitMQ logs.
[error] Error on AMQP connection XXX - Federation link (upstream: test, policy: testp), user: 'federation', state: opening):
[error] access to vhost 'vhost' refused for user 'federation'
[warning] Federation exchange 'EEE' in vhost 'vhost' did not connect to exchange 'EEE' in vhost 'vhost' on amqps: YYY/vhost. Reason: {error, AAA [warning] <BBB> not_allowed}
(Where XXX, YYY, AAA, and BBB represent redacted values.)
Active RabbitMQ federation links during the tile upgrade caused the upgrade task to hang after one node was upgraded.
This is a product issue triggered when federation remains enabled during the node upgrade process, and is planned to be fixed in a future Tanzu RabbitMQ Tile release.Subscribe to this KB article to receive updates when a fix and target version are available.
Workaround
Use one of the following workarounds during the upgrade.
Note: Disabling federation may temporarily impact cross-cluster or cross-vhost message flow, so plan a maintenance window accordingly.