RabbitMQ's stop_app operation may hang after removing the Mnesia database due to a network partition event especially when attempting to restart multiple nodes simultaneously.
All RabbitmQ and Erlang versions
This issue is primarily caused by:
1. Force stop nodes:
rabbitmqctl stop_app --timeout 60
2. Clean Mnesia directories on all nodes:
sudo rm -rf /var/lib/rabbitmq/mnesia/*
3 . Restart nodes sequentially with delays between each start:
Start one node at a time, waiting for it to initialize fully before starting the next.
Prevention Strategies References
Net Tick Time (Inter-node Communication Heartbeats)
Clustering and Network Partitions