VMware NSX for vSphere 6.x 控制器从群集中排除且显示以下消息:Zookeeper 客户端已断开连接 (Zookeeper client disconnected)
search cancel

VMware NSX for vSphere 6.x 控制器从群集中排除且显示以下消息:Zookeeper 客户端已断开连接 (Zookeeper client disconnected)

book

Article ID: 307660

calendar_today

Updated On:

Products

VMware NSX

Issue/Introduction

Symptoms:
免责声明:本文为 VMware NSX for vSphere 6.x controller is excluded from the cluster with the message: Zookeeper client disconnected (2135291) 的翻译版本。尽管我们会不断努力为本文提供最佳翻译版本,但本地化的内容可能会过时。有关最新内容,请参见英文版本。


 

  • 部署的 VMware NSX for vSphere 6.x 控制器从控制器群集断开。
  • 适用于正在运行的 NSX Controller 的 TCP 侦听器不再显示在 show network connections of-type tcp 命令的输出中
  • NSX Controller 日志包含类似于以下内容的条目:

    D0525 13:46:07.185200 31975 rpc-broker.cc:369] Registering address resolution for: 20.5.1.11:7777
    D0525 13:46:07.185246 31975 rpc-tcp.cc:548] Handshake complete, both peers support the same protocol.
    D0525 13:46:07.197654 31975 rpc-tcp.cc:1048] Rejecting a connection from peer 10.5.1.11:42195/ef447643-xxxx-xxxx-xxxx-35630df39060, cluster 9f7ea8ff-xxxx-xxxx-xxxx-628e834aa8a5, which doesn't match our cluster (00000000-0000-0000-0000-000000000000).
    D0525 13:46:07.222869 31975 rpc-tcp.cc:1048] Rejecting a connection from peer 100.5.1.11:42195/ef447643-xxxx-xxxx-xxxx-35630df39060, cluster 9f7ea8ff-xxxx-xxxx-xxxx-628e834aa8a5, which doesn't match our cluster (00000000-0000-0000-0000-000000000000)

    有关详细信息,请参见 Collecting diagnostic information for VMware NSX for vSphere 6.x (2074678)
  • 断开的控制器尝试使用全部为零的无效 UUID 加入群集。
  • show control-cluster history 命令会显示类似于以下内容的消息:

    INFO.20150530-000550.1774:D0530 13:25:29.452639 1983 zookeeper_client.cc:774] Zookeeper client disconnected!
  • 在 NSX Controller 控制台上运行 show log cloudnet/cloudnet_java-zookeeper*.log 命令会包含类似于以下内容的条目:

    cloudnet_java-zookeeper.20150530-000550.1806.log-2015-05-30 13:25:07,382 47956539 [SyncThread:1] WARN org.apache.zookeeper.server.persistence.FileTxnLog - fsync-ing the write ahead log in SyncThread:1 took 3219ms which will adversely effect operation latency. See the ZooKeeper troubleshooting guide.

    注意:上述日志摘录仅为示例。日期、时间和环境变量可能会因环境而有所不同。

Environment

VMware NSX for vSphere 6.2.x
VMware NSX for vSphere 6.1.x

Cause

由于磁盘性能较慢对 NSX Controller 群集造成了不利影响,会导致此问题。控制器 zookeeper 进程会在单线程中处理所有 I/O 事件。如果文件写入操作在消耗资源,则控制器保持活动消息可能会停止。

Resolution

VMware 建议在低延迟磁盘上部署 NSX for vSphere 控制器。有关详细信息,请参见 Troubleshooting storage issues when using VMware products (2013160)
 
要进一步减少导致控制器超时的磁盘问题,请确保 /var 磁盘未共享或虚拟化,并且磁盘未满或几乎已满。此外,确保 NTP 在所有控制器上正确配置,并且时间戳在需要日志分析时能够保持同步。

Additional Information