Copycluster failing
search cancel

Copycluster failing

book

Article ID: 271955

calendar_today

Updated On:

Products

Network Observability CA Performance Management

Issue/Introduction

We are planning on migrating single node Data repository Vertica database from one server to another. We have followed below documentation, installed Data repository on new server and setup passwordless ssh between the source and destination.

Migrate the Data Repository

But while running the copycluster script, we are getting error.:

2023-08-07 22:32:52 localhost vbr Remote procedure put_objects on xxxxxxxx failed: Error accessing remote storage: failed accessing remote storage on xxxxxxxx: rsync: safe_read failed to read 1 bytes [sender]: Connection timed out (110)
  rsync error: error in rsync protocol data stream (code 12) at io.c(276) [sender=3.1.3]
  : returncode=12
2023-08-07 22:32:52 localhost vsql /opt/vertica/bin/vsql -q -t -X -ddrdata -p5433 -Udradmin -hxxxxxxxx
2023-08-07 22:32:52 localhost vsql select log_vbr_invocations('Copy Cluster Task', '/tmp/vbr/vbr_2023-08-07-213010_BFGJR5P0.log', '7JWH8IH4KCULVXT0MEGI7P1TAPSG6Y28', 'Fail');
2023-08-07 22:32:52 localhost vsql Read from vsql: Logged vbr invocations
2023-08-07 22:32:52 localhost vbr Error (possibly expected) closing backup location rsync://[#.#.#.#]:50000/: [Errno 111] Connection refused
2023-08-07 22:32:52 10.197.229.235 vbr File unlocked: 7
2023-08-07 22:32:52 #.#.#.# vbr File unlocked: 7
2023-08-07 22:32:52 localhost vbr Traceback (most recent call last):
    File "/opt/vertica/bin/vbr.py", line 11242, in main
      if vbr_task.run():
    File "/opt/vertica/bin/vbr.py", line 5821, in run
      result = self._run()
    File "/opt/vertica/bin/vbr.py", line 9835, in _run
      Parallel.foreach(replicate_node, self._participating_nodes)
    File "/opt/vertica/bin/vbr.py", line 10921, in foreach
      cls.map(func, iterable, threads_num=threads_num)
    File "/opt/vertica/bin/vbr.py", line 10915, in map
      raise exc_info[0](err_msg).with_traceback(exc_info[2])
    File "/opt/vertica/bin/vbr.py", line 10899, in map
      if not thr.join(Parallel.WAIT_QUANTUM):
    File "/opt/vertica/bin/vbr.py", line 10875, in join
      return self._thr.join(*args, **kwargs)
    File "/opt/vertica/bin/vbr.py", line 10850, in join
      raise self._exc_info[0](err_msg).with_traceback(self._exc_info[2])
    File "/opt/vertica/bin/vbr.py", line 10828, in run
      super(Parallel.CancellableThread, self).run()
    File "/opt/vertica/oss/python3/lib/python3.7/threading.py", line 870, in run
      self._target(*self._args, **self._kwargs)
    File "/opt/vertica/bin/vbr.py", line 10863, in do_run
      self._result = func(val)
    File "/opt/vertica/bin/vbr.py", line 9824, in replicate_node
      remote_copier.put_object, [obj.storage_id, obj.obj_type, obj.loc_id, obj.length, obj.loc_id])
    File "/opt/vertica/bin/vbr.py", line 5596, in _put_object
      func(*obj_metadata)
    File "/opt/vertica/bin/vbr.py", line 5353, in put_object
      self._copy_obj(storage_id, obj_type, loc_id, length, dest_loc_id)
    File "/opt/vertica/bin/vbr.py", line 5276, in _copy_obj
      self._run_batch_if_ready(key, batch, idx)
    File "/opt/vertica/bin/vbr.py", line 5290, in _run_batch_if_ready
      self._run_work_thread(key, batch)
    File "/opt/vertica/bin/vbr.py", line 5247, in _run_work_thread
      thr.join()
    File "/opt/vertica/bin/vbr.py", line 10850, in join
      raise self._exc_info[0](err_msg).with_traceback(self._exc_info[2])
    File "/opt/vertica/bin/vbr.py", line 10828, in run
      super(Parallel.CancellableThread, self).run()
    File "/opt/vertica/oss/python3/lib/python3.7/threading.py", line 870, in run
      self._target(*self._args, **self._kwargs)
    File "/opt/vertica/bin/vbr.py", line 5331, in _copy_objs
      storage_location, remote_storage_location, obj_info, worker_ix)
    File "/opt/vertica/bin/vbr.py", line 5367, in _remote_copy_objs
      storage_location, remote_storage_location, obj_info)
    File "/opt/vertica/bin/vbr.py", line 1233, in invoke
      raise VbrError("On host %s: %s"%(hostname, error_msg))
  VbrError: On host 10.197.229.235: Error accessing remote storage: failed accessing remote storage on #.#.#.#: rsync: safe_read failed to read 1 bytes [sender]: Connection timed out (110)
  rsync error: error in rsync protocol data stream (code 12) at io.c(276) [sender=3.1.3]
  : returncode=12

Environment

Dx NetOps Performance Management any version

Cause

blocked port 

Resolution

Either unblock port 50000, or change the copycluster.ini file to use another known port by changing the following setting port_rsync = 50000