Backup hangs while starting
Hello and Good Day,
We are currently managing two servers under the same two-server license. These are similarly configured Linux CentOS servers which exhibit no other issues.
One of these systems runs backups without issue:
[root@localhost bin]# cat /etc/redhat-release
CentOS release 5.6 (Final)
[root@localhost bin]# uname -a
Linux localhost.localdomain 2.6.18-371.12.1.el5 #1 SMP Wed Sep 3 16:22:34 EDT 2014 x86_64 x86_64 x86_64 GNU/Linux
[root@localhost bin]#
The other system is hanging at the beginning of the backup process and strace is indicating that it's waiting for something, though I'm not sure what:
[root@localhost ~]# cat /etc/redhat-release
CentOS release 5.5 (Final)
[root@localhost ~]# uname -a
Linux localhost.localdomain 2.6.18-194.26.1.el5 #1 SMP Tue Nov 9 12:54:20 EST 2010 x86_64 x86_64 x86_64 GNU/Linux
[root@localhost ~]#
For example, the end of an strace (in seconds from initiating the command):
The command is run:
SYSTEMNAME=21073;strace /usr/sbin/acrocmd backup disk --volume=1-2 --backuptype=full --loc=/backups/${SYSTEMNAME}/current/root --arc=${SYSTEMNAME}
And after several pages of output, the end is seen:
futex(0x7fffcb7f3194, FUTEX_WAIT_PRIVATE, 1, NULL) = 0
futex(0x7fffcb7f31c0, FUTEX_WAKE_PRIVATE, 1) = 0
read(11, 0x7fffcb7f2f00, 32) = -1 EAGAIN (Resource temporarily unavailable)
poll([{fd=11, events=POLLIN|POLLPRI}, {fd=10, events=POLLOUT}], 2, -1) = 1 ([{fd=10, revents=POLLOUT}])
read(11, 0x7fffcb7f2f00, 32) = -1 EAGAIN (Resource temporarily unavailable)
poll([{fd=11, events=POLLIN|POLLPRI}, {fd=10, events=POLLOUT}], 2, -1) = 1 ([{fd=10, revents=POLLOUT}])
sendmsg(10, {msg_name(0)=NULL, msg_iov(1)=[{"\4\0\4\0\0\0", 6}], msg_controllen=0, msg_flags=0}, MSG_NOSIGNAL) = 6
read(11, 0x7fffcb7f2f00, 32) = -1 EAGAIN (Resource temporarily unavailable)
poll([{fd=11, events=POLLIN|POLLPRI}, {fd=10, events=POLLOUT}], 2, -1) = 1 ([{fd=10, revents=POLLOUT}])
read(11, 0x7fffcb7f2f00, 32) = -1 EAGAIN (Resource temporarily unavailable)
poll([{fd=11, events=POLLIN|POLLPRI}, {fd=10, events=POLLOUT}], 2, -1) = 1 ([{fd=10, revents=POLLOUT}])
sendmsg(10, {msg_name(0)=NULL, msg_iov(1)=[{"\20\0\0\0", 4}], msg_controllen=0, msg_flags=0}, MSG_NOSIGNAL) = 4
futex(0x40dc1eb4, FUTEX_WAKE_OP_PRIVATE, 1, 1, 0x40dc1eb0, {FUTEX_OP_SET, 0, FUTEX_OP_CMP_GT, 1}) = 1
futex(0x7fffcb7f3144, FUTEX_WAIT_PRIVATE, 1, NULL
This system worked without fault when Acronis was tested a few months ago, the other system continues to function without fault.
Please advise what steps we should take to further troubleshoot and resolve the issue.

- Log in to post comments