Hello All,
I intend to replace my old backup scripts (based on rsync, an ssh tunnel and nfs) by a state-of-the-art solution.
Main use case is to backup three computers via network to an external server. Root access to all devices is available.
Restic looks like an almost perfect match.
But: I am concerned about repository robustness and robustness of the backup process in general.
For testing purposes I interrupted the backup process and tried to resume afterwards.
In most of my test cases resuming the backup fails.
My assumption was that a restic unlock followed by a restic rebuild-index should be enough before restarting the backup.
Apparently this is not the case.
To me it looks like the repository is still in a state which is not usable.
root@minilin:~# ./restic --password-file rm-repo-pw --exclude-file rm-repo-exclude -r sftp:backupserver:rm-repo --verbose backup /mnt/rechenmonster/
open repository
repository 1452324a opened successfully, password is correct
lock repository
load index files
start scan on [/mnt/rechenmonster/]
start backup on [/mnt/rechenmonster/]
scan finished in 90.723s: 63304 files, 211.953 GiB
uploaded intermediate index 9edcd9f4
subprocess ssh: Write failed: Broken pipeotal 63304 files 211.953 GiB, 0 errors ETA 1:42:34
Save(<lock/e48056334c>) returned error, retrying after 639.631388ms: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 1.03550331s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 1.549049741s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 2.510793368s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 3.752043085s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 2.994066285s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 4.514784368s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 11.648304725s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 14.114089155s: ssh command exited: exit status 255
Save(<lock/e48056334c>) returned error, retrying after 18.06126997s: ssh command exited: exit status 255
unable to refresh lock: ssh command exited: exit status 25511.953 GiB, 0 errors ETA 1:29:16
Save(<lock/72a8acf055>) returned error, retrying after 508.546024ms: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 814.253356ms: ssh command exited: exit status 255
[30:01] 28.92% 13415 files 61.292 GiB, total 63304 files 211.953 GiB, 0 errors ETA 1:13:52
Save(<lock/72a8acf055>) returned error, retrying after 2.165714921s: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 2.541480516s: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 5.429713763s: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 4.530872255s: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 11.165183953s: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 12.267683116s: ssh command exited: exit status 255
Save(<lock/72a8acf055>) returned error, retrying after 28.369074974s: ssh command exited: exit status 255
unable to refresh lock: ssh command exited: exit status 25511.953 GiB, 0 errors ETA 1:11:09
Save(<data/5ad029a2a4>) returned error, retrying after 343.080735ms: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 813.277506ms: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 1.506813376s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 1.211843837s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 1.752838669s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 2.130523529s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 3.921231028s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 9.039136062s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 17.5063102s: ssh command exited: exit status 255
Save(<data/5ad029a2a4>) returned error, retrying after 12.589100081s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 251.417301ms: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 412.907106ms: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 764.639135ms: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 915.773596ms: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 3.716604584s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 3.404711931s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 4.714168987s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 6.37924004s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 9.267381356s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 22.90579546s: ssh command exited: exit status 255
Fatal: unable to save snapshot: ssh command exited: exit status 255iB, 0 errors ETA 1:14:51
Remove(<lock/4d7f027202>) returned error, retrying after 398.60313ms: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 1.033939473s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 1.504770267s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 1.79081597s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 2.504306698s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 5.631618235s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 7.325304333s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 11.199762921s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 19.211147779s: ssh command exited: exit status 255
Remove(<lock/4d7f027202>) returned error, retrying after 26.490950035s: ssh command exited: exit status 255
error in cleanup handler: ssh command exited: exit status 255root@minilin:~# ./restic --password-file rm-repo-pw --exclude-file rm-repo-exclude -r sftp:backupserver:rm-repo --verbose backup /mnt/rechenmonster/
open repository
repository 1452324a opened successfully, password is correct
lock repository
load index files
start scan on [/mnt/rechenmonster/]
start backup on [/mnt/rechenmonster/]
scan finished in 122.380s: 63304 files, 211.953 GiB
subprocess ssh: Write failed: Broken pipe 63304 files 211.953 GiB, 0 errors ETA 1:32:29
Save(<lock/44e973c1fd>) returned error, retrying after 534.843163ms: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 866.13663ms: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 1.140025974s: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 1.666121786s: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 1.64631936s: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 4.380748912s: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 6.573224506s: ssh command exited: exit status 255
[5:17] 6.29% 8221 files 13.327 GiB, total 63304 files 211.953 GiB, 0 errors ETA 1:18:57
Save(<lock/44e973c1fd>) returned error, retrying after 17.045598589s: ssh command exited: exit status 255
Save(<lock/44e973c1fd>) returned error, retrying after 12.830693829s: ssh command exited: exit status 255
unable to refresh lock: ssh command exited: exit status 255953 GiB, 0 errors ETA 1:16:06
Save(<data/0ace7664f9>) returned error, retrying after 549.495525ms: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 419.53713ms: ssh command exited: exit status 255derfrankreich/WolleBilderfrankreich 007.jpg
Save(<data/0ace7664f9>) returned error, retrying after 1.010968958s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 1.153458669s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 2.923233639s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 5.669796535s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 7.477980579s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 6.085261698s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 14.933166759s: ssh command exited: exit status 255
Save(<data/0ace7664f9>) returned error, retrying after 15.308642956s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 600.706632ms: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 1.091228601s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 1.492707128s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 847.919561ms: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 2.926447392s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 2.436503605s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 6.33416481s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 4.493453035s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 17.82109473s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 24.20504431s: ssh command exited: exit status 255
Fatal: unable to save snapshot: ssh command exited: exit status 255 0 errors ETA 1:32:20
Remove(<lock/a5f408e26b>) returned error, retrying after 638.496395ms: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 873.597202ms: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 1.458896684s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 1.109790543s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 1.780625266s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 1.988941899s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 3.247499977s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 11.74492638s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 7.512791006s: ssh command exited: exit status 255
Remove(<lock/a5f408e26b>) returned error, retrying after 25.917067357s: ssh command exited: exit status 255
error in cleanup handler: ssh command exited: exit status 255
Remark: during these runs the network connection was stable and the backup sever was up and running all time.
Based on your experience / expertise:
Is it assured that existing data in a repository never gets corrupted in case of a connection loss during an operation (backup, rebuild-index, prune etc.)?
What do I have to do to be able to safely continue operations after a connection loss?
Best regards