Continuous backup

rawtaz · July 25, 2018, 2:13pm

Deduplication will still work, it’s done on a block level.

You can use restic find to look for files you want to restore.

But yes, it’s a bit of an issue that you don’t see your entire filesystem/tree in your snapshots, it’s much messier to restore stuff with this approach.

I can’t help but think that if it takes 50 minutes for a full scan, there’s something very slow with your filesystem/disks?

askielboe · July 26, 2018, 9:13am

Ahh good point about the deduplication. Still think this feels too much like a hack to really use in practise. But the idea is nice.

Not sure why my backups are taking so long. Using a MacBook Pro from earlier this year with a 500 GB SSD. I guess I just I need to work some more on my exclude files.

For now I’ve changed by schedule to backup my working dir once per hour (6 min) and then the whole home folder two times per day (1 hour). I find similar times on other computers that I back up (all MacOS).

I like the idea of continuous backup, but not sure if it really fits the philosophy of restic at the moment (but I might be wrong).

Edit: Think I might have found the reason for my slow backups: Restic runs lstat on files excluded by extension - intended behaviour?

whereisaaron · July 26, 2018, 6:59pm

I can’t help but think that if it takes 50 minutes for a full scan, there’s something very slow with your filesystem/disks?

Per-file speed will vary, from your local 400,000 IOPS SSD to your 200 IOPS hard disk to your 200 IOPS high-latency network file system. But the problem is scan time is linear with with number of files, so there will always be a number of files that will make restic slow. If you have many millions of files, there is not something wrong with your filesystem that restic is ‘slow’

But yes, it’s a bit of an issue that you don’t see your entire filesystem/tree in your snapshots, it’s much messier to restore stuff with this approach.

Very good point @askielboe @rawtaz! That’s a huge fly in my continuous-wrapper ointment

rawtaz · July 26, 2018, 8:24pm

I see your point. I’m not sure what would be classified as “normal” scan time for X number of files and Y number of directories on an SSD.

On one of my systems, a MacBook Pro (Early 2015) the scan looks like this:

scanned 109233 directories, 631102 files in 0:41

That is not even one million files, indeed, but if it’s linear as you say and I were to have five million files, the scan would still take just about five minutes.

With that math on the same system I’d need to have around 50 million files for the scan to take 50 minutes.

@askielboe Would you mind showing the “scanned …” line of your output when the scan takes 50 minutes?

All this said, there’s of course lots of other variables involved in the scanning speed. I’m just surprised to hear 50 minutes scan time on a laptop with a fast SSD

askielboe · July 26, 2018, 10:07pm

So here is one where it scanned about a million files in 54 minutes (the scan time varies a bit with the load):

scan finished in 3243.814s: 1055943 files, 139.734 GiB

I’ve since then added a bunch more to my exclude file which cuts the scan time down quite a bit. I think the main culprit is a lot of protobuf files which I ignore(d) using *.pb. This yields scan times of ~ 30 min (including the other excludes I’ve added):

scan finished in 1789.684s: 756149 files, 93.377 GiB

If I instead exclude the path that contains the .pb-files I get a scan time of around 5 minutes:

scan finished in 286.896s: 757815 files, 93.381 GiB

So avoiding filename wildcard excludes (and extending my exclude file in general) seems to have fixed the issue for now.

askielboe · July 26, 2018, 10:10pm

Since this is a bit off topic, and because I already created a new forum post, I thought I’d just post the reply over here instead: Restic runs lstat on files excluded by extension - intended behaviour? - #2 by askielboe

whereisaaron · July 27, 2018, 3:41am

It could be an SSD stuck the other side of SATA controller? Which would massively impact the speed. 50 minutes for a local NVMe SSD would sound slow, as normally you could read every byte of a whole 1TB NVMe SSD in a fraction of that time

tomwaldnz · October 1, 2018, 7:43am

I also find Restic very slow for incremental backups. It does a lot of disk reading given my file system is close to static. Reading every file to see if a hash has changed is really inefficient - thorough but inefficient. Looking at the file modified date and only checking files changed since the last backup could reduce backup time by orders of magnitude for rarely changing sets of files - which is probably most backups.

On EC2 servers you have a burst balance for disk use, and running a backup that reads every file in your backup set could easily use up most or all of your disk credit. That would leave your production workloads running slowly. It’s the “ebs burst balance”.

Even on a dedicated server or home PC this is inefficient and slows down regular computer use.

I’d like to change from Borg to Restic because of a few problems with Borg, but I don’t really want to have my server or PC having to read GB or TB of data daily when I’ve usually changed about 10 files totaling about 20MB.

I really like Restic, and hope that one day I can use it as my primary backup program. For now I think I’ll continue to use it for weekly or monthly backups, but I don’t think it’s suitable for daily or more frequent backups.

whereisaaron · October 1, 2018, 12:43pm

Hi @tomwaldnz, restic already only checks the modified date (and not even the size) for repeat backups of the same file. But it does all the file checks linearly, one file at a time. So most of the backup time is just wasted/idle time waiting for file stat calls to return. Hence is it also much slower on high latency filesystems.

https://restic.readthedocs.io/en/latest/040_backup.html
“When you backup the same directory again (maybe with new or changed files) restic will find the old snapshot in the repo and by default only reads those files that are new or have been modified since the last snapshot. This is decided based on the modify date of the file in the file system.”

The scan is done as a linear task, and the linear file stat process starts in parallel. But each task is based on a linear algorithm right now, that scales linearly. It gets twice as slow if you latency it twice as long, and twice as slow if you have twice as many files.

The slowness is because restic usually can’t utilize all the available filesystem bandwidth and/or network bandwidth. The new restore will make a that not true for restores, which will be better than linear in the next release.

tomwaldnz · October 2, 2018, 8:21am

Thanks @whereisaaron, based on your information I’ve done some research and worked out why I was getting excessive disk access. It turns out my virus scanner, Avira, was virus scanning every file that Restic wanted to back up even if it was unchanged. I guess that’s either a bug or a feature. Once I disabled that it was quite fast.

restic/restic/blob/0882aca3a87e02a3c0bdac86c3a23a6e392dda9b/internal/archiver/archiver.go#L437-L466


      
          // fileChanged returns true if the file's content has changed since the node
          // was created.
          func fileChanged(fi os.FileInfo, node *restic.Node) bool {
          	if node == nil {
          		return true
          	}
          
          	// check type change
          	if node.Type != "file" {
          		return true
          	}
          
          	// check modification timestamp
          	if !fi.ModTime().Equal(node.ModTime) {
          		return true
          	}
          
          	// check size
          	extFI := fs.ExtendedStat(fi)
          	if uint64(fi.Size()) != node.Size || uint64(extFI.Size) != node.Size {

This file has been truncated. show original

You can see it checks:

The file type (was it a file before, and is now a symlink?)
The modification time
The file size
The file inode (which makes restic re-read files on fuse-based file systems like sshfs)

I’ve updated the documentation: Backing up — restic 0.16.3 documentation

tomwaldnz · October 2, 2018, 4:50pm

Absolutely. My initial test suggested that excluding the restic process from scanning didn’t work, but I had another go and it took. Either I did it wrong or it needed a reboot to take effect.