Prev: GRE Keys
Next: Spanning-tree Protocol implementation
From: David Brown on 28 Oct 2009 05:56 terryc wrote: > On Mon, 26 Oct 2009 20:52:26 +0100, David Brown wrote: > >> Gabriel Knight wrote: >>> Hi all I need a free program to backup a ubuntu server for my school >>> class, it has to be as good or better than Rdiff and Rsync the server >>> will use SSH, MYSql and be a file and web server and do a couple of >>> other things. I need it to be either a gui or text box program. >>> >>> >> As others have said, the obvious choice here would be ... rsync. > > *sync isn't a abckup system. It is just a copy system. A backup system means having copies of your data along with a way to restore it. You are right that rsync on its own is not a backup system, but it can form the backbone of a backup system. If you use it appropriately (either with your own scripts or command line arguments, or using a ready-made tool such as rsnapshot or dirvish), you then have a backup system.
From: terryc on 28 Oct 2009 06:57 On Tue, 27 Oct 2009 21:48:21 -0700, Keith Keller wrote: > On 2009-10-28, terryc <newsninespam-spam(a)woa.com.au> wrote: >> On Wed, 28 Oct 2009 14:57:55 +1100, Grant wrote: >> >>>>*sync isn't a abckup system. It is just a copy system. >>> >>> And a backup is not a copy? >> >> A backup system has multiple copies, with historical content. > > How do you come to that definition? A backup system has as many copies > as desired/needed by business practice. If one copy without history is > sufficient, then that's a good backup. The basic reason is that problems can happen with "the copy". No matter what your system, one copy is no copy at certain times. As others have pointed out, it is the system, not the tools (tar, cpio, rsync, etc). As you say, it is about what he really needs. My 2c is that depending on amount of data and frequency of changing, he could probably just dump his databases and just burn a DVD each night. $AUS50 for a dvd burner and AUS50c/DVD. Even cheap DVD should last a few years before they are toasters.
From: David Brown on 28 Oct 2009 07:34 terryc wrote: > On Tue, 27 Oct 2009 21:48:21 -0700, Keith Keller wrote: > >> On 2009-10-28, terryc <newsninespam-spam(a)woa.com.au> wrote: >>> On Wed, 28 Oct 2009 14:57:55 +1100, Grant wrote: >>> >>>>> *sync isn't a abckup system. It is just a copy system. >>>> And a backup is not a copy? >>> A backup system has multiple copies, with historical content. >> How do you come to that definition? A backup system has as many copies >> as desired/needed by business practice. If one copy without history is >> sufficient, then that's a good backup. > > The basic reason is that problems can happen with "the copy". No matter > what your system, one copy is no copy at certain times. > For an example, consider what happened to a very large software company recently when trying to do a hardware upgrade of the storage system. There was only one copy of the data (an old copy at that). Before doing the hardware upgrade, they wanted to make a new copy - but there was not enough space on the backup system. So they deleted their only copy, and started making a new copy. Since it was going to take days to make the new copy, and they couldn't be bothered waiting, they cancelled the copy and did the hardware upgrade anyway. When that failed, everything was gone. As you say, having a single copy is not a backup system by itself. And even multiple copies on multiple devices at multiple locations is not a backup system - you need a tried and tested recovery procedure to make it a backup system. > As others have pointed out, it is the system, not the tools (tar, cpio, > rsync, etc). > However, the tools form the central part of the backup system, and determine the main features you have available. Thus it does make sense to talk about an rsync backup system. > As you say, it is about what he really needs. > > My 2c is that depending on amount of data and frequency of changing, he > could probably just dump his databases and just burn a DVD each night. > $AUS50 for a dvd burner and AUS50c/DVD. Even cheap DVD should last a few > years before they are toasters.
From: terryc on 28 Oct 2009 09:18 On Wed, 28 Oct 2009 12:34:58 +0100, David Brown wrote: > Since it was going to take days to make the > new copy, and they couldn't be bothered waiting, they cancelled the copy > and did the hardware upgrade anyway. When that failed, everything was > gone. I actually worked, briefly, for a company that would not buy new backup tapes. After weeks of requesting new tapes, I eventually worked into the EDP mangers office and dumped a backup report that took up half a box of paper, normally 50 pages, and said "It is my belief that if we needed to recover from a backup, the company could not do it". Then I held up a sample of the 2nd hand tapes they were so proud of obtaining and left tape coating over the managers desk. {:-). > > As you say, having a single copy is not a backup system by itself. > > And even multiple copies on multiple devices at multiple locations is > not a backup system - you need a tried and tested recovery procedure to > make it a backup system. First thing you check when you become responsible for the company backups. I've also come across a couple of those in my time. This isn't too bad if you have a good GFS system and the paper trail (or transaction logs) still exists. > >> As others have pointed out, it is the system, not the tools (tar, cpio, >> rsync, etc). >> >> > However, the tools form the central part of the backup system, and > determine the main features you have available. Thus it does make sense > to talk about an rsync backup system. Between scripts running tar, cpio and diff, there hasn't been much else in many "branded" backup systems I've used. The part that makes them easy to use is the database, query and file selection facilities. The only other trick you need to know is that many put multiple jobs onto one tape so you need to use /dev/nst0 (no rewind) and move pass the end of tape(?) marker to be able to manually recover the contents of some of these tapes.
From: David Brown on 28 Oct 2009 09:53
terryc wrote: > On Wed, 28 Oct 2009 12:34:58 +0100, David Brown wrote: > >> Since it was going to take days to make the >> new copy, and they couldn't be bothered waiting, they cancelled the copy >> and did the hardware upgrade anyway. When that failed, everything was >> gone. > In case anyone didn't spot the reference: <http://www.hiptop3.com/archives/sidekick-backup-problems-blamed-on-management> > I actually worked, briefly, for a company that would not buy new backup > tapes. After weeks of requesting new tapes, I eventually worked into the > EDP mangers office and dumped a backup report that took up half a box of > paper, normally 50 pages, and said "It is my belief that if we needed to > recover from a backup, the company could not do it". Then I held up a > sample of the 2nd hand tapes they were so proud of obtaining and left > tape coating over the managers desk. {:-). I've heard of that kind of thing many times - people often make the mistake of equating "tape" with "good backup system", regardless of the procedures used for the backup. And since backup tapes cost money, it's easy to think that you can save money by buying fewer tapes without thinking through the consequences. >> As you say, having a single copy is not a backup system by itself. >> >> And even multiple copies on multiple devices at multiple locations is >> not a backup system - you need a tried and tested recovery procedure to >> make it a backup system. > > First thing you check when you become responsible for the company > backups. I've also come across a couple of those in my time. This isn't > too bad if you have a good GFS system and the paper trail (or transaction > logs) still exists. > You don't necessarily need something like a GFS. It's often okay for a recovery procedure to be somewhat slow and inconvenient, as long as it works reliably, and within the required timeframe. But while /you/ may check the recovery procedure, a great many people do not - that's why I emphasis it so much each time the topic comes up. Again, it's often a fault with tape-based backups. People think they've got a great system with incremental and differential backups, but when disaster strikes they have huge troubles. Maybe they find that they need to feed in a week's worth of differential backup tapes just to restore a single file. Or they find that when the hardware failure / fire / breakin put their server and backup machine out of commission, they can't read the tapes on a new machine. Or perhaps their backup software won't let them restore part of their data - so they can't restore part of yesterday's backup without destroying today's good data. There are many ways for restores to fail even when the data is technically safe, but people often don't think through and test their recovery procedures. >>> As others have pointed out, it is the system, not the tools (tar, cpio, >>> rsync, etc). >>> >>> >> However, the tools form the central part of the backup system, and >> determine the main features you have available. Thus it does make sense >> to talk about an rsync backup system. > > Between scripts running tar, cpio and diff, there hasn't been much else > in many "branded" backup systems I've used. The part that makes them easy > to use is the database, query and file selection facilities. > I find rsync particularly useful for doing offsite backups over the Internet, since it minimises the traffic. > The only other trick you need to know is that many put multiple jobs onto > one tape so you need to use /dev/nst0 (no rewind) and move pass the end > of tape(?) marker to be able to manually recover the contents of some of > these tapes. My experience with tape-based backups is that it involved too much manual work (changing tapes), recovery was slow and awkward, it was very difficult to be sure of the reliability of the system, and we had no practical (or cheap) way to test recovery via another machine - if the server and the backup machine with the tape drive died, we would have great difficulty getting the data back. For many years now, I've used rsync to two separate machines in two locations, with hardlinked copies to give snapshots without taking more space than necessary. Of course, the type of backup strategy you need depends entirely on the situation and the quantity of data. |