Arkeia, a commercial and professional network backup solution

ArticleCategory: [Choose a category for your article]

Applications

AuthorImage:[Here we need a little image form you]

[Photo of the Author]

TranslationInfo:[Author and translation history]

original in en Georges Tarbouriech

AboutTheAuthor:[A small biography about the author]

Georges is a long time Unix user (commercial and free Unix systems). He uses Arkeia to backup a very heterogeneous network.

Abstract:[Here you write a little summary]

Arkeia 4.2.3-1 is the latest version of a commercial and professional network backup solution from Knox Software http://www.arkeia.com. There is also a personal and free version available together with some Linux distributions.
You probably think, commercial software has nothing to do within LinuxFocus.
Well, this commercial product is aimed at professional people and Linux is getting more and more involved in the professional field.

ArticleIllustration:[This is the title picture for your article]

[Illustration]

ArticleBody:[The article body]

Introduction

Network administration is a job of a thousand tasks . One of these tasks is the backup of data. The amount of data in a network seems to increase every day, especially now that we have big hard disks, big applications and many more users.
The backup strategy had to evolve towards a more sophisticated strategy compared to what it was ten years ago.
Back then it was possible to use a tape drive attached to a server, using Unix commands such as dump, tar, cpio or find (-cpio is an option for find) to backup an application server.
Of course, it's still a solution, as long  as the number of servers, clients, applications haven't been growing exponentially.
If your network has grown exponentially you'll have to look for a better solution than the previous one.
When we say better, we mean faster!
With "faster" in mind you will try to find the right way to do backups.
When your network contains many different OSes, it becomes rather tricky to find the "best" solution. As usual, the "best" solution should be cheap, efficient and easy to implement! Seriously looking at it you will see that it's always a compromise, and the best for you is not the best for others.
So, the first thing to take into account concerns your needs: software, hardware.
There are a lot of products available, but most of them run proprietary software... and they are quite expensive (this is is an understatement).
Let's not forget the hardware part: you need at least, a number of tape drives or robots.
Last, but not least, you must know which machine is supposed to become a backup server.
This server must have a big hard disk, a lot of resources (memory, cpu) and SCSI ports.
It must be able to manage various backups at once.
Where to go from here?

Presentation

The choice of Arkeia
Many big companies "offer" network backup solutions as we mentioned earlier.
Arkeia was first selected because of its support of many OSes.
Backup servers can be: AIX 4.1, DEC Alpha Unix 4.0, HP-UX 10, IRIX 6.2 and up, Linux 2.* (x86), Solaris 2.5 and up and NT 4.0 Server Intel.
Backup clients can be: the same as the servers plus a lot more, such as SCO v5 (Intel), BSD 3.0, 4.0, Novell 4.11, FreeBSD 2.2.6, Windows 95, 98, NT 4.0 Server (Alpha) and NT 4.0 Workstation (Intel).
This list is not exhaustive and new OSes are added all the time. Go to http://www.arkeia.com for more information.
Most of the servers have a graphical interface, based on a proprietary Xlib for Unices and based on Java for Microsoft products.
The second reason for our choice was the price. We won't to quote any price here as it depends on your network, the number and type of servers and clients, if you use robots or not (in this case, you need to buy a license for the library managing the robot).
Just go to the Arkeia web site. It allows you to get a quotation from them tailored to your network configuration.
The last reason was the tape drive management. Arkeia is able to manage almost every type of tape drive: DAT, EXABYTE, QIC...
As we already owned a lot of QIC tape drives, it was a solved problem.

What does it look like?

adm.jpg
The main window

As we have seen before, Arkeia uses a GUI on the server side.
You need to log into Arkeia through a login window from where you have access to every task specific window backup management. These windows are opened from a tool-bar menu, contextual menu or from icons.
All this can be configured according to your taste (predefined backgrounds or color of your choice).
There is not much more to say as everything is obvious and user-friendly.
On Unix the GUI (graphical user interface) will work with most window managers.
On Microsoft things, you'll have to install THE Microsoft JVM (Java Virtual Machine) before using the GUI.
The GUI is quite practical whether you like its design or not.

How does it work?

Arkeia uses a parallel backup technology through a TCP/IP network.
A backup server manages the tape drives and gets the data from the clients using multi-flow.
It operates according to client/server standards using shared memory and message queues. IPC (Inter Process Communication) has to be perfectly configured on your backup server.
The user manual will teach you how to tune your backup server.
The configuration is quite logical: you need to install client software together with server software on the backup server and client software on the clients. So far, so good!
On the server side you'll have to define everything concerning tape drives, drivepacks, tape pools, tapes and savepacks.
- Tape drives: the first step of the configuring process.
Select "Drives management" from the "Devices" menu. You have to define a name for the drive, its authorizations (Read, write, clean, delete) and of course its device. This will depend on your system: for example /dev/rmt/1h on Solaris or /dev/rmt/tps1d2 on Irix or /dev/st0 on Linux.
You must define each drive and the corresponding device.
Let's say you have four QIC tape drives attached to your SGI O2 server (Irix 6.5):
The first one could be called QICone with /dev/rmt/tps1d2 as the device, the second one QICtwo with /dev/rmt/tps1d3 as the device, and so on.
The figure "1" after "tps" is the SCSI controller number and the figure "2" after "d" is the device ID.
You have to know how SCSI works. IDs, daisy chains etc...
If you own a robot, the configuring process is almost identical and the documentation tells you everything you need.
- Drivepacks:
From the same menu item, select "Drivepacks" and create a drivepack corresponding to the tape drive.
In line with the previous example, I assume you have a QICone tape drive. You could call your drivepack QICone Pack. Do the same for each defined tape drive.
Now, each drivepack must match its tape drive. Selecting the tape drive and the corresponding drivepack will do the trick.
- Tape pool:
This step allows you to create as many pools as you need. From the "Tapes management" menu, select "Pools management". Click in the "New" icon and fill in the fields to create a pool.
- Tapes:
Each pool must contain one or more tapes. The "Tapes in pool" window can opened by double-clicking on the pool. If you select "New" you open the "Create tape" window. Just fill the fields.
By the way, there is a very useful online help, one per window telling you what to do.
- Savepacks:
Last but not least, you can now create one or more savepacks. Savepacks contain the tree you want to backup for a specific client. You can browse through the tree with a navigator to select directories or files.
Repeat the operation for each client and for each tree to backup.
Now, you are ready for an interactive backup of various clients at the same time.

A real example

We have a funny network with four servers to backup.
The backup server is an SGI O2, and the clients are a SUN applications server running Solaris 2.6, a Linux communication server running RH6.0, two NT4.0 applications servers.
Let's make an interactive backup of the four machines.
We create the four tape drives: the names will be QIC Sun, QIC Linux, QIC Pcsvr and QIC Pcdev.
Original, isn't it?
Now let's define the four devices: QIC Sun on /dev/rmt/tps1d4, QIC Linux on /dev/rmt/tps1d5, QIC Pcsvr on /dev/rmt/tps1d3 and QIC Pcdev on /dev/rmt/tps1d2.

We give them all the rights to set it up as we like: we are the administrator and we are working as root.
Awful, isn't it?
Next, we create the corresponding drivepacks: Sun Pack, Linux Pack, Pcsvr Pack and Pcdev Pack, again with a lot of imagination.
Now, the tapes pool: as usual, Sun Pool, Linux Pool, Pcsvr Pool and Pcdev Pool.
We create a pool for each pack. That's a personal choice. You could create only one pool with all four tapes.
If your pool contains many tapes, Arkeia manages the drives. That means, if you don't set a priority for each drive, Arkeia decides what to backup on which drive.
This is probably the feature I dislike the most.
Last, let's define a tape within each pool: and again, Sun Tape, Linux Tape, Pcsvr Tape and Pcdev tape.
Don't worry, this only has to be done once. You'll be able to use all that stuff every time you'll need it.
What we want is a full backup for each server.
Let's define the savepacks: Sun, Linux, Pcsvr and Pcdev will be the names.
Of course, Arkeia is properly installed on the clients so you can see every machine in the browser. Just select them by clicking in the corresponding check box.
Let's launch the interactive backup. Select the "Interactive backup" menu item. When the window is displayed, you select a savepack, a drivepack and a pool for each server.
In our example, to backup the Sun, the savepack to select is Sun, the drivepack is Sun Pack and the pool is Sun Pool. Choose the type of backup (total in this example, but you could have selected incremental), the tape strategy (use new tapes for this example, alternatively we could have said complete existing tapes) and use e-mail notification or not (it's up to you). Just click in the check box and the Sun backup starts.
Repeate this step for each server and that's all. Four backups are running at once and you can have a coffee.

linux.jpg
The backup window

About 40 minutes later it's all over.
Some important facts: our network is a category 5... but under construction! That means: not all the cabling is cat 5, hubs are either 10 or 10/100 Mbit, only some machines have real 100Mb network cards...
Nevertheless, we did backup about 3Gb and something like 150000 files in about 40 minutes.
On true cat 5 networks with 100Mbit, you could have done it about 3 times faster.
Speed depends on many factors: network, machines, tape drives, tape size. Our example is a specific case and doesn't reflect the fantastic speed you can get on some configuration. If your network is true cat 5, if you own robots, if you use DAT, if your machines are very fast... you can expect about 70 Mb/min. In our example, we get a mean of about 25Mb/min.
This is for each backup. Considering they are done altogether gives 100Mb/min.

linuxlog.jpg pcdevlog.jpg pcsvrlog.jpg sunlog.jpg
The logs

However, an interesting way to understand Arkeia efficiency is to compare these results to the old backup strategy on the same hardware.
Before using Arkeia, backups were done directly from each machine, every tape drive being connected to their SCSI port.
We still use the same machines and the same tape drives but we gained about 50% speed.
Sun backup time with its own tape drive: 1 hour; with Arkeia: 38 minutes. Pcdev backup time with its own tape drive: more than 1 hour; with Arkeia: 32 minutes. And so on...
Don't we forget to backup the backup server: that time we don't use the network, and of course it becomes a bit slower (about 22Mb/min).
This example concerns network servers. Obviously, you can do the same for your network clients.
According to Arkeia, with the right hardware you should be able to backup 128 machines on 32 devices in parallel.
I have not been able to test such a configuration. We are poor...where I work.
We can't talk about backup without mentioning restoration: It's as easy and fast as backup. Using the navigator you can restore exactly what you want and even where you want, that is, you can restore into some other directory or machine.

More features

We haven't said a word about periodic backup. Of course Arkeia is able to manage backup programmed to run at a specific time. On Unix, it will use cron and on Microsoft products it will be a service.
You can define different intervals for a periodic backup: day, week, month...
It's possible to manage three levels for each backup. For example, you want to program your periodic backup for 1 month. Level 1 corresponds to monthly backup, level 2 is weekly and level 3 is daily. That means level 3 will run 6 times per week, level 2 will run 3 times per month and of course level 1 will only run once.
The side effect: you will need a lot of tapes and of course as many tape drives as servers to backup!
Unless you teach your dog to change the tapes at night between backups...
A part from that, it works the same way as interactive backup. There's a lot to say about periodic backup but we would need to write a new article just for it. Let's just say that  this is probably one of the greatest features of Arkeia.
Another interesting point concerns logs. You get logs for everything: tapes, devices, backup...
It does take some disk space, but it's quite useful.
These logs are quite well organized and give a lot of information. Arkeia manages them in a nice way, archiving them by month.
 

avlog.jpg
The available logs window

The online help deserves to be mentioned. You can work with Arkeia without reading the full documentation. Well, as soon as you don't want to go into advanced administration.
The full documentation is contained in the program CD-Rom in PDF format, allowing you to easily print it and to create your own manuals.
This doc contains very important information about platform specific configuration, security, troubleshooting, tools, robot libraries...
If you need more, there is a mailing list dedicated to Arkeia.

So what?

This article cannot be exhaustive. There is so much more to say about this software.
Arkeia didn't try to reinvent the wheel, but to use everything allowed to improve performance.
IPC is one of the keys. Once again, Thanks to Unix.
Let's mention the personal shareware version for Linux. It's intended for non commercial use. Nothing prevents you from testing it on your private network.
As it is shareware, if you send a contribution of 20$ or more, 25% will be donated to Software in the Public Interest.
Something else to mention: I got Arkeia from the European division. The person in charge knows very well what she is talking about and does a very good job (Hello Sandy!).
This company is the first one among the numerous I work with, who have been able to handle the matter entirely by e-mail. No fax, no post mail except for the software package. And last but not least, only plain platform independent TEXT!!!
This is totally off-topic, but I'm fed up with those Word or Excel docs running around the Internet. First, they are all full of potential macro viruses. Second, everybody on this earth is NOT obliged to own Word or Excel (How were we working before?). Third, a text file is about 10 to 100 times smaller than the same text as a Word document, and at a time where the bandwidth on the Internet is getting rather tight, size does matter.
Sorry for that digression!
Now, I will get a lot of new friends, so let's go back to Arkeia.
I do believe, Arkeia is a good network backup solution. There are points I don't quite like, but I think those people are working hard to improve their product. Personally I would like more features about tape management, such as the ability to decide how many tapes I want to use within a strategy or a possibility to overwrite a tape during backup. In short , I would like to be the boss!
The price is an important element at the time of choice. Arkeia seems very competitive in that field.
If you're looking for such a tool, don't forget to try this one. Even if you own Arkeia from a Linux distribution, go and have a look at http://www.arkeia.com to learn more about it.

We're living a great time!