Chapter 5. Archiving and Retrieving

NetWorker includes an optional Archive product for storage of data. Data archiving is the process of taking a snapshot of files or directories as they reside on primary media (usually disk) at a given point in time for long-term storage on media called archive volumes. Archive volumes are similar to backup volumes, except that their retention policy is “retain forever.” Archive volumes never expire.

Archiving is normally performed on data associated with specific projects, rather than on an entire system. Unlike data backup, users typically initiate data archiving whenever they need, so a network-wide archiving policy is usually impractical.


Note: If you are not sure if your site or your system has Networker Archive, ask your system administrator.

The principal goals of performing a backup and archiving differ:

Furthermore, backups are always associated with a level—full, differential, or incremental—whereas archives are not.

After archiving is complete, NetWorker allows you to delete archived files and directories to free disk space, or leave them in place. This option is called grooming. However, if you need to retrieve the files someday, you should be prepared for a wait. Your site's policy may be to move archives offsite, so getting them back could take a day or more. Check with your administrator for details.

Because archived data might be deleted from the system immediately or eventually, NetWorker provides an extra measure of security to make sure that archived data is correct. NetWorker verifies data in two ways:

Like the basic NetWorker program, the Archive product consists of two programs, an archive program for storing data on archive volumes and a retrieval program for bring the data back to your system. If NetWorker has been installed on your system, you will see the icons, as shown in Figure 5-1.

Figure 5-1. Networker Archive and Retrieve Icons


This chapter explains archiving data and retrieving data, in separate sections.

Archiving Data

This section explains

  • starting the Archive program

  • starting and monitoring an archive

  • grooming your filesystem

Starting the Archive Program

To start archiving, click on the Archive icon or type this command at the system prompt:

% nwarchive&

The NetWorker Archive window appears, as shown in Figure 5-2.

Figure 5-2. NetWorker Archive Window


This window resembles the NetWorker Backup window. See Figure 2-2 in “Using NetWorker Windows” in Chapter 2. For details on that program, refer to “Backing Up Data” in Chapter 4.

Most menu choices have shortcuts elsewhere in the window. The exceptions are “File Details” in the View menu, which produces a long listing of files in the right-hand panel, and “Speedbar” in the Options menu, which alternately hides and enables the speedbar buttons.

Use features of this window as follows:

speedbar  

These buttons perform the following functions:

  • Change Server: switches to a different NetWorker server

  • Mark: designates a file or directory hierarchy for archiving

  • Unmark: turns off the designation for archiving

  • Search: brings up a dialog box to help you find a named file

  • Start: begins the archive, if you have marked files

Selection field 

Shows the hostname of the chosen server and client and the currently selected (working) directory in the panels below. You can type a new pathname into the Selection field if desired.

left panel 

Displays the currently selected path on the client machine, all the way up to root. To display all directories and files in the currently selected directory, double-click the directory folder, or choose “Expand One Level” from the Tree menu.

To display all levels of subdirectories, choose “Expand Branch” from the Tree menu. To hide all the expanded subdirectories, double-click again on the directory folder, or choose “Collapse Branch” from the Tree menu.

right panel 

Displays all files in the currently selected directory, often as a scrolling list. Files appear as pages with the corner bent over, and directories appear as manila folders. To mark a file or directory for archiving, click on its check box to the left. A check mark appears in the box, as shown for the NSRman directory in Figure 5-2. To unmark a file or directory, click again on its check box.


Tip: Marking a directory also marks all the files it contains, as well as all subdirectories and everything they contain. To archive most (but not all) files in a directory, first mark the directory, and then unmark what you do not want to archive.


Starting and Monitoring an Archive

To begin the archive, click the Start button on the speedbar, or choose “Start Archive” from the File menu. The Archive Options window appears, as shown in Figure 5-3.

Figure 5-3. Archive Options Window


Use features of this window as follows:

Archive Title 

View the pathname of the currently selected directory.

Annotation field 


Type a comment that might help somebody remember or understand what this archive contains.


Tip: Because archives theoretically last forever, NetWorker does not track them in the online index. Consequently, when you retrieve archived data, the retrieve program shows only the archives that a server has made during a selected period, or for a certain user. Thus the best way to remember your archives is to create an annotation that remains meaningful over the years or to keep a record of the date.


Archive Pool field 


Specify a volume pool for your archive. If you leave this field blank, the volume pool defaults to Archive.

Operations list 

Click one of these buttons:

  • Clone: Click to clone your archive data (that is, make an exact duplicate) on a volume other than the regular archive volume. This action depends on the availability of two devices, each with the proper volume mounted. Verify their availability with your system administrator, if necessary.

  • Verify: Click to verify that the data on archive media matches the data on local disk.

  • Groom: Click to remove files and directories from local disk after archiving is complete.

Clone Pool field 


Enter the clone pool you wish to use, if you are cloning your archive data (click the Clone button, as explained above). Make sure that this volume is mounted and available. If you specify a Clone Pool, the clone is sorted to a volume associated with the pool you specified. If you leave this field blank, the clone pool defaults to Archive Clone.

Archive button 

Click to continue the archiving process. The Archive Status window appears to inform you of progress, as shown in Figure 5-4.

Figure 5-4. Archive Status Window


If you selected “Verify” in the Archive Options window, NetWorker checks the data written on the archive volume. The archive completion report in the Archive Status window indicates whether or not data could be verified as correct.

Grooming Your Filesystem

If you selected “Groom” in the Archive Options window, NetWorker deletes marked files after archiving is complete.


Note: Before you select the Groom option, return to the Archive window to see which directories and files are marked. Make sure that you have not marked directories or files that you wish to retain on disk.


Retrieving Data

Retrieval is the copying of files back to local disk from the long-term storage media called archive volumes. When you run the NetWorker retrieve program, NetWorker displays the names of the archive save sets on that server, listed by client name. You can retrieve a particular save set only if you have administrator or archive user privileges for that server. You can search for specific archives or alter the sort order of archive save sets in the viewing list.

When you choose an archive save set to retrieve, and your administrator ensures that the relevant archive volume (or a clone of that volume) is mounted on the NetWorker server, retrieval can begin.

Retrieved save sets can be relocated, renamed, or allowed to overwrite existing files of the same name, as with the NetWorker recover program.

This section explains

  • tracking archived data

  • starting the NetWorker retrieve program

  • viewing archive save set details

  • retrieving a save set

Tracking Archived Data

Archiving and backing up have different purposes. Archiving is a long-term strategy for preserving data and conserving disk space. Backup is short-term insurance against the disaster of data loss. To make it easy to recover lost or corrupted files, NetWorker keeps track of backups in the online file index. Maintaining this index requires considerable disk space. Because archives theoretically last forever, NetWorker does not track archived files in the online index.

Consequently, when you retrieve archived data, the windows do not show the structure of your files and directories. It shows only the archives that a server has made during a selected period, or for a certain user. Thus the best way to remember your archives is to keep a record of the date, or create an annotation that remains meaningful over the years.

Starting the NetWorker Retrieve Program

To retrieve archived files and directories, click on the Retrieve icon or type the following command at the system prompt:

% nwretrieve&

The NetWorker Archive Retrieve window appears, initially with no save sets listed. To see a list of archive save sets, click the Query button.

The window fills up with data, as shown in Figure 5-5.

Figure 5-5. NetWorker Archive Retrieve Window


Use features of this window as follows:

Search annotation for field 


Type a search string (an annotation text or a portion of one) and click the Query button, as explained below. You can use regular expressions in the search string. All archive save sets with matching annotations appear highlighted in the Archives list.

Archives scrolling list 


View a list of archive save sets available from the server.

Query button 

Click to display a list of archive save sets. If you have specified a search string in the “Search annotation for” field, the save sets with matching annotations appear highlighted in the Archives list.

Details button 

Click to view details about the most recently highlighted archive save set, as explained in the next section, “Viewing Archive Save Set Details.”

Viewing Archive Save Set Details

To see details about the most recently highlighted archive save set, click the Details button. The Retrieve Details window appears, as shown in Figure 5-6.

Figure 5-6. Retrieve Details Window


In this window, “name” represents the save set, “files” the number of files in the save set, “ssid” the save set ID, and “size” the number of bytes.

This window also shows the archive annotation, which you can search for in the NetWorker Archive Retrieve window.


Note: There is no point in selecting multiple save sets for viewing details; the most recently highlighted archive save set is the only one NetWorker reports on.


Retrieving a Save Set

To select a save set for retrieval, highlight it in the Archives list, and choose “Start Retrieve” from the File menu.


Tip: You can select multiple save sets for retrieval. NetWorker retrieves them in the order in which they were archived, so as not to overwrite later copies of files if you have Overwrite existing files selected.

The Retrieve Status window appears, as shown in Figure 5-7.

Figure 5-7. Retrieve Status Window


Use features of the Retrieve Status window as follows:

Archives list  

View all archive save sets selected for retrieval in the NetWorker Archive Retrieve window.

Relocate to field 


Specify a new or existing directory where retrieved files can be copied.

Overwrite existing files 


Select to cause NetWorker Retrieve to replace any files on local disk with the same pathname as files retrieved from the archive volume.

Ok button 

Click when you are ready to start the retrieve. The appropriate archive volume(s) must be mounted beforehand.

Cancel button 

Click to stop a retrieve in progress, or to dismiss the Retrieve Status window before retrieval has begun.

Status field 

View informational messages that inform you of progress of the retrieve.