In: Categories » Computers and technology » Data recovery » Evaluating Storage Media Requirements
The next step in developing a backup and recovery strategy is to determine the storage media requirements. Based on the information you have already gathered, you should have a pretty good idea of how much data is going to be backed up, how many copies you will need to keep, and how long you will keep them. If this is a new backup domain, you can use this information to determine what type of tape drives and media will work best. If this is an existing domain, you can determine if you have enough drive and library capacity.
Every piece is interrelated. The size of the servers is dependent on the number of drives. The number of drives is dependent on the amount of data being backed up and the backup window. The backup window is related to the number of drives and the speed of the drives. The ability to use the bandwidth of the drives is related to the network speed and layout. That is why we picked a starting point of analyzing the data and moved forward from there.
As you move forward with evaluating storage media, you need to consider the selection of a suitable drive technology. The number of different drive types and the number of different libraries that support the different drives complicate this decision. You should gather enough information to make an informed decision. One place that can provide a comparison of all the different types is storagemountain. This Web site shows all the different technologies and gives speeds, capacities, and load/unload times, as well as access times. If backup speed is the primary concern, a fast drive with a high-density cartridge might be the best solution. If recovery speed is more important, you might want to look at a fast drive with fast access, but high-density may not suit your requirements. If you want to recover a single file from the end of a tape, it would be faster to find it on a 20-GB tape than a 120-GB tape.
One of the most critical steps in evaluating the storage media requirements is determining the actual backup window. This must be the actual amount of time you are allowed to have backups running, while at the same time, controlling the backup hardware, using a major part of the network, and using the resources on the systems being backed up. The size of your backup window is becoming a much harder measurement to define. You must be able to determine the number of hours in a day and in a week that can be dedicated to backups, as this is an integral part of the equation to determine media requirements.
You need to know how much data needs to be backed up during each window. Generally, the largest backups will be the full backups. In the past, most administrators performed daily incremental backups and did all their full backups over the weekend when most people were not working. This concept is changing. It is very common now for a percentage of the systems, say, one-fifth, to have full backups done each day and the remaining systems to have incremental backups each day. The weekends are saved to do maintenance or to catch up. If this is closer to your model, then your window would be the time each day when backups are performed, and the amount of data would be the average sum of the total data that would be backed up, roughly a fifth of your total data. If you do not have specific operational information on the amount of data that will make up your incremental backups, you can estimate using a percentage of change to calculate the amount of data. It is common to use 20 percent, unless you have a more accurate measurement. The goal here is to try to get as close as possible to your actual environment.
Now that we have the amount of data and the number of hours needed to store that data, all we have left to do is some basic math. Just take the total amount of data that has to be backed up daily and divide by the duration of the daily backup window:
Ideal data transfer rate = Amount of data to back up ÷ Backup window
If you have 100 GB of data and an 8-hour window, your ideal data transfer rate would be 12.5 GB/hr.
After you have an idea of the ideal data transfer rate, you can then look at the different drive types to see which might offer the best fit for your needs. Not surprisingly, this is a little more complicated than just looking at the base numbers, though. With potential drive technology, you must consider both performance and capacity. In larger enterprise environments, one size usually does not fit all. As mentioned several times, you need to look at the recovery requirements first and work back. This might mean you will need two different types of drives, some that are very high performance but with less capacity and some that offer higher capacity with lower performance. Data that is being kept for long retention periods, especially to fulfill legal requirements, might be better suited for the lower-performance but higher-capacity media. Data that might be required for immediate restores where time is money might be better suited for the high-performance media. It is not uncommon to have backups done to high-performance drives and media and then the images vaulted to high-capacity drives and media for off-site storage.
This information can be very helpful in determining which drive technology you need, but never forget these are all theoretical numbers and are given without taking into account the internal drive compression. Drive manufacturers advertise compression rates for the different drive technologies. These vary depending on the drive but are also theoretical numbers. These specifications can change with new firmware levels or versions of the drives. To get the most accurate numbers, contact the drive vendor or go to their Web site, where you'll find up-to-date specification sheets.
When you start actually figuring how many of which kind of drive you will need, we recommend using the native transfer rates and capacities without compression. It is very difficult to estimate what kind of compression rate you will experience, as it is totally dependent on the makeup of your data. Some data is very compressible, while other data will yield very little compression. If you do your architecture based on no compression, the only surprises you should experience should be good ones; you will have plenty of capacity with room for growth.
After selecting the appropriate drive technology that provides the performance and cartridge capacity you need, you next want to look at how many cartridges you will need to have available. This involves all the elements we have looked at so far. The number of cartridges required depends on the amount of data that you are backing up, the frequency of your backups, your retention periods, and the capacity of the media used to store your backups. A simple formula that can be used is as follows:
Number of tapes = (Total data to back up × Frequency of backups × Retention period)/Tape capacity
Following is an example:
Total amount of data = 100 GB
Full backups per month = 4
Retention period for full backups = 6 months
Incremental backups per month = 30
Retention period for incremental backups = 1 month
Preliminary calculations:
Size of full backups = 100 GB × 4 per month × 6 months = 2.4 TB
Size of incremental backups = (20 percent of 100 GB) × 30 × 1 month = 600 GB
Total data stored = 2.4 TB + 600 GB = 3 TB
Solution:
Tape drive = DLT 7000
Tape capacity without compression = 31.5 GB
Total tapes needed for full backups = 2.4 TB / 31.5 GB = 76.2 = 77
Total tapes needed for incremental backups = 600 GB / 31.5 GB = 19.1 = 20
Total tapes needed = 77 + 20 = 97
By looking at this example, you would expect to have a minimum of 97 active cartridges at any given time. This also assumes that all the cartridges will be filled to capacity and there will be no unused tape. These calculations are based on no compression. This does give you an idea of the steps necessary to plan for an appropriately sized tape library. We would never recommend implementing an enterprise backup strategy that does not include a robotic tape library with a barcode reader. Without these, the management can become overwhelming and very susceptible to human error. It is much better to turn over media management to an enterprise backup application.
When figuring out how many slots are required to support your environment, do not forget to include some slots for cleaning tapes and at least two for the catalog backups. Actually, you will want to reserve twice as many slots for catalog backups as are needed so you can keep a copy of the catalog. If you are including an off-site storage solution of some type (vaulting) as part of your backup strategy, you need to include this in your total capacity calculations, since creating duplicate copies requires additional tapes.
legal notice
Our website is not responsible for the information contained by this article. Web-articles is a free articles resource.
Suggestion: If you need fresh, daily updated content for your website, feel free to use our service. Click here for more information.
Useful tools and features
If you like this article (tutorial), please link to it from your web page using the information above.
related articles
Once you understand the general backup requirements for all of the data and the business and legal requirements, you should have a pretty good idea of how much data needs to be backed up and at least a minimum requirement for the frequency. The trick in establishing the ideal frequency policy is to come up with a schedule that gives you adequate protection with minimal media usage. You don't want to back up any more often than needed to get the necessary level of protection, since 'more often' means more tapes, more data being mo...
2. Layout NetBackup Domain
Now the fun begins. You have gathered tons of data and know more about your enterprise than you ever thought was possible. It is time to put all of this knowledge to use. If this is the first time an actual backup and recovery strategy has been implemented, you will be able to tailor the backup domain. If this is an upgrade or application change, you will probably have to work within the confines of the existing layout, making changes as required. Using NetBackup as the application in this domain, you first want to list a...
3. Specific NetBackup Configuration Elements
There are several library manufacturers, each with an entire line of libraries from small to very large. Part of this decision will be based on the drive technology you select, as some libraries support only certain drives. The considerations for selecting a library are as follows: Does it handle the desired drive type? Will it handle the required number of drives? Does it support the needed number of slots? Does it have expansion capability? ...
4. Define some storage media to be used for the backups
The physical cartridges are called tapes, media, or volumes. When we discuss them within NetBackup Media Manager, they are referred to as volumes. These are the actual tapes that will be used to hold the data that is being backed up. Once you have configured one or more robotic devices, you can use the Volume Configuration Wizard. An easy way to proceed, at least for the first time, is to use this tool and to inventory the robotic libraries. If you are not using a library or have media without barcodes, you can use the Volume Con...
5. Using the Activity Monitor
Now that you have successfully installed and configured your backup domain, you are ready to sit back and take it easy. But wait, someone knocking on your door wants to know the status of their backup or restore. I guess you will have to start monitoring the backup and restore processes. While we are at it, we might as well look at monitoring some of the other elements in the backup domain that might need our attention. In this article, we go through some of the tools available to monitor the activities of our example backup and ...