S13: HPC Archive Solutions Made Simple
Sunday, Nov. 13
Alan Powers, CSC
Jason Hick, NERSC
Matt Cary, NASA Advanced Simulation Facility
This tutorial provides an overview of the different hardware technologies and software features used in HPC archival storage systems. The tutorial will touch on the following topics: tape libraries, tape drives, disk cache, HSM, data movers, migration, data recall, storage policies, tape pools, tape compaction, tape striping, archive metrics, data integrity, importing and exporting data, and disaster recovery. Other topics of interest developing an RFP. # How to determine recall and migration policies by profiling existing data files? # What requirements to consider for architecting an archive system? # What future requirements should be considered when sizing an archive system? Highlights of the archive policies of four major HPC sites: National Energy Research Scientific Computing Center (DOE/NERSC - 22 PB), NASA Advanced Supercomputing Facility (NASA/NAS - 28 PB), NOAA Environmental Security Computing Center (NOAA/NESCC - 14 PB), and Geophysical Fluid Dynamics Laboratory (NOAA/GFDL - 28 PB) - will be presented.