Keynote Speakers

 

Prof. Geoffrey Voelker, Department of Computer Science and Engineering, University of California San Diego, will give the keynote speech entitled "Automated Availability Management in Large-Scale Storage Systems".


Abstract

Availability is a storage system property that is both highly desired and yet minimally engineered.  While many systems provide mechanisms to improve availability -- such as redundancy and failure recovery -- how to best configure these mechanisms is typically left to the system manager.  Unfortunately, few individuals have the skills to properly manage the trade-offs involved, let alone the time to adapt these decisions to changing conditions.  Instead, most systems are configured statically and with only a cursory understanding of how the configuration will impact overall performance or availability.   While this issue can be problematic even for individual storage arrays, it becomes increasingly important as systems are distributed -- and absolutely critical for the wide-area peer-to-peer storage infrastructures being explored.

In this talk I will describe the motivation, architecture, and implementation for a peer-to-peer storage system, called TotalRecall, that automates the task of availability management.  In particular, the TotalRecall system automatically measures and estimates the availability of its constituent host components, predicts their future availability based on past behavior, calculates the appropriate redundancy mechanisms and repair policies, and  delivers user-specified availability while maximizing efficiency.  I will end by discussing open research problems in this area.