Keynote Speakers
Prof. Geoffrey Voelker, Department of Computer Science
and Engineering, University of California San Diego, will give the
keynote speech entitled "Automated
Availability Management in
Large-Scale Storage Systems".
Availability
is a storage system property that is both highly desired and yet
minimally engineered. While many systems provide mechanisms to
improve availability -- such as redundancy and failure recovery -- how
to best configure these mechanisms is typically left to the system
manager. Unfortunately, few individuals have the skills to
properly manage the trade-offs involved, let alone the time to adapt
these decisions to changing conditions. Instead, most systems are
configured statically and with only a cursory understanding of how the
configuration will impact overall performance or
availability. While this issue can be problematic even for
individual storage arrays, it becomes increasingly important as systems
are distributed -- and absolutely critical for the wide-area
peer-to-peer storage infrastructures being explored.
In this talk I will describe the motivation, architecture, and
implementation for a peer-to-peer storage system, called TotalRecall,
that automates the task of availability management. In
particular, the TotalRecall system automatically measures and estimates
the availability of its constituent host components, predicts their
future availability based on past behavior, calculates the appropriate
redundancy mechanisms and repair policies, and delivers
user-specified availability while maximizing efficiency. I will
end by discussing open research problems in this area.