Spectral based Methods that Streamline the Search for Failure Scenarios in Large-Scale Distributed Systems

Fern Y. Hunt, Katherine Morrison, and Christopher Dabrowski

Keywords

grid computing, cloud computing, spectral expansion, non-homogeneous Markov Chain

Abstract

We report our work on the development of analytical and numerical methods that enable the detection of failure scenarios in distributed grid computing, cloud computing and other large scale systems.The spectral (i.e. eigenvalue and eigenvector) properties of the matrices associated with a non-homogeneous absorbing Markov Chain are used to quickly compute the long time proportion of tasks completed at a given setting of parameters. This enables the discovery of critical ranges of parameter values where system performance deteriorates and fails.

Important Links:



Go Back