Resilience is an important aspect of computing systems. Previous work on resilience has often focused on the design and architectural aspects of such systems, and not on the quantification of resilience. In addition, quantification is often restricted to a limited portion of the system. In networked systems, where multiple heterogeneous components interact in a complex manner, resilience quantification becomes a nontrivial problem. This paper proposes a model for quantifying resilience on the basis of the interdependencies of services and their adaptation. It combines performance and adaptability metrics to compute resilience of individual services that are then applied to a Markov network that computes the overall system resilience. The adaptation metric, here called adaptivity, computes how often the service adapts and evaluates the efficiency of such adaptations in terms of performance improvement. This paper also presents an evaluation that considers critical infrastructure systems.
History
Publication title
IBM Journal of Research and Development
Volume
57
Issue
5
Pagination
3:1-3:9
ISSN
2151-8556
Department/School
School of Information and Communication Technology
Publisher
IBM Corp
Place of publication
1 New Orchard Road, Armonk, USA, Ny, 10504
Rights statement
Copyright 2013 by International Business Machines Corporation
Repository Status
Restricted
Socio-economic Objectives
Information systems, technologies and services not elsewhere classified