|
Showing 1 - 3 of
3 matches in All Departments
This book addresses the question of how system software should be
designed to account for faults, and which fault tolerance features
it should provide for highest reliability. With this second edition
of Software Design for Resilient Computer Systems the book is
thoroughly updated to contain the newest advice regarding software
resilience. With additional chapters on computer system performance
and system resilience, as well as online resources, the new edition
is ideal for researchers and industry professionals. The authors
first show how the system software interacts with the hardware to
tolerate faults. They analyze and further develop the theory of
fault tolerance to understand the different ways to increase the
reliability of a system, with special attention on the role of
system software in this process. They further develop the general
algorithm of fault tolerance (GAFT) with its three main processes:
hardware checking, preparation for recovery, and the recovery
procedure. For each of the three processes, they analyze the
requirements and properties theoretically and give possible
implementation scenarios and system software support required.
Based on the theoretical results, the authors derive an
Oberon-based programming language with direct support of the three
processes of GAFT. In the last part of this book, they introduce a
simulator, using it as a proof of concept implementation of a novel
fault tolerant processor architecture (ERRIC) and its newly
developed runtime system feature-wise and performance-wise. Due to
the wide reaching nature of the content, this book applies to a
host of industries and research areas, including military,
aviation, intensive health care, industrial control, and space
exploration.
This book addresses the question of how system software should be
designed to account for faults, and which fault tolerance features
it should provide for highest reliability. The authors first show
how the system software interacts with the hardware to tolerate
faults. They analyze and further develop the theory of fault
tolerance to understand the different ways to increase the
reliability of a system, with special attention on the role of
system software in this process. They further develop the general
algorithm of fault tolerance (GAFT) with its three main processes:
hardware checking, preparation for recovery, and the recovery
procedure. For each of the three processes, they analyze the
requirements and properties theoretically and give possible
implementation scenarios and system software support required.
Based on the theoretical results, the authors derive an
Oberon-based programming language with direct support of the three
processes of GAFT. In the last part of this book, they introduce a
simulator, using it as a proof of concept implementation of a novel
fault tolerant processor architecture (ERRIC) and its newly
developed runtime system feature-wise and performance-wise. The
content applies to industries such as military, aviation, intensive
health care, industrial control, space exploration, etc.
This book introduces an approach to active system control design
and development to improve the properties of our technological
systems. It extends concepts of control and data accumulation by
explaining how the system model should be organized to improve the
properties of the system under consideration. The authors define
these properties as reliability, performance and energy-efficiency,
and self-adaption. They describe how they bridge the gap between
data accumulation and analysis in terms of interpolation with the
real physical models when data used for interpretation of the
system conditions. The authors introduce a principle of active
system control and safety - an approach that explains what a model
of a system should have, making computer systems more efficient, a
crucial new concern in application domains such as safety critical,
embedded and low-power autonomous systems like transport,
healthcare, and other dynamic systems with moving substances and
elements. On a theoretical level, this book further extends the
concept of fault tolerance, introducing a system level of design
for improving overall efficiency. On a practical level it
illustrates how active system approach might help our systems be
self-evolving.
|
|