Search

Browse Subject Areas

For Authors

Submit a Proposal

Join Our Mailing List

Enter your email address:

Enter your first name:

Enter your last name:

Choose subjects that interest you
Hold down the CTRL key for multiple selection



Building Dependable Distributed Systems

By Wenbing Zhao
Series: Performability Engineering Series
Copyright: 2014   |   Status: Published
ISBN: 9781118549438  |  Hardcover  |  
364 pages | 97 illustrations
Price: $175 USD
Add To Cart

One Line Description
A one-volume guide to the most essential techniques for designing and building dependable distributed systems

Audience
The primary market is networked and distributed computing system designers and developers, software architects, senior software engineers, and technical managers. The book will also prove very useful for upper-level undergraduate and graduate students in computer science and computer engineering interested in dependable computing research.

Description
Instead of covering a broad range of research works for each dependability strategy, this useful reference focuses on only a selected few (usually the most seminal works, the most practical approaches, or the first publication of each approach), explaining each in depth, usually with a comprehensive set of examples. Each technique is dissected thoroughly enough so that readers who are not familiar with dependable distributed computing can actually grasp the technique after studying the book.
Building Dependable Distributed Systems consists of eight chapters. The first introduces the basic concepts and terminology of dependable distributed computing, and also provides an overview of the primary means of achieving dependability. Checkpointing and logging mechanisms, which are the most commonly used means of achieving limited degree of fault tolerance, are described in the second chapter. Works on recovery-oriented computing, focusing on the practical techniques that reduce the fault detection and recovery times for Internet-based applications, are covered in chapter three. Chapter four outlines the replication techniques for data and service fault tolerance. This chapter also pays particular attention to optimistic replication and the CAP theorem. Chapter five explains a few seminal works on group communication systems. Chapter six introduces the distributed consensus problem and covers a number of Paxos family algorithms in depth. The Byzantine generals problem and its latest solutions, including the seminal Practical Byzantine Fault Tolerance (PBFT) algorithm and a number of its derivatives, are introduced in chapter seven. The final chapter details the latest research results surrounding application-aware Byzantine fault tolerance, which represents an important step forward in the practical use of Byzantine fault tolerance techniques.


Back to Top
Author / Editor Details
Wenbing Zhao received his PhD in electrical and computer engineering from the University of California, Santa Barbara, in 2002. Currently, he is an Associate Professor in the Department of Electrical and Computer Engineering at Cleveland State University. Dr. Zhao has more than 80 academic publications to his credit, and three of his recent research papers in the area of dependable distributed computing have won best paper awards. Dr. Zhao also has a U.S. patent on consistent time service for fault-tolerant distributed systems.

Back to Top

Table of Contents
List of Figures
List of Tables
Acknowledgments
Preface

1 Introduction to Dependable Distributed Computing
1.1 Basic Concepts and Terminologies
1.2 Means to Achieve Dependability
2 Logging and Checkpointing
2.1 System Model
2.2 Checkpoint-Based Protocols
2.3 Log Based Protocols
3 Recovery-Oriented Computing
3.1 System Model
3.2 Fault Detection and Localization
3.3 Microreboot
3.4 Overcoming Operator Errors
4 Data and Service Replication
4.1 Service Replication
4.2 Data Replication
4.3 Optimistic Replication
4.4 CAP Theorem
5 Group Communication Systems
5.1 System Model
5.2 Sequencer Based Group Communication System Sender Based Group Communication System
5.3 Vector Clock Based Group Communication System
6 Consensus and the Paxos Algorithms
6.1 The Consensus Problem
6.2 The Paxos Algorithm
6.3 Multi-Paxos
6.4 Dynamic Paxos
6.5 Fast Paxos
6.6 Implementations of the Paxos Family Algorithms
7 Byzantine Fault Tolerance
7.1 The Byzantine Generals Problem
7.2 Practical Byzantine Fault Tolerance
7.3 Fast Byzantine Agreement
7.4 Speculative Byzantine Fault Tolerance
8 Application-Aware Byzantine Fault Tolerance
8.1 High Throughput BFT Systems: Networked
File Systems
8.2 Exploiting Deep Application Semantics: Web
Services Coordination
Index


Back to Top


BISAC SUBJECT HEADINGS
COM047000: COMPUTERS: Distributed Systems & Computing
COM051300 : COMPUTERS / Programming / Algorithms
TEC 020000: Technology & Engineering/ Manufacturing
 
BIC CODES
UTR: Distributed Systems
UMB: Algorithms & data structures
KND: Manufacturing Industries

Back to Top


Description
BISAC & BIC Codes
Author/Editor Details
Table of Contents
Bookmark this page