Fault-Tolerant Systems

Fault-Tolerant Systems

Author: Israel Koren

Publisher: Elsevier

Published: 2010-07-19

Total Pages: 399

ISBN-13: 0080492681

DOWNLOAD EBOOK

Fault-Tolerant Systems is the first book on fault tolerance design with a systems approach to both hardware and software. No other text on the market takes this approach, nor offers the comprehensive and up-to-date treatment that Koren and Krishna provide. This book incorporates case studies that highlight six different computer systems with fault-tolerance techniques implemented in their design. A complete ancillary package is available to lecturers, including online solutions manual for instructors and PowerPoint slides. Students, designers, and architects of high performance processors will value this comprehensive overview of the field. - The first book on fault tolerance design with a systems approach - Comprehensive coverage of both hardware and software fault tolerance, as well as information and time redundancy - Incorporated case studies highlight six different computer systems with fault-tolerance techniques implemented in their design - Available to lecturers is a complete ancillary package including online solutions manual for instructors and PowerPoint slides


Patterns for Fault Tolerant Software

Patterns for Fault Tolerant Software

Author: Robert S. Hanmer

Publisher: John Wiley & Sons

Published: 2013-07-12

Total Pages: 272

ISBN-13: 1118351541

DOWNLOAD EBOOK

Software patterns have revolutionized the way developer’s and architects think about how software is designed, built and documented. This new title in Wiley’s prestigious Series in Software Design Patterns presents proven techniques to achieve patterns for fault tolerant software. This is a key reference for experts seeking to select a technique appropriate for a given system. Readers are guided from concepts and terminology, through common principles and methods, to advanced techniques and practices in the development of software systems. References will provide access points to the key literature, including descriptions of exemplar applications of each technique. Organized into a collection of software techniques, specific techniques can be easily found with sufficient detail to allow appropriate choices for the system being designed.


Software Fault Tolerance Techniques and Implementation

Software Fault Tolerance Techniques and Implementation

Author: Laura L. Pullum

Publisher: Artech House

Published: 2001

Total Pages: 358

ISBN-13: 1580531377

DOWNLOAD EBOOK

Look to this innovative resource for the most-comprehensive coverage of software fault tolerance techniques available in a single volume. It offers you a thorough understanding of the operation of critical software fault tolerance techniques and guides you through their design, operation and performance. You get an in-depth discussion on the advantages and disadvantages of specific techniques, so you can decide which ones are best suited for your work.


Fault-Tolerant Design

Fault-Tolerant Design

Author: Elena Dubrova

Publisher: Springer Science & Business Media

Published: 2013-03-15

Total Pages: 195

ISBN-13: 1461421136

DOWNLOAD EBOOK

This textbook serves as an introduction to fault-tolerance, intended for upper-division undergraduate students, graduate-level students and practicing engineers in need of an overview of the field. Readers will develop skills in modeling and evaluating fault-tolerant architectures in terms of reliability, availability and safety. They will gain a thorough understanding of fault tolerant computers, including both the theory of how to design and evaluate them and the practical knowledge of achieving fault-tolerance in electronic, communication and software systems. Coverage includes fault-tolerance techniques through hardware, software, information and time redundancy. The content is designed to be highly accessible, including numerous examples and exercises. Solutions and powerpoint slides are available for instructors.


Fault-Tolerant Systems

Fault-Tolerant Systems

Author: Israel Koren

Publisher: Morgan Kaufmann

Published: 2020-09-01

Total Pages: 418

ISBN-13: 0128181060

DOWNLOAD EBOOK

Fault-Tolerant Systems, Second Edition, is the first book on fault tolerance design utilizing a systems approach to both hardware and software. No other text takes this approach or offers the comprehensive and up-to-date treatment that Koren and Krishna provide. The book comprehensively covers the design of fault-tolerant hardware and software, use of fault-tolerance techniques to improve manufacturing yields, and design and analysis of networks. Incorporating case studies that highlight more than ten different computer systems with fault-tolerance techniques implemented in their design, the book includes critical material on methods to protect against threats to encryption subsystems used for security purposes. The text's updated content will help students and practitioners in electrical and computer engineering and computer science learn how to design reliable computing systems, and how to analyze fault-tolerant computing systems. - Delivers the first book on fault tolerance design with a systems approach - Offers comprehensive coverage of both hardware and software fault tolerance, as well as information and time redundancy - Features fully updated content plus new chapters on failure mechanisms and fault-tolerance in cyber-physical systems - Provides a complete ancillary package, including an on-line solutions manual for instructors and PowerPoint slides


Fault Tolerance in Distributed Systems

Fault Tolerance in Distributed Systems

Author: Pankaj Jalote

Publisher: Prentice Hall

Published: 1994

Total Pages: 456

ISBN-13:

DOWNLOAD EBOOK

Fault tolerance is an approach by which reliability of a computer system can be increased beyond what can be achieved by traditional methods. Comprehensive and self-contained, this book explores the information available on software supported fault tolerance techniques, with a focus on fault tolerance in distributed systems.


Software-Implemented Hardware Fault Tolerance

Software-Implemented Hardware Fault Tolerance

Author: Olga Goloubeva

Publisher: Springer Science & Business Media

Published: 2006-09-19

Total Pages: 238

ISBN-13: 0387329374

DOWNLOAD EBOOK

This book presents the theory behind software-implemented hardware fault tolerance, as well as the practical aspects needed to put it to work on real examples. By evaluating accurately the advantages and disadvantages of the already available approaches, the book provides a guide to developers willing to adopt software-implemented hardware fault tolerance in their applications. Moreover, the book identifies open issues for researchers willing to improve the already available techniques.


Software Engineering of Fault Tolerant Systems

Software Engineering of Fault Tolerant Systems

Author: Patrizio Pelliccione

Publisher: World Scientific

Published: 2007

Total Pages: 293

ISBN-13: 9812705031

DOWNLOAD EBOOK

When architecting dependable systems, fault tolerance is required to improve the overall system robustness. Many studies have been proposed, but the solutions are usually commissioned late during the design and implementation phases of the software life-cycle (e.g., Java and Windows NT exception handling), thus reducing the error recovery effectiveness. Since the system design typically models only normal behaviors of the system while ignoring exceptional ones, the generated system implementation is unable to handle abnormal events. Consequently, the system may fail in unexpected ways due to some faults. Researchers have advocated that fault tolerance management during the entire life-cycle improves the overall system robustness and that different classes of exceptions must be identified for each identified phase of software development, depending on the abstraction level of the software system being modeled. This book builds on this trend and investigates how fault tolerance mechanisms can be used when engineering a software system. New problems will arise, new models are needed at different abstraction levels, methodologies for mode driven engineering of such systems must be defined, new technologies are required, and new validation and verification environments are necessary.


Design And Analysis Of Reliable And Fault-tolerant Computer Systems

Design And Analysis Of Reliable And Fault-tolerant Computer Systems

Author: Mostafa I Abd-el-barr

Publisher: World Scientific

Published: 2006-12-15

Total Pages: 463

ISBN-13: 190897978X

DOWNLOAD EBOOK

Covering both the theoretical and practical aspects of fault-tolerant mobile systems, and fault tolerance and analysis, this book tackles the current issues of reliability-based optimization of computer networks, fault-tolerant mobile systems, and fault tolerance and reliability of high speed and hierarchical networks.The book is divided into six parts to facilitate coverage of the material by course instructors and computer systems professionals. The sequence of chapters in each part ensures the gradual coverage of issues from the basics to the most recent developments. A useful set of references, including electronic sources, is listed at the end of each chapter./a


Fault-Tolerant Message-Passing Distributed Systems

Fault-Tolerant Message-Passing Distributed Systems

Author: Michel Raynal

Publisher: Springer

Published: 2018-09-08

Total Pages: 468

ISBN-13: 3319941410

DOWNLOAD EBOOK

This book presents the most important fault-tolerant distributed programming abstractions and their associated distributed algorithms, in particular in terms of reliable communication and agreement, which lie at the heart of nearly all distributed applications. These programming abstractions, distributed objects or services, allow software designers and programmers to cope with asynchrony and the most important types of failures such as process crashes, message losses, and malicious behaviors of computing entities, widely known under the term "Byzantine fault-tolerance". The author introduces these notions in an incremental manner, starting from a clear specification, followed by algorithms which are first described intuitively and then proved correct. The book also presents impossibility results in classic distributed computing models, along with strategies, mainly failure detectors and randomization, that allow us to enrich these models. In this sense, the book constitutes an introduction to the science of distributed computing, with applications in all domains of distributed systems, such as cloud computing and blockchains. Each chapter comes with exercises and bibliographic notes to help the reader approach, understand, and master the fascinating field of fault-tolerant distributed computing.