System Software Reliability

System Software Reliability

Author: Hoang Pham

Publisher: Springer Science & Business Media

Published: 2007-04-21

Total Pages: 442

ISBN-13: 1846282950

DOWNLOAD EBOOK

Computer software reliability has never been so important. Computers are used in areas as diverse as air traffic control, nuclear reactors, real-time military, industrial process control, security system control, biometric scan-systems, automotive, mechanical and safety control, and hospital patient monitoring systems. Many of these applications require critical functionality as software applications increase in size and complexity. This book is an introduction to software reliability engineering and a survey of the state-of-the-art techniques, methodologies and tools used to assess the reliability of software and combined software-hardware systems. Current research results are reported and future directions are signposted. This text will interest: graduate students as a course textbook introducing reliability engineering software; reliability engineers as a broad, up-to-date survey of the field; and researchers and lecturers in universities and research institutions as a one-volume reference.


Database Reliability Engineering

Database Reliability Engineering

Author: Laine Campbell

Publisher: "O'Reilly Media, Inc."

Published: 2017-10-26

Total Pages: 309

ISBN-13: 149192621X

DOWNLOAD EBOOK

The infrastructure-as-code revolution in IT is also affecting database administration. With this practical book, developers, system administrators, and junior to mid-level DBAs will learn how the modern practice of site reliability engineering applies to the craft of database architecture and operations. Authors Laine Campbell and Charity Majors provide a framework for professionals looking to join the ranks of today’s database reliability engineers (DBRE). You’ll begin by exploring core operational concepts that DBREs need to master. Then you’ll examine a wide range of database persistence options, including how to implement key technologies to provide resilient, scalable, and performant data storage and retrieval. With a firm foundation in database reliability engineering, you’ll be ready to dive into the architecture and operations of any modern database. This book covers: Service-level requirements and risk management Building and evolving an architecture for operational visibility Infrastructure engineering and infrastructure management How to facilitate the release management process Data storage, indexing, and replication Identifying datastore characteristics and best use cases Datastore architectural components and data-driven architectures


Site Reliability Engineering

Site Reliability Engineering

Author: Niall Richard Murphy

Publisher: "O'Reilly Media, Inc."

Published: 2016-03-23

Total Pages: 552

ISBN-13: 1491951176

DOWNLOAD EBOOK

The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems? In this collection of essays and articles, key members of Google’s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You’ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient—lessons directly applicable to your organization. This book is divided into four sections: Introduction—Learn what site reliability engineering is and why it differs from conventional IT industry practices Principles—Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE) Practices—Understand the theory and practice of an SRE’s day-to-day work: building and operating large distributed computing systems Management—Explore Google's best practices for training, communication, and meetings that your organization can use


The DevOps 2. 0 Toolkit

The DevOps 2. 0 Toolkit

Author: Viktor Farcic

Publisher:

Published: 2016-08-31

Total Pages: 462

ISBN-13: 9781785289194

DOWNLOAD EBOOK

Automating the Continuous Deployment Pipeline with Containerized MicroservicesAbout This Book* First principles of devops, Ansible, Docker, Kubernetes, microservices* Architect your software in a better and more efficient way with microservices packed as immutable containers* Practical guide describing an extremely modern and advanced devops toolchain that can be improved continuouslyWho This Book Is ForIf you are an intermediate-level developer who wants to master the whole microservices development and deployment lifecycle using some of the latest and greatest practices and tools, this is the book for you. Familiarity with the basics of Devops and Continuous Deployment will be useful.What You Will Learn * Get to grips with the fundamentals of Devops* Architect efficient software in a better and more efficient way with the help of microservices* Use Docker, Kubernetes, Ansible, Ubuntu, Docker Swarm and more* Implement fast, reliable and continuous deployments with zero-downtime and ability to roll-back* Learn about centralized logging and monitoring of your cluster* Design self-healing systems capable of recovery from both hardware and software failuresIn DetailBuilding a complete modern devops toolchain requires not only the whole microservices development and a complete deployment lifecycle, but also the latest and greatest practices and tools. Victor Farcic argues from first principles how to build a devops toolchain. This book shows you how to chain together Docker, Kubernetes, Ansible, Ubuntu, and other tools to build the complete devops toolkit.Style and approach This book follows a unique, hands-on approach familiarizing you to the Devops 2.0 toolkit in a very practical manner. Although there will be a lot of theory, you won't be able to complete this book by reading it in a metro on a way to work. You'll need to be in front of your computer and get your hands dirty.


Practical Site Reliability Engineering

Practical Site Reliability Engineering

Author: Pethuru Raj Chelliah

Publisher: Packt Publishing Ltd

Published: 2018-11-30

Total Pages: 379

ISBN-13: 1788838696

DOWNLOAD EBOOK

Create, deploy, and manage applications at scale using SRE principles Key FeaturesBuild and run highly available, scalable, and secure softwareExplore abstract SRE in a simplified and streamlined wayEnhance the reliability of cloud environments through SRE enhancementsBook Description Site reliability engineering (SRE) is being touted as the most competent paradigm in establishing and ensuring next-generation high-quality software solutions. This book starts by introducing you to the SRE paradigm and covers the need for highly reliable IT platforms and infrastructures. As you make your way through the next set of chapters, you will learn to develop microservices using Spring Boot and make use of RESTful frameworks. You will also learn about GitHub for deployment, containerization, and Docker containers. Practical Site Reliability Engineering teaches you to set up and sustain containerized cloud environments, and also covers architectural and design patterns and reliability implementation techniques such as reactive programming, and languages such as Ballerina and Rust. In the concluding chapters, you will get well-versed with service mesh solutions such as Istio and Linkerd, and understand service resilience test practices, API gateways, and edge/fog computing. By the end of this book, you will have gained experience on working with SRE concepts and be able to deliver highly reliable apps and services. What you will learnUnderstand how to achieve your SRE goalsGrasp Docker-enabled containerization conceptsLeverage enterprise DevOps capabilities and Microservices architecture (MSA)Get to grips with the service mesh concept and frameworks such as Istio and LinkerdDiscover best practices for performance and resiliencyFollow software reliability prediction approaches and enable patternsUnderstand Kubernetes for container and cloud orchestrationExplore the end-to-end software engineering process for the containerized worldWho this book is for Practical Site Reliability Engineering helps software developers, IT professionals, DevOps engineers, performance specialists, and system engineers understand how the emerging domain of SRE comes handy in automating and accelerating the process of designing, developing, debugging, and deploying highly reliable applications and services.