The Art of Site Reliability Engineering (Sre) with Azure: Building and Deploying Applications That Endure

Beloki, Unai Huete

  • 出版商: Apress
  • 出版日期: 2022-09-20
  • 售價: $1,500
  • 貴賓價: 9.5$1,425
  • 語言: 英文
  • 頁數: 280
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1484287037
  • ISBN-13: 9781484287033
  • 相關分類: DevOpsMicrosoft Azure
  • 海外代購書籍(需單獨結帳)

商品描述

Gain a foundational understanding of SRE and learn its basic concepts and architectural best practices for deploying Azure IaaS, PaaS, and microservices-based resilient architectures.

The book starts with the base concepts of SRE operations and developer needs, followed by definitions and acronyms of Service Level Agreements in real-world scenarios. Moving forward, you will learn how to build resilient IaaS solutions, PaaS solutions, and microservices architecture in Azure. Here you will go through Azure reference architecture for high-available storage, networking and virtual machine computing, describing Availability Sets and Zones and Scale Sets as main scenarios. You will explore similar reference architectures for Platform Services such as App Services with Web Apps, and work with data solutions like Azure SQL and Azure Cosmos DB.

Next, you will learn automation to enable SRE with Azure DevOps Pipelines and GitHub Actions. You'll also gain an understanding of how an open culture around post-mortems dramatically helps in optimizing SRE and the overall company culture around managing and running IT systems and application workloads. You'll be exposed to incent management and monitoring practices, by making use of Azure Monitor/Log Analytics/Grafana, which forms the foundation of monitoring Azure and Hybrid-running workloads.

As an extra, the book covers two new testing solutions: Azure Chaos Studio and Azure Load Testing. These solutions will make it easier to test the resilience of your services.

After reading this book, you will understand the underlying concepts of SRE and its implementation using Azure public cloud.


What Will You Learn:

  • Learn SRE definitions and metrics like SLI/SLO/SLA, Error Budget, toil, MTTR, MTTF, and MTBF
  • Understand Azure Well-Architected Framework (WAF) and Disaster Recovery scenarios on Azure
  • Understand resiliency and how to design resilient solutions in Azure for different architecture types and services
  • Master core DevOps concepts and the difference between SRE and tools like Azure DevOps and GitHub
  • Utilize Azure observability tools like Azure Monitor, Application Insights, KQL or Grafana
  • Understand Incident Response and Blameless Post-Mortems and how to improve collaboration using ChatOps practices with Microsoft tools


Who Is This Book For:
IT operations administrators, engineers, security team members, as well as developers or DevOps engineers.

商品描述(中文翻譯)

獲得對SRE的基礎理解,並學習在部署Azure IaaS、PaaS和基於微服務的強韌架構中的基本概念和架構最佳實踐。

本書從SRE操作和開發人員需求的基本概念開始,接著介紹了實際場景中服務水平協議的定義和縮寫。接下來,您將學習如何在Azure中構建強韌的IaaS解決方案、PaaS解決方案和微服務架構。在這裡,您將通過Azure高可用存儲、網絡和虛擬機計算的參考架構,描述了可用性集和區域以及規模集作為主要場景。您還將探索類似的參考架構,用於平台服務,如帶有Web應用程序的應用服務,以及與Azure SQL和Azure Cosmos DB等數據解決方案一起工作。

接下來,您將學習如何使用Azure DevOps Pipelines和GitHub Actions實現自動化,以實現SRE。您還將了解圍繞事後分析的開放文化如何在優化SRE和整體公司文化方面發揮重要作用,以管理和運行IT系統和應用工作負載。通過使用Azure Monitor/Log Analytics/Grafana,您將接觸到激勵管理和監控實踐,這是監控Azure和混合運行工作負載的基礎。

作為額外的內容,本書還介紹了兩個新的測試解決方案:Azure Chaos Studio和Azure Load Testing。這些解決方案將使您更容易測試服務的強韌性。

閱讀本書後,您將了解SRE的基本概念以及在Azure公有雲中的實施。

您將學到什麼:
- 學習SRE的定義和指標,如SLI/SLO/SLA、錯誤預算、toil、MTTR、MTTF和MTBF
- 了解Azure良好架構框架(WAF)和Azure上的災難恢復場景
- 了解強韌性以及如何為不同架構類型和服務設計Azure中的強韌解決方案
- 掌握核心DevOps概念,以及SRE和Azure DevOps、GitHub等工具之間的區別
- 利用Azure的可觀察性工具,如Azure Monitor、Application Insights、KQL或Grafana
- 了解事故響應和無責任事後分析,以及如何使用Microsoft工具改善協作

適合閱讀本書的讀者:
IT運營管理員、工程師、安全團隊成員,以及開發人員或DevOps工程師。