The Art of Site Reliability Engineering (Sre) with Azure: Building and Deploying Applications That Endure
暫譯: Azure 的網站可靠性工程 (SRE) 藝術:構建和部署持久的應用程式
Beloki, Unai Huete
- 出版商: Apress
- 出版日期: 2022-09-20
- 售價: $1,510
- 貴賓價: 9.5 折 $1,435
- 語言: 英文
- 頁數: 280
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1484287037
- ISBN-13: 9781484287033
-
相關分類:
DevOps、Microsoft Azure
海外代購書籍(需單獨結帳)
相關主題
商品描述
Gain a foundational understanding of SRE and learn its basic concepts and architectural best practices for deploying Azure IaaS, PaaS, and microservices-based resilient architectures.
The book starts with the base concepts of SRE operations and developer needs, followed by definitions and acronyms of Service Level Agreements in real-world scenarios. Moving forward, you will learn how to build resilient IaaS solutions, PaaS solutions, and microservices architecture in Azure. Here you will go through Azure reference architecture for high-available storage, networking and virtual machine computing, describing Availability Sets and Zones and Scale Sets as main scenarios. You will explore similar reference architectures for Platform Services such as App Services with Web Apps, and work with data solutions like Azure SQL and Azure Cosmos DB.
Next, you will learn automation to enable SRE with Azure DevOps Pipelines and GitHub Actions. You'll also gain an understanding of how an open culture around post-mortems dramatically helps in optimizing SRE and the overall company culture around managing and running IT systems and application workloads. You'll be exposed to incent management and monitoring practices, by making use of Azure Monitor/Log Analytics/Grafana, which forms the foundation of monitoring Azure and Hybrid-running workloads.
As an extra, the book covers two new testing solutions: Azure Chaos Studio and Azure Load Testing. These solutions will make it easier to test the resilience of your services.
After reading this book, you will understand the underlying concepts of SRE and its implementation using Azure public cloud.
What Will You Learn:
- Learn SRE definitions and metrics like SLI/SLO/SLA, Error Budget, toil, MTTR, MTTF, and MTBF
- Understand Azure Well-Architected Framework (WAF) and Disaster Recovery scenarios on Azure
- Understand resiliency and how to design resilient solutions in Azure for different architecture types and services
- Master core DevOps concepts and the difference between SRE and tools like Azure DevOps and GitHub
- Utilize Azure observability tools like Azure Monitor, Application Insights, KQL or Grafana
- Understand Incident Response and Blameless Post-Mortems and how to improve collaboration using ChatOps practices with Microsoft tools
Who Is This Book For:
IT operations administrators, engineers, security team members, as well as developers or DevOps engineers.
商品描述(中文翻譯)
獲得 SRE 的基礎理解,學習其基本概念和在 Azure 上部署 IaaS、PaaS 及基於微服務的彈性架構的最佳實踐。
本書從 SRE 操作和開發者需求的基本概念開始,接著介紹在實際情境中服務水平協議的定義和縮寫。接下來,您將學習如何在 Azure 中構建彈性的 IaaS 解決方案、PaaS 解決方案和微服務架構。在這裡,您將了解 Azure 的高可用性存儲、網絡和虛擬機計算的參考架構,描述可用性集(Availability Sets)和可用性區(Zones)以及擴展集(Scale Sets)作為主要情境。您將探索平台服務的類似參考架構,例如具有 Web 應用的應用服務(App Services),並使用數據解決方案如 Azure SQL 和 Azure Cosmos DB。
接下來,您將學習如何使用 Azure DevOps Pipelines 和 GitHub Actions 進行自動化以支持 SRE。您還將了解圍繞事後分析的開放文化如何顯著幫助優化 SRE 及整體公司文化,特別是在管理和運行 IT 系統及應用工作負載方面。您將接觸到激勵管理和監控實踐,利用 Azure Monitor/Log Analytics/Grafana,這些構成了監控 Azure 和混合運行工作負載的基礎。
作為額外內容,本書涵蓋了兩個新的測試解決方案:Azure Chaos Studio 和 Azure Load Testing。這些解決方案將使測試您的服務的彈性變得更加容易。
閱讀本書後,您將理解 SRE 的基本概念及其在 Azure 公有雲中的實施。
您將學到什麼:
- 學習 SRE 的定義和指標,如 SLI/SLO/SLA、錯誤預算(Error Budget)、繁瑣工作(toil)、平均修復時間(MTTR)、平均故障時間(MTTF)和平均故障間隔時間(MTBF)
- 了解 Azure Well-Architected Framework (WAF) 和 Azure 上的災難恢復情境
- 理解彈性及如何為不同架構類型和服務設計彈性解決方案
- 掌握核心 DevOps 概念以及 SRE 與 Azure DevOps 和 GitHub 等工具之間的區別
- 利用 Azure 觀察工具,如 Azure Monitor、Application Insights、KQL 或 Grafana
- 了解事件響應和無責任事後分析,以及如何使用 Microsoft 工具通過 ChatOps 實踐來改善協作
本書適合誰:
IT 操作管理員、工程師、安全團隊成員,以及開發人員或 DevOps 工程師。