Mastering Prometheus: Gain expert tips to monitoring your infrastructure, applications, and services

Hegedus, William

  • 出版商: Packt Publishing
  • 出版日期: 2024-04-19
  • 售價: $1,890
  • 貴賓價: 9.5$1,796
  • 語言: 英文
  • 頁數: 310
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1805125664
  • ISBN-13: 9781805125662
  • 立即出貨 (庫存=1)

相關主題

商品描述

Learn how to effectively implement, manage, and optimize Prometheus for monitoring your systems

Key Features
  • Achieve high availability with Prometheus by using Thanos
  • Integrate Prometheus into your broader observability stack with OpenTelemetry
  • Tweak, tune, and debug Prometheus to reliably scale without limits
  • Purchase of the print or Kindle book includes a free PDF eBook
Book Description

With an increased focus on observability and reliability, establishing a scalable and reliable monitoring environment is more important than ever. Over the last decade, Prometheus has emerged as the leading open-source, time-series based monitoring software catering to this demand. This book is your guide to scaling, operating, and extending Prometheus from small on-premises workloads to multi-cloud globally distributed workloads and everything in between.

Starting with an introduction to Prometheus and its role in observability, the book provides a walkthrough of its deployment. You'll explore Prometheus's query language and TSDB data model, followed by dynamic service discovery for monitoring targets and refining alerting through custom templates and formatting. The book then demonstrates horizontal scaling of Prometheus via sharding and federation, while equipping you with debugging techniques and strategies to fine-tune data ingestion. Advancing through the chapters, you'll manage Prometheus at scale through CI validations and templating with Jsonnet, and integrate Prometheus with other projects such as OpenTelemetry, Thanos, VictoriaMetrics, and Mimir.

By the end of this book, you'll have practical knowledge of Prometheus and its ecosystem, which will help you discern when, why, and how to scale it to meet your ever-growing needs.

What you will learn
  • Deploy Prometheus and Node Exporter to public clouds and Kubernetes
  • Gain in-depth knowledge of how Prometheus's underlying code works
  • Build your own custom service-discovery providers for Prometheus
  • Debug Prometheus performance issues to identify cardinality issues in your environment
  • Use VictoriaMetrics and/or Grafana Mimir for remote storage of Prometheus data
  • Define and implement SLO-based alerting
Who this book is for

The book is for site reliability engineers (SREs), developers, and platform engineers involved in the monitoring and observability of their team or company's systems. A background in Prometheus is assumed, so the book dedicates minimal time to the basics of getting Prometheus up and running. Whether you aim to expand monitoring capabilities, streamline configuration management, or enhance integration with existing tools, this book will help you maximize the potential of your Prometheus monitoring stack.

Table of Contents
  1. Observability, Monitoring, and Prometheus
  2. Deploying Prometheus
  3. The Prometheus Data Model and PromQL
  4. Using Service Discovery
  5. Effective Alerting with Prometheus
  6. Advancing Prometheus: Sharding, Federation, and HA
  7. Optimizing and Debugging Prometheus
  8. Enabling Systems Monitoring with the Node Exporter
  9. Utilizing Remote Storage Systems with Prometheus
  10. Extending Prometheus Globally with Thanos
  11. Jsonnet and Monitoring Mixins
  12. Utilizing Continuous Integration (CI) Pipelines with Prometheus
  13. Defining and Alerting on SLOs
  14. Integrating OpenTelemetry with Prometheus
  15. Beyond Prometheus

商品描述(中文翻譯)

學習如何有效地實施、管理和優化Prometheus,以監控您的系統

主要特點
- 通過使用Thanos實現Prometheus的高可用性
- 使用OpenTelemetry將Prometheus整合到更廣泛的可觀察性堆棧中
- 調整、優化和調試Prometheus,以可靠地無限擴展
- 購買印刷版或Kindle電子書,即可獲得免費的PDF電子書

書籍描述
隨著對可觀察性和可靠性的關注增加,建立可擴展和可靠的監控環境比以往任何時候都更加重要。在過去的十年中,Prometheus已成為領先的開源、基於時間序列的監控軟件,滿足了這一需求。本書將指導您如何從小型本地工作負載擴展到多雲全球分佈工作負載等各種情況下,擴展、操作和擴展Prometheus。

從介紹Prometheus及其在可觀察性中的角色開始,本書提供了部署Prometheus的詳細步驟。您將探索Prometheus的查詢語言和TSDB數據模型,然後通過自定義模板和格式調整警報,進一步完善監控目標的動態服務發現。本書還演示了通過分片和聯邦擴展Prometheus,同時提供調試技術和策略來微調數據輸入。通過逐章深入,您將通過CI驗證和使用Jsonnet進行模板化來管理大規模的Prometheus,並將Prometheus與其他項目(如OpenTelemetry、Thanos、VictoriaMetrics和Mimir)集成。

通過閱讀本書,您將獲得有關Prometheus及其生態系統的實用知識,這將幫助您判斷何時、為什麼以及如何擴展Prometheus以滿足不斷增長的需求。

您將學到什麼
- 在公共雲和Kubernetes上部署Prometheus和Node Exporter
- 深入了解Prometheus底層代碼的工作原理
- 為Prometheus構建自己的自定義服務發現提供程序
- 調試Prometheus性能問題,以識別環境中的基數問題
- 使用VictoriaMetrics和/或Grafana Mimir遠程存儲Prometheus數據
- 定義和實施基於SLO的警報

本書適合對其團隊或公司系統的監控和可觀察性負責的站點可靠性工程師(SRE)、開發人員和平台工程師。假設您具有Prometheus的背景知識,因此本書只花費很少的時間介紹如何啟動和運行Prometheus。無論您的目標是擴展監控能力、簡化配置管理還是增強與現有工具的集成,本書都將幫助您最大限度地發揮Prometheus監控堆棧的潛力。

目錄
1. 可觀察性、監控和Prometheus
2. 部署Prometheus
3. Prometheus數據模型和PromQL
4. 使用服務發現
5. 有效的Prometheus警報
6. 進階Prometheus:分片、聯邦和高可用性
7. 優化和調試Prometheus
8. 使用Node Exporter實現系統監控
9. 使用遠程存儲系統的Prometheus
10. 通過Thanos在全球範圍內擴展Prometheus
11. Jsonnet和監控Mixin
12. 使用Prometheus的持續集成(CI)流水線
13. 定義和警報SLO
14. 將OpenTelemetry與Prometheus集成
15. 超越Prometheus