Data Engineering with dbt: A practical guide to building a cloud-based, pragmatic, and dependable data platform with SQL

Zagni, Roberto

  • 出版商: Packt Publishing
  • 出版日期: 2023-06-30
  • 售價: $2,270
  • 貴賓價: 9.5$2,157
  • 語言: 英文
  • 頁數: 578
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1803246286
  • ISBN-13: 9781803246284
  • 相關分類: SQL
  • 海外代購書籍(需單獨結帳)


Use easy-to-apply patterns in SQL and Python to adopt modern analytics engineering to build agile platforms with dbt that are well-tested and simple to extend and run Purchase of the print or Kindle book includes a free PDF eBook

Key Features:

  • Build a solid dbt base and learn data modeling and the modern data stack to become an analytics engineer
  • Build automated and reliable pipelines to deploy, test, run, and monitor ELTs with dbt Cloud
  • Guided dbt + Snowflake project to build a pattern-based architecture that delivers reliable datasets

Book Description:

dbt Cloud helps professional analytics engineers automate the application of powerful and proven patterns to transform data from ingestion to delivery, enabling real DataOps.

This book begins by introducing you to dbt and its role in the data stack, along with how it uses simple SQL to build your data platform, helping you and your team work better together. You'll find out how to leverage data modeling, data quality, master data management, and more to build a simple-to-understand and future-proof solution. As you advance, you'll explore the modern data stack, understand how data-related careers are changing, and see how dbt enables this transition into the emerging role of an analytics engineer. The chapters help you build a sample project using the free version of dbt Cloud, Snowflake, and GitHub to create a professional DevOps setup with continuous integration, automated deployment, ELT run, scheduling, and monitoring, solving practical cases you encounter in your daily work.

By the end of this dbt book, you'll be able to build an end-to-end pragmatic data platform by ingesting data exported from your source systems, coding the needed transformations, including master data and the desired business rules, and building well-formed dimensional models or wide tables that'll enable you to build reports with the BI tool of your choice.

What You Will Learn:

  • Create a dbt Cloud account and understand the ELT workflow
  • Combine Snowflake and dbt for building modern data engineering pipelines
  • Use SQL to transform raw data into usable data, and test its accuracy
  • Write dbt macros and use Jinja to apply software engineering principles
  • Test data and transformations to ensure reliability and data quality
  • Build a lightweight pragmatic data platform using proven patterns
  • Write easy-to-maintain idempotent code using dbt materialization

Who this book is for:

This book is for data engineers, analytics engineers, BI professionals, and data analysts who want to learn how to build simple, futureproof, and maintainable data platforms in an agile way. Project managers, data team managers, and decision makers looking to understand the importance of building a data platform and foster a culture of high-performing data teams will also find this book useful. Basic knowledge of SQL and data modeling will help you get the most out of the many layers of this book. The book also includes primers on many data-related subjects to help juniors get started.


使用易於應用的 SQL 和 Python 模式,採用現代分析工程,使用 dbt 建立靈活的平台,這些平台經過良好測試且易於擴展和運行。購買印刷版或 Kindle 版本的書籍將包含免費的 PDF 電子書。

- 建立堅實的 dbt 基礎,學習數據建模和現代數據堆棧,成為一名分析工程師。
- 使用 dbt Cloud 建立自動化和可靠的管道,部署、測試、運行和監控 ELT。
- 引導式 dbt + Snowflake 項目,建立基於模式的架構,提供可靠的數據集。

dbt Cloud 幫助專業的分析工程師自動應用強大且經過驗證的模式,從數據輸入到交付,實現真正的 DataOps。

本書首先介紹了 dbt 及其在數據堆棧中的角色,以及它如何使用簡單的 SQL 構建數據平台,幫助您和您的團隊更好地合作。您將了解如何利用數據建模、數據質量、主數據管理等,建立一個易於理解且具有未來擴展性的解決方案。隨著您的進一步學習,您將探索現代數據堆棧,了解與數據相關的職業如何變化,以及 dbt 如何實現這一轉變,成為新興角色——分析工程師。本書的章節將幫助您使用 dbt Cloud 的免費版本、Snowflake 和 GitHub 建立一個示例項目,創建一個專業的 DevOps 環境,包括持續集成、自動化部署、ELT 運行、調度和監控,解決您在日常工作中遇到的實際問題。

通過閱讀本書,您將能夠通過導入源系統導出的數據、編寫所需的轉換代碼(包括主數據和所需的業務規則),以及構建良好結構的維度模型或寬表,建立一個端到端的實用數據平台,並使用您選擇的 BI 工具建立報表。

- 創建 dbt Cloud 帳戶,了解 ELT 工作流程。
- 結合 Snowflake 和 dbt,構建現代數據工程管道。
- 使用 SQL 將原始數據轉換為可用數據,並測試其準確性。
- 編寫 dbt 宏並使用 Jinja 應用軟件工程原則。
- 測試數據和轉換以確保可靠性和數據質量。
- 使用經驗證的模式構建輕量級實用數據平台。
- 使用 dbt materialization 編寫易於維護的幂等代碼。

本書適合數據工程師、分析工程師、BI 專業人士和數據分析師,他們希望學習如何以敏捷方式建立簡單、具有未來擴展性和可維護性的數據平台。項目經理、數據團隊經理和決策者希望了解建立數據平台的重要性,並培養高效數據團隊文化的人也會發現本書有用。具備 SQL 和數據建模的基礎知識將幫助您充分利用本書的多個層面。本書還包含許多與數據相關的入門知識,以幫助初學者入門。