Fundamentals of Data Engineering: Plan and Build Robust Data Systems (Paperback)

Reis, Joe, Housley, Matt

  • 出版商: O'Reilly
  • 出版日期: 2022-07-26
  • 定價: $2,800
  • 售價: 8.0$2,240 (限時優惠至 2024-04-28)
  • 語言: 英文
  • 頁數: 446
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1098108302
  • ISBN-13: 9781098108304
  • 相關分類: 大數據 Big-dataData Science
  • 立即出貨

買這商品的人也買了...

商品描述

Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available through the framework of the data engineering lifecycle.

Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, storage, and governance that are critical in any data environment regardless of the underlying technology.

This book will help you:

  • Get a concise overview of the entire data engineering landscape
  • Assess data engineering problems using an end-to-end framework of best practices
  • Cut through marketing hype when choosing data technologies, architecture, and processes
  • Use the data engineering lifecycle to design and build a robust architecture
  • Incorporate data governance and security across the data engineering lifecycle

商品描述(中文翻譯)

數據工程在過去十年中迅速發展,使許多軟體工程師、數據科學家和分析師尋求對這一實踐的全面了解。通過這本實用書,您將學習如何通過評估數據工程生命周期框架中提供的最佳技術來規劃和構建系統,以滿足組織和客戶的需求。

作者Joe Reis和Matt Housley將引導您了解數據工程生命周期,並向您展示如何結合各種雲技術來滿足下游數據消費者的需求。您將了解如何應用數據生成、摄取、編排、轉換、存儲和治理的概念,這些概念在任何數據環境中都是至關重要的,無論底層技術如何。

本書將幫助您:
- 獲得對整個數據工程領域的簡明概述
- 使用一個端到端的最佳實踐框架評估數據工程問題
- 在選擇數據技術、架構和流程時避免市場炒作
- 使用數據工程生命周期設計和構建強大的架構
- 在整個數據工程生命周期中融入數據治理和安全性