Architecting a Modern Data Warehouse for Large Enterprises: Build Multi-Cloud Modern Distributed Data Warehouses with Azure and Aws

Kumar, Anjani, Mishra, Abhishek, Kumar, Sanjeev

  • 出版商: Apress
  • 出版日期: 2023-12-28
  • 售價: $2,020
  • 貴賓價: 9.5$1,919
  • 語言: 英文
  • 頁數: 266
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 9798868800283
  • ISBN-13: 9798868800283
  • 相關分類: Amazon Web ServicesMicrosoft Azure
  • 海外代購書籍(需單獨結帳)

商品描述

Design and architect new generation cloud-based data warehouses using Azure and AWS. This book provides an in-depth understanding of how to build modern cloud-native data warehouses, as well as their history and evolution.

The book starts by covering foundational data warehouse concepts, and introduces modern features such as distributed processing, big data storage, data streaming, and processing data on the cloud. You will gain an understanding of the synergy, relevance, and usage data warehousing standard practices in the modern world of distributed data processing. The authors walk you through the essential concepts of Data Mesh, Data Lake, Lakehouse, and Delta Lake. And they demonstrate the services and offerings available on Azure and AWS that deal with data orchestration, data democratization, data governance, data security, and business intelligence.

 

After completing this book, you will be ready to design and architect enterprise-grade, cloud-based modern data warehouses using industry best practices and guidelines.

What You Will Learn

 

  • Understand the core concepts underlying modern data warehouses
  • Design and build cloud-native data warehouses
  • Gain a practical approach to architecting and building data warehouses on Azure and AWS
  • Implement modern data warehousing components such as Data Mesh, Data Lake, Delta Lake, and Lakehouse
  • Process data through pandas and evaluate your model's performance using metrics such as F1-score, precision, and recall
  • Apply deep learning to supervised, semi-supervised, and unsupervised anomaly detection tasks for tabular datasets and time series applications

 

Who This Book Is For

Experienced developers, cloud architects, and technology enthusiasts looking to build cloud-based modern data warehouses using Azure and AWS

商品描述(中文翻譯)

設計和架構使用Azure和AWS的新一代基於雲端的數據倉庫。本書深入介紹了如何構建現代雲原生數據倉庫以及其歷史和演進。

本書首先介紹了基礎數據倉庫概念,並介紹了分佈式處理、大數據存儲、數據流和雲端數據處理等現代特性。您將了解在分佈式數據處理的現代世界中,數據倉庫標準實踐的協同作用、相關性和使用方法。作者們將引導您了解Data Mesh、Data Lake、Lakehouse和Delta Lake的基本概念。並展示了Azure和AWS上處理數據編排、數據民主化、數據治理、數據安全和商業智能的服務和產品。

完成本書後,您將能夠根據行業最佳實踐和指南設計和架構企業級的基於雲端的現代數據倉庫。

您將學到什麼:
- 了解現代數據倉庫的核心概念
- 設計和構建雲原生數據倉庫
- 以實用方法設計和構建Azure和AWS上的數據倉庫
- 實施現代數據倉庫組件,如Data Mesh、Data Lake、Delta Lake和Lakehouse
- 通過pandas處理數據,並使用F1-score、精確度和召回率等指標評估模型的性能
- 將深度學習應用於監督、半監督和無監督的異常檢測任務,適用於表格數據集和時間序列應用

本書適合對象:
有經驗的開發人員、雲架構師和技術愛好者,希望使用Azure和AWS構建基於雲端的現代數據倉庫。

作者簡介

Anjani Kumar is the Managing Director and Founder of MultiCloud4u, a rapidly growing startup that helps clients and partners seamlessly implement data-driven solutions for their digital businesses. With a background in computer science, Anjani began his career researching and developing multi-lingual systems that were powered by distributed processing and data synchronization across remote regions of India. He later collaborated with companies such as Mahindra Satyam, Microsoft, RBS, and Sapient to create data warehouses and other data-based systems that could handle high-volume data processing and transformation.

 

 

 

Abhishek Mishra is a Cloud Architect at a leading organization and has more than a decade and a half of experience building and architecting software solutions for large and complex enterprises across the globe. He has deep expertise in enabling digital transformations for his customers using the cloud and artificial intelligence.

Sanjeev Kumar heads up a global data and analytics practice at the leading and oldest multinational shoe company with headquarters in Switzerland. He has 19+ years of experience working for organizations modeling modern data solutions in multiple industries. He has consulted with some of the top multinational firms and enabled digital transformation for large enterprises using modern data warehouses in the cloud. He is an expert in multiple fields of modern data management and execution including data strategy, automation, data governance, architecture, metadata, modeling, business intelligence, data management, and analytics.

作者簡介(中文翻譯)

Anjani Kumar是MultiCloud4u的董事總經理和創始人,該公司是一家快速發展的初創企業,為客戶和合作夥伴提供無縫實施數據驅動解決方案的服務。Anjani在計算機科學方面有背景,他的職業生涯始於研究和開發由分佈式處理和數據同步驅動的多語言系統,這些系統遍布印度的遠程地區。他後來與Mahindra Satyam、Microsoft、RBS和Sapient等公司合作,創建了能夠處理大量數據處理和轉換的數據倉庫和其他基於數據的系統。

Abhishek Mishra是一家領先組織的雲架構師,擁有超過十五年的經驗,為全球大型和複雜企業構建和設計軟件解決方案。他在使用雲和人工智能實現客戶的數字轉型方面具有深厚的專業知識。

Sanjeev Kumar在瑞士總部的一家領先且歷史悠久的跨國鞋業公司負責全球數據和分析業務。他擁有19年以上的經驗,曾為多個行業的組織建模現代數據解決方案。他曾與一些頂級跨國公司進行諮詢,並通過在雲端使用現代數據倉庫實現大型企業的數字轉型。他是現代數據管理和執行的多個領域的專家,包括數據策略、自動化、數據治理、架構、元數據、建模、商業智能、數據管理和分析。