Business Intelligence with Databricks SQL: Concepts, tools, and techniques for scaling business intelligence on the data lakehouse

Gupta, Vihag

  • 出版商: Packt Publishing
  • 出版日期: 2022-09-16
  • 售價: $1,810
  • 貴賓價: 9.5$1,720
  • 語言: 英文
  • 頁數: 348
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1803235330
  • ISBN-13: 9781803235332
  • 相關分類: SQL
  • 下單後立即進貨 (約3~4週)

商品描述

Master critical skills needed to deploy and use Databricks SQL and elevate your BI from the warehouse to the lakehouse with confidence


Key Features:

  • Learn about business intelligence on the lakehouse with features and functions of Databricks SQL
  • Make the most of Databricks SQL by getting to grips with the enablers of its data warehousing capabilities
  • A unique approach to teaching concepts and techniques with follow-along scenarios on real datasets


Book Description:

In this new era of data platform system design, data lakes and data warehouses are giving way to the lakehouse - a new type of data platform system that aims to unify all data analytics into a single platform. Databricks, with its Databricks SQL product suite, is the hottest lakehouse platform out there, harnessing the power of Apache Spark(TM), Delta Lake, and other innovations to enable data warehousing capabilities on the lakehouse with data lake economics.

This book is a comprehensive hands-on guide that helps you explore all the advanced features, use cases, and technology components of Databricks SQL. You'll start with the lakehouse architecture fundamentals and understand how Databricks SQL fits into it. The book then shows you how to use the platform, from exploring data, executing queries, building reports, and using dashboards through to learning the administrative aspects of the lakehouse - data security, governance, and management of the computational power of the lakehouse. You'll also delve into the core technology enablers of Databricks SQL - Delta Lake and Photon. Finally, you'll get hands-on with advanced SQL commands for ingesting data and maintaining the lakehouse.

By the end of this book, you'll have mastered Databricks SQL and be able to deploy and deliver fast, scalable business intelligence on the lakehouse.


What You Will Learn:

  • Understand how Databricks SQL fits into the Databricks Lakehouse Platform
  • Perform everyday analytics with Databricks SQL Workbench and business intelligence tools
  • Organize and catalog your data assets
  • Program the data security model to protect and govern your data
  • Tune SQL warehouses (computing clusters) for optimal query experience
  • Tune the Delta Lake storage format for maximum query performance
  • Deliver extreme performance with the Photon query execution engine
  • Implement advanced data ingestion patterns with Databricks SQL


Who this book is for:

This book is for business intelligence practitioners, data warehouse administrators, and data engineers who are new to Databrick SQL and want to learn how to deliver high-quality insights unhindered by the scale of data or infrastructure. This book is also for anyone looking to study the advanced technologies that power Databricks SQL. Basic knowledge of data warehouses, SQL-based analytics, and ETL processes is recommended to effectively learn the concepts introduced in this book and appreciate the innovation behind the platform.

商品描述(中文翻譯)

掌握部署和使用Databricks SQL所需的關鍵技能,並自信地將您的商業智能從數據倉庫提升到湖屋。

主要特點:
- 了解在湖屋上的商業智能,以及Databricks SQL的功能和功能。
- 通過瞭解其數據倉庫功能的實現方式,充分利用Databricks SQL。
- 通過在真實數據集上的實際場景進行跟隨,以獨特的教學方法教授概念和技術。

書籍描述:
在這個新的數據平台系統設計時代,數據湖和數據倉庫正在讓位給湖屋-一種旨在將所有數據分析統一到一個平台的新型數據平台系統。 Databricks憑藉其Databricks SQL產品套件成為最熱門的湖屋平台,利用Apache Spark(TM)、Delta Lake和其他創新技術的威力,在湖屋上實現數據倉庫功能並實現數據湖經濟。

本書是一本全面的實踐指南,幫助您探索Databricks SQL的所有高級功能、用例和技術組件。您將從湖屋架構基礎知識入手,了解Databricks SQL的適用性。然後,本書將向您展示如何使用該平台,從探索數據、執行查詢、構建報告和使用儀表板,到學習湖屋的管理方面-數據安全、治理和計算能力管理。您還將深入研究Databricks SQL的核心技術支持-Delta Lake和Photon。最後,您將親自體驗使用高級SQL命令進行數據輸入和維護湖屋。

通過閱讀本書,您將掌握Databricks SQL,能夠在湖屋上部署並提供快速、可擴展的商業智能。

學到什麼:
- 瞭解Databricks SQL如何融入Databricks湖屋平台。
- 使用Databricks SQL Workbench和商業智能工具進行日常分析。
- 組織和分類您的數據資產。
- 編程數據安全模型以保護和管理數據。
- 調整SQL倉庫(計算集群)以獲得最佳查詢體驗。
- 調整Delta Lake存儲格式以獲得最大的查詢性能。
- 使用Photon查詢執行引擎實現極致性能。
- 使用Databricks SQL實現高級數據輸入模式。

本書適合商業智能從業人員、數據倉庫管理員和數據工程師,他們對Databricks SQL尚不熟悉,並希望學習如何在不受數據或基礎設施規模限制的情況下提供高質量的洞察力。本書還適合任何希望研究驅動Databricks SQL的先進技術的人。建議您具備數據倉庫、基於SQL的分析和ETL流程的基本知識,以有效學習本書介紹的概念並欣賞該平台背後的創新。