Snowflake Cookbook: Techniques for building modern cloud data warehousing solutions
暫譯: Snowflake 食譜:構建現代雲端數據倉儲解決方案的技術

Qureshi, Hamid Mahmood, Sharif, Hammad

  • 出版商: Packt Publishing
  • 出版日期: 2021-02-26
  • 售價: $2,030
  • 貴賓價: 9.5$1,929
  • 語言: 英文
  • 頁數: 330
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1800560613
  • ISBN-13: 9781800560611
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Develop modern solutions with Snowflake's unique architecture and integration capabilities; process bulk and real-time data into a data lake; and leverage time travel, cloning, and data-sharing features to optimize data operations

Key Features

  • Build and scale modern data solutions using the all-in-one Snowflake platform
  • Perform advanced cloud analytics for implementing big data and data science solutions
  • Make quicker and better-informed business decisions by uncovering key insights from your data

Book Description

Snowflake is a unique cloud-based data warehousing platform built from scratch to perform data management on the cloud. This book introduces you to Snowflake's unique architecture, which places it at the forefront of cloud data warehouses.

You'll explore the compute model available with Snowflake, and find out how Snowflake allows extensive scaling through the virtual warehouses. You will then learn how to configure a virtual warehouse for optimizing cost and performance. Moving on, you'll get to grips with the data ecosystem and discover how Snowflake integrates with other technologies for staging and loading data.

As you progress through the chapters, you will leverage Snowflake's capabilities to process a series of SQL statements using tasks to build data pipelines and find out how you can create modern data solutions and pipelines designed to provide high performance and scalability. You will also get to grips with creating role hierarchies, adding custom roles, and setting default roles for users before covering advanced topics such as data sharing, cloning, and performance optimization.

By the end of this Snowflake book, you will be well-versed in Snowflake's architecture for building modern analytical solutions and understand best practices for solving commonly faced problems using practical recipes.

What you will learn

  • Get to grips with data warehousing techniques aligned with Snowflake's cloud architecture
  • Broaden your skills as a data warehouse designer to cover the Snowflake ecosystem
  • Transfer skills from on-premise data warehousing to the Snowflake cloud analytics platform
  • Optimize performance and costs associated with a Snowflake solution
  • Stage data on object stores and load it into Snowflake
  • Secure data and share it efficiently for access
  • Manage transactions and extend Snowflake using stored procedures
  • Extend cloud data applications using Spark Connector

Who this book is for

This book is for data warehouse developers, data analysts, database administrators, and anyone involved in designing, implementing, and optimizing a Snowflake data warehouse. Knowledge of data warehousing and database and cloud concepts will be useful. Basic familiarity with Snowflake is beneficial, but not necessary.

商品描述(中文翻譯)

**使用 Snowflake 獨特的架構和整合能力開發現代解決方案;將批量和即時數據處理成數據湖;利用時間旅行、克隆和數據共享功能來優化數據操作**

#### 主要特點

- 使用全方位的 Snowflake 平台構建和擴展現代數據解決方案
- 執行先進的雲端分析以實施大數據和數據科學解決方案
- 通過從數據中發掘關鍵見解,做出更快且更明智的商業決策

#### 書籍描述

Snowflake 是一個獨特的雲端數據倉儲平台,從零開始構建以在雲端執行數據管理。本書將介紹 Snowflake 的獨特架構,使其在雲端數據倉儲中處於領先地位。

您將探索 Snowflake 提供的計算模型,並了解 Snowflake 如何通過虛擬倉庫實現廣泛的擴展。接下來,您將學習如何配置虛擬倉庫以優化成本和性能。隨著進展,您將熟悉數據生態系統,並發現 Snowflake 如何與其他技術整合以進行數據的暫存和加載。

在各章節中,您將利用 Snowflake 的能力,使用任務處理一系列 SQL 語句來構建數據管道,並了解如何創建旨在提供高性能和可擴展性的現代數據解決方案和管道。您還將學習創建角色層級、添加自定義角色以及為用戶設置默認角色,然後再深入探討數據共享、克隆和性能優化等高級主題。

在本書結束時,您將熟悉 Snowflake 的架構,以構建現代分析解決方案,並了解使用實用配方解決常見問題的最佳實踐。

#### 您將學到什麼

- 熟悉與 Snowflake 雲端架構相符的數據倉儲技術
- 擴展作為數據倉儲設計師的技能,涵蓋 Snowflake 生態系統
- 將本地數據倉儲的技能轉移到 Snowflake 雲端分析平台
- 優化與 Snowflake 解決方案相關的性能和成本
- 在物件存儲上暫存數據並將其加載到 Snowflake
- 確保數據安全並有效共享以供訪問
- 管理事務並使用存儲過程擴展 Snowflake
- 使用 Spark Connector 擴展雲端數據應用程序

#### 本書適合誰

本書適合數據倉儲開發人員、數據分析師、數據庫管理員以及任何參與設計、實施和優化 Snowflake 數據倉儲的人士。對數據倉儲、數據庫和雲端概念的了解將會有幫助。對 Snowflake 的基本熟悉將是有益的,但不是必要的。

作者簡介

Hamid Qureshi is a senior cloud and data warehouse professional with almost two decades of total experience, having architected, designed, and led the implementation of several data warehouse and business intelligence solutions. He has extensive experience and certifications across various data analytics platforms, ranging from Teradata, Oracle, and Hadoop to modern, cloud-based tools such as Snowflake. Having worked extensively with traditional technologies, combined with his knowledge of modern platforms, he has accumulated substantial practical expertise in data warehousing and analytics in Snowflake, which he has subsequently captured in his publications.

Hammad Sharif is an experienced data architect with more than a decade of experience in the information domain, covering governance, warehousing, data lakes, streaming data, and machine learning.

He has worked with a leading data warehouse vendor for a decade as part of a professional services organization, advising customers in telco, retail, life sciences, and financial industries located in Asia, Europe, and Australia during presales and post-sales implementation cycles.

Hammad holds an MSc. in computer science and has published conference papers in the domains of machine learning, sensor networks, software engineering, and remote sensing.

作者簡介(中文翻譯)

哈米德·庫雷希擁有近二十年的雲端和數據倉儲專業經驗,曾架構、設計並主導多個數據倉儲和商業智慧解決方案的實施。他在各種數據分析平台上擁有廣泛的經驗和認證,涵蓋從 Teradata、Oracle 和 Hadoop 到現代雲端工具如 Snowflake。他在傳統技術方面有著豐富的工作經驗,並結合對現代平台的了解,積累了在 Snowflake 中進行數據倉儲和分析的實際專業知識,並在其出版物中進行了詳細記錄。

哈馬德·沙里夫是一位經驗豐富的數據架構師,在信息領域擁有超過十年的經驗,涵蓋治理、數據倉儲、數據湖、流數據和機器學習。

他在一家領先的數據倉儲供應商工作了十年,作為專業服務組織的一部分,為位於亞洲、歐洲和澳洲的電信、零售、生命科學和金融行業的客戶提供建議,涵蓋售前和售後實施周期。

哈馬德擁有計算機科學碩士學位,並在機器學習、傳感器網絡、軟體工程和遙感等領域發表了會議論文。

目錄大綱

  1. Getting Started with Snowflake
  2. Managing the Data Life Cycle
  3. Loading and Extracting Data into and out of Snowflake
  4. Building Data Pipelines in Snowflake
  5. Data Protection and Security in Snowflake
  6. Performance and Cost Optimization
  7. Secure Data Sharing
  8. Back to the Future with Time Travel
  9. Advanced SQL Techniques
  10. Extending Snowflake's Capabilities

目錄大綱(中文翻譯)


  1. Getting Started with Snowflake

  2. Managing the Data Life Cycle

  3. Loading and Extracting Data into and out of Snowflake

  4. Building Data Pipelines in Snowflake

  5. Data Protection and Security in Snowflake

  6. Performance and Cost Optimization

  7. Secure Data Sharing

  8. Back to the Future with Time Travel

  9. Advanced SQL Techniques

  10. Extending Snowflake's Capabilities