Building Big Data Pipelines with Apache Beam: Use a single programming model for both batch and stream data processing (Paperback)
暫譯: 使用 Apache Beam 建構大數據管道：單一程式模型處理批次與串流數據

Name: Building Big Data Pipelines with Apache Beam: Use a single programming model for both batch and stream data processing (Paperback)
Price: 1795 TWD
Availability: OnlineOnly
Author: Lukavský, Jan
ISBN: 1800564937

Lukavský, Jan

出版商: Packt Publishing
出版日期: 2022-01-21
售價: $1,890
貴賓價: 9.5 折 $1,795
語言: 英文
頁數: 342
裝訂: Quality Paper - also called trade paper
ISBN: 1800564937
ISBN-13: 9781800564930
相關分類: 大數據 Big-data

海外代購書籍(需單獨結帳)

買這商品的人也買了...

~~$969~~ $918

Python and HDF5 (Paperback)
$453

InfluxDB 原理與實戰
~~$2,180~~ $2,071

Machine Learning for Algorithmic Trading, 2/e (Paperback)
~~$1,700~~ $1,615

Python Algorithmic Trading Cookbook: All the recipes you need to implement your own algorithmic trading strategies in Python
~~$1,786~~ $1,692

Kubeflow for Machine Learning: From Lab to Production
$499

事件流實戰
~~$780~~ $616

MongoDB 技術手冊, 3/e (MongoDB: The Definitive Guide: Powerful and Scalable Data Storage, 3/e)
~~$2,200~~ $2,090

Algorithmic Short Selling with Python: Refine your algorithmic trading edge, consistently generate investment ideas, and build a robust long/short pro (Paperback)
~~$880~~ $695

從 OS 等級探究：Redis 運作原理程式逐行講解
~~$1,800~~ $1,710

Advanced Python Programming : Accelerate your Python programs using proven techniques and design patterns, 2/e (Paperback)
$1,428

Time Series Analysis with Python Cookbook: Practical recipes for exploratory data analysis, data preparation, forecasting, and model evaluation (Paperback)
$448

跨數據中心機器學習：賦能多雲智能數算融合
~~$714~~ $678

使用 GitOps 實現 Kubernetes 的持續部署：模式、流程及工具
~~$599~~ $569

Elasticsearch 數據搜索與分析實戰
~~$539~~ $512

Kafka 實戰
~~$880~~ $695

資料視覺化｜使用 Python 與 JavaScript, 2/e (Data Visualization with Python and JavaScript: Scrape, Clean, Explore, and Transform Your Data, 2/e)
~~$1,860~~ $1,767

Google Cloud Platform for Data Science: A Crash Course on Big Data, Machine Learning, and Data Analytics Services (Paperback)
~~$1,700~~ $1,615

Practical Machine Learning on Databricks: Seamlessly transition ML models and MLOps on Databricks (Paperback)
~~$780~~ $616

Terraform 建置與執行, 3/e (Terraform: Up and Running: Writing Infrastructure as Code, 3/e)
$509

基於 GPT-3、ChatGPT、GPT-4 等 Transformer 架構的自然語言處理
~~$680~~ $537

資料科學：困難部分 (Data Science: The Hard Parts: Techniques for Excelling at Data Science)
$607

CUDA 並行編程與性能優化
~~$1,690~~ $1,605

GPU Programming with C++ and CUDA: Uncover effective techniques for writing efficient GPU-parallel C++ applications (Paperback)
~~$499~~ $424

最強 AI 組合技！NotebookLM / Gemini / Nano Banana / Veo 3 【影音生成進化版】
~~$650~~ $513

Nano Banana 藝術宇宙 - Veo x Sora: 多模態 AI 創作時代

商品描述

Implement, run, operate, and test data processing pipelines using Apache Beam

Key Features:

Understand how to improve usability and productivity when implementing Beam pipelines
Learn how to use stateful processing to implement complex use cases using Apache Beam
Implement, test, and run Apache Beam pipelines with the help of expert tips and techniques

Book Description:

Apache Beam is an open source unified programming model for implementing and executing data processing pipelines, including Extract, Transform, and Load (ETL), batch, and stream processing.

This book will help you to confidently build data processing pipelines with Apache Beam. You'll start with an overview of Apache Beam and understand how to use it to implement basic pipelines. You'll also learn how to test and run the pipelines efficiently. As you progress, you'll explore how to structure your code for reusability and also use various Domain Specific Languages (DSLs). Later chapters will show you how to use schemas and query your data using (streaming) SQL. Finally, you'll understand advanced Apache Beam concepts, such as implementing your own I/O connectors.

By the end of this book, you'll have gained a deep understanding of the Apache Beam model and be able to apply it to solve problems.

What You Will Learn:

Understand the core concepts and architecture of Apache Beam
Implement stateless and stateful data processing pipelines
Use state and timers for processing real-time event processing
Structure your code for reusability
Use streaming SQL to process real-time data for increasing productivity and data accessibility
Run a pipeline using a portable runner and implement data processing using the Apache Beam Python SDK
Implement Apache Beam I/O connectors using the Splittable DoFn API

Who this book is for:

This book is for data engineers, data scientists, and data analysts who want to learn how Apache Beam works. Intermediate-level knowledge of the Java programming language is assumed.

商品描述(中文翻譯)

使用 Apache Beam 實作、運行、操作和測試資料處理管道

主要特點：

了解在實作 Beam 管道時如何改善可用性和生產力

學習如何使用有狀態處理來實作複雜的使用案例，使用 Apache Beam

在專家提示和技巧的幫助下，實作、測試和運行 Apache Beam 管道

書籍描述：

Apache Beam 是一個開源的統一程式設計模型，用於實作和執行資料處理管道，包括提取、轉換和加載（ETL）、批次和串流處理。

這本書將幫助你自信地使用 Apache Beam 建立資料處理管道。你將從 Apache Beam 的概述開始，了解如何使用它來實作基本管道。你還將學習如何有效地測試和運行這些管道。隨著進展，你將探索如何結構化你的程式碼以便重用，並使用各種特定領域語言（DSL）。後面的章節將向你展示如何使用模式和使用（串流）SQL 查詢你的資料。最後，你將理解進階的 Apache Beam 概念，例如實作自己的 I/O 連接器。

在這本書結束時，你將深入了解 Apache Beam 模型，並能夠應用它來解決問題。

你將學到什麼：