Stream Processing with Apache Spark: Best Practices for Scaling and Optimizing Apache Spark
暫譯: 使用 Apache Spark 進行串流處理：擴展與優化 Apache Spark 的最佳實踐

Name: Stream Processing with Apache Spark: Best Practices for Scaling and Optimizing Apache Spark
Price: 2142 TWD
Availability: InStock
Author: Francois Garillot, Gerard Maas
ISBN: 1491944242

Francois Garillot, Gerard Maas

出版商: O'Reilly
出版日期: 2019-07-16
定價: $2,380
售價: 9.5 折 $2,261
貴賓價: 9.0 折 $2,142
語言: 英文
頁數: 452
裝訂: Paperback
ISBN: 1491944242
ISBN-13: 9781491944240
相關分類: Spark

立即出貨 (庫存 < 4)

買這商品的人也買了...

$792

HBase in Action (Paperback)
~~$2,760~~ $2,622

Thinking Functionally with Haskell (Paperback)
~~$680~~ $530

Python + Spark 2.0 + Hadoop 機器學習與大數據分析實戰
$474

Elasticsearch 技術解析與實戰
~~$2,100~~ $1,995

Pandas Cookbook
~~$450~~ $270

演算法圖鑑：26種演算法 + 7種資料結構，人工智慧、數據分析、邏輯思考的原理和應用 step by step 全圖解
$990

Kafka Streams in Action: Real-time apps and microservices with the Kafka Streaming API
~~$1,575~~ $1,496

Event Streams in Action: Unified log processing with Kafka and Kinesis
$1,888

Cybersecurity Ops with bash: Attack, Defend, and Analyze from the Command Line
$414

別拿相關當因果：因果關係簡易入門
$469

JRockit 權威指南 : 深入理解 JVM
$421

JVM G1 源碼分析和調優
$1,188

Practical Haskell: A Real World Guide to Programming
~~$520~~ $410

A-Life｜使用 Python 實作人工生命模型
~~$600~~ $468

邁向 Linux 工程師之路：Superuser 一定要懂的技術與運用, 2/e (How Linux Works: What Every Superuser Should Know, 2/e)
~~$580~~ $458

機器學習的數學基礎 : AI、深度學習打底必讀
~~$380~~ $323

從Q到Q+：精準提問打破偏見僵局×避開決策陷阱，關鍵時刻做出最佳決斷
$478

AWS 解決方案架構師學習指南 (第2版·SAA-C01)
~~$599~~ $509

資料科學的建模基礎 : 別急著 coding！你知道模型的陷阱嗎？
~~$780~~ $616

不當礦工當老闆：自己動手開發區塊鏈應用業務
~~$499~~ $424

ACCELERATE：精益軟體與 DevOps 背後的科學 (Accelerate: The Science of Lean Software and DevOps: Building and Scaling High Performing Technology Organizations)
~~$750~~ $675

集成式學習：Python 實踐！整合全部技術，打造最強模型 (Hands-On Ensemble Learning with Python: Build highly optimized ensemble machine learning models using scikit-learn and Keras)
~~$980~~ $774

Java 也可以 K8s：使用最新 Quarkus 打造新世代原生微服務
~~$636~~ $604

MLOps 實踐 — 機器學習從開發到生產 (全彩)
~~$900~~ $855

製造數據科學：邁向智慧製造與數位決策

商品描述

To build analytics tools that provide faster insights, knowing how to process data in real time is a must, and moving from batch processing to stream processing is absolutely required. Fortunately, the Spark in-memory framework/platform for processing data has added an extension devoted to fault-tolerant stream processing: Spark Streaming.

If you're familiar with Apache Spark and want to learn how to implement it for streaming jobs, this practical book is a must.

Understand how Spark Streaming fits in the big picture
Learn core concepts such as Spark RDDs, Spark Streaming clusters, and the fundamentals of a DStream
Discover how to create a robust deployment
Dive into streaming algorithmics
Learn how to tune, measure, and monitor Spark Streaming

商品描述(中文翻譯)

為了建立提供更快洞察的分析工具，了解如何實時處理數據是必須的，並且從批處理轉向流處理是絕對必要的。幸運的是，Spark 的內存框架/平台已經增加了一個專門用於容錯流處理的擴展：Spark Streaming。

如果您熟悉 Apache Spark 並想學習如何為流式作業實現它，這本實用的書籍是必備之選。

- 了解 Spark Streaming 在整體架構中的位置
- 學習核心概念，如 Spark RDD、Spark Streaming 集群以及 DStream 的基本原理
- 探索如何創建穩健的部署
- 深入了解流式算法
- 學習如何調整、測量和監控 Spark Streaming

作者簡介

Gerard Maas is a Principal Engineer at Lightbend, where he works on the seamless integration of Structured Streaming and other scalable stream processing technologies into the Lightbend Platform. Previously, he worked at a cloud-native IoT startup, where he led the data processing team on building the streaming pipelines that pushed Spark Streaming to its limits in terms of throughput. Back then, he published the first comprehensive guide to tune Spark Streaming performance.

Gerard has held leading roles at several startups and large enterprises, building data science governance, cloud-native IoT platforms, telecom platforms, and scalable APIs. He is a regular speaker at technology conferences and contributes to small and large open source projects. Gerard has a degree in Computer Engineering from the Simón Bolívar University, Venezuela. You can find him on twitter as @maasg.

François Garillot is based in Seattle, where he works on distributed computing at Facebook. He received a Ph.D. from École Polytechnique in 2011, and worked on Spark Streaming's back-pressure while working at Lightbend in 2015. His interests include type systems, leveraging programming languages to make analytics simpler to express, and a passion for Scala, Spark, and roasted arabica. When not at work, he can be found enjoying the mountains of the Pacific Northwest.

作者簡介(中文翻譯)

Gerard Maas 是 Lightbend 的首席工程師，他專注於將結構化流處理（Structured Streaming）和其他可擴展的流處理技術無縫整合到 Lightbend 平台中。之前，他在一家雲原生物聯網（IoT）初創公司工作，負責數據處理團隊，建立流式管道，將 Spark Streaming 的吞吐量推向極限。當時，他發表了第一本全面的指南，以調整 Spark Streaming 的性能。

Gerard 在多家初創公司和大型企業中擔任過領導角色，建立數據科學治理、雲原生物聯網平台、電信平台和可擴展的 API。他是技術會議的常客演講者，並參與大小型的開源項目。Gerard 擁有委內瑞拉西蒙·玻利瓦爾大學的計算機工程學位。你可以在 Twitter 上找到他，帳號是 @maasg。

François Garillot 現居西雅圖，在 Facebook 從事分散式計算工作。他於 2011 年獲得 École Polytechnique 的博士學位，並在 2015 年於 Lightbend 工作時研究 Spark Streaming 的反壓（back-pressure）。他的興趣包括類型系統、利用程式語言簡化分析表達，以及對 Scala、Spark 和烘焙阿拉比卡咖啡的熱情。當不在工作時，他喜歡在太平洋西北的山區享受大自然。