Spark in Action
Petar Zecevic, Marko Bonaci
- 出版商: Manning
- 出版日期: 2016-11-26
- 定價: $1,650
- 售價: 6.0 折 $990
- 語言: 英文
- 頁數: 472
- 裝訂: Paperback
- ISBN: 1617292605
- ISBN-13: 9781617292606
-
相關分類:
Spark
-
其他版本:
Spark in Action ,2/e (Paperback)
買這商品的人也買了...
-
$620$527 -
$380$342 -
$1,311$1,242 -
$1,808The Art of SEO: Mastering Search Engine Optimization, 3/e (Paperback)
-
$341圖解機器學習
-
$1,568Spark: Big Data Cluster Computing in Production (Paperback)
-
$2,470$2,347 -
$580$458 -
$540$459 -
$1,197Fast Data Processing with Spark 2 - Third Edition
-
$1,970$1,872 -
$403機器學習導論 (An Introduction to Machine Learning)
-
$590$502 -
$1,575Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS (paperback)
-
$680$578 -
$590$460 -
$580$493 -
$500$425 -
$1,610$1,530 -
$3,500$3,325 -
$1,940$1,843 -
$540$459 -
$798Deep Learning with Hadoop (Paperback)
-
$1,575High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark (Paperback)
-
$2,330$2,214
相關主題
商品描述
Summary
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. Fully updated for Spark 2.0.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
Big data systems distribute datasets across clusters of machines, making it a challenge to efficiently query, stream, and interpret them. Spark can help. It is a processing system designed specifically for distributed data. It provides easy-to-use interfaces, along with the performance you need for production-quality analytics and machine learning. Spark 2 also adds improved programming APIs, better performance, and countless other upgrades.
About the Book
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. You'll get comfortable with the Spark CLI as you work through a few introductory examples. Then, you'll start programming Spark using its core APIs. Along the way, you'll work with structured data using Spark SQL, process near-real-time streaming data, apply machine learning algorithms, and munge graph data using Spark GraphX. For a zero-effort startup, you can download the preconfigured virtual machine ready for you to try the book's code.
What's Inside
- Updated for Spark 2.0
- Real-life case studies
- Spark DevOps with Docker
- Examples in Scala, and online in Java and Python
About the Reader
Written for experienced programmers with some background in big data or machine learning.
About the Authors
Petar Zečević and Marko Bonaći are seasoned developers heavily involved in the Spark community.
Table of Contents
PART 1 - FIRST STEPS
PART 2 - MEET THE SPARK FAMILY
PART 3 - SPARK OPS
PART 4 - BRINGING IT TOGETHER
- Introduction to Apache Spark
- Spark fundamentals
- Writing Spark applications
- The Spark API in depth
- Sparkling queries with Spark SQL
- Ingesting data with Spark Streaming
- Getting smart with MLlib
- ML: classification and clustering
- Connecting the dots with GraphX
- Running Spark
- Running on a Spark standalone cluster
- Running on YARN and Mesos
- Case study: real-time dashboard
- Deep learning on Spark with H2O
商品描述(中文翻譯)
摘要
Spark in Action教授您使用Spark有效處理批次和流式數據所需的理論和技能。全面更新至Spark 2.0。
購買印刷版書籍將包含Manning Publications提供的PDF、Kindle和ePub格式的免費電子書。
關於技術
大數據系統將數據集分佈在機器集群中,這使得高效查詢、流式傳輸和解釋數據成為一個挑戰。Spark可以幫助您。它是一個專為分佈式數據設計的處理系統。它提供易於使用的界面,以及您在生產質量分析和機器學習中所需的性能。Spark 2還增加了改進的編程API、更好的性能和無數其他升級。
關於本書
Spark in Action教授您使用Spark有效處理批次和流式數據所需的理論和技能。通過一些入門示例,您將熟悉Spark CLI。然後,您將使用其核心API編程Spark。在此過程中,您將使用Spark SQL處理結構化數據,處理近實時流數據,應用機器學習算法,並使用Spark GraphX處理圖數據。為了讓您輕鬆開始,您可以下載預配置的虛擬機器,準備好試用本書的代碼。
內容簡介
- 更新至Spark 2.0
- 真實案例研究
- 使用Docker進行Spark DevOps
- Scala示例,以及Java和Python的線上示例
讀者對象
本書適合有一定大數據或機器學習背景的經驗豐富的程序員。
作者簡介
Petar Zečević和Marko Bonaći是深度參與Spark社區的經驗豐富的開發人員。
目錄
第1部分 - 初步
第2部分 - 認識Spark家族
第3部分 - Spark操作
第4部分 - 綜合應用
- Apache Spark簡介
- Spark基礎知識
- 編寫Spark應用程序
- 深入了解Spark API
- 使用Spark SQL進行查詢
- 使用Spark Streaming接收數據
- 使用MLlib進行智能處理
- 機器學習:分類和聚類
- 使用GraphX連接數據
- 運行Spark
- 在Spark獨立集群上運行
- 在YARN和Mesos上運行
- 案例研究:實時儀表板
- 使用H2O在Spark上進行深度學習