Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
暫譯: Apache Spark 2 入門：包含彈性分散式資料集、Spark SQL、結構化串流及 Spark 機器學習庫

Name: Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
Price: 1200 TWD
Availability: InStock
Author: Hien Luu
ISBN: 1484235789

Hien Luu

出版商: Apress
出版日期: 2018-08-17
定價: $1,500
售價: 8.0 折 $1,200
語言: 英文
頁數: 408
裝訂: Paperback
ISBN: 1484235789
ISBN-13: 9781484235782
相關分類: Spark

立即出貨 (庫存 < 3)

商品描述

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.

Along the way, you’ll discover resilient distributed datasets (RDDs); use Spark SQL for structured data; and learn stream processing and build real-time applications with Spark Structured Streaming. Furthermore, you’ll learn the fundamentals of Spark ML for machine learning and much more.

After you read this book, you will have the fundamentals to become proficient in using Apache Spark and know when and how to apply it to your big data applications.

What You Will Learn

Understand Spark unified data processing platform
How to run Spark in Spark Shell or Databricks
Use and manipulate RDDs
Deal with structured data using Spark SQL through its operations and advanced functions
Build real-time applications using Spark Structured Streaming
Develop intelligent applications with the Spark Machine Learning library

Who This Book Is For

Programmers and developers active in big data, Hadoop, and Java but who are new to the Apache Spark platform.

商品描述(中文翻譯)

開發大數據環境中的應用程式，使用 Spark 和 Hadoop。本書還解釋了 Spark 在開發可擴展的機器學習和分析應用程式中與雲技術的角色。《Beginning Apache Spark 2》為您介紹 Apache Spark，並展示如何使用它。

在這個過程中，您將發現彈性分散式資料集（RDD）；使用 Spark SQL 處理結構化資料；學習串流處理並使用 Spark Structured Streaming 建立即時應用程式。此外，您還將學習 Spark ML 的基本概念，以便進行機器學習及更多內容。

閱讀完本書後，您將掌握使用 Apache Spark 的基本知識，並了解何時以及如何將其應用於您的大數據應用程式。

您將學到的內容：

- 了解 Spark 統一資料處理平台
- 如何在 Spark Shell 或 Databricks 中運行 Spark
- 使用和操作 RDD
- 通過 Spark SQL 的操作和高級功能處理結構化資料
- 使用 Spark Structured Streaming 建立即時應用程式
- 使用 Spark 機器學習庫開發智能應用程式

本書適合對象：

活躍於大數據、Hadoop 和 Java 的程式設計師和開發人員，但對 Apache Spark 平台較為陌生。

Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library 暫譯: Apache Spark 2 入門：包含彈性分散式資料集、Spark SQL、結構化串流及 Spark 機器學習庫

Hien Luu

商品描述

商品描述(中文翻譯)

類似商品

Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
暫譯: Apache Spark 2 入門：包含彈性分散式資料集、Spark SQL、結構化串流及 Spark 機器學習庫