Big Data Analytics with Hadoop 3

Sridhar Alla

商品描述

Dive deep into Big Data concepts, platforms, analytics and their applications using the power of Hadoop 3 About This Book * Leverage the power of Hadoop 3 to build effective big data analytics solutions on-premise and on cloud * Integrate Hadoop with other big data tools such as R, Python, Apache Spark and Apache Flink * Get deep insights from your Big Data using Hadoop 3 with the help of real-world examples Who This Book Is For If you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3's powerful features, this book is for you. If you're new to Big Data analytics, this book will also help you. A basic understanding of the Java programming language is required for this book. What You Will Learn * Explore the new features of Hadoop 3 along with HDFS, YARN and MapReduce. * Get well-versed with the analytical capabilities of Hadoop ecosystem using practical examples * Integrate Hadoop with R and Python for more efficient big data processing * Learn to use Hadoop with Apache Spark and Apache Flink for real-time data analytics * Setup a Hadoop cluster on AWS cloud * Perform Big Data Analytics on AWS using Elastic Map Reduce In Detail Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. This book shows you how to do just that, with the help of practical examples. You will start with getting a quick overview of the new features introduced in Hadoop 3 along with HDFS, MapReduce and YARN , and how they enable faster, more efficient big data processing. Further, you will learn how to integrate Hadoop with the open source tools such as Python and R to analyse and visualise data and to perform statistical computing on Big Data. The book will also show you how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing, and demonstrates how to use Hadoop to build analytics solutions on the cloud. Finally, you will learn to build an end to end pipeline to perform Big Data Analytics using practical use cases. By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform Big Data analytics and get insights from your Big Data without any hassle.

商品描述(中文翻譯)

深入探索大數據概念、平台、分析及其在Hadoop 3上的應用力量

關於本書
* 利用Hadoop 3的力量,在本地和雲端上建立有效的大數據分析解決方案
* 將Hadoop與其他大數據工具(如R、Python、Apache Spark和Apache Flink)整合
* 通過實際示例,利用Hadoop 3從大數據中獲取深入洞察

本書適合對於使用Hadoop 3的強大功能為企業或業務建立高性能分析解決方案的讀者。如果您對於大數據分析還不熟悉,本書也將對您有所幫助。本書需要讀者對Java編程語言有基本的理解。

您將學到什麼
* 探索Hadoop 3的新功能,包括HDFS、YARN和MapReduce。
* 通過實際示例熟悉Hadoop生態系統的分析能力
* 將Hadoop與R和Python整合,以實現更高效的大數據處理
* 學習使用Hadoop與Apache Spark和Apache Flink進行實時數據分析
* 在AWS雲上設置Hadoop集群
* 使用Elastic Map Reduce在AWS上進行大數據分析

詳細內容
Apache Hadoop是最受歡迎的大數據處理平台,可以與其他大數據工具結合使用,構建強大的分析解決方案。本書通過實際示例向您展示如何實現這一目標。您將首先快速瞭解Hadoop 3引入的新功能,包括HDFS、MapReduce和YARN,以及它們如何實現更快、更高效的大數據處理。此外,您還將學習如何將Hadoop與Python和R等開源工具整合,以分析和可視化數據,並在大數據上進行統計計算。本書還將向您展示如何使用Hadoop 3與Apache Spark和Apache Flink進行實時數據分析和流處理,並演示如何使用Hadoop在雲上構建分析解決方案。最後,您將學習如何通過實際用例構建端到端的管道,執行大數據分析。通過閱讀本書,您將熟悉Hadoop生態系統的分析能力,能夠輕鬆構建強大的大數據分析解決方案,並從大數據中獲取洞察。