Hadoop MapReduce Cookbook

Srinath Perera, Thilina Gunarathne

  • 出版商: Packt Publishing
  • 出版日期: 2013-01-30
  • 售價: $1,890
  • 貴賓價: 9.5$1,796
  • 語言: 英文
  • 頁數: 300
  • 裝訂: Paperback
  • ISBN: 1849517282
  • ISBN-13: 9781849517287
  • 相關分類: Hadoop分散式架構
  • 下單後立即進貨 (約3~4週)

相關主題

商品描述

Learn how to use Hadoop MapReduce to analyze large and complex datasets with this comprehensive cookbook. Over fifty recipes with step-by-step instructions quickly take your Hadoop skills to the next level.

Overview

  • Learn to process large and complex data sets, starting simply, then diving in deep
  • Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations.
  • More than 50 Hadoop MapReduce recipes, presented in a simple and straightforward manner, with step-by-step instructions and real world examples.

In Detail

We are facing an avalanche of data. The unstructured data we gather can contain many insights that might hold the key to business success or failure. Harnessing the ability to analyze and process this data with Hadoop MapReduce is one of the most highly sought after skills in today's job market.

"Hadoop MapReduce Cookbook" is a one-stop guide to processing large and complex data sets using the Hadoop ecosystem. The book introduces you to simple examples and then dives deep to solve in-depth big data use cases.

"Hadoop MapReduce Cookbook" presents more than 50 ready-to-use Hadoop MapReduce recipes in a simple and straightforward manner, with step-by-step instructions and real world examples.

Start with how to install, then configure, extend, and administer Hadoop. Then write simple examples, learn MapReduce patterns, harness the Hadoop landscape, and finally jump to the cloud.

The book deals with many exciting topics such as setting up Hadoop security, using MapReduce to solve analytics, classifications, on-line marketing, recommendations, and searching use cases. You will learn how to harness components from the Hadoop ecosystem including HBase, Hadoop, Pig, and Mahout, then learn how to set up cloud environments to perform Hadoop MapReduce computations.

"Hadoop MapReduce Cookbook" teaches you how process large and complex data sets using real examples providing a comprehensive guide to get things done using Hadoop MapReduce.

What you will learn from this book

  • How to install Hadoop MapReduce and HDFS to begin running examples
  • How to configure and administer Hadoop and HDFS securely
  • Understanding the internals of Hadoop and how Hadoop can be extended to suit your needs
  • How to use HBase, Hive, Pig, Mahout, and Nutch to get things done easily and efficiently
  • How to use MapReduce to solve many types of analytics problems
  • Solve complex problems such as classifications, finding relationships, online marketing, and recommendations
  • Using MapReduce for massive text data processing
  • How to use cloud environments to perform Hadoop computation

Approach

Individual self-contained code recipes. Solve specific problems using individual recipes, or work through the book to develop your capabilities.

Who this book is written for

If you are a big data enthusiast and striving to use Hadoop to solve your problems, this book is for you. Aimed at Java programmers with some knowledge of Hadoop MapReduce, this is also a comprehensive reference for developers and system admins who want to get up to speed using Hadoop.

商品描述(中文翻譯)

學習如何使用Hadoop MapReduce來分析大型和複雜的數據集,這本全面的食譜將帶您的Hadoop技能提升到更高的水平。超過50個具有逐步指示的食譜,快速提升您的Hadoop技能。

概述:
- 學習如何處理大型和複雜的數據集,從簡單開始,然後深入研究
- 解決複雜的大數據問題,如分類、尋找關係、線上營銷和推薦
- 提供50多個Hadoop MapReduce食譜,以簡單直接的方式呈現,附有逐步指示和實際案例

詳細內容:
我們正面臨著大量的數據。我們收集的非結構化數據可能包含許多洞察力,可能是業務成功或失敗的關鍵。利用Hadoop MapReduce分析和處理這些數據的能力是當今就業市場上最受追捧的技能之一。

《Hadoop MapReduce Cookbook》是使用Hadoop生態系統處理大型和複雜數據集的一站式指南。本書介紹了簡單的示例,然後深入解決深度大數據用例。

《Hadoop MapReduce Cookbook》以簡單直接的方式呈現了50多個可立即使用的Hadoop MapReduce食譜,附有逐步指示和實際案例。

從安裝、配置、擴展和管理Hadoop開始。然後編寫簡單的示例,學習MapReduce模式,利用Hadoop生態系統,最後轉向雲端。

本書涉及許多令人興奮的主題,如設置Hadoop安全性,使用MapReduce解決分析、分類、線上營銷、推薦和搜索用例。您將學習如何利用Hadoop生態系統的組件,包括HBase、Hadoop、Pig和Mahout,然後學習如何設置雲環境來執行Hadoop MapReduce計算。

《Hadoop MapReduce Cookbook》教您如何使用真實示例處理大型和複雜的數據集,提供了一個全面的指南,以使用Hadoop MapReduce完成工作。

本書的學習重點:
- 如何安裝Hadoop MapReduce和HDFS以開始運行示例
- 如何安全配置和管理Hadoop和HDFS
- 了解Hadoop的內部結構以及如何擴展Hadoop以滿足您的需求
- 如何使用HBase、Hive、Pig、Mahout和Nutch輕鬆高效地完成工作
- 如何使用MapReduce解決各種類型的分析問題
- 解決複雜問題,如分類、尋找關係、線上營銷和推薦
- 使用MapReduce進行大量文本數據處理
- 如何使用雲環境進行Hadoop計算

方法:
個別獨立的代碼食譜。使用個別的食譜解決特定問題,或通過閱讀本書來提升您的能力。

本書的讀者:
如果您是一位大數據愛好者,並努力使用Hadoop解決問題,那麼本書適合您。本書針對具有一定Hadoop MapReduce知識的Java程序員,也是開發人員和系統管理員的全面參考資料,他們希望快速上手使用Hadoop。