Hadoop RealWorld Solutions Cookbook Second Edition
暫譯: Hadoop 實戰解決方案食譜(第二版)
Tanmay Deshpande
- 出版商: Packt Publishing
- 出版日期: 2016-03-29
- 售價: $2,400
- 貴賓價: 9.5 折 $2,280
- 語言: 英文
- 頁數: 290
- 裝訂: Paperback
- ISBN: 1784395501
- ISBN-13: 9781784395506
-
相關分類:
Hadoop
海外代購書籍(需單獨結帳)
相關主題
商品描述
Over 100+ hands-on recipes to help you learn and master the intricacies of Apache Hadoop 2.X, YARN, Hive, Pig, Oozie, Flume, Sqoop, Apache Spark, and Mahout
About This Book
- Implement outstanding Machine Learning use cases on your own analytics models and processes.
- Solutions to common problems when working with the Hadoop ecosystem.
- Step-by-step implementation of end-to-end big data use cases.
Who This Book Is For
Readers who have a basic knowledge of big data systems and want to advance their knowledge with hands-on recipes.
What You Will Learn
- Installing and maintaining Hadoop 2.X cluster and its ecosystem.
- Write advanced Map Reduce programs and understand design patterns.
- Advanced Data Analysis using the Hive, Pig, and Map Reduce programs.
- Import and export data from various sources using Sqoop and Flume.
- Data storage in various file formats such as Text, Sequential, Parquet, ORC, and RC Files.
- Machine learning principles with libraries such as Mahout
- Batch and Stream data processing using Apache Spark
In Detail
Big data is the current requirement. Most organizations produce huge amount of data every day. With the arrival of Hadoop-like tools, it has become easier for everyone to solve big data problems with great efficiency and at minimal cost. Grasping Machine Learning techniques will help you greatly in building predictive models and using this data to make the right decisions for your organization.
Hadoop Real World Solutions Cookbook gives readers insights into learning and mastering big data via recipes. The book not only clarifies most big data tools in the market but also provides best practices for using them. The book provides recipes that are based on the latest versions of Apache Hadoop 2.X, YARN, Hive, Pig, Sqoop, Flume, Apache Spark, Mahout and many more such ecosystem tools. This real-world-solution cookbook is packed with handy recipes you can apply to your own everyday issues. Each chapter provides in-depth recipes that can be referenced easily. This book provides detailed practices on the latest technologies such as YARN and Apache Spark. Readers will be able to consider themselves as big data experts on completion of this book.
This guide is an invaluable tutorial if you are planning to implement a big data warehouse for your business.
商品描述(中文翻譯)
超過 100 個實作食譜,幫助您學習和掌握 Apache Hadoop 2.X、YARN、Hive、Pig、Oozie、Flume、Sqoop、Apache Spark 和 Mahout 的複雜性
本書簡介
- 在您自己的分析模型和流程上實作出色的機器學習用例。
- 解決在使用 Hadoop 生態系統時常見的問題。
- 逐步實作端到端的大數據用例。
本書適合誰閱讀
對大數據系統有基本了解並希望透過實作食譜提升知識的讀者。
您將學到什麼
- 安裝和維護 Hadoop 2.X 集群及其生態系統。
- 撰寫進階的 Map Reduce 程式並理解設計模式。
- 使用 Hive、Pig 和 Map Reduce 程式進行進階數據分析。
- 使用 Sqoop 和 Flume 從各種來源進行數據的進口和出口。
- 以各種文件格式(如 Text、Sequential、Parquet、ORC 和 RC Files)進行數據存儲。
- 使用 Mahout 的機器學習原則。
- 使用 Apache Spark 進行批次和串流數據處理。
詳細內容
大數據是當前的需求。大多數組織每天產生大量數據。隨著 Hadoop 類工具的出現,解決大數據問題變得更加容易,效率高且成本低。掌握機器學習技術將大大幫助您建立預測模型,並利用這些數據為您的組織做出正確的決策。
《Hadoop 實務解決方案食譜》為讀者提供了透過食譜學習和掌握大數據的見解。本書不僅澄清了市場上大多數大數據工具,還提供了使用它們的最佳實踐。本書提供的食譜基於最新版本的 Apache Hadoop 2.X、YARN、Hive、Pig、Sqoop、Flume、Apache Spark、Mahout 及其他許多生態系統工具。這本實務解決方案食譜充滿了您可以應用於日常問題的實用食譜。每一章都提供了深入的食譜,便於參考。本書詳細介紹了最新技術,如 YARN 和 Apache Spark。讀者在完成本書後將能夠自信地認為自己是大數據專家。
如果您計劃為您的業務實施大數據倉庫,這本指南將是無價的教程。