YARN Essentials

Amol Fasale, Nirmal Kumar

  • 出版商: Packt Publishing
  • 出版日期: 2015-02-28
  • 售價: $1,360
  • 貴賓價: 9.5$1,292
  • 語言: 英文
  • 頁數: 176
  • 裝訂: Paperback
  • ISBN: 1784391735
  • ISBN-13: 9781784391737
  • 下單後立即進貨 (約3~4週)

商品描述

A comprehensive, hands-on guide to install, administer, and configure settings in YARN

About This Book

  • Learn the inner workings of YARN and how its robust and generic framework enables optimal resource utilization across multiple applications
  • Get to grips with single and multi-node installation, administration, and real-time distributed application development
  • A step-by-step self-learning guide to help you perform optimal resource utilization in a cluster

Who This Book Is For

If you have a working knowledge of Hadoop 1.x but want to start afresh with YARN, this book is ideal for you. You will be able to install and administer a YARN cluster and also discover the configuration settings to fine-tune your cluster both in terms of performance and scalability. This book will help you develop, deploy, and run multiple applications/frameworks on the same shared YARN cluster.

What You Will Learn

  • Understand how existing MapReduce applications can run on top of YARN and how they are backward compatible
  • Explore the YARN concepts, terminologies, architecture, key components, and interaction between the components
  • Set up a standalone and multi-node clustered YARN environment
  • Design, develop, and run different frameworks such as MapReduce, Apache Storm, Apache Tez, and Giraffe on top of YARN
  • Get to grips with the built-in support for multitenancy in YARN
  • Discover the motivation behind YARN's architecture design, implementations, and why YARN was needed
  • Learn how failures at each level are gracefully handled by the new framework to achieve fault tolerance and scalability

In Detail

YARN is the next generation generic resource platform used to manage resources in a typical cluster and is designed to support multitenancy in its core architecture. As optimal resource utilization is central to the design of YARN, learning how to fully utilize the available fine-grained resources (RAM, CPU cycles, and so on) in the cluster becomes vital.

This book is an easy-to-follow, self-learning guide to help you start working with YARN. Beginning with an overview of YARN and Hadoop, you will dive into the pitfalls of Hadoop 1.x and how YARN takes us to the next level. You will learn the concepts, terminology, architecture, core components, and key interactions, and cover the installation and administration of a YARN cluster as well as learning about YARN application development with new and emerging data processing frameworks.