Pro Apache Phoenix: An SQL Driver for HBase

Shakil Akhtar, Ravi Magham

  • 出版商: Apress
  • 出版日期: 2016-12-30
  • 售價: $1,440
  • 貴賓價: 9.5$1,368
  • 語言: 英文
  • 頁數: 140
  • 裝訂: Paperback
  • ISBN: 1484223691
  • ISBN-13: 9781484223697
  • 相關分類: NoSQLSQL
  • 海外代購書籍(需單獨結帳)

商品描述

Leverage Phoenix as an ANSI SQL engine built on top of the highly distributed and scalable NoSQL framework HBase. Learn the basics and best practices that are being adopted in Phoenix to enable a high write and read throughput in a big data space. 

This book includes real-world cases such as Internet of Things devices that send continuous streams to Phoenix, and the book explains how key features such as joins, indexes, transactions, and functions help you understand the simple, flexible, and powerful API that Phoenix provides. Examples are provided using real-time data and data-driven businesses that show you how to collect, analyze, and act in seconds.  

Pro Apache Phoenix covers the nuances of setting up a distributed HBase cluster with Phoenix libraries, running performance benchmarks, configuring parameters for production scenarios, and viewing the results. The book also shows how Phoenix plays well with other key frameworks in the Hadoop ecosystem such as Apache Spark, Pig, Flume, and Sqoop.

You will learn how to:

  • Handle a petabyte data store by applying familiar SQL techniques
  • Store, analyze, and manipulate data in a NoSQL Hadoop echo system with HBase
  • Apply best practices while working with a scalable data store on Hadoop and HBase
  • Integrate popular frameworks (Apache Spark, Pig, Flume) to simplify big data analysis
  • Demonstrate real-time use cases and big data modeling techniques

Who This Book Is For

Data engineers, Big Data administrators, and architects.



商品描述(中文翻譯)

利用Phoenix作為一個建立在高度分散和可擴展的NoSQL框架HBase之上的ANSI SQL引擎。學習Phoenix中正在被採用的基礎知識和最佳實踐,以實現在大數據空間中的高寫入和讀取吞吐量。

本書包括實際案例,例如連續流傳送到Phoenix的物聯網設備,並且解釋了關鍵功能(如連接、索引、事務和函數)如何幫助您理解Phoenix提供的簡單、靈活和強大的API。使用實時數據和數據驅動的業務提供了示例,向您展示如何在幾秒鐘內收集、分析和執行。

《Pro Apache Phoenix》介紹了使用Phoenix庫設置分散式HBase集群的細微差別,運行性能基準測試,配置生產場景的參數以及查看結果。本書還展示了Phoenix如何與Hadoop生態系統中的其他關鍵框架(如Apache Spark、Pig、Flume和Sqoop)良好配合。

您將學習如何:
- 通過應用熟悉的SQL技術處理PB級數據存儲
- 在HBase的NoSQL Hadoop生態系統中存儲、分析和操作數據
- 在使用Hadoop和HBase的可擴展數據存儲時應用最佳實踐
- 整合流行框架(Apache Spark、Pig、Flume)以簡化大數據分析
- 展示實時用例和大數據建模技術

本書適合數據工程師、大數據管理員和架構師閱讀。