Microsoft SQL Server 2012 with Hadoop

Debarchan Sarkar

  • 出版商: Packt Publishing
  • 出版日期: 2013-08-10
  • 售價: $1,700
  • 貴賓價: 9.5$1,615
  • 語言: 英文
  • 頁數: 96
  • 裝訂: Paperback
  • ISBN: 1782177981
  • ISBN-13: 9781782177982
  • 相關分類: HadoopMSSQLSQL
  • 下單後立即進貨 (約3~4週)

商品描述

Getting SQL Server talking to Hadoop is a smooth process when you follow this tutorial. Learn all the tools and techniques you need integrate the data and then extract powerful business insights from the merged result.

Overview

  • Integrate data from unstructured (Hadoop) and structured (SQL Server 2012) sources
  • Configure and install connectors for a bi-directional transfer of data
  • Full of illustrations, diagrams, and tips with clear, step-by-step instructions and practical examples

In Detail

With the explosion of data, the open source Apache Hadoop ecosystem is gaining traction, thanks to its huge ecosystem that has arisen around the core functionalities of its distributed file system (HDFS) and Map Reduce. As of today, being able to have SQL Server talking to Hadoop has become increasingly important because the two are indeed complementary. While petabytes of unstructured data can be stored in Hadoop taking hours to be queried, terabytes of structured data can be stored in SQL Server 2012 and queried in seconds. This leads to the need to transfer and integrate data between Hadoop and SQL Server.

Microsoft SQL Server 2012 with Hadoop is aimed at SQL Server developers. It will quickly show you how to get Hadoop activated on SQL Server 2012 (it ships with this version). Once this is done, the book will focus on how to manage big data with Hadoop and use Hadoop Hive to query the data. It will also cover topics such as using in-memory functions by SQL Server and using tools for BI with big data.

Microsoft SQL Server 2012 with Hadoop focuses on data integration techniques between relational (SQL Server 2012) and non-relational (Hadoop) worlds. It will walk you through different tools for the bi-directional movement of data with practical examples.

You will learn to use open source connectors like SQOOP to import and export data between SQL Server 2012 and Hadoop, and to work with leading in-memory BI tools to create ETL solutions using the Hive ODBC driver for developing your data movement projects. Finally, this book will give you a glimpse of the present day self-service BI tools such as Excel and PowerView to consume Hadoop data and provide powerful insights on the data.

What you will learn from this book

  • Use the Native SQOOP Connector for data movement between SQL Server 2012 and Hadoop
  • Configure and use the Hive ODBC driver to enable any ODBC compliant client to consume Hadoop data
  • Create ETL solutions and automate data movement jobs between SQL Server 2012 and Hadoop using SQL Server Integration Services
  • Provide powerful reporting on the integrated data with just a matter of a few clicks using Microsoft self-service BI tools
  • Merge structured and unstructured data together in a common warehouse for analysis, which is essential

Approach

This book will be a step-by-step tutorial, which practically teaches working with big data on SQL Server through sample examples in increasing complexity.

Who this book is written for

Microsoft SQL Server 2012 with Hadoop is specifically targeted at readers who want to cross-pollinate their Hadoop skills with SQL Server 2012 business intelligence and data analytics. A basic understanding of traditional RDBMS technologies and query processing techniques is essential.

商品描述(中文翻譯)

這本書將透過逐步教學的方式,實際示範如何在 SQL Server 上處理大數據,並透過逐漸增加的複雜度來進行範例演練。

本書的目標讀者是希望將 Hadoop 技能與 SQL Server 2012 商業智能和數據分析相結合的讀者。基本了解傳統關聯式數據庫技術和查詢處理技巧是必要的。

這本書將教導您使用原生的 SQOOP 連接器在 SQL Server 2012 和 Hadoop 之間進行數據移動,配置和使用 Hive ODBC 驅動程序使任何符合 ODBC 標準的客戶端能夠使用 Hadoop 數據,使用 SQL Server Integration Services 創建 ETL 解決方案並自動化 SQL Server 2012 和 Hadoop 之間的數據移動作業,使用 Microsoft 自助商業智能工具輕鬆提供整合數據的強大報告,將結構化和非結構化數據合併到一個共同的數據倉庫進行分析。

本書將以實際範例逐漸增加的複雜度,逐步教授在 SQL Server 上處理大數據的技巧。