Entity Resolution and Information Quality (Paperback)

John R. Talburt

  • 出版商: Morgan Kaufmann
  • 出版日期: 2010-12-08
  • 定價: $1,730
  • 售價: 8.5$1,471
  • 語言: 英文
  • 頁數: 256
  • 裝訂: Paperback
  • ISBN: 0123819725
  • ISBN-13: 9780123819727
  • 相關分類: Data ScienceInformation-management
  • 立即出貨(限量) (庫存=3)

商品描述

Customers and products are the heart of any business, and corporations collect more data about them every year. However, just because you have data doesn't mean you can use it effectively. If not properly integrated, data can actually encourage false conclusions that result in bad decisions and lost opportunities. Entity Resolution (ER) is a powerful tool for transforming data into accurate, value-added information. Using entity resolution methods and techniques, you can identify equivalent records from multiple sources corresponding to the same real-world person, place, or thing.

This emerging area of data management is clearly explained throughout the book. It teaches you the process of locating and linking information about the same entity - eliminating duplications - and making crucial business decisions based on the results. This book is an authoritative, vendor-independent technical reference for researchers, graduate students and practitioners, including architects, technical analysts, and solution developers. In short, Entity Resolution and Information Quality gives you the applied level know-how you need to aggregate data from disparate sources and form accurate customer and product profiles that support effective marketing and sales. It is an invaluable guide for succeeding in today's info-centric environment.



  • First authoritative reference explaining entity resolution and how to use it effectively
  • Provides practical system design advice to help you get a competitive advantage
  • Includes a companion site with synthetic customer data for applicatory exercises, and access to a Java-based Entity Resolution program.

商品描述(中文翻譯)

顧客和產品是任何企業的核心,企業每年都會收集更多有關它們的數據。然而,僅僅擁有數據並不意味著您可以有效地使用它。如果數據沒有得到正確整合,實際上可能會導致錯誤的結論,進而導致糟糕的決策和損失的機會。實體解析(ER)是將數據轉化為準確、增值信息的強大工具。使用實體解析的方法和技術,您可以識別來自多個來源的相等記錄,對應於同一個現實世界的人、地方或事物。

本書清晰地解釋了這一新興的數據管理領域。它教導您定位和鏈接有關同一實體的信息的過程,消除重複,並基於結果做出關鍵的業務決策。本書是一本權威的、與供應商無關的技術參考書,適用於研究人員、研究生和從業人員,包括架構師、技術分析師和解決方案開發人員。簡而言之,實體解析和信息質量為您提供了應用級別的專業知識,幫助您從不同來源聚合數據,形成支持有效市場營銷和銷售的準確客戶和產品概況。在當今信息中心環境中取得成功的無價指南。

以下是本書的特點:
- 首本權威參考書,解釋實體解析及其有效使用方法
- 提供實用的系統設計建議,幫助您獲得競爭優勢
- 包含一個附帶網站,提供應用練習所需的合成客戶數據,以及使用Java的實體解析程序的訪問權限。