Managing Gigabytes: Compressing and Indexing Documents and Images, 2/e (Hardcover)

Ian H. Witten, Alistair Moffat, Timothy C. Bell

買這商品的人也買了...

商品描述


Order This Book | Authors | Contents | Web-Enhanced | Related Titles

"This book is the Bible for anyone who needs to manage large data collections. It's required reading for our search gurus at Infoseek. The authors have done an outstanding job of incorporating and describing the most significant new research in information retrieval over the past five years into this second edition."
Steve Kirsch, Cofounder, Infoseek Corporation

"The new edition of Witten, Moffat, and Bell not only has newer and better text search algorithms but much material on image analysis and joint image/text processing. If you care about search engines, you need this book: it is the only one with full details of how they work. The book is both detailed and enjoyable; the authors have combined elegant writing with top-grade programming."
Michael Lesk, National Science Foundation

"The coverage of compression, file organizations, and indexing techniques for full text and document management systems is unsurpassed. Students, researchers, and practitioners will all benefit from reading this book."
Bruce Croft, Director, Center for Intelligent Information Retrieval at the University of Massachusetts

In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web.

Authors:

The authors (Ian H. Witten, Alistair Moffat, and Timothy C. Bell) all hold senior faculty positions at leading southern hemisphere universities and have undertaken innovative research in the areas addressed in this book. Collectively, they have authored eight books and over 300 research papers. They also serve on the program committees of many international conferences, including the IEEE Data Compression Conference and the ACM Digital Libraries and Information Retrieval conferences.

Table of Contents:

PREFACE
1. OVERVIEW
2. TEXT COMPRESSION
3. INDEXING
4. QUERYING
5. INDEX CONSTRUCTION
6. IMAGE COMPRESSION
7. TEXTUAL IMAGES
8. MIXED TEXT AND IMAGES
9. IMPLEMENTATION
10. THE INFORMATION EXPLOSION
A. GUIDE TO THE MG SYSTEM
B. GUIDE TO THE NZDL
REFERENCES
INDEX

Web-Enhanced:

The authors' website for the book.

Related Titles:

Multimedia Information & Systems
Database
Computer & Communication Networks

商品描述(中文翻譯)

「訂購本書」|「作者」|「目錄」|「網路增強」|「相關書籍」

「這本書是任何需要管理大型資料集的人的聖經。對於我們在Infoseek的搜尋專家來說,這是必讀的。作者在第二版中將過去五年來資訊檢索領域最重要的新研究融入並描述得非常出色。」
Steve Kirsch,Infoseek Corporation聯合創辦人

「Witten、Moffat和Bell的新版不僅有更新且更好的文字搜尋演算法,還有許多關於影像分析和影像/文字處理的內容。如果你關心搜尋引擎,你需要這本書:它是唯一一本詳細介紹搜尋引擎如何運作的書籍。這本書既詳細又有趣;作者們將優雅的寫作與高品質的程式設計結合在一起。」
Michael Lesk,美國國家科學基金會

「這本書對於全文和文件管理系統的壓縮、檔案組織和索引技術的涵蓋範圍是無與倫比的。學生、研究人員和從業人員都將從閱讀這本書中受益。」
Bruce Croft,麻省大學智能資訊檢索中心主任

在這本高度讚譽的《Managing Gigabytes》第二版中,作者Witten、Moffat和Bell繼續提供無與倫比的最新壓縮和索引技術的全面介紹。無論你從事哪個領域,如果你處理大量資訊,這本書是必讀的——它是權威的理論資源和應對最嚴峻的儲存和存取挑戰的實用指南。它涵蓋了壓縮和索引的最新發展及其在網路和數位圖書館中的應用。它還詳細介紹了數十種強大的技術,這些技術由作者們自己的mg系統支持,用於壓縮、儲存和檢索文字、影像和文字影像。mg的原始碼可以在網路上免費取得。

「作者:」
作者(Ian H. Witten、Alistair Moffat和Timothy C. Bell)都在南半球領先的大學擔任高級教職,並在本書所涉及的領域進行了創新研究。他們共同撰寫了八本書和300多篇研究論文。他們還擔任許多國際會議的程序委員會成員,包括IEEE資料壓縮會議和ACM數位圖書館和資訊檢索會議。

「目錄:」
前言
1. 概述
2. 文字壓縮
3. 索引
4. 查詢
5. 索引建構
6. 影像壓縮
7. 文字影像
8. 混合文字和影像
9. 實作
10. 資訊爆炸
A. MG系統指南
B. NZDL指南
參考文獻
索引

「網路增強:」