Data Compression for Data Mining Algorithms
暫譯: 數據挖掘演算法的數據壓縮
Wang, Xiaochun
- 出版商: Morgan Kaufmann
- 出版日期: 2026-05-07
- 售價: $6,190
- 貴賓價: 9.5 折 $5,880
- 語言: 英文
- 頁數: 364
- 裝訂: Quality Paper - also called trade paper
- ISBN: 0443405417
- ISBN-13: 9780443405419
-
相關分類:
Data-mining、DeepLearning
海外代購書籍(需單獨結帳)
相關主題
商品描述
Data Compression for Data Mining Algorithms tackles the important problems in the design of more efficient data mining algorithms by way of data compression techniques and provides the first systematic and comprehensive description of the relationships between data compression mechanisms and the computations involved in data mining algorithms. Data mining algorithms are powerful analytical techniques used across various disciplines, including business, engineering, and science. However, in the big data era, tasks such as association rule mining and classification often require multiple scans of databases, while clustering and outlier detection methods typically depend on Euclidean distance for similarity measures, leading to high computational costs. Data Compression for Data Mining Algorithms addresses these challenges by focusing on the scalarization of data mining algorithms, leveraging data compression techniques to reduce dataset sizes and applying information theory principles to minimize computations involved in tasks such as feature selection and similarity computation. The book features the latest developments in both lossless and lossy data compression methods and provides a comprehensive exposition of data compression methods for data mining algorithm design from multiple points of view. Key discussions include Huffman coding, scalar and vector quantization, transforms, subbands, wavelet-based compression for scalable algorithms, and the role of neural networks, particularly deep learning, in feature selection and dimensionality reduction. The book's contents are well-balanced for both theoretical analysis and real-world applications, and the chapters are well organized to compose a solid overview of the data compression techniques for data mining. To provide the reader with a more complete understanding of the material, projects and problems solved with Python are interspersed throughout the text.
商品描述(中文翻譯)
《資料壓縮與資料探勘演算法》探討了設計更有效率的資料探勘演算法中重要的問題,透過資料壓縮技術提供了資料壓縮機制與資料探勘演算法中涉及的計算之間關係的首個系統性和全面性描述。資料探勘演算法是用於各種學科的強大分析技術,包括商業、工程和科學。然而,在大數據時代,像是關聯規則挖掘和分類等任務通常需要對資料庫進行多次掃描,而聚類和異常檢測方法通常依賴於歐幾里得距離作為相似性度量,這導致了高計算成本。
《資料壓縮與資料探勘演算法》針對這些挑戰,專注於資料探勘演算法的標量化,利用資料壓縮技術來減少資料集的大小,並應用資訊理論原則來最小化在特徵選擇和相似性計算等任務中所涉及的計算。這本書展示了最新的無損和有損資料壓縮方法的發展,並從多個角度提供了資料探勘演算法設計的資料壓縮方法的全面闡述。
主要討論內容包括霍夫曼編碼、標量和向量量化、變換、子帶、基於小波的可擴展演算法壓縮,以及神經網絡,特別是深度學習在特徵選擇和降維中的角色。書中的內容在理論分析和實際應用之間取得了良好的平衡,章節組織良好,構成了資料探勘資料壓縮技術的堅實概述。為了讓讀者對材料有更完整的理解,書中穿插了使用 Python 解決的專案和問題。