Text Mining Application Programming (Paperback)
暫譯: 文本挖掘應用程式設計

Name: Text Mining Application Programming (Paperback)
Price: 2058 TWD
Availability: InStock
Author: Manu Konchady
ISBN: 1584504609

Manu Konchady

出版商: Charles River Media
出版日期: 2006-05-04
售價: $2,100
貴賓價: 9.8 折 $2,058
語言: 英文
頁數: 432
裝訂: Paperback
ISBN: 1584504609
ISBN-13: 9781584504603
相關分類: Text-mining

立即出貨(限量) (庫存=1)

買這商品的人也買了...

~~$480~~ $379

系統分析與設計概論 (Essential of Systems Analysis and Design)
~~$880~~ $695

深入淺出設計模式 (Head First Design Patterns)
~~$860~~ $679

資料庫系統原理 (Fundamentals of Database Systems, 4/e)
~~$880~~ $695

深入淺出 Java 程式設計, 2/e (Head First Java, 2/e)
~~$780~~ $663

鳥哥的 Linux 私房菜基礎學習篇, 2/e
~~$650~~ $507

ASP.NET 2.0 深度剖析範例集
~~$680~~ $578

Microsoft SQL Server 2005 管理實務
~~$550~~ $467

SQL 語法範例辭典
~~$980~~ $774

Linux 驅動程式, 3/e (Linux Device Drivers, 3/e)
~~$750~~ $593

精通 MFC 視窗程式設計─Visual Studio 2005 版
~~$880~~ $695

操作介面設計模式 (Designing Interfaces)
~~$750~~ $592

Visual C# 2005 程式開發與介面設計秘訣
~~$680~~ $537

Ajax 實戰手冊 (Ajax in Action)
~~$450~~ $355

Ajax 技術手冊 (Foundations of Ajax)
~~$780~~ $616

Ajax 快速上手 (Head Rush Ajax)
~~$720~~ $568

聖殿祭司的 ASP.NET 2.0 專家技術手冊─使用 C#
~~$1,200~~ $948

Linux 核心詳解, 3/e (Understanding the Linux Kernel, 3/e)
~~$1,400~~ $1,372

The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data
~~$990~~ $891

C++ Primer, 4/e (中文版)
~~$580~~ $452

ASP.NET AJAX 應用剖析立即上手
~~$600~~ $480

現代嵌入式系統開發專案實務－菜鳥成長日誌與專案經理的私房菜
$1,440

Handbook of Digital Forensics and Investigation (Paperback)
~~$1,630~~ $1,548

C# 4.0 How-To (Paperback)
~~$1,810~~ $1,719

C# in Depth, 2/e (Paperback)
~~$2,160~~ $2,052

Digital Forensics with Open Source Tools (Paperback)

商品描述

Description

Text Mining Application Programming teaches software developers how to mine the vast amounts of information available on the Web, internal networks, and desktop files and turn it into usable data. The book helps developers understand the problems associated with managing unstructured text, and explains how to build your own mining tools using standard statistical methods from Information Theory, Artificial Intelligence, and Operations Research. Each of the topics covered are thoroughly explained and then a practical implementation is provided.

The book begins with a brief overview of text data, where it can be found, and the typical search engines and tools used to search and gather this text. It details how to build tools for extracting and using the text, and covers the mathematics behind many of the algorithms used in building these tools. From there you’ll learn how to build tokens from text, construct indexes, and detect patterns in text. You’ll also find methods to extract the names of people, places, and organizations from an email, a news article, or a web page. The next portion of the book teaches you how to find information on the Web, the structure of the Web, and building spiders to crawl the Web. Text categorization is also described in the context of managing email. The final part of the book covers information monitoring, summarization, and a simple Question & Answer (Q&A) system. The code used in the book is written in Perl, but knowledge of Perl is not necessary to run the software. Developers with an intermediate level of experience with Perl can customize the software. Although the book is about programming, methods are explained with English-like pseudocode and the source code is provided on the CD-ROM.

After reading this book you’ll be ready to tap into the bevy of information available online in ways you never thought possible.

Features

Teaches developers how to build text mining applications to manage vast amounts of text and turn it into useful data

Covers key topics such as information extraction, clustering, building spiders, text categorization, summarization, and natural language query systems

Shows step-by-step techniques for implementing text mining solutions, and provides customizable solutions

商品描述(中文翻譯)

**描述**

《文本挖掘應用程式設計》教導軟體開發人員如何從網路、內部網路和桌面檔案中挖掘大量可用的資訊，並將其轉化為可用的數據。本書幫助開發人員理解管理非結構化文本所面臨的問題，並解釋如何使用資訊理論、人工智慧和運籌學中的標準統計方法來構建自己的挖掘工具。每個主題都進行了詳細的解釋，並提供了實際的實作範例。

本書首先簡要概述了文本數據、其來源以及用於搜尋和收集這些文本的典型搜尋引擎和工具。接著詳細說明如何構建提取和使用文本的工具，並涵蓋了許多用於構建這些工具的算法背後的數學知識。然後，您將學習如何從文本中構建標記、構建索引以及檢測文本中的模式。您還會找到從電子郵件、新聞文章或網頁中提取人名、地名和組織名稱的方法。本書的下一部分教您如何在網路上尋找資訊、網路的結構，以及構建爬蟲來爬取網路。文本分類也在管理電子郵件的背景下進行了描述。本書的最後部分涵蓋了資訊監控、摘要以及一個簡單的問答系統。本書中使用的程式碼是用 Perl 編寫的，但運行軟體並不需要 Perl 知識。具有中級 Perl 經驗的開發人員可以自定義該軟體。雖然本書是關於程式設計，但方法是用類似英語的偽代碼進行解釋，並且源代碼隨附在 CD-ROM 中。

閱讀完本書後，您將準備好以您從未想過的方式利用網上豐富的資訊。

**特點**

- 教導開發人員如何構建文本挖掘應用程式，以管理大量文本並將其轉化為有用的數據
- 涵蓋關鍵主題，如資訊提取、聚類、構建爬蟲、文本分類、摘要和自然語言查詢系統
- 展示逐步實施文本挖掘解決方案的技術，並提供可自定義的解決方案