相關主題
商品描述
The areas of natural language processing and computational linguistics have continued to grow in recent years, driven by the demand to automatically process text and spoken data. With the processing power and techniques now available, research is scaling up from lab prototypes to real-world, proven applications.
This book teaches the principles of natural language processing, first covering practical linguistics issues such as encoding and annotation schemes, defining words, tokens and parts of speech and morphology, as well as key concepts in machine learning, such as entropy, regression and classification, which are used throughout the book. It then details the language-processing functions involved, including part-of-speech tagging using rules and stochastic techniques, using Prolog to write phase-structure grammars, syntactic formalisms and parsing techniques, semantics, predicate logic and lexical semantics and analysis of discourse and applications in dialogue systems. A key feature of the book is the author's hands-on approach throughout, with sample code in Prolog and Perl, extensive exercises, and a detailed introduction to Prolog. The reader is supported with a companion website that contains teaching slides, programs and additional material.
The second edition is a complete revision of the techniques exposed in the book to reflect advances in the field the author redesigned or updated all the chapters, added two new ones and considerably expanded the sections on machine-learning techniques.
商品描述(中文翻譯)
自然語言處理和計算語言學領域近年來持續增長,受到自動處理文本和口語數據的需求推動。隨著現有的處理能力和技術,研究從實驗室原型擴展到真實世界中經過驗證的應用。
本書教授自然語言處理的原則,首先涵蓋實用語言學問題,如編碼和標註方案,定義詞語、標記和詞性以及形態學,以及機器學習中的關鍵概念,如熵、回歸和分類,這些概念貫穿全書。然後詳細介紹涉及的語言處理功能,包括使用規則和隨機技術進行詞性標註,使用Prolog編寫階段結構語法,句法形式和解析技術,語義學,謂詞邏輯和詞彙語義學,以及對話系統中的話語分析和應用。本書的一個重要特點是作者貫穿始終的實踐方法,提供了Prolog和Perl的示例代碼,大量練習題,以及對Prolog的詳細介紹。讀者還可以在附帶的網站上找到教學幻燈片、程序和其他資料。
第二版是對書中所介紹的技術進行的全面修訂,以反映該領域的進展。作者重新設計或更新了所有章節,增加了兩個新章節,並大幅擴展了有關機器學習技術的部分。