-
出版商:
Morgan & Claypool
-
出版日期:
2015-05-01
-
售價:
$2,710
-
貴賓價:
9.5 折
$2,575
-
語言:
英文
-
頁數:
254
-
裝訂:
Paperback
-
ISBN:
1627054464
-
ISBN-13:
9781627054461
商品描述
Artificial Intelligence federates numerous scientific fields in the aim of developing machines able to assist human operators performing complex treatments; most of which demand high cognitive skills (e.g. learning or decision processes). Central to this quest is to give machines the ability to estimate the likeness or similarity between things in the way human beings estimate the similarity between stimuli. In this context, this book focuses on semantic measures: approaches designed for comparing semantic entities such as units of language, e.g. words, sentences, or concepts and instances defined into knowledge bases. The aim of these measures is to assess the similarity or relatedness of such semantic entities by taking into account their semantics, i.e. their meaning; intuitively, the words tea and coffee, which both refer to stimulating beverage, will be estimated to be more semantically similar than the words toffee (confection) and coffee, despite that the last pair has a higher syntactic similarity. The two state-of-the-art approaches for estimating and quantifying semantic similarities/relatedness of semantic entities are presented in detail: the first one relies on corpora analysis and is based on Natural Language Processing techniques and semantic models while the second is based on more or less formal, computer-readable and workable forms of knowledge such as semantic networks, thesauri or ontologies. Semantic measures are widely used today to compare units of language, concepts, instances or even resources indexed by them (e.g., documents, genes). They are central elements of a large variety of Natural Language Processing applications and knowledge-based treatments, and have therefore naturally been subject to intensive and interdisciplinary research efforts during last decades. Beyond a simple inventory and categorization of existing measures, the aim of this monograph is to convey novices as well as researchers of these domains toward a better understanding of semantic similarity estimation and more generally semantic measures. To this end, we propose an in-depth characterization of existing proposals by discussing their features, the assumptions on which they are based and empirical results regarding their performance in particular applications. By answering these questions and by providing a detailed discussion on the foundations of semantic measures, our aim is to give the reader key knowledge required to: (i) select the more relevant methods according to a particular usage context, (ii) understand the challenges offered to this field of study, (iii) distinguish room of improvements for state-of-the-art approaches and (iv) stimulate creativity toward the development of new approaches. In this aim, several definitions, theoretical and practical details, as well as concrete applications are presented.
商品描述(中文翻譯)
人工智慧整合了眾多科學領域,旨在開發能夠協助人類操作員執行複雜處理的機器;這些處理大多需要高認知技能(例如學習或決策過程)。在這個追求的核心是賦予機器評估事物之間相似性或類似性的能力,這種能力類似於人類對刺激之間相似性的評估。在這個背景下,本書專注於語義度量:旨在比較語義實體的方法,例如語言單位,如單詞、句子或定義在知識庫中的概念和實例。這些度量的目的是評估這些語義實體的相似性或相關性,考慮到它們的語義,即它們的意義;直觀地說,單詞「茶」和「咖啡」都指代刺激性飲料,因此會被評估為在語義上比「太妃糖」(糖果)和「咖啡」更相似,儘管後者在句法上具有更高的相似性。兩種最先進的方法用於估計和量化語義實體的語義相似性/相關性,將在此詳細介紹:第一種依賴於語料庫分析,基於自然語言處理技術和語義模型,而第二種則基於或多或少正式的、計算機可讀且可操作的知識形式,如語義網絡、同義詞庫或本體論。語義度量今天被廣泛用於比較語言單位、概念、實例甚至由它們索引的資源(例如文件、基因)。它們是各種自然語言處理應用和基於知識的處理的核心元素,因此在過去幾十年中,自然成為了密集和跨學科研究的主題。除了對現有度量的簡單清單和分類外,本專著的目的是引導新手以及這些領域的研究者更好地理解語義相似性估計以及更一般的語義度量。為此,我們提出對現有提案的深入特徵描述,討論它們的特點、所基於的假設以及在特定應用中的實證結果。通過回答這些問題並提供有關語義度量基礎的詳細討論,我們的目標是為讀者提供關鍵知識,以便:(i) 根據特定使用情境選擇更相關的方法,(ii) 理解該研究領域所面臨的挑戰,(iii) 辨別最先進方法的改進空間,以及 (iv) 激發創造力以開發新方法。為此,將呈現幾個定義、理論和實踐細節,以及具體應用。