Audio Processing and Speech Recognition: Concepts, Techniques and Research Overviews (SpringerBriefs in Applied Sciences and Technology)
Soumya Sen, Anjan Dutta, Nilanjan Dey
- 出版商: Springer
- 出版日期: 2019-02-20
- 售價: $2,370
- 貴賓價: 9.5 折 $2,252
- 語言: 英文
- 頁數: 96
- 裝訂: Paperback
- ISBN: 9811360979
- ISBN-13: 9789811360978
-
相關分類:
語音辨識 Speech-recognition
海外代購書籍(需單獨結帳)
買這商品的人也買了...
相關主題
商品描述
This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system.
Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification.
By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.
商品描述(中文翻譯)
本書提供了音訊處理的概述,包括音訊處理和語音識別中使用的最新方法進展。首先,它討論了音訊索引的重要性以及傳統資訊檢索問題,並介紹了兩種主要的索引技術,即大型詞彙連續語音識別(Large Vocabulary Continuous Speech Recognition, LVCSR)和語音音素搜尋(Phonetic Search)。接著,書中簡要介紹了人類語音產生系統及其建模,這些都是產生人工語音所需的內容。它還討論了自動語音識別(Automatic Speech Recognition, ASR)系統的各個組成部分。
本書描述了ASR系統的時間發展,並簡要檢視了ASR中使用的統計模型及相關的數學推導,總結了多種最先進的分類技術及其在音訊/語音分類中的應用。
通過提供對音訊/語音處理和語音識別各個方面的見解,本書吸引了廣泛的讀者群,從研究人員和研究生到對該領域感興趣的新手。