Distant Speech Recognition (Hardcover)
暫譯: 遠程語音識別(精裝版)

Matthias Woelfel, John McDonough

  • 出版商: Wiley
  • 出版日期: 2009-06-01
  • 售價: $4,780
  • 貴賓價: 9.5$4,541
  • 語言: 英文
  • 頁數: 594
  • 裝訂: Hardcover
  • ISBN: 0470517042
  • ISBN-13: 9780470517048
  • 相關分類: 語音辨識 Speech-recognition
  • 海外代購書籍(需單獨結帳)

買這商品的人也買了...

相關主題

商品描述

A complete overview of distant automatic speech recognition

 

The performance of conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon as the microphone is moved away from the mouth of the speaker. This is due to a broad variety of effects such as background noise, overlapping speech from other speakers, and reverberation. While traditional ASR systems underperform for speech captured with far-field sensors, there are a number of novel techniques within the recognition system as well as techniques developed in other areas of signal processing that can mitigate the deleterious effects of noise and reverberation, as well as separating speech from overlapping speakers.

 

Distant Speech Recognitionpresents a contemporary and comprehensive description of both theoretic abstraction and practical issues inherent in the distant ASR problem.

 

Key Features:

 

  • Covers the entire topic of distant ASR and offers practical solutions to overcome the problems related to it
  • Provides documentation and sample scripts to enable readers to construct state-of-the-art distant speech recognition systems
  • Gives relevant background information in acoustics and filter techniques,
  • Explains the extraction and enhancement of classification relevant speech features
  • Describes maximum likelihood as well as discriminative parameter estimation, and maximum likelihood normalization techniques
  • Discusses the use of multi-microphone configurations for speaker tracking and channel combination
  • Presents several applications of the methods and technologies described in this book
  • Accompanying website with open source software and tools to construct state-of-the-art distant speech recognition systems

 

This reference will be an invaluable resource for researchers, developers, engineers and other professionals, as well as advanced students in speech technology, signal processing, acoustics, statistics and artificial intelligence fields.

商品描述(中文翻譯)

遠距自動語音辨識的完整概述

傳統自動語音辨識(ASR)系統的性能在麥克風離開說話者的嘴巴時會急劇下降。這是由於多種因素造成的,例如背景噪音、其他說話者的重疊語音以及混響。雖然傳統的 ASR 系統在使用遠場感測器捕捉語音時表現不佳,但在辨識系統內部以及其他信號處理領域中,有許多新技術可以減輕噪音和混響的有害影響,並將語音從重疊的說話者中分離出來。

遠距語音辨識提供了對遠距 ASR 問題中理論抽象和實際問題的當代全面描述。

主要特點:


  • 涵蓋遠距 ASR 的整個主題,並提供實用解決方案以克服相關問題

  • 提供文檔和範例腳本,幫助讀者構建最先進的遠距語音辨識系統

  • 提供聲學和濾波技術的相關背景資訊

  • 解釋分類相關語音特徵的提取和增強

  • 描述最大似然估計和區別性參數估計,以及最大似然正規化技術

  • 討論多麥克風配置在說話者追蹤和通道組合中的應用

  • 介紹本書中所描述的方法和技術的幾個應用

  • 附帶網站提供開源軟體和工具,以構建最先進的遠距語音辨識系統

這本參考書將成為研究人員、開發者、工程師及其他專業人士,以及語音技術、信號處理、聲學、統計和人工智慧領域的高級學生的重要資源。