Audio Source Separation and Speech Enhancement

  • 出版商: Wiley
  • 出版日期: 2018-10-22
  • 定價: $4,800
  • 售價: 9.5$4,560
  • 語言: 英文
  • 頁數: 504
  • 裝訂: Hardcover
  • ISBN: 1119279895
  • ISBN-13: 9781119279891
  • 相關分類: Machine Learning
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Learn the technology behind hearing aids, Siri, and Echo 

Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software.

Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting.

Key features:

  • Consolidated perspective on audio source separation and speech enhancement.
  • Both historical perspective and latest advances in the field, e.g. deep neural networks.
  • Diverse disciplines: array processing, machine learning, and statistical signal processing.
  • Covers the most important techniques for both single-channel and multichannel processing.

This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.

商品描述(中文翻譯)

學習助聽器、Siri和Echo背後的技術

音頻源分離和語音增強旨在從涉及多個聲源的音頻錄音中提取一個或多個感興趣的源信號。這些技術是當今音頻信號處理中最研究的領域之一,對於助聽器、免提電話、語音命令和其他抗噪音音頻分析系統以及音樂後期製作軟件的成功起著關鍵作用。

對這個主題的研究遵循了三條匯合的路徑,分別從傳感器陣列處理、計算聽覺場景分析和基於機器學習的方法(如獨立成分分析)開始。本書是第一本在統一的框架下提供全面概述的書籍,介紹了這些技術的共同基礎和差異。

主要特點:

- 對音頻源分離和語音增強提供整合的觀點。
- 包括領域的歷史背景和最新進展,例如深度神經網絡。
- 涵蓋多個學科:陣列處理、機器學習和統計信號處理。
- 涵蓋單通道和多通道處理的最重要技術。

本書提供了適合具有基本信號處理和機器學習知識的人的入門和高級材料。由於其全面性,它將幫助學生選擇有前途的研究方向,研究人員利用跨領域知識設計改進的技術,以及工程師和開發人員為目標應用場景選擇合適的技術。對於其他領域的從業人員(例如聲學、多媒體、語音學和音樂學),希望利用音頻源分離或語音增強作為其自身需求的預處理工具,本書也將非常有用。