Developments in Speech Synthesis

Mark Tatham, Katherine Morton

  • 出版商: Wiley
  • 出版日期: 2005-04-15
  • 定價: $4,500
  • 售價: 8.5$3,825
  • 語言: 英文
  • 頁數: 356
  • 裝訂: Hardcover
  • ISBN: 047085538X
  • ISBN-13: 9780470855386
  • 相關分類: 人工智慧語音辨識 Speech-recognition
  • 立即出貨 (庫存 < 4)

買這商品的人也買了...

商品描述

Description:

With a growing need for understanding the process involved in producing and perceiving spoken language, this timely publication answers these questions in an accessible reference.  Containing material resulting from many years’ teaching and research, Speech Synthesis provides a complete account of the theory of speech.  By bringing together the common goals and methods of speech synthesis into a single resource, the book will lead the way towards a comprehensive view of the process involved in human speech. The book includes applications in speech technology and speech synthesis.

It is ideal for intermediate students of linguistics and phonetics who wish to proceed further, as well as researchers and engineers in telecommunications working in speech technology and speech synthesis who need a comprehensive overview of the field and who wish to gain an understanding of the objectives and achievements of the study of speech production and perception. 

 

 

Table of Contents:

Acknowledgements.

Introduction.

Part I: Current Work.

1. High-Level and Low-Level Synthesis.

2. Low-Level Synthesisers: Current Status.

3. Text-To-Speech.

4. Different Low-Level Synthesisers: What Can Be Expected?

5. Low-Level Synthesis Potential.

Part II: A New Direction for Speech Synthesis.

6. A View of Naturalness.

7. Physical Parameters and Abstract Information Channels.

8. Variability and System Integrity.

9. Automatic Speech Recognition.

Part III: High-Level Control.

10. The Need for High-Level Control.

11. The Input to High-Level Control.

12. Problems for Automatic Text Markup.

Part IV: Areas for Improvement.

13. Filling Gaps.

14. Using Different Units.

15. Waveform Concatenation Systems: Naturalness and Large Databases.

16. Unit Selection Systems.

Part V: Markup.

17. VoiceXML.

18. Speech Synthesis Markup Language (SSML).

19. SABLE.

20. The Need for Prosodic Markup.

Part VI: Strengthening the High-Level Model.

21. Speech.

22. Basic Concepts.

23. Underlying Basic Disciplines: Expression Studies.

24. Labelling Expressive/Emotive Content.

25. The Proposed Model.

26. Types of Model.

Part VII: Expanded Static and Dynamic Modelling.

27. The Underlying Linguistics System.

28. Planes for Synthesis.

Part VIII: The Prosodic Framework, Coding and Intonation.

29. The Phonological Prosodic Framework.

30. Sample Code.

31. XML Coding.

32. Prosody: General.

33. Phonological and Phonetic Models of Intonation.

Part IX: Approaches to Natural-Sounding Synthesis.

34. The General Approach.

35. The Expression Wrapper in XML.

36. Advantages of XML in Wrapping.

37. Considerations in Characterising Expression/Emotion.

38. Summary.

Part X: Concluding Overview.

References.

Author Index.

Index.

商品描述(中文翻譯)

描述:
隨著對於產生和感知口語語言過程的需求不斷增長,這本及時出版的書籍以易於理解的參考資料回答了這些問題。《語音合成》是多年教學和研究成果的結晶,提供了對語音理論的完整解釋。通過將語音合成的共同目標和方法集結成一個資源,本書將引領人們對人類語音過程的全面理解。本書還包括語音技術和語音合成的應用。

這本書非常適合想要深入研究語言學和音韻學的中級學生,以及在電信領域從事語音技術和語音合成工作的研究人員和工程師,他們需要對這一領域有全面的概述,並希望了解語音產生和感知研究的目標和成就。

目錄:
致謝。
引言。
第一部分:當前工作。
1. 高層次和低層次合成。
2. 低層次合成器:現狀。
3. 文字轉語音。
4. 不同的低層次合成器:可以期待什麼?
5. 低層次合成的潛力。
第二部分:語音合成的新方向。
6. 自然性的觀點。
7. 物理參數和抽象信息通道。
8. 變異性和系統完整性。
9. 自動語音識別。
第三部分:高層次控制。
10. 高層次控制的需求。
11. 高層次控制的輸入。
12. 自動文本標記的問題。
第四部分:改進領域。
13. 填補空白。
14. 使用不同的單位。
15. 波形串聯系統:自然性和大型數據庫。
16. 單位選擇系統。
第五部分:標記。
17. VoiceXML。
18. 語音合成標記語言(SSML)。
19. SABLE。
20. 對韻律標記的需求。
第六部分:加強高層次模型。
21. 語音。
22. 基本概念。