Articulatory Speech Synthesis from Th

Articulatory Speech Synthesis from Th

Paperback

$40.00
Eligible for FREE SHIPPING
  • Get it by Wednesday, September 27 , Order now and choose Expedited Delivery during checkout.

Overview

Articulatory Speech Synthesis from Th by Steven Levinson

This book addresses the problem of articulatory speech synthesis based on computed vocal tract geometries and the basic physics of sound production in it. Unlike conventional methods based on analysis/synthesis using the well-known source filter model, which assumes the independence of the excitation and filter, we treat the entire vocal apparatus as one mechanical system that produces sound by means of fluid dynamics. The vocal apparatus is represented as a three-dimensional time-varying mechanism and the sound propagation inside it is due to the non-planar propagation of acoustic waves through a viscous, compressible fluid described by the Navier-Stokes equations.

We propose a combined minimum energy and minimum jerk criterion to compute the dynamics of the vocal tract during articulation. Theoretical error bounds and experimental results show that this method obtains a close match to the phonetic target positions while avoiding abrupt changes in the articulatory trajectory. The vocal folds are set into aerodynamic oscillation by the flow of air from the lungs. The modulated air stream then excites the moving vocal tract. This method shows strong evidence for source-filter interaction.

Based on our results, we propose that the articulatory speech production model has the potential to synthesize speech and provide a compact parameterization of the speech signal that can be useful in a wide variety of speech signal processing problems.

Table of Contents: Introduction / Literature Review / Estimation of Dynamic Articulatory Parameters / Construction of Articulatory Model Based on MRI Data / Vocal Fold Excitation Models / Experimental Results of Articulatory Synthesis / Conclusion

Product Details

ISBN-13: 9781598291780
Publisher: Morgan and Claypool Publishers
Publication date: 04/07/2008
Series: Synthesis Lectures on Speech and Audio Processing Series
Pages: 118
Product dimensions: 7.50(w) x 9.25(h) x 0.25(d)

Customer Reviews

Most Helpful Customer Reviews

See All Customer Reviews