Speech Coding and Synthesis Using Parametric Curves

Jesus and Cawley (1997)

In Proceedings of the 5th European Conference on Speech Communication and Technology (EuroSpeech'97), Volume 2, Rhodes, Greece, pp. 597-600

Accurate modeling of co-articulation, the context-sensitive merging of the boundaries between allophones in continuous speech, is vital for natural sounding speech synthesis. This paper describes initial research investigating the use of Bézier Curves to form models of co-articulation in human speech. A 12th order, pitch synchronous line spectral pair (LSP) analysis is performed on a corpus of 239 phonetically balanced sentences of English speech. The resulting data are divided to form an inventory of the diphones occurring in the speech database. The trajectory of each line spectral pair parameter through each diphone can then be represented by a single cubic Bézier curve segment, found using the Levenberg-Marquardt curve fitting method. Results are presented showing the accuracy of Bézier models of the co-articulation between different types of speech sounds.

JesusCawley1997.pdf (Acrobat file)

JesusCawley1997.ps (PostScript file)


Photo Gallery


An interactive Bézier segment. The source code (© Gavin Cawley).
Use your mouse to adjust the control points.

Last updated 12/9/2008
lmtj@ua.pt
Luís Miguel Teixeira de Jesus

Back to Homepage