Prosody Generation Model for TTS Systems

Segmental Durations and F0 Contours with Fujisaki Model

Fotogalerie

João Paulo Teixeira

Prosody Generation Model for TTS Systems

Segmental Durations and F0 Contours with Fujisaki Model

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Autorenporträt

Andere Kunden interessierten sich auch für

K. Sreenivasa Rao
Predicting Prosody from Text for Text-to-Speech Synthesis

41,99 €
Poonam Shashikant Shetake
Devnagari Text To Speech Conversion

25,99 €
R.I. Damper (Hrsg.)
Data-Driven Techniques in Speech Synthesis

121,99 €
Marcelo Sampaio de Alencar
Music Science

142,99 €
Data-Driven Techniques in Speech Synthesis

117,99 €
Manikandan Mani
Enhancement of PQ on Wind Generation Systems by SMES coil

25,99 €
Vaibhav Hendre
Antenna Selection in MIMO for Future Generation Wireless Systems

49,99 €

Produktbeschreibung

This book presents the development of a prosody system for text-to-speech (TTS) applications. The prosody is responsible for a communicative intention and guarantees some naturalness in the uttered speech. The prosodic features consist in the imposition of the timing, characterized by the segmental durations and pauses, the intonation, characterized by the fundamental frequency (F0) curve, and by the intensity curve. The proposed prosody model consists of several sub-models, namely, the duration model to predict the segmental durations and the model to predict the F0 pattern. The segmental durations model consists of one ANN carefully selected concerning its architecture and type as well as the set of input features with the objective of minimizing the error between predicted and measured durations. One alternative model, is based on same considerations but uses one dedicated ANN for each phoneme. The alternative model, with dedicated ANNs, improved the final performance. The proposed model to predict the F0 contour is based on the Fujisaki model and consists of two sub-models. One predicts the Phrase Commands parameters and the other predicts the Accent Commands parameters.

Produktdetails

Produktdetails
Verlag: LAP Lambert Academic Publishing
Aufl.
Seitenzahl: 276
Erscheinungstermin: 8. August 2012
Englisch
Abmessung: 220mm x 150mm x 17mm
Gewicht: 429g
ISBN-13: 9783659162770
ISBN-10: 3659162779
Artikelnr.: 36215242

Herstellerkennzeichnung
Books on Demand GmbH
In de Tarpen 42
22848 Norderstedt
info@bod.de
040 53433511

Produktdetails

Verlag: LAP Lambert Academic Publishing
Aufl.
Seitenzahl: 276
Erscheinungstermin: 8. August 2012
Englisch
Abmessung: 220mm x 150mm x 17mm
Gewicht: 429g
ISBN-13: 9783659162770
ISBN-10: 3659162779
Artikelnr.: 36215242

Herstellerkennzeichnung
Books on Demand GmbH
In de Tarpen 42
22848 Norderstedt
info@bod.de
040 53433511

Autorenporträt

Doutorado pela FEUP em Engenharia Eletrotécnica e de Computadores, é professor no IPB na área de processamento de sinal. Pertence ao quadro editorial/comissão científica de algumas revistas e conferências científicas. É autor de dois livros, de alguns capítulos de livro e diversos artigos científicos sobre processamento de sinal e redes neuronais.