Catégorie de document |
Contribution à un colloque ou à un congrès |
Titre |
A Multi-Level Context-Dependent Prosodic Model Applied to Durational Modeling |
Auteur principal |
Nicolas Obin |
Co-auteurs |
Xavier Rodet, Anne Lacheret-Dujour |
Colloque / congrès |
Interspeech. Brighton : 2009 |
Comité de lecture |
Oui |
Année |
2009 |
Statut éditorial |
Non publié |
Résumé |
We present in this article a multi-level prosodic model based on the estimation of prosodic parameters on a set of well defined linguistic units. Different linguistic units are used to represent different scales of prosodic variations (local and global forms) and thus to estimate the linguistic factors that can explain the variations of prosodic parameters independently on each level. This model is applied to the modeling of syllable-based durational parameters on two read speech corpora -laboratory and acted speech. Compared to a syllable-based baseline model, the proposed approach improves performance in terms of the temporal organization of the predicted durations (correlation score) and reduces model’s complexity, when showing comparable performance in terms of relative prediction error. |
Mots-clés |
speech synthesis / prosody / multi-level model / context-dependent model |
Equipe |
Analyse et synthèse sonores |
Cote |
Obin09b |
Adresse de la version en ligne |
http://architexte.ircam.fr/textes/Obin09b/index.pdf |
|