

One of the most recently proposed techniques for modeling the prosody of an utterance is the decomposition of its pitch, duration and/or energy contour into physiologically motivated units called atoms, based on matching pursuit. Since this model is based on the physiology of the production of sentence intonation, it is essentially language independent. However, the intonation of an utterance in a particular language is obviously under the influence of factors of a predominantly linguistic nature. In this research, restricted to the case of American English with prosody annotated using standard ToBI conventions, we have shown that, under certain mild constraints, the positive and negative atoms identified in the pitch contour coincide very well with high and low pitch accents and phrase accents of ToBI. By giving a linguistic interpretation of the atom decomposition model, this research enables its practical use in domains such as speech synthesis or cross-lingual prosody transfer. © Springer International Publishing Switzerland 2016.
| Engineering controlled terms: | AtomsLinguisticsPhysiologySpeech synthesis |
|---|---|
| Engineering uncontrolled terms | American EnglishAtom decompositionEnergy contoursFundamental frequency contourLanguage independentsMatching pursuitPitch contoursToBI |
| Engineering main heading: | Decomposition |
| Funding sponsor | Funding number | Acronym |
|---|---|---|
| Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung See opportunities by SNF | SNF | |
| Ministarstvo Prosvete, Nauke i Tehnološkog Razvoja | CRSII2-147611/1,TR32035 | MPNTR |
| Ministarstvo Prosvete, Nauke i Tehnološkog Razvoja | MPNTR |
The presented study was supported in part by the Ministry of Education, Science and Technological Development of the Republic of Serbia (grant TR32035), and was carried out within the SCOPES project “SP2: SCOPES Project for Speech Prosody” (No. CRSII2-147611/1), supported by Swiss National Science Foundation. The authors are grateful to the company Speech Morphing, Inc. from Campbell, CA, USA, for providing the speech corpus used in the experiments.
Sečujski, M.; Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia;
© Copyright 2017 Elsevier B.V., All rights reserved.