Skip to main content
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)Volume 9811 LNCS, 2016, Pages 59-6618th International Conference on Speech and Computer, SPECOM 2016; Budapest; Hungary; 23 August 2016 through 27 August 2016; Code 179989

A linguistic interpretation of the atom decomposition of fundamental frequency contour for American English(Conference Paper)

  Save all to author list
  • aFaculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia
  • bFaculty of Electrical Engineering and Information Techologies, Ss. Cyril and Methodius University, Skopje, North Macedonia

Abstract

One of the most recently proposed techniques for modeling the prosody of an utterance is the decomposition of its pitch, duration and/or energy contour into physiologically motivated units called atoms, based on matching pursuit. Since this model is based on the physiology of the production of sentence intonation, it is essentially language independent. However, the intonation of an utterance in a particular language is obviously under the influence of factors of a predominantly linguistic nature. In this research, restricted to the case of American English with prosody annotated using standard ToBI conventions, we have shown that, under certain mild constraints, the positive and negative atoms identified in the pitch contour coincide very well with high and low pitch accents and phrase accents of ToBI. By giving a linguistic interpretation of the atom decomposition model, this research enables its practical use in domains such as speech synthesis or cross-lingual prosody transfer. © Springer International Publishing Switzerland 2016.

Author keywords

Atom decompositionPitch contourToBI

Indexed keywords

Engineering controlled terms:AtomsLinguisticsPhysiologySpeech synthesis
Engineering uncontrolled termsAmerican EnglishAtom decompositionEnergy contoursFundamental frequency contourLanguage independentsMatching pursuitPitch contoursToBI
Engineering main heading:Decomposition

Funding details

Funding sponsor Funding number Acronym
Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung
See opportunities by SNF
SNF
Ministarstvo Prosvete, Nauke i Tehnološkog RazvojaCRSII2-147611/1,TR32035MPNTR
Ministarstvo Prosvete, Nauke i Tehnološkog RazvojaMPNTR
  • 1

    The presented study was supported in part by the Ministry of Education, Science and Technological Development of the Republic of Serbia (grant TR32035), and was carried out within the SCOPES project “SP2: SCOPES Project for Speech Prosody” (No. CRSII2-147611/1), supported by Swiss National Science Foundation. The authors are grateful to the company Speech Morphing, Inc. from Campbell, CA, USA, for providing the speech corpus used in the experiments.

  • ISSN: 03029743
  • ISBN: 978-331943957-0
  • Source Type: Book Series
  • Original language: English
  • DOI: 10.1007/978-3-319-43958-7_6
  • Document Type: Conference Paper
  • Volume Editors: Ronzhin A.,Potapova R.,Nemeth G.
  • Sponsors:
  • Publisher: Springer Verlag

  Sečujski, M.; Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia;
© Copyright 2017 Elsevier B.V., All rights reserved.

Cited by 1 document

Honnet, P.-E. , Gerazov, B. , Gjoreski, A.
Intonation modelling using a muscle model and perceptually weighted matching pursuit
(2018) Speech Communication
View details of this citation
{"topic":{"name":"Fundamental Frequency; Speech Communication; Prosody","id":26476,"uri":"Topic/26476","prominencePercentile":12.833589,"prominencePercentileString":"12.834","overallScholarlyOutput":0},"dig":"2a16cf17e63e1c09acf1eda506ace145927a7c657777e6d83cf20596a634d507"}

SciVal Topic Prominence

Topic:
Prominence percentile: