Modelling Prominence and Emphasis Improves Unit-Selection Synthesis

Loading...
Thumbnail Image
Penn collection
Departmental Papers (CIS)
Degree type
Discipline
Subject
speech synthesis
prosody
prominence
pitch accent
unit selection
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Strom, Volker
Clark, Robert
Vazquez-Alvarez, Yolanda
Brenier, Jason
King, Simon
Jurafsky, Dan
Contributor
Abstract

We describe the results of large scale perception experiments showing improvements in synthesising two distinct kinds of prominence: standard pitch-accent and strong emphatic accents. Previously prominence assignment has been mainly evaluated by computing accuracy on a prominence-labelled test set. By contrast we integrated an automatic pitch-accent classifier into the unit selection target cost and showed that listeners preferred these synthesised sentences. We also describe an improved recording script for collecting emphatic accents, and show that generating emphatic accents leads to further improvements in the fiction genre over incorporating pitch accent only. Finally, we show differences in the effects of prominence between child-directed speech and news and fiction genres.

Advisor
Date of presentation
2007-08-01
Conference name
Departmental Papers (CIS)
Conference dates
2023-05-17T02:40:09.000
Conference location
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Presented at Proceedings of Interspeech, 2007, August 2007. URL: http://hdl.handle.net/1842/1992
Recommended citation
Collection