Modelling Prominence and Emphasis Improves Unit-Selection Synthesis

Strom, Volker; Nenkova, Ani; Clark, Robert; Vazquez-Alvarez, Yolanda; Brenier, Jason; King, Simon; Jurafsky, Dan

Modelling Prominence and Emphasis Improves Unit-Selection Synthesis

Files

is_p540.pdf (56.09 KB)

Penn collection

Departmental Papers (CIS)

Subject

speech synthesis
prosody
prominence
pitch accent
unit selection

Permalink

https://repository.upenn.edu/handle/20.500.14332/6439

View all metadata

Author

Strom, Volker

Nenkova, Ani

Clark, Robert

Vazquez-Alvarez, Yolanda

Brenier, Jason

King, Simon

Jurafsky, Dan

Abstract

We describe the results of large scale perception experiments showing improvements in synthesising two distinct kinds of prominence: standard pitch-accent and strong emphatic accents. Previously prominence assignment has been mainly evaluated by computing accuracy on a prominence-labelled test set. By contrast we integrated an automatic pitch-accent classifier into the unit selection target cost and showed that listeners preferred these synthesised sentences. We also describe an improved recording script for collecting emphatic accents, and show that generating emphatic accents leads to further improvements in the fiction genre over incorporating pitch accent only. Finally, we show differences in the effects of prominence between child-directed speech and news and fiction genres.

Date of presentation

2007-08-01

Conference name

Departmental Papers (CIS)

Conference dates

2023-05-17T02:40:09.000

Comments

Presented at Proceedings of Interspeech, 2007, August 2007. URL: http://hdl.handle.net/1842/1992

Collection

Presentations