JAIST Repository >
School of Information Science >
Articles >
Journal Articles >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10119/4908

Title: Toward a rule-based synthesis of emotional speech on linguistic description of perception
Authors: Huang, Chun-Fang
Akagi, Masato
Issue Date: 2005
Publisher: Springer
Magazine name: Lecture Notes in Computer Science
Volume: 3784
Start page: 366
End page: 373
DOI: 10.1007/11573548_47
Abstract: This paper reports rules for morphing a voice to make it be perceived as containing various primitive features, for example, to make it sound more “bright” or “dark”. In a previous work we proposed a three-layered model, which contains emotional speech, primitive features, and acoustic features, for the perception of emotional speech. By experiments and acoustic analysis, we built the relationships between the three layers and reported that such relationships are significant. Then, a bottom-up method was adopted in order to verify the relationships. That is, we morphed (resynthesized) a speech voice by composing acoustic features in the bottommost layer to produce a voice in which listeners could perceive a single or multiple primitive features, which could be further perceived as different categories of emotion. The intermediate results show that the relationships of the model built in previous work are valid.
Rights: This is the author-created version of Springer, Chun-Fang Huang and Masato Akagi, Lecture Notes in Computer Science, 3784, 2005, 366-373. The original publication is available at www.springerlink.com, http://dx.doi.org/10.1007/11573548_47
URI: http://hdl.handle.net/10119/4908
Material Type: author
Appears in Collections:b10-1. 雑誌掲載論文 (Journal Articles)

Files in This Item:

File Description SizeFormat
ACII2005_Huang.pdf395KbAdobe PDFView/Open

All items in DSpace are protected by copyright, with all rights reserved.

 


Contact : Library Information Section, Japan Advanced Institute of Science and Technology