JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/15770

タイトル: Study on Relationship between Degree of Emphasis and Acoustic Feature for Synthesizing Emphasized Speech
著者: Ohtani, Yasuhiro
Akagi, Masato
発行日: 2019-03-06
出版者: Research Institute of Signal Processing, Japan
誌名: 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019)
開始ページ: 256
終了ページ: 259
抄録: Humans can perceive not only presence/absence of emphasis but also degrees of emphasis from actual emphasized speech. However, humans cannot fully do from synthesized speech. This paper focused on two properties of Fundamental frequency (F0) contours: amount of decay from the accent nucleus and variation between each accent nucleus, and hypothesized that the two properties of F0 contours are important for synthesizing emphasized speech. To discuss this hypothesis, this paper clarified relationships between degrees of emphasis and F0 contours. To clarify relationships, it was necessary to compare relationships for each stimulus. To compare relationships, it was necessary to know the degree of emphasis of each stimulus and analyze variations of F0 contours. A listening test was carried out to obtain the degrees of emphasis of stimulus. A value which is frequency at the barycentric point of the vowel was extracted from F0 contours to analyze the variation of F0 contour. From these results, we had two findings; degree of emphasis is increasing when amount of decay from accent nucleus to next mora is increasing, and the variation of accent nuclei is different with/without emphasis. The experiment was carried out to evaluate hypothesis. Synthesized stimuli from non-emphasized voice by varying amount of decay and variation of accent nuclei are used for the experiment. The results showed that the participants of the experiment can perceive emphasis with degrees from the synthesized stimuli. This result clarified the relationships between presence/absence of emphasis and two findings. In addition the hypothesis is important for synthesizing emphasized speech which convey presence/absence of emphasis.
Rights: Copyright (C) 2019 Research Institute of Signal Processing, Japan. Yasuhiro Ohtani and Masato Akagi, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), 2019, 256-259.
URI: http://hdl.handle.net/10119/15770
資料タイプ: publisher
出現コレクション:b11-1. 会議発表論文・発表資料 (Conference Papers)

このアイテムのファイル:

ファイル 記述 サイズ形式
2921.pdf1140KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係