JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/11930

タイトル: Glottal source analysis of emotional speech
著者: Li, Yongwei
Akagi, Masato
発行日: 2014
出版者: 2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'14)
誌名: 2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'14)
開始ページ: 513
終了ページ: 516
抄録: Emotional speech makes speech more expressive, emotional speech conversion is needed in many systerms. Analyzing of glottal source wave plays an important role in emotional speech conversion. The purpose of this paper is to analyze glottal source of emotional speech for emotional speech conversion based on the Auto-Regressive eXogenous (ARX) model combined with Liljencrant-fant (LF) model, in which the Glottal Clsure Instant (GCI) and Glottal Opening Instant (GOI) are two important parameters and greatly affect the accuracy of the ARX-LF model. Therefore, a mean-based signal method is suggested to improve the estimation accuracy of GCI, and GOI is estimated from the Hilbert envelope of LP residual. The ARXLF model with accurate GCI and GOI is applied for analysis of glottal source of emotional speech. The results show that the proposed approach improve the accuracy of glottal source wave of speech, and the different glottal source waves of different emotional speech can be obtained.
Rights: This material is posted here with permission of the Research Institute of Signal Processing Japan. Yongwei Li, Masato Akagi, 2014 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'14), 2014, 513-516.
URI: http://hdl.handle.net/10119/11930
資料タイプ: publisher
出現コレクション:b11-1. 会議発表論文・発表資料 (Conference Papers)


ファイル 記述 サイズ形式
NCSP2014_Yongwei.pdf387KbAdobe PDF見る/開く



お問合せ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)