JAIST Repository: The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity

トップページ| 北陸先端科学技術大学院大学| 附属図書館

一覧

コミュニティ
& コレクション
タイトル
著者
日付
学位論文
リサーチレポート・テクニカルメモランダム

登録利用者:

登録者ページ
利用者(E-people)

当システムについて

JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/16095

タイトル:	The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity
著者:	Li, Xingfeng Akagi, Masato
キーワード:	wavelet transform speech emotion recognition emotion dimension three-layer model
発行日:	2019
出版者:	International Speech Communication Association
誌名:	Proc. Interspeech 2019
開始ページ:	3262
終了ページ:	3266
DOI:	10.21437/Interspeech.2019-2229
抄録:	The multi-layered perceptual process of emotion in human speech plays an essential role in the field of affective computing for underlying a speaker’s state. However, a comprehensive process analysis of emotion perception is still challenging due to the lack of powerful acoustic features allowing accurate inference of emotion across speaker and language diversities. Most previous research works study acoustic features mostly using Fourier transform, short time Fourier transform or linear predictive coding. Even though these features may be useful for stationary signal within short frames, they may not capture the localized event adequately as speech transmits emotion information dynamically over time. This case introduces a set of acoustic features via wavelet transform analysis of the speech signal, and specifically, models the perceptual process of emotion for language diversity. For this aim, the proposed features are analyzed in a three-layer emotion perception model across multiple languages. Experiments show that the proposed acoustic features significantly enhance the perceptual process of emotion and render a better result in multilingual emotion recognition when compared it to the widely used prosodic and spectral features, as well as their combination in literature.
Rights:	Copyright (C) 2019 International Speech Communication Association. Xingfeng Li and Masato Akagi, Proc. Interspeech 2019, 2019, 3262-3266. http://dx.doi.org/10.21437/Interspeech.2019-2229
URI:	http://hdl.handle.net/10119/16095
資料タイプ:	publisher
出現コレクション:	b11-1. 会議発表論文・発表資料 (Conference Papers)

このアイテムのファイル:

ファイル	記述	サイズ	形式
IS2019_Xingfeng.PDF		436Kb	Adobe PDF	見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

お問合せ先 : 北陸先端科学技術大学院大学　研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)