JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/18192
|
タイトル: | Speech Emotion Recognition Using Multichannel Parallel Convolutional Recurrent Neural Networks based on Gammatone Auditory Filterbank |
著者: | Peng, Zhichao Zhu, Zhi Unoki, Masashi Dang, Jianwu Akagi, Masato |
発行日: | 2017-12-15 |
出版者: | APSIPA |
誌名: | Proceedings of APSIPA Annual Summit and Conference 2017 |
抄録: | Speech Emotion Recognition (SER) using deep learning methods based on computational auditory models of human auditory system is a new way to identify emotional state. In this paper, we propose to utilize multichannel parallel convolutional recurrent neural networks (MPCRNN) to extract salient features based on Gammatone auditory filterbank from raw waveform and reveal that this method is effective for speech emotion recognition. We first divide the speech signal into segments, and then get multichannel data using Gammatone auditory filterbank, which is used as a first stage before applying MPCRNN to get the most relevant features for emotion
recognition from speech. We subsequently obtain emotion state probability distribution for each speech segment. Eventually, utterance-level features are constructed from segment-level probability distributions and fed into support vector machine (SVM) to identify the emotions. According to the experimental results, speech emotion features can be effectively learned utilizing the proposed deep learning approach based on Gammatone auditory filterbank. |
Rights: | Copyright (C) 2017 APSIPA. This material is posted here with permission of APSIPA (Asia-Pacific Signal and Information Processing Association). Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi, Proceedings of APSIPA Annual Summit and Conference 2017 |
URI: | http://hdl.handle.net/10119/18192 |
資料タイプ: | publisher |
出現コレクション: | b11-1. 会議発表論文・発表資料 (Conference Papers)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
APSIPA2017.pdf | | 569Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|