JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/9985
|
タイトル: | Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristics |
著者: | Petrick, Rico Lu, Xugang Unoki, Masashi Akagi, Masato Hoffmann, Ruediger |
キーワード: | reverberation robust ASR harmonicity based feature analysis temporal power envelope feature analysis |
発行日: | 2008-09-24 |
出版者: | International Speech Communication Association |
誌名: | Proceedings of INTERSPEECH 2008 |
開始ページ: | 658 |
終了ページ: | 661 |
抄録: | This paper proposes two methods for robust automatic speech recognition (ASR) in reverberant environments. Unlike other methods which mostly apply inverse filtering by blindly estimated room impulse responses to achieve dereverberation, theproposed methods are based on the utilization of the characteristics of speech. The first method - Harmonicity based Feature Analysis – takes advantage of the harmonic componentsof speech, which are assumed to be undistorted. The second method - Temporal Power Envelope Feature Analysis – utilizes the temporal modulation structure of speech, representing the phoneme level temporal events which contain most intelligibility information. Both methods increase the recognition performance remarkably in a different way. Combining both of them connects their individual advantages. In order to examine theperformance of utilizing harmonicity and modulation temporal structure for reverberant ASR, the methods are tested in clean and reverberant training. As results show, even in strong reverberantconditions both methods obtain practical applicableperformance for reverberant training. In addition, besides testing their performance in dependency on the reverberation time, their performance considering the speaker-to-microphone distanceis tested, which is another new contributions in this paper. |
Rights: | Copyright (C) 2008 International Speech Communication Association. Rico Petrick, Xugang Lu, Masashi Unoki, Masato Akagi, Ruediger Hoffmann, Proceedings of INTERSPEECH 2008, pp.658-661. |
URI: | http://hdl.handle.net/10119/9985 |
資料タイプ: | publisher |
出現コレクション: | b11-1. 会議発表論文・発表資料 (Conference Papers)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
IS2008_Rico.pdf | | 619Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|