タイトル: Methods for robust speech recognition in reverberant environments: A comparison
著者: Petrick, Rico
Feher, Thomas
Unoki, Masashi
Hoffmann, Rudiger
キーワード: reverberation
robust ASR
発行日: 2010-09
出版者: International Speech Communication Association
誌名: Proceedings of INTERSPEECH 2010
開始ページ: 582
終了ページ: 585
抄録: In this article the authors continue previous studies regarding the investigation of methods that aim to improve the decreased recognition rate (RR) in reverberant environments of automatic speech recognition (ASR) systems. Previously threerobust front-end methods are tested, the harmonicity based feature analysis (HFA), the temporal power envelope feature analysis(TPEFA) and their combination (HFA+TPEFA). This paper additionally introduces two well-known methods into the comparison. These are the dereverberation method using the inverse modulation transfer function (IMTF) and the delay-and-sum beamformer (DSB). Recognition experiments are accomplished for command word recognition, the reverberant environmentsare comprehensive chosen as functions of the reverberation time T_60 and the speaker to microphone distance (SMD) as the most important parameters to describe reverberant distortions.The results of this first comparison of such methodsprove experimentally some drawn assumptions, e. g. the IMTF method achieves robustness only in the far field, the DSB improves the RR slightly but is outperformed by the HFA due to its indirectivity at low frequencies.
Rights: Copyright (C) 2010 International Speech Communication Association. Rico Petrick, Thomas Feher, Masashi Unoki, and Rudiger Hoffmann, Proceedings of INTERSPEECH 2010, 2010, 582-585.
