JAIST Repository: Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication

トップページ| 北陸先端科学技術大学院大学| 附属図書館

一覧

コミュニティ
& コレクション
タイトル
著者
日付
学位論文
リサーチレポート・テクニカルメモランダム

登録利用者:

登録者ページ
利用者(E-people)

当システムについて

JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: https://hdl.handle.net/10119/10724

タイトル:	Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication
著者:	Li, Junfeng Sakamoto, Shuichi Hongo, Satoshi Akagi, Masato Suzuki, Yôiti
キーワード:	Binaural masking level difference Equalization–cancellation model Two-stage binaural speech enhancement (TS-BASE) Binaural cue preservation Sound localization
発行日:	2010-06-02
出版者:	Elsevier
誌名:	Speech Communication
巻:	53
号:	5
開始ページ:	677
終了ページ:	689
DOI:	10.1016/j.specom.2010.04.009
抄録:	Speech enhancement has been researched extensively for many years to provide high-quality speech communication in the presence of background noise and concurrent interference signals. Human listening is robust against these acoustic interferences using only two ears, but state-of-the-art two-channel algorithms function poorly. Motivated by psychoacoustic studies of binaural hearing (equalization–cancellation (EC) theory), in this paper, we propose a two-stage binaural speech enhancement with Wiener filter (TS-BASE/WF) approach that is a two-input two-output system. In this proposed TS-BASE/WF, interference signals are first estimated by equalizing and cancelling the target signal in a way inspired by the EC theory, a time-variant Wiener filter is then applied to enhance the target signal given the noisy mixture signals. The main advantages of the proposed TS-BASE/WF are (1) effectiveness in dealing with non-stationary multiple-source interference signals, and (2) success in preserving binaural cues after processing. These advantages were confirmed according to the comprehensive objective and subjective evaluations in different acoustical spatial configurations in terms of speech enhancement and binaural cue preservation.
Rights:	NOTICE: This is the author's version of a work accepted for publication by Elsevier. Junfeng Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi, Yôiti Suzuki, Speech Communication, 53(5), 2010, 677-689, http://dx.doi.org/10.1016/j.specom.2010.04.009
URI:	https://hdl.handle.net/10119/10724
資料タイプ:	author
出現コレクション:	b10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル	記述	サイズ	形式
17002.pdf		274Kb	Adobe PDF	見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

お問合せ先 : 北陸先端科学技術大学院大学　研究推進課学術情報係 (ir-sys[at]ml.jaist.ac.jp)