JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/10825

タイトル: A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture model
著者: Phung, Nghia Trung
Unoki, Masashi
Akagi, Masato
キーワード: bone-conducted speech
Gaussian mixture model
linear prediction
speech intelligibility
発行日: 2012-09
出版者: 信号処理学会
誌名: Journal of Signal Processing
巻: 16
号: 5
開始ページ: 409
終了ページ: 417
抄録: The restoration of bone-conducted speech is a very important issue that enables robust speech communication in extremely noisy environments. We proposed a method of blind restoration in our previous studies based on a scheme of linear prediction with a method of training and prediction based on the simple recurrent neural network. However, prediction based on neural networks is not suitable for training with large corpora, which is necessary for real applications. The over-training problem with simple recurrent neural networks makes it difficult to train various kinds of bone-conducted speech in one session. In addition, it is difficult to adapt the neural network model to bone-conducted speech in unknown noisy environments to build an open dataset restoration of bone-conducted speech. Thus, a method of training and prediction based on the Gaussian mixture model was used in this research, instead of a neural network. A method of re-estimating the residual ratio in the scheme of linear prediction is also proposed. We also investigated how the proposed method works to restore bone-conducted speech in extremely noisy environments. Objective and subjective evaluations were carried out to evaluate the improvements in sound quality and the intelligibility of restored speech. The results revealed that our proposed method outperformed previous methods in both human hearing and automatic speech recognition systems even in extremely noisy environments.
Rights: Copyright (C) 2012 信号処理学会. Phung Nghia Trung, Masashi Unoki and Masato Akagi, Journal of Signal Processing, 16(5), 2012, 409-417.
URI: http://hdl.handle.net/10119/10825
資料タイプ: publisher
出現コレクション:b10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル 記述 サイズ形式
1237.pdf2231KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係