JAIST Repository >
b. 情報科学研究科・情報科学系 >
b30. リサーチレポート >
Research Report - School of Information Science : ISSN 0918-7553 >
IS-RR-2007 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/3735

タイトル: Estimation of fundamental frequency of reverberant speech by utilizing complex cepstrum analysis
著者: Unoki, Masashi
Hosorogiya, Toshihiro
キーワード: Fundamental frequency (F_0)
F_0 estimation
reverberant speech
complex cepstrum analysis
MTF concept
source-filter model
発行日: 2007-06-18
出版者: 北陸先端科学技術大学院大学情報科学研究科
誌名: Research report (School of Information Science, Japan Advanced Institute of Science and Technology)
巻: IS-RR-2007-008
開始ページ: 1
終了ページ: 14
抄録: This paper reports the comparative evaluations of twelve typical methods of estimating fundamental frequency (F_0) over huge speech-sound datasets in artificial reverberant environments. They involve several classic algorithms such as Cepstrum, AMDF, LPC, and modified autocorrelation algorithms. Other methods involve a few modern instantaneous amplitude- and/or frequency-based algorithms, such as TEMPO, IFHC, and PHIA. The comparative results revealed that the percentage correct rates and SNRs of the estimated F_0s were reduced drastically as reverberation time increased. They also demonstrated that homomorphic (complex cepstrum) analysis and the concept of the source-filter model were relatively effective for estimating F_0 from reverberant speech. This paper thus proposes a new method of robustly and accurately F_0 estimating in reverberant environments, by utilizing the MTF concept and the source-filter model on the complex cepstrum analysis. The MTF concept is used in this method to eliminate dominant reverberant characteristics from observed reverberant speech. The source-filter model (liftering) is used to extract source information from the processed cepstrum. Finally, F_0s are estimated from them by using the comb-filtering method. Additive-comparative evaluation was carried out on the proposed method with other typical methods. The results demonstrated that it was better than the previously reported methods in terms of robustness and providing accurate F_0 estimates in reverberant environments.
URI: http://hdl.handle.net/10119/3735
資料タイプ: publisher


ファイル 記述 サイズ形式
62-1.pdf505KbAdobe PDF見る/開く



お問合せ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)