JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/8176

タイトル: A flexible spectral modification method based on temporal decomposition and Gaussian mixture model
著者: Nguyen, Binh Phu
Akagi, Masato
キーワード: Spectral modification
Temporal decomposition
Gaussian mixture model
STRAIGHT
発行日: 2009
出版者: Acoustical Society of Japan(日本音響学会)
誌名: Acoustical Science and Technology
巻: 30
号: 3
開始ページ: 170
終了ページ: 179
DOI: 10.1250/ast.30.170
抄録: Manipulating spectral structure often leads to degradation of speech quality, which is mainly due to insufficient smoothness of the modified spectra between frames, and ineffective spectralmodification. This paper presents a new spectral modification method to improve the quality ofmodified speech. If frames are processed independently, discontinuous features may be generated. Therefore, a speech analysis technique called temporal decomposition (TD), which decomposes speech into event targets and event functions, is used to model the spectral evolution effectively. Instead of modifying the speech spectra frame by frame, we only need to modify event targets and event functions. This feature leads to easy modification of the speech spectra, and the smoothness of modified speech is ensured by the shape of event functions. To improve spectral modification, we explore Gaussian mixture model parameters (spectral-GMM parameters) to model the spectral envelope of each event target, and develop a new algorithm for modifying spectral-GMM parameters in accordance with formant scaling factors. We first evaluate the effectiveness of our proposed method in spectra modeling, and then apply it to two areas which require different amounts of spectral modification, emotional speech synthesis and voice gender conversion. Experimental results show that the effectiveness of our proposed method is verified for spectra modeling and spectral modification.
Rights: Copyright (C)2009 Acoustical Society of Japan, Binh Phu Nguyen and Masato Akagi, Acoustical Science and Technology, 30(3), 2009, 170-179.
URI: http://hdl.handle.net/10119/8176
資料タイプ: publisher
出現コレクション:b10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル 記述 サイズ形式
AST2009_Binh.pdf143KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係