JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/4888

タイトル: Spectral Modification for Voice Gender Conversion using Temporal Decomposition
著者: Nguyen, Binh Phu
Akagi, Masato
発行日: 2007-07
出版者: Research Institute of Signal Processing Japan(信号処理学会)
誌名: Journal of Signal Processing
巻: 11
号: 4
開始ページ: 333
終了ページ: 336
抄録: In most state-of-the-art voice gender conversion systems, the converted speech still sounds unnatural, which is mainly attributed to the insufficient smoothness of the converted spectra between frames and ineffective spectral modification. In this paper, we present a new method for voice gender conversion using a speech analysis technique called temporal decomposition (TD). TD is used to model spectral evolution effectively. Instead of modifying speech spectra frame by frame, we only need to modify event targets and event functions, and the smoothness of the converted speech is ensured by the shape of the event functions. To overcome the ineffective spectral modification, we explore Gaussian mixture model (GMM) parameter sets for an input of TD to flexibly model the spectral envelope, and develop a new method of modifying GMM parameters in accordance with formant scaling factors. For transforming fundamental frequencies, our system is based on STRAIGHT, which is a very high-quality vocoder. Experimental results show that the quality of the speech converted by the proposed method is significantly improved.
Rights: Copyright (C) 2007 Research Institute of Signal Processing Japan. Binh Phu Nguyen and Masato Akagi, Journal of Signal Processing, 11(4), 2007, 333-336.
URI: http://hdl.handle.net/10119/4888
資料タイプ: publisher
出現コレクション:b10-1. 雑誌掲載論文 (Journal Articles)


ファイル 記述 サイズ形式
JSP_Binh_2007.pdf373KbAdobe PDF見る/開く



お問合せ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)