JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >
このアイテムの引用には次の識別子を使用してください:
http://hdl.handle.net/10119/4888
|
タイトル: | Spectral Modification for Voice Gender Conversion using Temporal Decomposition |
著者: | Nguyen, Binh Phu Akagi, Masato |
発行日: | 2007-07 |
出版者: | Research Institute of Signal Processing Japan(信号処理学会) |
誌名: | Journal of Signal Processing |
巻: | 11 |
号: | 4 |
開始ページ: | 333 |
終了ページ: | 336 |
抄録: | In most state-of-the-art voice gender conversion systems, the converted speech still sounds unnatural, which is mainly attributed to the insufficient smoothness of the converted spectra between frames and ineffective spectral modification. In this paper, we present a new method for voice gender conversion using a speech analysis technique called temporal decomposition (TD). TD is used to model spectral evolution effectively. Instead of modifying speech spectra frame by frame, we only need to modify event targets and event functions, and the smoothness of the converted speech is ensured by the shape of the event functions. To overcome the ineffective spectral modification, we explore Gaussian mixture model (GMM) parameter sets for an input of TD to flexibly model the spectral envelope, and develop a new method of modifying GMM parameters in accordance with formant scaling factors. For transforming fundamental frequencies, our system is based on STRAIGHT, which is a very high-quality vocoder. Experimental results show that the quality of the speech converted by the proposed method is significantly improved. |
Rights: | Copyright (C) 2007 Research Institute of Signal Processing Japan. Binh Phu Nguyen and Masato Akagi, Journal of Signal Processing, 11(4), 2007, 333-336. |
URI: | http://hdl.handle.net/10119/4888 |
資料タイプ: | publisher |
出現コレクション: | b10-1. 雑誌掲載論文 (Journal Articles)
|
このアイテムのファイル:
ファイル |
記述 |
サイズ | 形式 |
JSP_Binh_2007.pdf | | 373Kb | Adobe PDF | 見る/開く |
|
当システムに保管されているアイテムはすべて著作権により保護されています。
|