JAIST Repository: A singing voices synthesis system to characterize vocal registers using ARX-LF model

トップページ| 北陸先端科学技術大学院大学| 附属図書館

一覧

コミュニティ
& コレクション
タイトル
著者
日付
学位論文
リサーチレポート・テクニカルメモランダム

登録利用者:

登録者ページ
利用者(E-people)

当システムについて

JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >

このアイテムの引用には次の識別子を使用してください: https://hdl.handle.net/10119/11510

タイトル:	A singing voices synthesis system to characterize vocal registers using ARX-LF model
著者:	Motoda, Hiroki Akagi, Masato
発行日:	2013-03
出版者:	2013 International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'13)
誌名:	2013 International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'13)
開始ページ:	93
終了ページ:	96
抄録:	This paper proposes a singing voices synthesis system to synthesize singing voices having characteristics of vocal registers, such as vocal fly, modal and falsetto. Human can sing songs naturally in wide range of frequency by training how to use vocal fold vibrations to represent vocal registers. However, even state-of-the-art singing voices synthesis systems cannot produce vocal registers appropriately. Naturalness of the synthesized singing voices using these systems is reduced in low and high frequency ranges. One of the methods for improving naturalness is adding characteristics of glottal sources for each vocal register. In this paper, the ARX-LF model that can formulate glottal sources for each vocal register by simulating human voice production mechanisms was applied. A model for controlling ARX-LF parameters corresponding to characteristics of glottal sources was constructed, and acoustic features corresponding to naturalness of singing voice were added. Singing voice data of each vocal register were analyzed by the ARX-LF model, and ARX-LF parameter values corresponding to glottal source of each vocal register were obtained. The control model was constructed using the results of the analysis. Singing voices were synthesized by the control model, and quality of the synthesized voices was evaluated. As the results, almost the same impressions were obtained from the synthesized singing voices as those from actual singing voices in each vocal register. Results revealed effectiveness of the proposed system for synthesizing singing voices to characterize vocal registers.
Rights:	This material is posted here with permission of the Research Institute of Signal Processing Japan. Hiroki Motoda and Masato Akagi, 2013 International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP'13), 2013, pp.93-96.
URI:	https://hdl.handle.net/10119/11510
資料タイプ:	publisher
出現コレクション:	b11-1. 会議発表論文・発表資料 (Conference Papers)

このアイテムのファイル:

ファイル	記述	サイズ	形式
NCSP2013_Motoda.pdf		1524Kb	Adobe PDF	見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

お問合せ先 : 北陸先端科学技術大学院大学　研究推進課学術情報係 (ir-sys[at]ml.jaist.ac.jp)