タイトル: Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis
著者: Saitou, Takeshi
Unoki, Masashi
Akagi, Masato
キーワード: F0 fluctuation
singing-voice perception
F0 control model
singing-voice synthesis
発行日: 2005-07
出版者: Elsevier
誌名: Speech Communication
巻: 46
開始ページ: 405
終了ページ: 417
抄録: A fundamental frequency (F0) control model, which can cope with F0 dynamic characteristics related to singing-voice perception, is required to construct natural singing-voice synthesis systems. This paper discusses importance of F0 dynamic characteristics in singing-voices and demonstrates how strongly they influence singing-voice perception through psychoacoustic experiments. This paper, then, proposes an F0 control model that can generate F0 contours of singing-voices based on these considerations, and a singing-voice synthesis system. The results show that several types of F0 fluctuation – overshoot, vibrato, preparation, and fine-fluctuation – affect the perception and quality of a singing-voice, and that overshoot has the greatest effect. Moreover, the results show that the proposed F0 control model can control F0 fluctuations, and generate F0-contours of singing-voices, and can be applied to natural singing-voice synthesis.
Rights: Copyright (C)2005, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International license (CC BY-NC-ND 4.0). [http://creativecommons.org/licenses/by-nc-nd/4.0/] NOTICE: This is the author's version of a work accepted for publication by Elsevier. Takeshi Saitou, Masashi Unoki, Masato Akagi Birkholz, Speech Communication, 46, 2005, 405-417, https://doi.org/10.1016/j.specom.2005.01.010
URI: http://hdl.handle.net/10119/18075
