JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/11509

タイトル: Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languages
著者: Phung, Trung-Nghia
Luong, Mai Chi
Akagi, Masato
キーワード: Concatenative
speech synthesis
tone transformation
quality to size ratio
発行日: 2012-12
出版者: Institute of Electrical and Electronics Engineers (IEEE)
誌名: 2012 International Conference on Speech Database and Assessments (Oriental COCOSDA)
開始ページ: 129
終了ページ: 134
DOI: 10.1109/ICSDA.2012.6422458
抄録: Concatenative speech synthesis (CSS) provides the greatest naturalness. However, it requires a huge stored database resulting a huge footprint. Reducing the capacity of stored database while preserving the quality of CSS, or improving the quality to size ratio (QSr), is still a challenge. In this paper, we propose a method of transforming fundamental frequency (F0) contours of lexical tones, developed from TD-GMM framework that successfully applied for transforming spectral sequence in previous researches, in order to improve the QSr of CSS of tonal languages that results CSS available with limited data at offline stage, storing small online footprint, while preserving perceptual quality. The experimental results show that the proposed F0 transformation outperforms conventional and state-of-the-art F0 contour transformations for transforming lexical tones in terms of speech quality. When applying the proposed F0 contour transformation for transforming lexical tones in CSS of tonal languages, the QSr is enhanced compared with the method of simple F0 exchange while the quality of synthetic speech is preserved.
Rights: This is the author's version of the work. Copyright (C) 2012 IEEE. 2012 International Conference on Speech Database and Assessments (Oriental COCOSDA), 2012, 129-134. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI: http://hdl.handle.net/10119/11509
資料タイプ: author
出現コレクション:b11-1. 会議発表論文・発表資料 (Conference Papers)

このアイテムのファイル:

ファイル 記述 サイズ形式
O_COCOSDA2012_Nghia.pdf194KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係