JAIST Repository >
School of Information Science >
Conference Papers >
Conference Papers >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10119/15771

Title: Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency range
Authors: Takahashi, Kyoko
Akagi, Masato
Issue Date: 2018-11-15
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Magazine name: 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
Start page: 1879
End page: 1887
DOI: 10.23919/APSIPA.2018.8659480
Abstract: Estimation of glottal vibration and vocal tract for singing voices is necessary for clarifying the mechanism of singing voice production. However, accurate estimation of glottal vibration and vocal tract shape in singing voices with a high fundamental frequency (f0) is difficult using simulated models such as the auto-regressive with exogenous input (ARX) model and LiljencrantsFant (LF) model. This is caused by two problems: the inaccurate estimation method of the glottal closure instant (GCI) and the inappropriate estimation method of ARX model parameter values in singing voices with high f0. Therefore, this proposed method aims to accurately estimate glottal source waveforms and vocal tract shape for singing voices with wide frequency range. To achieve this objective, we propose two solutions: estimation of GCI using an electroglottogram (EGG) signal and estimation of ARX model parameter values using multi-stage optimization and an evaluation function including the leaking effect from forwarded periods. In experiments using simulated singing voices and real singing voices, it was indicated that the accurate estimation of GCI, the reliable estimation of the parameter values of the ARX model for singing voices with high f0, and the estimation of glottal vibration and vocal tract shape in singing voices with wide frequency range were achieved by the proposed method.
Rights: This is the author's version of the work. Copyright (C) 2018 IEEE. 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2018, 1879-1887. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI: http://hdl.handle.net/10119/15771
Material Type: publisher
Appears in Collections:b11-1. 会議発表論文・発表資料 (Conference Papers)

Files in This Item:

File Description SizeFormat
2908.pdf1307KbAdobe PDFView/Open

All items in DSpace are protected by copyright, with all rights reserved.

 


Contact : Library Information Section, Japan Advanced Institute of Science and Technology