JAIST Repository >
School of Information Science >
Conference Papers >
Conference Papers >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10119/16078

Title: Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model
Authors: Li, Yongwei
Sakakibara, Ken-Ichi
Akagi, Masato
Keywords: glottal source waveform
vocal tract shape
ARX-LF model
Issue Date: 2019-05-06
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Magazine name: 2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP)
Start page: 230
End page: 234
DOI: 10.1109/ISCSLP.2018.8706694
Abstract: The widely used method to estimate glottal source waveform and vocal tract shape is to process speech signal using inverse filter and then to fit residual signal using glottal source model. However, since source-tract interactions, estimation accuracy is reduced. In this paper, we propose a method to estimate glottal source waveform and vocal tract shape simultaneously based on analysis-by-synthesis approach with a source-filter model constructed with an auto-regressive eXogenous (ARX) model combined with the Lilijencrant-Fant (LF) model. Since the optimization of multiple parameters makes simultaneous estimation difficult, there are two steps: the glottal source parameters are initialized using the inverse filter method, then the accurate parameters of the glottal source and the vocal tract shape are estimated simultaneously using an analysis-by-synthesis approach. Experimental results with synthetic and real speech signals showed the higher estimation accuracy of the proposed method than inverse filter.
Rights: This is the author's version of the work. Copyright (C) 2019 IEEE. 2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2019, pp.230-234. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI: http://hdl.handle.net/10119/16078
Material Type: author
Appears in Collections:b11-1. 会議発表論文・発表資料 (Conference Papers)

Files in This Item:

File Description SizeFormat
ISCSLP2018_Yongwei.pdf2074KbAdobe PDFView/Open

All items in DSpace are protected by copyright, with all rights reserved.

 


Contact : Library Information Section, Japan Advanced Institute of Science and Technology