JAIST Repository >
School of Information Science >
Conference Papers >
Conference Papers >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10119/18193

Title: Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis
Authors: Li, Kai
Unoki, Masashi
Li, Yongwei
Dang, Jianwu
Akagi, Masato
Issue Date: 2021-12
Publisher: APSIPA
Magazine name: Proceedings, APSIPA Annual Summit and Conference 2021
Start page: 36
End page: 43
Abstract: Correct estimation of glottal source as well as vocal tract parameters is crucial for speech analysis and synthesis. Nearly all methods for estimating these parameters are based on the source-filter assumption. However, the separation and estimation of the source and filter parts are still challenging due to the unreasonable modeling related to physiological processes of speech production or inappropriate estimation procedures. We propose a model that combines the autoregressive moving average exogenous (ARMAX) and Liljencrants-Fant (LF) models, called the ARMAX-LF model, to accurately represent the physiological processes of speech production. The ARMAX model represents the vocal tract as a pole-zero filter with an additional exogenous residual signal, and the LF model represents glottal source waveform as a parametrized time-domain model. Furthermore, we propose a two-stage iterative estimation procedure to separately and simultaneously estimate the parameters of the ARMAXLF model. The estimated parameters were evaluated objectively and subjectively with synthesized vowels, synthesized consonants, and natural speech. The results indicate that the ARMAX-LF model with the estimated parameters can separately represent the glottal source and vocal tract characteristics and can be widely used in speech analysis and synthesis.
Rights: Copyright (C) 2021 APSIPA. This material is posted here with permission of APSIPA (Asia-Pacific Signal and Information Processing Association). Kai Li, Masashi Unoki, Yongwei Li, Jianwu Dang, Masato Akagi, Proceedings of APSIPA Annual Summit and Conference 2021,pp.36-43
URI: http://hdl.handle.net/10119/18193
Material Type: publisher
Appears in Collections:b11-1. 会議発表論文・発表資料 (Conference Papers)

Files in This Item:

File Description SizeFormat
APSIPA0000036.pdf441KbAdobe PDFView/Open

All items in DSpace are protected by copyright, with all rights reserved.

 


Contact : Library Information Section, Japan Advanced Institute of Science and Technology