JAIST Repository >
School of Information Science >
Articles >
Journal Articles >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10119/16245

Title: Mimicking Lombard Effect: An Analysis and Reconstruction
Authors: Ngo, Thuan Van
Kubo, Rieko
Akagi, Masato
Keywords: Lombard speech
perceptual mimicking
rule-based methods
Issue Date: 2020-05-01
Publisher: 電子情報通信学会
Magazine name: IEICE Transactions on Information and Systems
Volume: E103-D
Number: 5
Start page: 1108
End page: 1117
DOI: 10.1587/transinf.2019EDP7260
Abstract: Lombard speech is produced in noisy environments due to the Lombard effect and is intelligible in adverse environments. To adaptively control the intelligibility of transmitted speech for public announcement systems, in this study, we focus on perceptually mimicking Lombard speech under backgrounds with varying noise levels. Other approaches map corresponding neutral speech features to Lombard speech features, but as this can only be applied to one noise level at a time, it is unsuitable for varying noise levels because the characteristics of Lombard speech are varied according to noise level. Instead, we utilize a rule-based method that automatically generates rules and flexibly controls features with any change of noise level. Specifically, we conduct a feature tendency analysis and propose a continuous rule generation model to estimate the effect of varying noise levels on features. The proposed techniques, which are based on a coarticulation model, MRTD, and spectral-GMM, can easily modify neutral speech features by following the generated rules. Voices having these features are then synthesized by STRAIGHT to obtain Lombard speech fitting to noises with varying levels. To validate our proposed method, the quality of mimicking speech is evaluated in subjective listening experiments on similarity, intelligibility, and naturalness. In varying noise levels, the results show equal similarity with Lombard speech between the proposed method and a state-of-the-art method. Intelligibility and naturalness are comparable with some feature modifications.
Rights: Copyright (C)2020 IEICE. Thuan Van Ngo, Rieko Kubo, and Masato Akagi, IEICE Transactions on Information and Systems, E103-D(5), 2020, pp.1108-1117. https://www.ieice.org/jpn/trans_online/
URI: http://hdl.handle.net/10119/16245
Material Type: publisher
Appears in Collections:b10-1. 雑誌掲載論文 (Journal Articles)

Files in This Item:

File Description SizeFormat
IEICE-FD_Ngo.pdf1002KbAdobe PDFView/Open

All items in DSpace are protected by copyright, with all rights reserved.

 


Contact : Library Information Section, Japan Advanced Institute of Science and Technology