JAIST Repository >
b. 情報科学研究科・情報科学系 >
b11. 会議発表論文・発表資料等 >
b11-1. 会議発表論文・発表資料 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/18156

タイトル: Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion
著者: Ho, Tuan Vu
Kobayashi, Maori
Akagi, Masato
キーワード: Voice conversion
speech intelligibility
professional announcer
speech-in-noise
発行日: 2022-09
出版者: International Speech Communication Association
誌名: Proc. InterSpeech 2022
開始ページ: 171
終了ページ: 175
DOI: 10.21437/Interspeech.2022-124
抄録: In most of practical scenarios, the announcement system must deliver speech messages in a noisy environment, in which the background noise cannot be cancelled out. The local noise reduces speech intelligibility and increases listening effort of the listener, hence hamper the effectiveness of announcement system. There has been reported that voices of professional announcers are clearer and more comprehensive than that of nonexpert speakers in noisy environment. This finding suggests that the speech intelligibility might be related to the speaking style of professional announcer, which can be adapted using voice conversion method. Motivated by this idea, this paper proposes a speech intelligibility enhancement in noisy environment by applying voice conversion method on non-professional voice. We discovered that the professional announcers and nonprofessional speakers are clusterized into different clusters on the speaker embedding plane. This implies that the speech intelligibility can be controlled as an independent feature of speaker individuality. To examine the advantage of converted voice in noisy environment, we experimented using test words masked in pink noise at different SNR levels. The results of objective and subjective evaluations confirm that the speech intelligibility of converted voice is higher than that of original voice in low SNR conditions.
Rights: Copyright (C) 2022 International Speech Communication Association. Tuan Vu Ho, Maori Kobayashi, Masato Akagi, Proc. InterSpeech2022, 2022, pp.171-175. doi: 10.21437/Interspeech.2022-124
URI: http://hdl.handle.net/10119/18156
資料タイプ: publisher
出現コレクション:b11-1. 会議発表論文・発表資料 (Conference Papers)

このアイテムのファイル:

ファイル 記述 サイズ形式
vuho22_interspeech.pdf832KbAdobe PDF見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係