JAIST Repository >

AKAGI Masato ProfessorFaculty Profile

No.Bibliographical information
1 Increasing Speech Intelligibility by Mimicking Professional Announcers’ Voices and Its Physical Correlates / Tran, Dung Kim, Akagi, Masato, Unoki, Masashi, 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.1187-1192, 2023-10-31, Institute of Electrical and Electronics Engineers (IEEE)
2 Music Theory-inspired Acoustic Representation for Speech Emotion Recognition / Li, Xingfeng, Shi, Xiaohan, Hu, Desheng, Li, Yongwei, Zhang, Qingchen, Wang, Zhengxia, Unoki, Masashi, Akagi, Masato, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, pp.2534-2547, 2023-06-26, Institute of Electrical and Electronics Engineers (IEEE)
3 Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection / Li, Kai, Li, Sheng, Lu, Xugang, Akagi, Masato, Liu, Meng, Zhang, Lin, Zeng, Chang, Wang, Longbiao, Dang, Jianwu, Unoki, Masashi, Proc. InterSpeech 2022, pp.664-668, 2022-09, International Speech Communication Association
4 Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion / Ho, Tuan Vu, Kobayashi, Maori, Akagi, Masato, Proc. InterSpeech 2022, pp.171-175, 2022-09, International Speech Communication Association
5 Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement / Ho, Tuan Vu, Nguyen, Quoc Huy, Akagi, Masato, Unoki, Masashi, Proc. InterSpeech 2022, pp.176-180, 2022-09, International Speech Communication Association
6 Speech Emotion and Naturalness Recognitions With Multitask and Single-Task Learnings / Atmaja, Bagus Tris, Sasou, Akira, Akagi, Masato, IEEE Access, 10, pp.72381-72387, 2022-07-07, Institute of Electrical and Electronics Engineers (IEEE)
7 Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion / Atmaja, Bagus Tris, Sasou, Akira, Akagi, Masato, Speech Communication, 140, pp.11-28, 2022-03-26, Elsevier
8 Acoustic features correlated to perceived urgency in evacuation announcements / Kobayashi, Maori, Hamada, Yasuhiro, Akagi, Masato, Speech Communication, 139, pp.22-34, 2022-03-06, Elsevier
9 Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis / Li, Kai, Unoki, Masashi, Li, Yongwei, Dang, Jianwu, Akagi, Masato, Proceedings, APSIPA Annual Summit and Conference 2021, pp.36-43, 2021-12, APSIPA
10 Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition / Li, Xingfeng, Guo, Taiyang, Hu, Xinhui, Xu, Xinkang, Dang, Jianwu, Akagi, Masato, Proceedings, APSIPA Annual Summit and Conference 2021, pp.700-704, 2021-12, APSIPA
11 Automatic Naturalness Recognition from Acted Speech Using Neural Networks / Atmaja, Bagus Tris, Sasou, Akira, Akagi, Masato, Proceedings, APSIPA Annual Summit and Conference 2021, pp.731-736, 2021-12, APSIPA
12 F_0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model / Li, Yongwei, Tao, Jianhua, Erickson, Donna, Liu, Bin, Akagi, Masato, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, pp.3375-3383, 2021-10-15, Institute of Electrical and Electronics Engineers (IEEE)
13 Increasing speech intelligibility and naturalness in noise based on concepts of modulation spectrum and modulation transfer function / Ngo, Thuanvan, Kubo, Rieko, Akagi, Masato, Speech Communication, 135, pp.11-24, 2021-10-01, Elsevier
14 Cross-Lingual Voice Conversion With Controllable Speaker Individuality Using Variational Autoencoder and Star Generative Adversarial Network / Ho, Tuan Vu, Akagi, Masato, IEEE Access, 9, pp.47503-47515, 2021-03-02, Institute of Electrical and Electronics Engineers (IEEE)
15 Predicting Valence and Arousal by Aggregating Acoustic Features for Acoustic-Linguistic Information Fusion / Atmaja, Bagus Tris, Hamada, Yasuhiro, Akagi, Masato, 2020 IEEE REGION 10 CONFERENCE (TENCON), pp.1081-1085, 2020-11-19, Institute of Electrical and Electronics Engineers (IEEE)
16 Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM / Atmaja, Bagus Tris, Akagi, Masato, Speech Communication, 126, pp.9-21, 2020-11-19, Elsevier
17 On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and Classifiers / Atmaja, Bagus Tris, Akagi, Masato, 2020 IEEE REGION 10 CONFERENCE (TENCON), pp.968-972, 2020-11-18, Institute of Electrical and Electronics Engineers (IEEE)
18 Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information / Atmaja, Bagus Tris, Akagi, Masato, 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), pp.166-171, 2020-11-06, Institute of Electrical and Electronics Engineers (IEEE)
19 Continuous Audiovisual Emotion Recognition Using Feature Selection and LSTM / Elbarougy, Reda, Atmaja, Bagus Tris, Akagi, Masato, Journal of Signal Processing, 24(6), pp.229-235, 2020-11-01, Research Institute of Signal Processing Japan
20 Non-parallel Voice Conversion based on Hierarchical Latent Embedding Vector Quantized Variational Autoencoder / Ho, Tuan Vu, Akagi, Masato, Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, pp.140-144, 2020-10-30, International Speech Communication Association
21 Acoustic and articulatory analysis and synthesis of shouted vowels / Xue, Yawen, Marxen, Michael, Akagi, Masato, Birkholz, Peter, Computer Speech & Language, 66, pp.101156-, 2020-10-09, Elsevier
22 Segment-level Effects of Gender, Nationality and Emotion Information on Text-independent Speaker Verification / Li, Kai, Akagi, Masato, Wu, Yibo, Dang, and Jianwu, Proc. InterSpeech2020, pp.2987-2991, 2020-10, International Speech Communication Association
23 Comparison of glottal source parameter values in emotional vowels / Li, Yongwei, Tao, Jianhua, Liu, Bin, Erickson, Donna, Akagi, Masato, Proc. InterSpeech2020, pp.4103-4107, 2020-10, International Speech Communication Association
24 A Two-Stage Phase-Aware Approach for Monaural Multi-Talker Speech Separation / Yin, Lu, Li, Junfeng, Yan, Yonghong, Akagi, Masato, IEICE Transactions Information and Systems, E103-D(7), pp.1732-1743, 2020-07-01, 電子情報通信学会
25 Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning / Atmaja, Bagus Tris, Akagi, Masato, APSIPA Transactions on Signal and Information Processing, 9, pp.e17-, 2020-05-27, Cambridge University Press
26 The Effect of Silence Feature in Dimensional Speech Emotion Recognition / Atmaja, Bagus Tris, Akagi, Masato, Proc. 10th International Conference on Speech Prosody 2020, pp.26-30, 2020-05-25, International Speech Communication Association
27 Mimicking Lombard Effect: An Analysis and Reconstruction / Ngo, Thuan Van, Kubo, Rieko, Akagi, Masato, IEICE Transactions on Information and Systems, E103-D(5), pp.1108-1117, 2020-05-01, 電子情報通信学会
28 Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition / Atmaja, Bagus Tris, Akagi, Masato, 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4482-4486, 2020-05, Institute of Electrical and Electronics Engineers (IEEE)
29 Study on relationship between warmness of speech and valence, activation or dominance / Miyagawa, Natsumi, Akagi, Masato, 2020 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP 2020), pp.299-302, 2020-02-20, Research Institute of Signal Processing Japan
30 Influence of auditory feedback on uttering vowel speech in noisy environment / Nishigaki, Tomoya, Akagi, Masato, 2020 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP 2020), pp.303-306, 2020-02-20, Research Institute of Signal Processing Japan
31 Effect of articulatory and acoustic features on the intelligibility of speech in noise: an articulatory synthesis study / Ngo, Thuanvan, Akagi, Masato, Birkholz, Peter, Speech Communication, 117, pp.13-20, 2020-01-22, Elsevier
32 Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends / Peng, Zhichao, Li, Xingfeng, Zhu, Zhi, Unoki, Masashi, Dang, Jianwu, Akagi, Masato, IEEE Access, 8, pp.16560-16572, 2020-01-20, Institute of Electrical and Electronics Engineers (IEEE)
33 Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model / Li, Yongwei, Sakakibara, Ken-Ichi, Akagi, Masato, Journal of Signal Processing Systems, 92, pp.831-838, 2019-12-23, Springer
34 Combining F0 and non-negative constraint robust principal component analysis for singing voice separation / Li, Feng, Akagi, Masato, Signal Processing, 170, pp.107432-, 2019-12-14, Elsevier
35 Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking / Li, Feng, Qian, Kaizhi, Hasegawa-Johnson, Mark, Akagi, Masato, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.1239-1243, 2019-11-20, Institute of Electrical and Electronics Engineers (IEEE)
36 Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational Autoencoder / Ho, Tuan Vu, Akagi, Masato, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.106-111, 2019-11-19, Institute of Electrical and Electronics Engineers (IEEE)
37 Speech Emotion Recognition Using Speech Feature and Word Embedding / Atmaja, Bagus Tris, Shirai, Kiyoaki, Akagi, Masato, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.519-523, 2019-11-19, Institute of Electrical and Electronics Engineers (IEEE)
38 Evaluation of the Lombard Effect Model on Synthesizing Lombard Speech in Varying Noise Level Environments with Limited Data / Ngo, Thuan Van, Kubo, Rieko, Akagi, Masato, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.133-137, 2019-11-19, Institute of Electrical and Electronics Engineers (IEEE)
39 Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Network / Peng, Zhichao, Zhu, Zhi, Unoki, Masashi, Dang, Jianwu, Akagi, Masato, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.524-528, 2019-11-19, Institute of Electrical and Electronics Engineers (IEEE)
40 Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model / Atmaja, Bagus Tris, Akagi, Masato, 2019 IEEE International Conference on Signals and Systems (ICSigSys), 2019-07-16, Institute of Electrical and Electronics Engineers (IEEE)
41 Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model / Li, Yongwei, Sakakibara, Ken-Ichi, Akagi, Masato, 2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), pp.230-234, 2019-05-06, Institute of Electrical and Electronics Engineers (IEEE)
42 Blind Monaural Singing Voice Separation Using Rank-1 Constraint Robust Principal Component Analysis and Vocal Activity Detection / Li, Feng, Akagi, Masato, Neurocomputing, 350, pp.44-52, 2019-04-17, Elsevier
43 Improving multilingual speech emotion recognition by combining acoustic features in a three-layer model / Li, Xingfeng, Akagi, Masato, Speech Communication, 110, pp.1-12, 2019-04-03, Elsevier
44 Study on Nonlinear Relationships between Semantic Primitives and Emotional Dimensions for Improving Three-layered Model / Liu, Xingyu, Elbarougy, Reda Elsaid, Akagi, Masato, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), pp.522-525, 2019-03-07, Research Institute of Signal Processing, Japan
45 Study on Relations between Emotion Perception and Acoustic Features using Speech Morphing Techniques / Wang, Zi, Kobayashi, Maori, Akagi, Masato, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), pp.510-513, 2019-03-07, Research Institute of Signal Processing, Japan
46 Study on Perception of Speaker Age by Semantic Differential Method / Li, Yang, Kobayashi, Maori, Akagi, Masato, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), pp.248-251, 2019-03-06, Research Institute of Signal Processing, Japan
47 Variation of Formant Amplitude and Frequencies in Vowel Spectrum uttered under Various Noisy Environments / Matsumoto, Shumpei, Akagi, Masato, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), pp.252-255, 2019-03-06, Research Institute of Signal Processing, Japan
48 Study on Relationship between Degree of Emphasis and Acoustic Feature for Synthesizing Emphasized Speech / Ohtani, Yasuhiro, Akagi, Masato, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), pp.256-259, 2019-03-06, Research Institute of Signal Processing, Japan
49 Relationship between discomfort sound and its physical correlates / Takahashi, Yumiko, Akagi, Masato, 2019 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP2019), pp.311-314, 2019-03-06, Research Institute of Signal Processing, Japan
50 The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity / Li, Xingfeng, Akagi, Masato, Proc. Interspeech 2019, pp.3262-3266, 2019, International Speech Communication Association

1 2 3 4 5 next

 


Contact : Library Information Section, Japan Advanced Institute of Science and Technology