JAIST Repository >

著者:  "AKAGI, Masato"

「一覧: 著者」画面に戻る
タイトル順ソート 日付順ソート

208 著者名表示.

発行日タイトル 著者
1995 Speaker individualities in speech spectral envelopesKitamura, Tatsuya; Akagi, Masato
1997 Speaker individuality in fundamental frequency contours and its controlAkagi, Masato; Ienaga, Taro
Mar-1997 雑音が付加された波形からの信号波形の一抽出法鵜木, 祐史; 赤木, 正人; UNOKI, Masashi; AKAGI, Masato
6-Feb-1998 A computational model of co-modulation masking releaseUnoki, Masashi; Akagi, Masato
6-Feb-1998 A method of signal extraction from noisy signal based on auditory scene analysisUnoki, Masashi; Akagi, Masato
Apr-1999 A method of signal extraction from noisy signal based on auditory scene analysisUnoki, Masashi; Akagi, Masato
20-Apr-1999 マイクロホン対を用いたスペクトルサブトラクションによる雑音除去法水町, 光徳; 赤木, 正人; MIZUMACHI, Mitsunori; AKAGI, Masato
20-Oct-1999 聴覚の情景解析に基づいた雑音下の調波複合音の一抽出法鵜木, 祐史; 赤木, 正人; UNOKI, Masashi; AKAGI, Masato
2000 The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noisesMizumachi, Mitsunori; Akagi, Masato
Jul-2000 A computational model of auditory sound localization based on ITDIto, Kazuhito; Akagi, Masato
1-Jul-2000 蝸牛神経核細胞の機能モデルの提案 : 前腹側核細胞の応答特性牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
25-Dec-2000 2.聴覚モデルの系譜 : 聴覚分野(〈特集〉-音響学における20世紀の成果と21世紀に残された課題-)赤木, 正人; Akagi, Masato
2001 Computational Models of Auditory Function : A computational model of auditory sound localizationIto, Kazuhito; Akagi, Masato
2001 Computational Models of Auditory Function : A computational model of co-modulation masking releaseUnoki, Masashi; Akagi, Masato
2002 Enabling Society With Information Technology : Speech enhancement and segregation based on human auditory mechanismsAkagi, Masato; Mizumachi, Mitsunori; Ishimoto, Yuichi; Unoki, Masashi
25-Dec-2002 蝸牛神経核腹側核細胞モデルの振幅変調音に対する応答特性Amplitude modulation; 牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
25-Dec-2002 初期聴覚系における神経発火の時間-周波数応答パタン(<小特集>末梢聴覚機能解析の動向)牧, 勝弘; 伊藤, 一仁; 赤木, 正人; Maki, Katuhiro; Ito, Kazuhito; Akagi, Masato
1-Mar-2003 Modified Restricted Temporal Decomposition and Its Application to Low Rate Speech CodingNGUYEN, Phu Chien; OCHI, Takao; AKAGI, Masato
25-Dec-2003 蝸牛神経核背側核細胞の周波数応答特性に関する神経回路モデルの提案 : トーンバースト刺激に対する応答牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
2004 A speech dereverberation method based on the MTF concept in power envelope restorationUnoki, Masashi; Sakata, Keigo; Furukawa, Masakazu; Akagi, Masato
2004 An improved method based on the MTF concept for restoring the power envelope from a reverberant signalUnoki, Masashi; Furukawa, Masakazu; Sakata, Keigo; Akagi, Masato
1-Jan-2004 Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic FeaturesISHIMOTO, Yuichi; ISHIZUKA, Kentaro; AIKAWA, Kiyoaki; AKAGI, Masato
1-Jun-2004 下丘細胞の時間応答特性に関する計算モデルの提案牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
2005 Toward a rule-based synthesis of emotional speech on linguistic description of perceptionHuang, Chun-Fang; Akagi, Masato
2005 Study on improving regularity of neural phase locking in single neurons of AVCN via a computational modelIto, Kazuhito; Akagi, Masato
2005 A computational model of cochlear nucleus neuronsMaki, Katuhiro; Akagi, Masato
28-Mar-2005 Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequencyIshimoto, Yuichi; Unoki, Masashi; Akagi, Masato
Jul-2005 Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesisSaitou, Takeshi; Unoki, Masashi; Akagi, Masato
2006 A Model-Concept of the Selective Sound Segregation : A Prototype Model for Selective Segregation of Target Instrument Sound from the Mixed Sound of Various InstrumentsUnoki, Masashi; Kubo, Masaaki; Haniu, Atsushi; Akagi, Masato
2006 Multi-channel noise reduction in noisy environmentsLi, Junfeng; Akagi, Masato; Suzuki, Yoiti
2006 A Study on Restoration of Bone-Conducted Speech with MTF-Based and LP-Based ModelsThang, Tat Vu; Kimura, Kenji; Unoki, Masashi; Akagi, Masato
Feb-2006 A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environmentsLi, Junfeng; Akagi, Masato
1-Apr-2006 有限要素法による声道伝達特性推定の有効性に関する検討西本, 博則; 赤木, 正人; 北村, 達也; 鈴木, 規子; Nishimoto, Hironori; Akagi, Masato; Kitamura, Tatsuya; Suzuki, Noriko
Jul-2006 Effect of ITD and component frequencies on perception of alarm signals in noisy environmentsNakanishi, Josaku; Unoki, Masashi; Akagi, Masato
Jul-2006 Effects of complicated vocal tract shape on vocal tract transfer functionNishimoto, Hironori; Akagi, Masato
1-Jul-2006 Noise reduction method based on generalized subtractive beamformerLi, Junfeng; Akagi, Masato
2007 Advances for In-Vehicle and Mobile Systems : Noise reduction based on microphone array and post-filtering for robust speech recognition in car environmentsLi, Junfeng; Lu, Xugang; Akagi, Masato
Apr-2007 Limited error based event localizing temporal decomposition and its application to variable-rate speech codingNguyen, Phu Chien; Akagi, Masato; Nguyen, Binh Phu
Jul-2007 Spectral Modification for Voice Gender Conversion using Temporal DecompositionNguyen, Binh Phu; Akagi, Masato
Oct-2007 Speech-to-Singing Synthesis: Converting Speaking Voices to Singing Voices By Controlling Acoustic Features Unique to Singing Voices,Saitou, Takeshi; Goto, Masataka; Unoki, Masashi; Akagi, Masato
Oct-2007 Improvement of Detectability of Alarm Signal in Noisy Environments by Utilizing Spatial CuesUchiyama, Hideaki; Unoki, Masashi; Akagi, Masato
5-Oct-2007 LP-based method of blind restoration to improve intelligibility of bone-conducted speechThang, Tat Vu; Unoki, Masashi; Akagi, Masato
2008 Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systemsLu, Xugang; Unoki, Masashi; Akagi, Masato
1-May-2008 歌声らしさの知覚モデルに基づいた歌声特有の音響特徴量の分析齋藤, 毅; 辻, 直也; 鵜木, 祐史; 赤木, 正人; Saitou, Takeshi; Tsuji, Naoya; Unoki, Masashi; Akagi, Masato
Jun-2008 An LP-based blind model for restoring bone-conducted speechVu, Thang tat; Unoki, Masashi; Akagi, Masato
Jun-2008 Phoneme-based Spectral Voice Conversion Using Temporal Decomposition and Gaussian Mixture ModelNguyen, Binh Phu; Akagi, Masato
Jun-2008 A hybrid microphone array post-filter in a diffuse noise fieldLi, Junfeng; Akagi, Masato
1-Jun-2008 A Two-Microphone Noise Reduction Method in Highly Non-stationary Multiple-Noise-Source EnvironmentsLI, Junfeng; AKAGI, Masato; SUZUKI, Yoiti
Jul-2008 Estimation of local peaks based on particle filter in adverse environmentsTomoike, Seiji; Akagi, Masato
23-Sep-2008 Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimizationLi, Junfeng; Jiang, Hui; Akagi, Masato
24-Sep-2008 High-quality analysis/synthesis method based on Temporal decomposition for speech modificationNguyen, Binh Phu; Shibata, Takeshi; Akagi, Masato
24-Sep-2008 Robust front end processing for speech recognition in reverberant environments: Utilization of speech characteristicsPetrick, Rico; Lu, Xugang; Unoki, Masashi; Akagi, Masato; Hoffmann, Ruediger
Oct-2008 A three-layered model for expressive speech perceptionHuang, Chun-Fang; Akagi, Masato
Nov-2008 Adaptive β-order generalized spectral subtraction for speech enhancementLi, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yoiti
25-Dec-2008 アジアの音赤木, 正人; Akagi, Masato
2009 聴覚末梢系の機能モデルの提案-聴神経の位相固定性及びスパイク生成機構のモデル化-牧, 勝弘; 赤木, 正人; 廣田, 薫; Maki, Katuhiro; Akagi, Masato; Hirota, Kaoru
2009 A flexible spectral modification method based on temporal decomposition and Gaussian mixture modelNguyen, Binh Phu; Akagi, Masato
1-Mar-2009 A study on nonlinguistic features in singing and speaking voices by brain activity measurementNakamura, Tomohiko; Kitamura, Tatsuya; Akagi, Masato
1-Mar-2009 An emotional speech recognition system based on multi-layer emotional speech perception modelAoki, Yuusuke; Huang, Chun-Fang; Akagi, Masato
1-Mar-2009 An MTF-based Blind Restoration Method for Improving Intelligibility of Bone-conducted SpeechKinugasa, Kota; Unoki, Masashi; Akagi, Masato
1-Mar-2009 Effects from Spatial Cues on Detectability of Alarm Signals in Car EnvironmentsKuroda, Naoki; Li, Junfeng; Iwaya, Yukio; Unoki, Masashi; Akagi, Masato
Apr-2009 Psychoacoustically-motivated adaptive β-order generalized spectral subtraction for cochlear implant patientsLi, Junfeng; Fu, Qian-Jie; Jiang, Hui; Akagi, Masato
Jul-2009 An MTF-based method of blind restoration for improving intelligibility of bone-conducted speechKinugasa, Kota; Unoki, Masashi; Akagi, Masato
25-Aug-2009 MTF-based power envelope restoration in noisy reverberant environmentsUnoki, Masashi; Yamasaki, Yutaka; Akagi, Masato
8-Sep-2009 感情音声知覚モデルの提案とその応用赤木, 正人; AKAGI, Masato
9-Sep-2009 Efficient modeling of temporal structure of speech for applications in voice transformationNguyen, Binh Phu; Akagi, Masato
Oct-2009 Two-stage binaural speech enhancement with Wiener filter based on equalization-cancellation modelLi, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yoiti
2010 Comparison of Emotion Perception among Different CulturesDang, Jianwu; Li, Aijun; Erickson, Donna; Suemitsu, Atsuo; Akagi, Masato; Sakuraba, Kyoko; Minematsu, Nobuaki; Hirose, Keikichi
Mar-2010 赤木研究室(北陸先端科学技術大学院大学)赤木, 正人; Akagi, Masato
4-Mar-2010 A study on brain activities elicited by synthesized emotional voices controlled with prosodic featuresHamada, Yasuhiro; Kitamura, Tatsuya; Akagi, Masato
4-Mar-2010 A study on the IMTF-based filtering for the modulation spectrum of reverberant speechMorita, Shota; Unoki, Masashi; Akagi, Masato
4-Mar-2010 Experimental evaluations of TS-BASE/WF in reverberant conditionsLi, Junfeng; Sasaki, Yuuki; Akagi, Masato; Yan, Yonghong
4-Mar-2010 Pitch perception of complex sounds with varied fundamental frequency and spectral tiltIshida, Mai; Akagi, Masato
2-Jun-2010 Two-stage binaural speech enhancement with Wiener filter for high-quality speech communicationLi, Junfeng; Sakamoto, Shuichi; Hongo, Satoshi; Akagi, Masato; Suzuki, Yôiti
Jul-2010 A Study on the IMTF-Based Filtering on the Modulation Spectrum of Reverberant SignalMorita, Shota; Unoki, Masashi; Akagi, Masato
1-Aug-2010 音声に含まれる感情情報の認識 : 感情空間をどのように表現するか赤木, 正人; Akagi, Masato
30-Sep-2010 A DOA estimation algorithm based on equalization-cancellation theoryChau, Duc Thanh; Li, Junfeng; Akagi, Masato
1-Oct-2010 A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic FeaturesZHOU, Yu; LI, Junfeng; SUN, Yanqing; ZHANG, Jianping; YAN, Yonghong; AKAGI, Masato
Nov-2010 Intelligibility Investigation of Single-Channel Noise Reduction Algorithms for Chinese and JapaneseLi, Junfeng; Yang, Lin; Yan, Yonghong; Thanh, Chau Duc; Akagi, Masato
2011 An investigation on perceptual line spectral frequency (PLP-LSF) target stability against the vowel neutralization phenomenonPhung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
Feb-2011 An investigation on speech perception over coarticulationPhung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
1-Mar-2011 Study on suitable-architecture of IIR all-pass filter for digital-audio watermarking technique based on cochlear-delay characteristicsKOSUGI, Toshizo; HANIU, Atsushi; MIYAUCHI, Ryota; UNOKI, Masashi; AKAGI, Masato
2-Mar-2011 音声の知覚と認識 : 人は脳で音声を聞く.機械は?赤木, 正人; 羽二生, 篤; AKAGI, Masato; HANIU, Atsushi
2-Mar-2011 A binaural model accounting for spatial masking releaseMizukawa, Shinya; Akagi, Masato
2-Mar-2011 Study on blind estimation of Speech Transmission Index in room acousticsIkeda, Tomohiro; Unoki, Masashi; Akagi, Masato
2-Mar-2011 Study on detectability of target signal by utilizing differences between movements in temporal envelopes of target and background signalsYano, Yuta; Miyauchi, Ryota; Unoki, Masashi; Akagi, Masato
2-Mar-2011 Study on MTF-based power envelope restoration in noisy reverberant environmentsMorita, Shota; Lu, Xugang; Unoki, Masashi; Akagi, Masato
3-Mar-2011 Influences of transformed auditory feedback with first three formant frequenciesShih, Tsungming; Suemitsu, Atsuo; Akagi, Masato
3-Mar-2011 Towards an intelligent binaural speech enhancement system by integrating meaningful signal extractionChau, Duc Thanh; Li, Junfeng; Akagi, Masato
1-Apr-2011 聴覚フィードバック下での音声知覚・生成の同時脳活動計測に関する研究赤木, 正人; Akagi, Masato
10-May-2011 Comparative intelligibility investigation of single-channel noise-reduction algorithms for Chinese, Japanese, and EnglishLi, Junfeng; Yang, Lin; Zhang, Jianping; Yan, Yonghong; Hu, Yi; Akagi, Masato; C. Loizou, Philipos
Jul-2011 Towards intelligent binaural speech enhancement by meaningful sound extractionChau, Duc Thanh; Li, Junfeng; Akagi, Masato
5-Mar-2012 Study on hearing impression of speaker identification focusing on dynamic featuresIzumida, Tsuyoshi; Akagi, Masato
5-Mar-2012 Speech enhancement technique in noisy reverberant environment using two microphone arraysSasaki, Yuuki; Akagi, Masato
6-Mar-2012 Study on detectability of signals by utilizing differences in their amplitude modulationYano, Yuta; Miyauchi, Ryota; Unoki, Masashi; Akagi, Masato
22-Aug-2012 Privacy protection for speech based on concepts of auditory scene analysisAKAGI, Masato; IRIE, Yoshihiro
Sep-2012 A study on restoration of bone-conducted speech in noisy environments with LP-based model and Gaussian mixture modelPhung, Nghia Trung; Unoki, Masashi; Akagi, Masato
Dec-2012 Speech Emotion Recognition System Based on a Dimensional Approach Using a Three-Layered ModelElbarougy, Reda; Akagi, Masato
Dec-2012 A concatenative speech synthesis for monosyllabic languages with limited dataPhung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
Dec-2012 Transformation of F0 contours for lexical tones in concatenative speech synthesis of tonal languagesPhung, Trung-Nghia; Luong, Mai Chi; Akagi, Masato
Mar-2013 A singing voices synthesis system to characterize vocal registers using ARX-LF modelMotoda, Hiroki; Akagi, Masato
Mar-2013 A Study on individualization of Head-Related Transfer Function in the median planeHisatsune, Hideki; Akagi, Masato
15-May-2013 音声中の感情認識のための新しい認識方略に関する研究赤木, 正人; Akagi, Masato
2-Jun-2013 Exploring auditory aging can exclusively explain Japanese adults′ age-related decrease in training effects of American English /r/-/l/Kubo, Rieko; Akagi, Masato
8-Jul-2013 Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratioChau, Duc Thanh; Li, Junfeng; Akagi, Masato
27-Aug-2013 Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and JapaneseLi, Junfeng; Chen, Fei; Akagi, Masato; Yan, Yonghong
Sep-2013 Acoustic sound source tracking for a moving object using precise Doppler-shift measurementNishie, Suminori; Akagi, Masato
2-Sep-2013 A Hybrid TTS between Unit Selection and HMM-based TTS under limited data conditionsPhung, Trung-Nghia; Luong, Chi Mai; Akagi, Masato
Oct-2013 Admissible range for individualization of head-related transfer function in median planeAkagi, Masato; Hisatsune, Hideki
Oct-2013 Cross-lingual Speech Emotion Recognition System Based on a Three-Layer Model for Human PerceptionElbarougy, Reda; Akagi, Masato
1-Nov-2013 Improving Naturalness of HMM-Based TTS Trained with Limited Data by Temporal DecompositionPHUNG, Trung-Nghia; PHAN, Thanh-Son; VU, Thang Tat; LUONG, Mai Chi; AKAGI, Masato
2014 Speech recognition in noisy conditions based on speech separation using Non-negative Matrix FactorizationDu, Yuxuan; Akagi, Masato
2014 Study on Analyzing Individuality of Instrurment Sounds Using Non-negative Matrix FactorizationKobayashi, Keisuke; Morikawa, Daisuke; Akagi, Masato
2014 Glottal source analysis of emotional speechLi, Yongwei; Akagi, Masato
2014 Improving speech emotion dimensions estimation using a three-layer model of human perceptionElbarougy, Reda; Akagi, Masato
2014 Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and EnglishLi, Junfeng; Xia, Risheng; Ying, Dongwen; Yan, Yonghong; Akagi, Masato
1-Apr-2014 音情景解析の概念にもとづいた音声プライバシー保護赤木, 正人; 入江, 佳洋; Akagi, Masato; Irie, Yoshihiro
1-Apr-2014 弦楽器F0 推定のための精密周波数測定方法西江, 純教; 赤木, 正人; Nishie, Suminori; Akagi, Masato
Jul-2014 Toward relaying emotional state for speech-to-speech translator: Estimation of emotional state for synthesizing speech with emotionAkagi, Masato; Elbarougy, Reda
Aug-2014 Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation SystemAkagi, Masato; Han, Xiao; Elbarougy, Reda; Hamada, Yasuhiro; Li, Junfeng
Sep-2014 Toward relaying an affective Speech-to-Speech translator: Cross-language perception of emotional state represented by emotion dimensionsElbarougy, Reda; Xiao, Han; Akagi, Masato; Li, Junfeng
1-Oct-2014 Binaural Sound Source Localization in Noisy Reverberant Environments Based on Equalization-Cancellation TheoryChau, Thanh-Duc; Li, Junfeng; Akagi, Masato
Feb-2015 A study on perception of emotional states in multiple languages on Valence-Activation approachHan, Xiao; Elbarougy, Reda; Akagi, Masato; Li, Junfeng; Ngo, Thi Duyen; Bui, The Duy
1-Sep-2015 Dependence on age of interference with phoneme perception by first- and second-language speech maskersKubo, Rieko; Akagi, Masato; Akahane-Yamada, Reiko
28-Oct-2015 Toward Improving Estimation Accuracy of Emotion Dimensions in Bilingual Scenario Based on Three-layered ModelLI, Xingfeng; Akagi, Masato
Dec-2015 Study on method to control fundamental frequency contour related to a position on Valence-Activation spaceHamada, Yasuhiro; Elbarougy, Reda; Xue, Yuawn; Akagi, Masato
19-Dec-2015 Emotional speech synthesis system based on a three-layered model using a dimensional approachXue, Yawen; Hamada, Yasuhiro; Akagi, Masato
2016 Voice Conversion to Emotional Speech based on Three-layered Model in Dimensional Approach and Parameterization of Dynamic Features in ProsodyXue, Yawen; Hamada, Yasuhiro; Akagi, Masato
Mar-2016 A study on quality improvement of HMM-based synthesized voices using asymmetric bilinear modelDinh-Anh, Tuan; Morikawa, Daisuke; Akagi, Masato
Mar-2016 Automatic Speech Emotion Recognition in Chinese Using a Three-layered Model in Dimensional ApproachLi, Xingfeng; Akagi, Masato
Mar-2016 A study on applying target prediction model to parameterize power envelope of emotional speechXue, Yawen; Akagi, Masato
21-Aug-2016 Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyanceKubo, Rieko; Morikawa, Daisuke; Akagi, Masato
Oct-2016 Voice conversion system to emotional speech in multiple languages based on three-layered model for dimensional spaceXue, Yawen; Hamada, Yasuhiro; Elbarougy, Reda; Akagi, Masato
Oct-2016 Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using non-negative matrix factorizationDinh, Anh-Tuan; Akagi, Masato
18-Oct-2016 Optimizing Fuzzy Inference Systems for Improving Speech Emotion RecognitionElbarougy, Reda; Akagi, Masato
12-Nov-2016 Quality Improvement of Vietnamese HMM-Based Speech Synthesis System Based on Decomposition of Naturalness and Intelligibility using Non-negative Matrix FactorizationDinh, Anh-Tuan; Phan, Thanh-Son; Akagi, Masato
2017 Acoustical Analyses of Tendencies of Intelligibility in Lombard Speech with Different Background Noise LevelsNgo, Thuan Van; Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato
2-Mar-2017 Acoustical analyses of Lombard speech by different background noise levels for tendencies of intelligibilityNgo, Thuan Van; Kubo, Rieko; Morikawa, Daisuke; Akagi, Masato
2-Mar-2017 Articulatory Characteristics of Expressive Speech in Activation-Evaluation SpaceAsai, Takuya; Suemitsu, Atsuo; Akagi, Masato
1-Jun-2017 ヒト発話シミュレータによるStory Teller Systemの構築赤木, 正人; Akagi, Masato
26-Oct-2017 Weighted Robust Principal Component Analysis with Gammatone Auditory Filterbank for Singing Voice SeparationLi, Feng; Akagi, Masato
1-Nov-2017 Feature Selection Method for Real-time Speech Emotion RecognitionElbarougy, Reda; Akagi, Masato
15-Dec-2017 Speech Emotion Recognition Using Multichannel Parallel Convolutional Recurrent Neural Networks based on Gammatone Auditory FilterbankPeng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
2018 Unsupervised Singing Voice Separation Based on Robust Principal Component Analysis Exploiting Rank-1 ConstraintLi, Feng; Akagi, Masato
2018 A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual SpeechLi, Xingfeng; Akagi, Masato
5-Mar-2018 Perceptual grouping with prosodic features in Japanese dialectsZhang, Ling; Akagi, Masato
6-Mar-2018 Study on differences between perceptions of Japanese and Chinese emotional speech by Japanese and Chinese listenersZhang, Chenyi; Akagi, Masato
7-Mar-2018 Non-parallel training dictionary-based voice conversion with Variational AutoencoderVu, Ho-Tuan; Akagi, Masato
7-Mar-2018 Synthesis of expressive singing voice by F0, amplitude envelope and spectral feature conversionNguyen, Thi-Hao; Akagi, Masato
7-Mar-2018 Estimation of glottal source waveform and vocal tract shape for singing-voice analysisTakahashi, Kyoko; Akagi, Masato
19-Jul-2018 Voice conversion for emotional speech: Rule-based synthesis with degree of emotion controllable in dimensional spaceXue, Yawen; Hamada, Yasuhiro; Akagi, Masato
25-Jul-2018 Nonparallel Dictionary-Based Voice Conversion Using Variational Autoencoder with Modulation-Spectrum-Constrained TrainingHo, Tuan Vu; Akagi, Masato
26-Jul-2018 Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal RepresentationPeng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
22-Aug-2018 Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal spaceLi, Yongwei; Li, Junfeng; Akagi, Masato
11-Sep-2018 Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional SpeechLi, Yongwei; Sakakibara, Ken-Ichi; Morikawa, Daisuke; Akagi, Masato
15-Nov-2018 Maximal Information Coefficient and Predominant Correlation-Based Feature Selection Toward A Three-Layer Model for Speech Emotion RecognitionLi, Xingfeng; Akagi, Masato
15-Nov-2018 Estimation of glottal source waveforms and vocal tract shape for singing voices with wide frequency rangeTakahashi, Kyoko; Akagi, Masato
15-Nov-2018 Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component AnalysisLi, Feng; Akagi, Masato
2019 The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language DiversityLi, Xingfeng; Akagi, Masato
6-Mar-2019 Study on Perception of Speaker Age by Semantic Differential MethodLi, Yang; Kobayashi, Maori; Akagi, Masato
6-Mar-2019 Variation of Formant Amplitude and Frequencies in Vowel Spectrum uttered under Various Noisy EnvironmentsMatsumoto, Shumpei; Akagi, Masato
6-Mar-2019 Study on Relationship between Degree of Emphasis and Acoustic Feature for Synthesizing Emphasized SpeechOhtani, Yasuhiro; Akagi, Masato
6-Mar-2019 Relationship between discomfort sound and its physical correlatesTakahashi, Yumiko; Akagi, Masato
7-Mar-2019 Study on Nonlinear Relationships between Semantic Primitives and Emotional Dimensions for Improving Three-layered ModelLiu, Xingyu; Elbarougy, Reda Elsaid; Akagi, Masato
7-Mar-2019 Study on Relations between Emotion Perception and Acoustic Features using Speech Morphing TechniquesWang, Zi; Kobayashi, Maori; Akagi, Masato
3-Apr-2019 Improving multilingual speech emotion recognition by combining acoustic features in a three-layer modelLi, Xingfeng; Akagi, Masato
17-Apr-2019 Blind Monaural Singing Voice Separation Using Rank-1 Constraint Robust Principal Component Analysis and Vocal Activity DetectionLi, Feng; Akagi, Masato
6-May-2019 Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF modelLi, Yongwei; Sakakibara, Ken-Ichi; Akagi, Masato
16-Jul-2019 Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention ModelAtmaja, Bagus Tris; Akagi, Masato
19-Nov-2019 Non-parallel Voice Conversion with Controllable Speaker Individuality using Variational AutoencoderHo, Tuan Vu; Akagi, Masato
19-Nov-2019 Speech Emotion Recognition Using Speech Feature and Word EmbeddingAtmaja, Bagus Tris; Shirai, Kiyoaki; Akagi, Masato
19-Nov-2019 Evaluation of the Lombard Effect Model on Synthesizing Lombard Speech in Varying Noise Level Environments with Limited DataNgo, Thuan Van; Kubo, Rieko; Akagi, Masato
19-Nov-2019 Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural NetworkPeng, Zhichao; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
20-Nov-2019 Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency MaskingLi, Feng; Qian, Kaizhi; Hasegawa-Johnson, Mark; Akagi, Masato
14-Dec-2019 Combining F0 and non-negative constraint robust principal component analysis for singing voice separationLi, Feng; Akagi, Masato
23-Dec-2019 Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF ModelLi, Yongwei; Sakakibara, Ken-Ichi; Akagi, Masato
20-Jan-2020 Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-EndsPeng, Zhichao; Li, Xingfeng; Zhu, Zhi; Unoki, Masashi; Dang, Jianwu; Akagi, Masato
22-Jan-2020 Effect of articulatory and acoustic features on the intelligibility of speech in noise: an articulatory synthesis studyNgo, Thuanvan; Akagi, Masato; Birkholz, Peter
20-Feb-2020 Study on relationship between warmness of speech and valence, activation or dominanceMiyagawa, Natsumi; Akagi, Masato
20-Feb-2020 Influence of auditory feedback on uttering vowel speech in noisy environmentNishigaki, Tomoya; Akagi, Masato
May-2020 Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion RecognitionAtmaja, Bagus Tris; Akagi, Masato
1-May-2020 Mimicking Lombard Effect: An Analysis and ReconstructionNgo, Thuan Van; Kubo, Rieko; Akagi, Masato
25-May-2020 The Effect of Silence Feature in Dimensional Speech Emotion RecognitionAtmaja, Bagus Tris; Akagi, Masato
27-May-2020 Dimensional speech emotion recognition from speech features and word embeddings by using multitask learningAtmaja, Bagus Tris; Akagi, Masato
1-Jul-2020 A Two-Stage Phase-Aware Approach for Monaural Multi-Talker Speech SeparationYin, Lu; Li, Junfeng; Yan, Yonghong; Akagi, Masato
Oct-2020 Segment-level Effects of Gender, Nationality and Emotion Information on Text-independent Speaker VerificationLi, Kai; Akagi, Masato; Wu, Yibo; Dang, and Jianwu
Oct-2020 Comparison of glottal source parameter values in emotional vowelsLi, Yongwei; Tao, Jianhua; Liu, Bin; Erickson, Donna; Akagi, Masato
9-Oct-2020 Acoustic and articulatory analysis and synthesis of shouted vowelsXue, Yawen; Marxen, Michael; Akagi, Masato; Birkholz, Peter
30-Oct-2020 Non-parallel Voice Conversion based on Hierarchical Latent Embedding Vector Quantized Variational AutoencoderHo, Tuan Vu; Akagi, Masato
1-Nov-2020 Continuous Audiovisual Emotion Recognition Using Feature Selection and LSTMElbarougy, Reda; Atmaja, Bagus Tris; Akagi, Masato
6-Nov-2020 Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic InformationAtmaja, Bagus Tris; Akagi, Masato
18-Nov-2020 On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and ClassifiersAtmaja, Bagus Tris; Akagi, Masato
19-Nov-2020 Predicting Valence and Arousal by Aggregating Acoustic Features for Acoustic-Linguistic Information FusionAtmaja, Bagus Tris; Hamada, Yasuhiro; Akagi, Masato
19-Nov-2020 Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVMAtmaja, Bagus Tris; Akagi, Masato
2-Mar-2021 Cross-Lingual Voice Conversion With Controllable Speaker Individuality Using Variational Autoencoder and Star Generative Adversarial NetworkHo, Tuan Vu; Akagi, Masato
1-Oct-2021 Increasing speech intelligibility and naturalness in noise based on concepts of modulation spectrum and modulation transfer functionNgo, Thuanvan; Kubo, Rieko; Akagi, Masato
15-Oct-2021 F_0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF ModelLi, Yongwei; Tao, Jianhua; Erickson, Donna; Liu, Bin; Akagi, Masato
Dec-2021 Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion RecognitionLi, Xingfeng; Guo, Taiyang; Hu, Xinhui; Xu, Xinkang; Dang, Jianwu; Akagi, Masato
Dec-2021 Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/SynthesisLi, Kai; Unoki, Masashi; Li, Yongwei; Dang, Jianwu; Akagi, Masato
Dec-2021 Automatic Naturalness Recognition from Acted Speech Using Neural NetworksAtmaja, Bagus Tris; Sasou, Akira; Akagi, Masato
6-Mar-2022 Acoustic features correlated to perceived urgency in evacuation announcementsKobayashi, Maori; Hamada, Yasuhiro; Akagi, Masato
26-Mar-2022 Survey on bimodal speech emotion recognition from acoustic and linguistic information fusionAtmaja, Bagus Tris; Sasou, Akira; Akagi, Masato
7-Jul-2022 Speech Emotion and Naturalness Recognitions With Multitask and Single-Task LearningsAtmaja, Bagus Tris; Sasou, Akira; Akagi, Masato
Sep-2022 Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice ConversionHo, Tuan Vu; Kobayashi, Maori; Akagi, Masato
Sep-2022 Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio DetectionLi, Kai; Li, Sheng; Lu, Xugang; Akagi, Masato; Liu, Meng; Zhang, Lin; Zeng, Chang; Wang, Longbiao; Dang, Jianwu; Unoki, Masashi
Sep-2022 Vector-quantized Variational Autoencoder for Phase-aware Speech EnhancementHo, Tuan Vu; Nguyen, Quoc Huy; Akagi, Masato; Unoki, Masashi
26-Jun-2023 Music Theory-inspired Acoustic Representation for Speech Emotion RecognitionLi, Xingfeng; Shi, Xiaohan; Hu, Desheng; Li, Yongwei; Zhang, Qingchen; Wang, Zhengxia; Unoki, Masashi; Akagi, Masato
31-Oct-2023 Increasing Speech Intelligibility by Mimicking Professional Announcers’ Voices and Its Physical CorrelatesTran, Dung Kim; Akagi, Masato; Unoki, Masashi

 


お問い合わせ先 : 北陸先端科学技術大学院大学 研究推進課図書館情報係