JAIST Repository: Combining F0 and non-negative constraint robust principal component analysis for singing voice separation

トップページ| 北陸先端科学技術大学院大学| 附属図書館

一覧

コミュニティ
& コレクション
タイトル
著者
日付
学位論文
リサーチレポート・テクニカルメモランダム

登録利用者:

登録者ページ
利用者(E-people)

当システムについて

JAIST Repository >
b. 情報科学研究科・情報科学系 >
b10. 学術雑誌論文等 >
b10-1. 雑誌掲載論文 >

このアイテムの引用には次の識別子を使用してください: http://hdl.handle.net/10119/18018

タイトル:	Combining F0 and non-negative constraint robust principal component analysis for singing voice separation
著者:	Li, Feng Akagi, Masato
キーワード:	Singing voice separation Robust principal component analysis Non-negative rank-1 constraint F0
発行日:	2019-12-14
出版者:	Elsevier
誌名:	Signal Processing
巻:	170
開始ページ:	107432
DOI:	10.1016/j.sigpro.2019.107432
抄録:	Separating singing voice from a musical mixture remains an important task in the field of music information retrieval. Recent studies on singing voice separation have shown that robust principal component analysis (RPCA) with rank-1 constraint approach can improve separation quality. However, the performance of separation is limited because the vocal part can not be described well by the separated matrix. Therefore, prior information such as fundamental frequency (F0) should be considered. F0 can significantly improve separation performance by removing the spectral components of non-repeating instruments (e.g., bass and guitar). In this paper, we propose a novel singing voice separation algorithm by combining prior information and non-negative constraint RPCA, which incorporates F0 and non-negative rank-1 constraint minimization of singular values in RPCA instead of minimizing the nuclear norm. In addition, we use the original phase recovery in estimating the spectral components of the separated singing voice. Experimental results on the iKala and MIR-1K datasets show higher efficiency of the proposed algorithm compared with state-of-the-art methods in terms of separation accuracy.
Rights:	Copyright (C)2019, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International license (CC BY-NC-ND 4.0). [http://creativecommons.org/licenses/by-nc-nd/4.0/] NOTICE: This is the author’s version of a work accepted for publication by Elsevier. Changes resulting from the publishing process, including peer review, editing, corrections, structural formatting and other quality control mechanisms, may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Feng Li and Masato Akagi, Signal Processing, 170, 2019, 107432, http://dx.doi.org/10.1016/j.sigpro.2019.107432
URI:	http://hdl.handle.net/10119/18018
資料タイプ:	author
出現コレクション:	b10-1. 雑誌掲載論文 (Journal Articles)

このアイテムのファイル:

ファイル	記述	サイズ	形式
3063.pdf		494Kb	Adobe PDF	見る/開く

当システムに保管されているアイテムはすべて著作権により保護されています。

お問合せ先 : 北陸先端科学技術大学院大学　研究推進課図書館情報係 (ir-sys[at]ml.jaist.ac.jp)