Information Integration for Neuroscience Team (PI: Kazuyoshi Yoshii)

November 16, 2021 09:01

Description

Sound Scene Understanding Team (https://aip.riken.jp/labs/goalorient_tech/sound_scene_understand/) at RIKEN AIP

Speaker 1: Kazuyoshi Yoshii (45 min)
Title: The Unified Theory of Blind Source Separation Based on Independence, Nonnegativity, and Low-rankness
Abstract: We comprehensively review blind source separation (BSS) methods from a unified theoretical point of view. Independence, nonnegativity, and low-rankness inherent in sound sources, which are the three major clues of BSS, have been used for formulating a probabilistic model of observed mixture signals consisting of a source model representing the time-frequency structures of sources and a spatial model representing the inter-channel covariance structures of the sources. Nonnegative matrix factorization (NMF) is the most basic technique used for single-channel BSS and has successfully been extended to deal with the time, frequency, and/or spatial covariance structures of sources for single- and multi-channel BSS. We organize existing BSS methods in terms of covariance complexities (full-rank, rank-1, and jointly diagonalizable models) over the time, frequency, and channel dimensions. Among them, FastMNMF is considered as the state-of-the-art versatile BSS method with excellent separation performance and computational efficiency in practice.

Speaker 2: Kouhei Sekiguchi (15 min)
Title: Joint Blind Source Separation and Dereverberation with ARMA Models
Abstract: We explain an extension of FastMNMF (ARMA-FastMNMF) for joint blind source separation and dereverberation. The probabilistic model of ARMA-FastMNMF is obtained by integrating the source and spatial models of FastMNMF with AR and MA models representing the early reflections and late reverberations of sources, respectively.

Speaker 3: Mathieu Fontaine (15 min)
Title: Robust Blind Source Separation with Heavy-Tailed Models
Abstract: We comprehensively review heavy-tailed extensions of FastMNMF from a unified theoretical point of view.

Speaker 4: Yoshiaki Bando (15 min)
Title: Semi-supervised Source Separation with Deep Source Models
Abstract: We explain a DNN-based extension of the source model for semi-supervised source separation.

Speaker 5: Aditya Arie Nugraha (15 min)
Title: Unsupervised Source Separation with Deep Spatial Models
Abstract: We explain a DNN-based extension of the spatial model for unsupervised BSS.

Related Laboratories

last updated on April 9, 2026 15:57Laboratory

Sound Scene Understanding Team (2017/4--2026/3)

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
			1st	2nd	3rd	4th
5th	6th	7th	8th	9th	10th	11th
12th	13th	14th	15th	Link to the event page for the 16th	17th	18th
19th	20th	21th	Link to the event page for the 22th	Link to the event page for the 23th	24th	25th
26th	27th	28th	29th	Link to the event page for the 30th	31th

Center for Advanced Intelligence Project

Videos