School of Electrical Engineering, KAIST, Daejeon, Republic of Korea
{cdy3773, jwoo}@kaist.ac.kr
[Paper Link] | [Paper PDF]
Abstract
Model Architecture
Performance Comparisons
t-SNE Trajectories
Five Audio Samples
Click a toggle to listen to samples.
All spectograms are the first channel of microphone array.
Note: Audio and images may take a moment to load.
Extraction of each audio from the mixture including 4 sources
in a cuboid room of size [width, length, height] = [5.57, 5.20, 3.79] m with an RT60 of 0.32 s
SI-SNRi Contour Maps
Input Mixture and Sum of Direct/Reverb Components from Direct/Reverb Audio Decoder
Extraction of each audio from the mixture including 5 sources
in a cuboid room of size [width, length, height] = [5.85, 5.89, 3.56] m with an RT60 of 0.44 s
SI-SNRi Contour Maps
Input Mixture and Sum of Direct/Reverb Components from Direct/Reverb Audio Decoder
Extraction of each audio from the mixture including 5 sources
in a cuboid room of size [width, length, height] = [5.39, 7.95, 3.07] m with an RT60 of 0.41 s
SI-SNRi Contour Maps
Input Mixture and Sum of Direct/Reverb Components from Direct/Reverb Audio Decoder
Extraction of each audio from the mixture including 5 sources
in a cuboid room of size [width, length, height] = [7.15, 5.25, 3.68] m with an RT60 of 0.29 s
SI-SNRi Contour Maps
Input Mixture and Sum of Direct/Reverb Components from Direct/Reverb Audio Decoder
Extraction of each audio from the mixture including 5 sources
in a cuboid room of size [width, length, height] = [7.66, 7.39, 3.07] m with an RT60 of 0.25 s
SI-SNRi Contour Maps
Input Mixture and Sum of Direct/Reverb Components from Direct/Reverb Audio Decoder
Poster presented at ICASSP 2026