Abstract: Environmental Sound Recognition (ESR) is an essential task in audio analysis, involving the identification and classification of sounds from various environmental contexts. This study ...
Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
All the datasets must be located in the datasets folder. This folder should contain the following subfolders after downloading the datasets: GTZAN Speech_Music: Contains the GTZAN Speech Music dataset ...
CATS is a signal processing technique and framework for detecting and denoising sparse signals in the time-frequency domain. Particularly, very useful for processing earthquakes. This work is still in ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results