A study published in the journal Information Sciences introduces a novel framework for speech emotion recognition using dual-channel spectrograms and optimized deep features. Their proposed ...
Abstract: In this paper, a novel endpoint detection algorithm based on spectrogram row self-correlation is proposed. Initially, the original speech signals are changed into speech spectrogram. In ...
Abstract: In this study, we extend the capability of the method of relative-to-maximum masking (RMM) in speech enhancement by further leveraging the importance of each time-frequency unit in the ...
This study presents a valuable advance in reconstructing naturalistic speech from intracranial ECoG data using a dual-pathway model. The evidence supporting the claims of the authors is solid, ...
We incorporate effective components of the TasNet into a freq-domain separation method. We introduce a solution for directly optimizing the separation criterion in freq-domain networks. Our exp ...
Recent studies have successfully shown how style transfer can be applied on images from one domain to another. In this project we attempt to use this technique to embed emotions in spectrogram images.