Trainable signal encoders that are robust against noise - l'unam - université nantes angers le mans
Conference Papers Year : 2024

Trainable signal encoders that are robust against noise

Abstract

Within the deep learning paradigm, finite impulse response (FIR) filters are often used to encode audio signals, yielding flexible and adaptive feature representations. We show that a stabilization of FIR filterbanks with fixed filter lengths (convolutional layers with 1-D filters)leads to encoders that are optimally robust against noise and can be inverted with perfect reconstruction by their transposes. To maintain their flexibility as regular neural network layers, we implement the stabilization via a computationally efficient regularizing term in the objective function of the learning problem. In this way, the encoder keeps its expressive power and is optimally stable and noise-robust throughout the whole learning procedure. We show in a denoising task where noise is present in the input and in the encoder representation, that the proposed stabilization of the trainable filterbank encoder is decisive for increasing the signal-to-noise ratio of the denoised signals significantly compared to a model with a naively trained encoder.

Domains

Sound [cs.SD]
Fichier principal
Vignette du fichier
balazs2024internoise.pdf (212.89 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04697953 , version 1 (14-09-2024)

Identifiers

  • HAL Id : hal-04697953 , version 1

Cite

Peter Balazs, Daniel Haider, Vincent Lostanlen, Felix Perfler. Trainable signal encoders that are robust against noise. Inter-Noise, Aug 2024, Nantes (France), France. ⟨hal-04697953⟩
0 View
0 Download

Share

More