Latent and Adversarial Data Augmentation for Sound Event Detection and Classification - Département Image, Données, Signal Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Latent and Adversarial Data Augmentation for Sound Event Detection and Classification

Résumé

Invariance-based learning is a promising approach in deep learning. Among other benefits, it can mitigate the lack of diversity of available datasets and increase the interpretability of trained models. To this end, practitioners often use a consistency cost penalizing the sensitivity of a model to a set of carefully selected data augmentations. However, there is no consensus about how these augmentations should be selected. In this paper, we study the behavior of several augmentation strategies. We consider the task of sound event detection and classification for our experiments. In particular, we show that transformations operating on the internal layers of a deep neural network are beneficial for this task.
Fichier principal
Vignette du fichier
dcase.pdf (178.96 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03782827 , version 1 (21-09-2022)

Identifiants

  • HAL Id : hal-03782827 , version 1

Citer

David Perera, Slim Essid, Gaël Richard. Latent and Adversarial Data Augmentation for Sound Event Detection and Classification. International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE), Nov 2022, Nancy, France. ⟨hal-03782827⟩
206 Consultations
151 Téléchargements

Partager

Gmail Facebook X LinkedIn More