Semi-supervised Sound Event Detection using Random Augmentation and Consistency Regularization

Li, Xiaofei

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2102

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Semi-supervised Sound Event Detection using Random Augmentation and Consistency Regularization

Authors: Xiaofei Li

(Submitted on 30 Jan 2021)

Abstract: Sound event detection is a core module for acoustic environmental analysis. Semi-supervised learning technique allows to largely scale up the dataset without increasing the annotation budget, and recently attracts lots of research attention. In this work, we study on two advanced semi-supervised learning techniques for sound event detection. Data augmentation is important for the success of recent deep learning systems. This work studies the audio-signal random augmentation method, which provides an augmentation strategy that can handle a large number of different audio transformations. In addition, consistency regularization is widely adopted in recent state-of-the-art semi-supervised learning methods, which exploits the unlabelled data by constraining the prediction of different transformations of one sample to be identical to the prediction of this sample. This work finds that, for semi-supervised sound event detection, consistency regularization is an effective strategy, especially the best performance is achieved when it is combined with the MeanTeacher model.

Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2102.00154 [eess.AS]
	(or arXiv:2102.00154v1 [eess.AS] for this version)

Submission history

From: Xiaofei Li [view email]
[v1] Sat, 30 Jan 2021 05:22:13 GMT (50kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2102.00154

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Semi-supervised Sound Event Detection using Random Augmentation and Consistency Regularization

Submission history