Sobieraj, Iwona, Kong, Qiuqiang and Plumbley, Mark D. (2017) Masked Non-negative Matrix Factorization for Bird Detection Using Weakly Labelled Data. In: 25th European Signal Processing Conference, 2017 (EUSIPCO-2017), 28 Aug- 2 Sep 2017, Kos Island, Greece.

Abstract Acoustic monitoring of bird species is an increasingly important field in signal processing. Many available bird sound datasets do not contain exact timestamp of the bird call but have a coarse weak label instead. Traditional Non-negative Matrix Factorization (NMF) models are not well designed to deal with weakly labeled data. In this paper we propose a novel Masked Non-negative Matrix Factorization (Masked NMF) approach for bird detection using weakly labeled data. During dictionary extraction we introduce a binary mask on the activation matrix. In that way we are able to control which parts of dictionary are used to reconstruct the training data. We compare our method with conventional NMF approaches and current state of the art methods. The proposed method outperforms the NMF baseline and offers a parsimonious model for bird detection on weakly labeled data. Moreover, to our knowledge, the proposed Masked NMF achieved the best result among non-deep learning methods on a test dataset used for the recent Bird Audio Detection Challenge.

Link to full paper ⤧  Next post DCASE 2017 challenge success ⤧  Previous post Kong et al. (2017a)