Deep Learning and Domain Transfer for Orca Vocalization Detection

Paul Best,Maxence Ferrari,Marion Poupard,Sébastien Paris,Ricard Marxer,Helena Symonds,Paul Spong,Hervé Glotin

Deep Learning and Domain Transfer for Orca Vocalization Detection

2020

In this paper, we study the difficulties of domain transfer when training deep learning models, on a specific task that is orca vocalization detection. Deep learning appears to be an answer to many sound recognition tasks in human speech analysis as well as in bioacoustics. This method allows to learn from large amounts of data, and find the best scoring way to discriminate between classes (e.g. orca vocalization and other sounds). However, to learn the perfect data representation and discrimination boundaries, all possible data configurations need to be processed. This causes problems when those configurations are ever changing (e.g. in our experiment, a change in the recording system happened to considerably disturb our previously well performing model). We thus explore approaches to compensate on the difficulties faced with domain transfer, with two convolutionnal neural networks (CNN) architectures, one that works in the time-frequency domain, and one that works directly on the time domain.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations