Windy events detection in big bioacoustics datasets using a pre-trained Convolutional Neural Network.

Bioacoustics Deep learning Ecoacoustics Passive Acoustic Monitoring Soundscape ecology Wind-noise

Journal

The Science of the total environment
ISSN: 1879-1026
Titre abrégé: Sci Total Environ
Pays: Netherlands
ID NLM: 0330500

Informations de publication

Date de publication:
19 Jul 2024
Historique:
received: 06 03 2024
revised: 16 07 2024
accepted: 16 07 2024
medline: 22 7 2024
pubmed: 22 7 2024
entrez: 21 7 2024
Statut: aheadofprint

Résumé

Passive Acoustic Monitoring (PAM), which involves using autonomous record units for studying wildlife behaviour and distribution, often requires handling big acoustic datasets collected over extended periods. While these data offer invaluable insights about wildlife, their analysis can present challenges in dealing with geophonic sources. A major issue in the process of detection of target sounds is represented by wind-induced noise. This can lead to false positive detections, i.e., energy peaks due to wind gusts misclassified as biological sounds, or false negative, i.e., the wind noise masks the presence of biological sounds. Acoustic data dominated by wind noise makes the analysis of vocal activity unreliable, thus compromising the detection of target sounds and, subsequently, the interpretation of the results. Our work introduces a straightforward approach for detecting recordings affected by windy events using a pre-trained convolutional neural network. This process facilitates identifying wind-compromised data. We consider this dataset pre-processing crucial for ensuring the reliable use of PAM data. We implemented this preprocessing by leveraging YAMNet, a deep learning model for sound classification tasks. We evaluated YAMNet as-is ability to detect wind-induced noise and tested its performance in a Transfer Learning scenario by using our annotated data from the Stony Point Penguin Colony in South Africa. While the classification of YAMNet as-is achieved a precision of 0.71, and recall of 0.66, those metrics strongly improved after the training on our annotated dataset, reaching a precision of 0.91, and recall of 0.92, corresponding to a relative increment of >28 %. Our study demonstrates the promising application of YAMNet in the bioacoustics and ecoacoustics fields, addressing the need for wind-noise-free acoustic data. We released an open-access code that, combined with the efficiency and peak performance of YAMNet, can be used on standard laptops for a broad user base.

Identifiants

pubmed: 39034006
pii: S0048-9697(24)05017-4
doi: 10.1016/j.scitotenv.2024.174868
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

174868

Informations de copyright

Copyright © 2024. Published by Elsevier B.V.

Auteurs

Francesca Terranova (F)

Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy. Electronic address: francesca.terranova@unito.it.

Lorenzo Betti (L)

Department of Network and Data Science, Central European University, Vienna, Austria.

Valeria Ferrario (V)

Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy; Chester Zoo, Caughall Road, Chester, UK.

Olivier Friard (O)

Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy.

Katrin Ludynia (K)

Southern African Foundation for the Conservation of Coastal Birds (SANCCOB), Cape Town, South Africa; Department of Biodiversity and Conservation Biology, University of the Western Cape, Robert Sobukwe Road, Bellville, South Africa.

Gavin Sean Petersen (GS)

Southern African Foundation for the Conservation of Coastal Birds (SANCCOB), Cape Town, South Africa.

Nicolas Mathevon (N)

ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, Saint-Etienne, France; Institut universitaire de France, Ministry of Higher Education, Research and Innovation, 1 rue Descartes, CEDEX 05, Paris, France; Ecole Pratique des Hautes Etudes, CHArt lab, PSL University, Paris, France.

David Reby (D)

ENES Bioacoustics Research Lab, CRNL, University of Saint-Etienne, CNRS, Inserm, Saint-Etienne, France; Institut universitaire de France, Ministry of Higher Education, Research and Innovation, 1 rue Descartes, CEDEX 05, Paris, France.

Livio Favaro (L)

Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy; Stazione Zoologica Anton Dohrn, Naples, Italy.

Classifications MeSH