The final, formatted version of the article will be published soon.
METHODS article
Front. Ecol. Evol.
Sec. Population, Community, and Ecosystem Dynamics
Volume 12 - 2024 |
doi: 10.3389/fevo.2024.1409407
The use of BirdNET embeddings as a fast solution to find novel sound classes in audio recordings
Provisionally accepted- James Cook University, Townsville, Australia
Passive acoustic monitoring has emerged as a useful technique for monitoring vocal species and contributing to biodiversity monitoring goals. However, finding target sounds for species without pre-existing recognisers still proves challenging. Here, we demonstrate how the embeddings from the large acoustic model BirdNET can be used to quickly and easily find new sound classes outside the original model's training set. We outline the general workflow, and present three case studies covering a range of ecological use cases that we believe are common requirements in research and management: monitoring invasive species, generating species lists, and detecting threatened species. In all cases, a minimal amount of target class examples and validation effort was required to obtain results applicable to the desired application. The demonstrated success of this method across different datasets and different taxonomic groups suggests a wide applicability of BirdNET embeddings for finding novel sound classes. We anticipate this method will allow easy and rapid detection of sound classes for which no current recognisers exist, contributing to both monitoring and conservation goals.
Keywords: acoustic recognition, bioacoustics, Biodiversity, conservation, deep learning, Passive acoustic monitoring
Received: 30 Mar 2024; Accepted: 08 Nov 2024.
Copyright: © 2024 Allen-Ankins, Hoefer, Bartholomew, Brodie and Schwarzkopf. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Slade Allen-Ankins, James Cook University, Townsville, Australia
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.