Abstract
Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and computing costs limit the field's adoption. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting their current development to data-rich bird taxa. Here, we identify the optimum pretraining strategy for data-deficient domains, using tropical reefs as a representative case study. We assembled ReefSet, an annotated library of 57 000 reef sounds taken across 16 datasets, though still modest in scale compared to annotated bird libraries. We performed multiple pretraining experiments and found that pretraining on a library of bird audio 50 times the size of ReefSet provides notably superior generalizability on held-out reef datasets, with a mean area under the receiver operating characteristic curve (AUC-ROC) of 0.881 (±0.11), compared to pretraining on ReefSet itself or unrelated audio, with a mean AUC-ROC of 0.724 (±0.05) and 0.834 (±0.05), respectively. However, our key findings show that cross-domain mixing, where bird, reef and unrelated audio are combined during pretraining, provides superior transfer learning performance, with an AUC-ROC of 0.933 (±0.02). SurfPerch, our optimum pretrained network, provides a strong foundation for automated analysis of tropical reef and related PAM data with minimal annotation and computing costs.
This article is part of the theme issue 'Acoustic monitoring for tropical ecology and conservation'.
This article is part of the theme issue 'Acoustic monitoring for tropical ecology and conservation'.
| Original language | English |
|---|---|
| Article number | 20240280 |
| Number of pages | 10 |
| Journal | Philosophical Transactions of the Royal Society B: Biological Sciences |
| Volume | 380 |
| Issue number | 1928 |
| DOIs | |
| Publication status | Published - 12 Jun 2025 |
Bibliographical note
Publisher Copyright:© 2025 The Authors.
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 14 Life Below Water
Keywords
- bioacoustics
- coral reef
- deep learning
- machine learning
- marine
- passive acoustic monitoring
- soundscape
Fingerprint
Dive into the research topics of 'Using tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver