Skip to main navigation Skip to search Skip to main content

Using tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics

Ben Williams*, Bart Van Merriënboer, Vincent Dumoulin, Jenny Hamer, Abram B. Fleishman, Matthew McKown, Jill Munger, Aaron N. Rice, Ashlee Lillis, Clemency White, Catherine Hobbs, Tries Razak, David Curnick, Kate E. Jones, Tom Denton

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

10 Citations (Scopus)

Abstract

Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and computing costs limit the field's adoption. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting their current development to data-rich bird taxa. Here, we identify the optimum pretraining strategy for data-deficient domains, using tropical reefs as a representative case study. We assembled ReefSet, an annotated library of 57 000 reef sounds taken across 16 datasets, though still modest in scale compared to annotated bird libraries. We performed multiple pretraining experiments and found that pretraining on a library of bird audio 50 times the size of ReefSet provides notably superior generalizability on held-out reef datasets, with a mean area under the receiver operating characteristic curve (AUC-ROC) of 0.881 (±0.11), compared to pretraining on ReefSet itself or unrelated audio, with a mean AUC-ROC of 0.724 (±0.05) and 0.834 (±0.05), respectively. However, our key findings show that cross-domain mixing, where bird, reef and unrelated audio are combined during pretraining, provides superior transfer learning performance, with an AUC-ROC of 0.933 (±0.02). SurfPerch, our optimum pretrained network, provides a strong foundation for automated analysis of tropical reef and related PAM data with minimal annotation and computing costs.

This article is part of the theme issue 'Acoustic monitoring for tropical ecology and conservation'.
Original languageEnglish
Article number20240280
Number of pages10
JournalPhilosophical Transactions of the Royal Society B: Biological Sciences
Volume380
Issue number1928
DOIs
Publication statusPublished - 12 Jun 2025

Bibliographical note

Publisher Copyright:
© 2025 The Authors.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 14 - Life Below Water
    SDG 14 Life Below Water

Keywords

  • bioacoustics
  • coral reef
  • deep learning
  • machine learning
  • marine
  • passive acoustic monitoring
  • soundscape

Fingerprint

Dive into the research topics of 'Using tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics'. Together they form a unique fingerprint.

Cite this