Skip to main content

ORIGINAL RESEARCH article

Front. Mar. Sci.
Sec. Marine Fisheries, Aquaculture and Living Resources
Volume 12 - 2025 | doi: 10.3389/fmars.2025.1476616
This article is part of the Research Topic Challenges in Fishery Assessment Methodologies View all 8 articles

Automatic detection, identification and counting of deep-water snappers on underwater baited video using deep learning

Provisionally accepted
Florian Baletaud Florian Baletaud 1,2,3*Sébastien Villon Sébastien Villon 1Antoine Gilbert Antoine Gilbert 2Jean-Marie Côme Jean-Marie Côme 3Sylvie Fiat Sylvie Fiat 1Corina Iovan Corina Iovan 1Laurent Vigliola Laurent Vigliola 1
  • 1 ENTROPIE, Institut de Recherche pour le Développement (IRD), UR, UNC, IFREMER, CNRS, Centre IRD de Nouméa, 98000 Noumea, New Caledonia
  • 2 Ginger Soproner, Noumea, New Caledonia
  • 3 Burgeap, groupe GINGER, Lyon, France

The final, formatted version of the article will be published soon.

    Deep-sea demersal fisheries in the Pacific have strong commercial, cultural, and recreational value, especially snappers (Lutjanidae) which make the bulk of catches. Yet, managing these fisheries is challenging due to the scarcity of data. Stereo-Baited Remote Underwater Video Stations (BRUVS) can provide valuable quantitative information on fish stocks, but manually processing large amounts of videos is time-consuming and sometimes unrealistic. To address this issue, we used a Regionbased Convolutional Neural Network (Faster R-CNN), a deep learning architecture to automatically detect, identify and count deep-water snappers in BRUVS. Videos were collected in New Caledonia (South Pacific) at depths ranging from 47 to 552 m. Using a dataset of 12,100 an-notations from 11 deep-water snapper species observed in 6,364 images extracted from BRUVS, we obtained good model performance (F-measures >0.7, up to 0.87) for the 6 species with sufficient annotations (Fmeasures >0.7, up to 0.87). The correlation between automatic and manual estimates of fish MaxN abundance in videos was high (0.72 -0.9), but the Faster R-CNN showed an underestimation bias at higher abundances. A semi-automatic protocol where our model supported manual observers in processing BRUVS footage improved performance with a correlation of 0.96 with manual counts and a perfect match (R=1) for some key species.

    Keywords: Deep-water snapper fisheries, artificial intelligence, semi-automatic, BRUVS, Faster R-CNN

    Received: 06 Aug 2024; Accepted: 20 Jan 2025.

    Copyright: © 2025 Baletaud, Villon, Gilbert, Côme, Fiat, Iovan and Vigliola. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Florian Baletaud, ENTROPIE, Institut de Recherche pour le Développement (IRD), UR, UNC, IFREMER, CNRS, Centre IRD de Nouméa, 98000 Noumea, New Caledonia

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.