Skip to main content

ORIGINAL RESEARCH article

Front. Mar. Sci.
Sec. Coastal Ocean Processes
Volume 12 - 2025 | doi: 10.3389/fmars.2025.1539210
This article is part of the Research Topic Climate Change and Human Impact: Assessing Vulnerability and Intensification of Hazards in Estuarine and Coastal Zones View all 9 articles

Marine object detection in forward-looking sonar images via semantic-spatial feature enhancement

Provisionally accepted
  • 1 Electronic Information, Xijing University, Xi'an, China
  • 2 Geography and Remote Sensing, Hohai University, Nanjing, Jiangsu Province, China

The final, formatted version of the article will be published soon.

    Forward-looking sonar object detection plays a vital role in marine applications such as underwater navigation, surveillance, and exploration, serving as an essential underwater acoustic detection method. However, the challenges posed by seabed reverberation noise, complex marine environments, and varying object scales significantly hinder accurate detection of diverse object categories. To overcome these challenges, we propose a novel semantic-spatial feature enhanced detection model, namely YOLO-SONAR, tailored for marine object detection in forwardlooking sonar imagery. Specifically, we introduce the competitive coordinate attention mechanism (CCAM) and the spatial group enhance attention mechanism (SGEAM), both integrated into the backbone network to effectively capture semantic and spatial features within sonar images, while feature fusion is employed to suppress complex marine background noise. To address the detection of small-scale marine objects, we develop a context feature extraction module (CFEM), which enhances feature representation for tiny object regions by integrating multi-scale contextual information. Furthermore, we adopt the Wise-IoUv3 loss function to mitigate the issue of class imbalance within marine sonar datasets and stabilize the model training process.Experimental evaluations conducted on real-world forward-looking sonar datasets, MDFLS and WHFLS, demonstrate that the proposed detection model outperforms other state-of-the-art methods, achieving an average precision (mAP) of 81.96% on MDFLS and 82.30% on WHFLS, which are improvements of 7.65% and 12.89%, respectively, over the best-performing existing methods. These findings highlight the potential of our approach to significantly advance marine object detection technologies, facilitating more efficient underwater exploration and monitoring.

    Keywords: Marine object detection, Forward-looking sonar, semantic-spatial feature enhancement, attention mechanism, Feature fusion

    Received: 04 Dec 2024; Accepted: 21 Jan 2025.

    Copyright: © 2025 Wang, Guo, Zhang and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Nan Xu, Geography and Remote Sensing, Hohai University, Nanjing, 210098, Jiangsu Province, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.