AUTHOR=Zhang Peng , Yu Hong , Li Haiqing , Zhang Xin , Wei Sixue , Tu Wan , Yang Zongyi , Wu Junfeng , Lin Yuanshan TITLE=MSGNet: multi-source guidance network for fish segmentation in underwater videos JOURNAL=Frontiers in Marine Science VOLUME=10 YEAR=2023 URL=https://www.frontiersin.org/journals/marine-science/articles/10.3389/fmars.2023.1256594 DOI=10.3389/fmars.2023.1256594 ISSN=2296-7745 ABSTRACT=
Fish segmentation in underwater videos provides basic data for fish measurements, which is vital information that supports fish habitat monitoring and fishery resources survey. However, because of water turbidity and insufficient lighting, fish segmentation in underwater videos has low accuracy and poor robustness. Most previous work has utilized static fish appearance information while ignoring fish motion in underwater videos. Considering that motion contains more detail, this paper proposes a method that simultaneously combines appearance and motion information to guide fish segmentation in underwater videos. First, underwater videos are preprocessed to highlight fish in motion, and obtain high-quality underwater optical flow. Then, a multi-source guidance network (MSGNet) is presented to segment fish in complex underwater videos with degraded visual features. To enhance both fish appearance and motion information, a non-local-based multiple co-attention guidance module (M-CAGM) is applied in the encoder stage, in which the appearance and motion features from the intra-frame salient fish and the moving fish in video sequences are reciprocally enhanced. In addition, a feature adaptive fusion module (FAFM) is introduced in the decoder stage to avoid errors accumulated in the video sequences due to blurred fish or inaccurate optical flow. Experiments based on three publicly available datasets were designed to test the performance of the proposed model. The mean pixel accuracy (