AUTHOR=Prior Jack H. , Campbell Matthew D. , Dawkins Matthew , Mickle Paul F. , Moorhead Robert J. , Alaba Simegnew Y. , Shah Chiranjibi , Salisbury Joseph R. , Rademacher Kevin R. , Felts A. Paul , Wallace Farron TITLE=Estimating precision and accuracy of automated video post-processing: A step towards implementation of AI/ML for optics-based fish sampling JOURNAL=Frontiers in Marine Science VOLUME=10 YEAR=2023 URL=https://www.frontiersin.org/journals/marine-science/articles/10.3389/fmars.2023.1150651 DOI=10.3389/fmars.2023.1150651 ISSN=2296-7745 ABSTRACT=
Increased necessity to monitor vital fish habitat has resulted in proliferation of camera-based observation methods and advancements in camera and processing technology. Automated image analysis through computer vision algorithms has emerged as a tool for fisheries to address big data needs, reduce human intervention, lower costs, and improve timeliness. Models have been developed in this study with the goal to implement such automated image analysis for commercially important Gulf of Mexico fish species and habitats. Further, this study proposes adapting comparative otolith aging methods and metrics for gauging model performance by comparing automated counts to validation set counts in addition to traditional metrics used to gauge AI/ML model performance (such as mean average precision - mAP). To evaluate model performance we calculated percent of stations matching ground-truthed counts, ratios of false-positive/negative detections, and coefficient of variation (CV) for each species over a range of filtered outputs using model generated confidence thresholds (CTs) for each detected and classified fish. Model performance generally improved with increased annotations per species, and false-positive detections were greatly reduced with a second iteration of model training. For all species and model combinations, false-positives were easily identified and removed by increasing the CT to classify more restrictively. Issues with occluded fish images and reduced performance were most prevalent for schooling species, whereas for other species lack of training data was likely limiting. For 23 of the examined species, only 7 achieved a CV less than 25%. Thus, for most species, improvements to the training library will be needed and next steps will include a queried learning approach to bring balance to the models and focus during training. Importantly, for select species such as Red Snapper (