AUTHOR=Roelfsema Chris M. , Lyons Mitchell , Murray Nicholas , Kovacs Eva M. , Kennedy Emma , Markey Kathryn , Borrego-Acevedo Rodney , Ordoñez Alvarez Alexandra , Say Chantel , Tudman Paul , Roe Meredith , Wolff Jeremy , Traganos Dimosthenis , Asner Gregory P. , Bambic Brianna , Free Brian , Fox Helen E. , Lieb Zoe , Phinn Stuart R. TITLE=Workflow for the Generation of Expert-Derived Training and Validation Data: A View to Global Scale Habitat Mapping JOURNAL=Frontiers in Marine Science VOLUME=8 YEAR=2021 URL=https://www.frontiersin.org/journals/marine-science/articles/10.3389/fmars.2021.643381 DOI=10.3389/fmars.2021.643381 ISSN=2296-7745 ABSTRACT=

Our ability to completely and repeatedly map natural environments at a global scale have increased significantly over the past decade. These advances are from delivery of a range of on-line global satellite image archives and global-scale processing capabilities, along with improved spatial and temporal resolution satellite imagery. The ability to accurately train and validate these global scale-mapping programs from what we will call “reference data sets” is challenging due to a lack of coordinated financial and personnel resourcing, and standardized methods to collate reference datasets at global spatial extents. Here, we present an expert-driven approach for generating training and validation data on a global scale, with the view to mapping the world’s coral reefs. Global reefs were first stratified into approximate biogeographic regions, then per region reference data sets were compiled that include existing point data or maps at various levels of accuracy. These reference data sets were compiled from new field surveys, literature review of published surveys, and from individually sourced contributions from the coral reef monitoring and management agencies. Reference data were overlaid on high spatial resolution satellite image mosaics (3.7 m × 3.7 m pixels; Planet Dove) for each region. Additionally, thirty to forty satellite image tiles; 20 km × 20 km) were selected for which reference data and/or expert knowledge was available and which covered a representative range of habitats. The satellite image tiles were segmented into interpretable groups of pixels which were manually labeled with a mapping category via expert interpretation. The labeled segments were used to generate points to train the mapping models, and to validate or assess accuracy. The workflow for desktop reference data creation that we present expands and up-scales traditional approaches of expert-driven interpretation for both manual habitat mapping and map training/validation. We apply the reference data creation methods in the context of global coral reef mapping, though our approach is broadly applicable to any environment. Transparent processes for training and validation are critical for usability as big data provide more opportunities for managers and scientists to use global mapping products for science and conservation of vulnerable and rapidly changing ecosystems.