AUTHOR=Fourrier Marine , Coppola Laurent , Claustre Hervé , D’Ortenzio Fabrizio , Sauzède Raphaëlle , Gattuso Jean-Pierre TITLE=A Regional Neural Network Approach to Estimate Water-Column Nutrient Concentrations and Carbonate System Variables in the Mediterranean Sea: CANYON-MED JOURNAL=Frontiers in Marine Science VOLUME=7 YEAR=2020 URL=https://www.frontiersin.org/journals/marine-science/articles/10.3389/fmars.2020.00620 DOI=10.3389/fmars.2020.00620 ISSN=2296-7745 ABSTRACT=

A regional neural network-based method, “CANYON-MED” is developed to estimate nutrients and carbonate system variables specifically in the Mediterranean Sea over the water column from pressure, temperature, salinity, and oxygen together with geolocation and date of sampling. Six neural network ensembles were developed, one for each variable (i.e., three macronutrients: nitrates (NO3-), phosphates (PO43-) and silicates (SiOH4), and three carbonate system variables: pH on the total scale (pHT), total alkalinity (AT), and dissolved inorganic carbon or total carbon (CT), trained using a specific quality-controlled dataset of reference “bottle” data in the Mediterranean Sea. This dataset is representative of the peculiar conditions of this semi-enclosed sea, as opposed to the global ocean. For each variable, the neural networks were trained on 80% of the data chosen randomly and validated using the remaining 20%. CANYON-MED retrieved the variables with good accuracies (Root Mean Squared Error): 0.78 μmol.kg–1 for NO3-, 0.043 μmol.kg–1 for PO43- and 0.71 μmol.kg–1 for Si(OH)4, 0.014 units for pHT, 13 μmol.kg–1 for AT and 12 μmol.kg–1 for CT. A second validation on the ANTARES independent time series confirmed the method’s applicability in the Mediterranean Sea. After comparison to other existing methods to estimate nutrients and carbonate system variables, CANYON-MED stood out as the most robust, using the aforementioned inputs. The application of CANYON-MED on the Mediterranean Sea data from autonomous observing systems (integrated network of Biogeochemical-Argo floats, Eulerian moorings and ocean gliders measuring hydrological properties together with oxygen concentration) could have a wide range of applications. These include data quality control or filling gaps in time series, as well as biogeochemical data assimilation and/or the initialization and validation of regional biogeochemical models still lacking crucial reference data. Matlab and R code are available at https:// github.com/MarineFou/CANYON-MED/.