AUTHOR=Li Runwei , MacDonald Gibson Jacqueline TITLE=Predicting the occurrence of short-chain PFAS in groundwater using machine-learned Bayesian networks JOURNAL=Frontiers in Environmental Science VOLUME=10 YEAR=2022 URL=https://www.frontiersin.org/journals/environmental-science/articles/10.3389/fenvs.2022.958784 DOI=10.3389/fenvs.2022.958784 ISSN=2296-665X ABSTRACT=

In the past two decades, global manufacturing of per- and polyfluoroalkyl substances (PFAS) has shifted from long-chain compounds to short-chain alternatives in response to evidence of the health hazards of long-chain formulations. However, accumulating data indicate that short-chain PFAS also pose health risks and are highly mobile and persistent in the environment. Because short-chain PFAS are relatively new chemicals, comprehensive knowledge needed to predict their environmental fate is lacking. This study evaluated the capacity of machine-learned Bayesian networks (BNs) to predict risks of exposure to short-chain PFAS in a Minnesota region affected by PFAS releases from the 3M Cottage Grove facility. Models were trained using long-term monitoring data provided by the Minnesota Department of Health (n = 12,406), which we coupled to a comprehensive dataset created by curating 88 other variables that describe potential PFAS sources, soil and hydrogeologic characteristics, and land use. Model performance was assessed using the area under the receiver-operating characteristic curve (AUC), a common measure of the accuracy of machine-learned classification algorithms. In addition, exposure risks were visualized spatially by coupling model predictions to a geographic information system. We found that machine-learned BN models had robust predictive performance, with AUCs above 0.96 in cross-validation. Significant risk factors identified by the BNs include distance to the 3M factory, distance to a former landfill, and areal extent of wetlands and developed land. We also found that risks of exposure to and the areal extent of perfluorosulfonic acids were greater than for perfluorocarboxylic acids with the same carbon number. The results suggest that machine-learned BNs could provide a promising screening tool for assessing short-chain PFAS exposure risks in groundwater.