AUTHOR=Chen Brian H. TITLE=Minimum standards for evaluating machine-learned models of high-dimensional data JOURNAL=Frontiers in Aging VOLUME=3 YEAR=2022 URL=https://www.frontiersin.org/journals/aging/articles/10.3389/fragi.2022.901841 DOI=10.3389/fragi.2022.901841 ISSN=2673-6217 ABSTRACT=
The maturation of machine learning and technologies that generate high dimensional data have led to the growth in the number of predictive models, such as the “epigenetic clock”. While powerful, machine learning algorithms run a high risk of overfitting, particularly when training data is limited, as is often the case with high-dimensional data (“large