AUTHOR=Olivetti Emanuele , Greiner Susanne , Avesani Paolo TITLE=ADHD diagnosis from multiple data sources with batch effects JOURNAL=Frontiers in Systems Neuroscience VOLUME=6 YEAR=2012 URL=https://www.frontiersin.org/journals/systems-neuroscience/articles/10.3389/fnsys.2012.00070 DOI=10.3389/fnsys.2012.00070 ISSN=1662-5137 ABSTRACT=
The Attention Deficit Hyperactivity Disorder (ADHD) affects the school-age population and has large social costs. The scientific community is still lacking a pathophysiological model of the disorder and there are no objective biomarkers to support the diagnosis. In 2011 the ADHD-200 Consortium provided a rich, heterogeneous neuroimaging dataset aimed at studying neural correlates of ADHD and to promote the development of systems for automated diagnosis. Concurrently a competition was set up with the goal of addressing the wide range of different types of data for the accurate prediction of the presence of ADHD. Phenotypic information, structural magnetic resonance imaging (MRI) scans and resting state fMRI recordings were provided for nearly 1000 typical and non-typical young individuals. Data were collected by eight different research centers in the consortium. This work is not concerned with the main task of the contest, i.e., achieving a high prediction accuracy on the competition dataset, but we rather address the proper handling of such a heterogeneous dataset when performing classification-based analysis. Our interest lies in the clustered structure of the data causing the so-called