AUTHOR=Bayer Johanna M. M. , Thompson Paul M. , Ching Christopher R. K. , Liu Mengting , Chen Andrew , Panzenhagen Alana C. , Jahanshad Neda , Marquand Andre , Schmaal Lianne , Sämann Philipp G. 

TITLE=Site effects how-to and when: An overview of retrospective techniques to accommodate site effects in multi-site neuroimaging analyses

JOURNAL=Frontiers in Neurology

VOLUME=Volume 13 - 2022

YEAR=2022

URL=https://www.frontiersin.org/journals/neurology/articles/10.3389/fneur.2022.923988

DOI=10.3389/fneur.2022.923988

ISSN=1664-2295

ABSTRACT=<p>Site differences, or systematic differences in feature distributions across multiple data-acquisition sites, are a known source of heterogeneity that may adversely affect large-scale meta- and mega-analyses of independently collected neuroimaging data. They influence nearly all multi-site imaging modalities and biomarkers, and methods to compensate for them can improve reliability and generalizability in the analysis of genetics, omics, and clinical data. The origins of statistical site effects are complex and involve both <italic>technical</italic> differences (scanner vendor, head coil, acquisition parameters, imaging processing) and differences in <italic>sample characteristics</italic> (inclusion/exclusion criteria, sample size, ancestry) between sites. In an age of expanding international consortium research, there is a growing need to disentangle technical site effects from sample characteristics of interest. Numerous statistical and machine learning methods have been developed to control for, model, or attenuate site effects – yet to date, no comprehensive review has discussed the benefits and drawbacks of each for different use cases. Here, we provide an overview of the different existing statistical and machine learning methods developed to remove unwanted site effects from independently collected neuroimaging samples. We focus on linear mixed effect models, the ComBat technique and its variants, adjustments based on image quality metrics, normative modeling, and deep learning approaches such as generative adversarial networks. For each method, we outline the statistical foundation and summarize strengths and weaknesses, including their assumptions and conditions of use. We provide information on software availability and comment on the ease of use and the applicability of these methods to different types of data. We discuss validation and comparative reports, mention caveats and provide guidance on when to use each method, depending on context and specific research questions.</p>