Site effects how-to and when: An overview of retrospective techniques to accommodate site effects in multi-site neuroimaging analyses
Publication year
2022Source
Frontiers in Neurology, 13, (2022), article 923988ISSN
Publication type
Article / Letter to editor
Display more detailsDisplay less details
Organization
Cognitive Neuroscience
Journal title
Frontiers in Neurology
Volume
vol. 13
Subject
Radboudumc 7: Neurodevelopmental disorders DCMN: Donders Center for Medical Neuroscience; Cognitive Neuroscience - Radboud University Medical CenterAbstract
Site differences, or systematic differences in feature distributions across multiple data-acquisition sites, are a known source of heterogeneity that may adversely affect large-scale meta- and mega-analyses of independently collected neuroimaging data. They influence nearly all multi-site imaging modalities and biomarkers, and methods to compensate for them can improve reliability and generalizability in the analysis of genetics, omics, and clinical data. The origins of statistical site effects are complex and involve both technical differences (scanner vendor, head coil, acquisition parameters, imaging processing) and differences in sample characteristics (inclusion/exclusion criteria, sample size, ancestry) between sites. In an age of expanding international consortium research, there is a growing need to disentangle technical site effects from sample characteristics of interest. Numerous statistical and machine learning methods have been developed to control for, model, or attenuate site effects - yet to date, no comprehensive review has discussed the benefits and drawbacks of each for different use cases. Here, we provide an overview of the different existing statistical and machine learning methods developed to remove unwanted site effects from independently collected neuroimaging samples. We focus on linear mixed effect models, the ComBat technique and its variants, adjustments based on image quality metrics, normative modeling, and deep learning approaches such as generative adversarial networks. For each method, we outline the statistical foundation and summarize strengths and weaknesses, including their assumptions and conditions of use. We provide information on software availability and comment on the ease of use and the applicability of these methods to different types of data. We discuss validation and comparative reports, mention caveats and provide guidance on when to use each method, depending on context and specific research questions.
This item appears in the following Collection(s)
- Academic publications [244228]
- Electronic publications [131195]
- Faculty of Medical Sciences [92893]
- Open Access publications [105201]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.