Newlin, Nancy R.; Kim, Michael E.; Kanakaraj, Praitayini; McMaster, Elyssa M.; Cho, Chloe; Gao, Chenyu; Hohman, Timothy J.; Beason-Held, Lori L.; Resnick, Susan M.; O’Bryant, Sid E.; Phillips, Nicole R.; Barber, Robert Clinton; Bennett, David Alan; Barnes, Lisa Laverne; Biber, Sarah A.; Johnson, Sterling C.; Archer, Derek B.; Li, Zhiyuan; Zuo, Lianrui; Moyer, Daniel C.; & Landman, Bennett Allan. (2025).听.听Journal of Medical Imaging,听12(6), 64001.听
Diffusion magnetic resonance imaging data collected across different studies often vary because of differences in scanners, software, and acquisition protocols, which can introduce unwanted technical effects that interfere with biological analysis. This is especially challenging in large multi site studies of Alzheimer鈥檚 disease, where both technical variation and true disease related changes are present. In this study, the authors developed a harmonization approach that learns low dimensional representations of brain structural connectivity that are insensitive to imaging site, scanner type, and acquisition settings, while still preserving information related to cognitive status. They used a conditional variational autoencoder, a type of deep learning model that compresses data into a latent space while controlling which information is retained or removed. The model was trained on diffusion imaging data from 6956 individuals across 9 cohorts and 38 imaging sites, totaling nearly 12,000 scans, including participants with normal cognition, mild cognitive impairment, and Alzheimer鈥檚 disease. The learned representations successfully removed statistically significant site related effects across 12 brain network connectivity measures and improved the accuracy of predicting cognitive diagnosis from 68 percent to 73 percent. The method showed consistent performance across multiple data configurations, demonstrating that representation learning can effectively reduce scanner related confounding while strengthening biologically meaningful signals in large neuroimaging studies.
Fig.听1
A total of 11,927 connectomes are pooled together across 9 cohorts and 35 unique imaging acquisitions (for a total of 38 鈥渟ites鈥). (a)聽If we naively combine all data and create聽t-SNE plots, there are obvious clusters driven by site. We compare cognitive diagnosis differences in one network connectivity measure, brain modularity, computed from these uncorrected connectomes. (b)聽Site-wise differences drive the detected differences and result in false findings. (c)聽In comparison, the聽t-SNE features of our learned representation have decreased site information, and the (d)聽resulting modularity values are harmonized and provide clear, significant (p<0.001) trends with cognitive diagnosis.
