Bootstrapped Sparse Canonical Correlation Analysis: Mining Stable Imaging and Genetic Associations With Implicit Structure Learning. Mining Stable Imaging and Genetic Associations With Implicit Structure Learning.

Jingwen Yan, Lei Du, Sungeun Kim, Shannon L. Risacher, Heng Huang, Mark Inlow, Jason H. Moore, Andrew J. Saykin, Li Shen

Research output: Chapter in Book/Report/Conference proceedingChapter


Sparse canonical correlation analysis (SCCA) based on lasso and structured lasso has been widely studied to explore the complex associations between brain imaging and genetics features. Although those based on lasso have a better control of overall sparsity, they capture only a small portion of signals because of competition within correlated features. Advanced structure-based models provide a partial solution, but final patterns mostly depend on the prior structures applied. In this work, we propose a new framework, bootstrapped sparse canonical correlation analysis (BoSCCA), to explore the stable associations between correlated imaging and genetic data sets and to implicitly reconstruct the hidden structures. We compare the performances of BoSCCA and traditional SCCA using both synthetic and real data. In synthetic data, BoSCCA outperforms traditional SCCA in both association identification and group structure extraction, especially when the signal proportion goes below 5%. In real data, BoSCCA better captures the group structure within regions of interest and linkage disequilibrium blocks among single-nucleotide polymorphisms and yielded more biologically meaningful results.

Original languageEnglish (US)
Title of host publicationImaging Genetics
PublisherElsevier Inc.
Number of pages17
ISBN (Electronic)9780128139691
ISBN (Print)9780128139684
StatePublished - Jan 1 2018



  • Bootstrap sampling
  • Feature selection
  • Imaging genetics
  • Selection stability
  • Sparse canonical correlation analysis
  • Structure learning

ASJC Scopus subject areas

  • Medicine (miscellaneous)

Cite this