Pattern Discovery in Brain Imaging Genetics via SCCA Modeling with a Generic Non-convex Penalty

Du, Lei; Liu, Kefei; Yao, Xiaohui; Yan, Jingwen; Risacher, Shannon L.; Han, Junwei; Guo, Lei; Saykin, Andrew J.; Shen, Li

doi:10.1038/s41598-017-13930-y

Download PDF

Article
Open access
Published: 25 October 2017

Pattern Discovery in Brain Imaging Genetics via SCCA Modeling with a Generic Non-convex Penalty

Scientific Reports volume 7, Article number: 14052 (2017) Cite this article

2284 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Brain imaging genetics intends to uncover associations between genetic markers and neuroimaging quantitative traits. Sparse canonical correlation analysis (SCCA) can discover bi-multivariate associations and select relevant features, and is becoming popular in imaging genetic studies. The L1-norm function is not only convex, but also singular at the origin, which is a necessary condition for sparsity. Thus most SCCA methods impose ${\ell }_{{\bf{1}}}$-norm onto the individual feature or the structure level of features to pursuit corresponding sparsity. However, the ${\ell }_{{\bf{1}}}$-norm penalty over-penalizes large coefficients and may incurs estimation bias. A number of non-convex penalties are proposed to reduce the estimation bias in regression tasks. But using them in SCCA remains largely unexplored. In this paper, we design a unified non-convex SCCA model, based on seven non-convex functions, for unbiased estimation and stable feature selection simultaneously. We also propose an efficient optimization algorithm. The proposed method obtains both higher correlation coefficients and better canonical loading patterns. Specifically, these SCCA methods with non-convex penalties discover a strong association between the APOE e4 rs429358 SNP and the hippocampus region of the brain. They both are Alzheimer’s disease related biomarkers, indicating the potential and power of the non-convex methods in brain imaging genetics.

Transcriptome-wide association analysis of brain structures yields insights into pleiotropy with complex neuropsychiatric traits

Article Open access 17 May 2021

Gene-SGAN: discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering

Article Open access 08 January 2024

An adaptive and robust method for multi-trait analysis of genome-wide association studies using summary statistics

Article 26 May 2023

Introduction

By identifying the associations between genetic factors and brain imaging measurements, brain imaging genetics intends to model and understand how genetic factors influence the structure or function of human brain^{1,2,3,4,5,6,7,8,9,10,11,12,13,14}. Both genetic biomarkers such as single nucleotide polymorphisms (SNPs), and brain imaging measurements such as imaging quantitative traits (QTs) are multivariate. To address this problem, bi-multivariate association models, such as multiple linear regression¹⁵, reduced rank regression^16,17,18, parallel independent component analysis¹⁹, partial least squares regression^20,21, canonical correlation analysis (CCA)²² and their sparsity-inducing variants²³, have been widely used to uncover the joint effect of multiple SNPs on one or multiple QTs. Among them, SCCA (Sparse CCA), which can discover bi-multivariate relationships and extract relevant features, is becoming popular in brain imaging genetics.

The CCA technique has been introduced for several decades²⁴. CCA can only perform well when the number of observations is larger than the combined feature number of the two views. Unfortunately, the problem usually is a large-p-small-n problem in the biomedical and biology studies. And it gets even worse because in CCA we are facing a large-(p + q)-small-n problem. In order to overcome this limitation, sparse CCA (SCCA)^{25,26,27,28,29,30,31,32,33,34,35,36} employs a sparsity inducing regularization term to select a small set of relevant features and has received increasing attention. The ${\ell }_{1}$-norm based SCCA method²⁵ has gained great success for its sparsity pursuing capability. After that, there are many SCCA variants based on the ${\ell }_{1}$-norm. For examples, the fused lasso penalty imposes the ${\ell }_{1}$-norm onto the ordered pairwise features²⁵, and the group lasso penalty imposes the ${\ell }_{1}$-norm onto the group of features^29,32. Further, the graph lasso or the graph guided lasso can be viewed as imposing the ${\ell }_{1}$-norm onto the pairwise features defined by an undirected graph²⁹.

However, the ${\ell }_{1}$-norm penalty shows the conflict of optimal prediction and consistent feature selection³⁷. In penalized least squares modeling, Fan and Li³⁸ showed that a good penalty function should meet three properties. First, the penalty function should be singular at the origin to produce sparse results. Second, it should produce continuous models for stable model selection, and third, the penalty function should not penalize large coefficients to avoid estimation bias. The ${\ell }_{1}$-norm penalty is successful in feature selection because it is singular at the origin. On the contrary, the ${\ell }_{1}$-norm penalty over-penalizes large coefficients, and thus it may be suboptimal with respect to the estimation risk^39,40. The ${\ell }_{0}$-norm function which only involves the number of nonzero features is an ideal sparsity-inducing penalty. However, it is neither convex nor continuous, and thus solving ${\ell }_{0}$-norm constrained problem is NP-hard⁴¹.

A number of non-convex penalties are proposed as the surrogate of the ${\ell }_{0}$-norm to handle this issue. These penalties includes the ${\ell }_{\gamma }$-norm (0 < γ < 1) penalty⁴², the Geman penalty⁴³, the Smoothly Clipped Absolute Deviation (SCAD) penalty³⁸, the Laplace penalty⁴⁴, the Minimax Concave Penalty (MCP)⁴⁵, the Exponential-Type Penalty (ETP)⁴⁶ and the Logarithm penalty⁴⁷. These non-convex functions have attractive theoretical properties for they all are singular at the origin and leave those larger coefficients unpenalized. Though they have gained great success in generalized linear models (GLMs), it is an unexplored topic to apply them to the SCCA models for achieving sparsity and unbiased prediction simultaneously.

Therefore, it is essential and of great interest to investigate performances of various SCCA models based on these non-convex penalties. A major challenge of non-convex function is the computational complexity. The local quadratic approximation (LQA) technique is introduced to solve the SCAD penalizing problem³⁸. LQA approximates the objective by a locally quadratic expression which can be solved like a ridge constrained problem. Inspired by this, in this paper, we propose a generic non-convex SCCA models with these non-convex penalties, and propose a unified optimization algorithm based on the LQA technique and the Alternate Convex Search (ACS) method⁴⁸. Using both synthetic data and real imaging genetic data, the experimental results show that with appropriate parameters, the non-convex SCCA methods have better performance on both canonical loading patterns and correlation coefficients estimation than the ${\ell }_{1}$-norm based SCCA methods.

Methods

Throughout this paper, scalars are denoted as italic letters, column vectors as boldface lowercase letters, and matrices as boldface capitals. The $\Vert {\bf{u}}\Vert $ denotes the Euclidean norm of a vector u.

Preliminaries

Sparse Canonical Correlation Analysis (SCCA)

Let ${\bf{X}}\in { {\mathcal R} }^{n\times p}$ be a matrix representing the SNP biomarkers data, where n is the number of participants and p is the number of SNPs. Let ${\bf{Y}}\in { {\mathcal R} }^{n\times q}$ be the QT data with q being the number of imaging measurements. A typical SCCA model is defined as

$$\mathop{{\rm{\min }}}\limits_{{\bf{u}},{\bf{v}}}-{{\bf{u}}}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}$$

(1)

$$s\mathrm{.}t\mathrm{.\ \ }{\Vert {\bf{X}}{\bf{u}}\Vert }^{2}\le 1,{\Vert {\bf{Y}}{\bf{v}}\Vert }^{2}\le 1,{\rm{\Omega }}({\bf{u}})\le {c}_{1},{\rm{\Omega }}({\bf{v}})\le {c}_{2},$$

where Xu and Yv are the canonical variables, u and v are the corresponding canonical vectors we desire to estimate, and c ₁, c ₂ are the tuning parameters that control the sparsity level of the solution. The penalty function could be the ${\ell }_{1}$-norm penalty, or its variants such as the fused lasso, group lasso and graph lasso^{25,27,29,32,34}.

Non-convex Penalty Functions for SCCA

In this paper, we investigate seven non-convex surrogate penalties of ${\ell }_{0}$-norm in the SCCA model. They are singular at the origin, which is essential to achieve sparsity in the solution. And they do not overly penalize large coefficients. In order to facilitate a unified description, we denote the non-convex penalty as

$${{\rm{\Omega }}}_{{\rm{nc}}}({\bf{u}})=\sum _{i=1}^{p}{P}_{\lambda ,\gamma }(|{u}_{i}|),$$

(2)

where λ and γ are nonnegative parameters, and P _λ,γ(|u _i|) is a non-convex function. We absorb λ into the penalty because it cannot be decoupled from several penalties, such as the SCAD function³⁸. We here have seven penalties and they are described in Table 1 and visualized in Fig. 1, where for clarity we have dropped the subscript i in u _i. There is a sharp point at the origin for each of them, indicating that they are singular at the origin. This is essential to achieve sparseness in the solution. Besides, these curves are concave in |u _i| and monotonically decreasing on (−∞, 0], and monotonically increasing on [0, ∞). Therefore, though these penalties are not convex, they are piecewise continuously differentiable and their supergradients exist on both (−∞, 0] and [0, ∞)⁴⁹. Table 1 also shows their supergradients P′_λ,γ(|u _i|) with respect to |u _i|.

Table 1 The seven non-convex penalty functions and their supergradients.

Full size table

The Proposed Non-convex SCCA Model and Optimization Algorithm

Replacing the ${\ell }_{1}$-norm constraints in the SCCA model, we define the unified non-convex SCCA model as follows

$$\mathop{{\rm{\min }}}\limits_{{\bf{u}},{\bf{v}}}-{{\bf{u}}}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}+{{\rm{\Omega }}}_{{\rm{nc}}}({\bf{u}})+{{\rm{\Omega }}}_{{\rm{nc}}}({\bf{v}})$$

(3)

$$s\mathrm{.}t\mathrm{.\ \ }{\Vert {\bf{X}}{\bf{u}}\Vert }^{2}\le \mathrm{1,}\,{\Vert {\bf{Y}}{\bf{v}}\Vert }^{2}\le \mathrm{1,}$$

where Ω_nc(u) and Ω_nc(v) can be any of the non-convex functions listed in Table 1.

To solve the non-convex SCCA problem, we use the Lagrangian method,

$$ {\mathcal L} {\boldsymbol{(}}{\bf{u}}{\boldsymbol{,}}{\bf{v}}{\boldsymbol{)}}=-{{\bf{u}}}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}+{{\rm{\Omega }}}_{{\rm{nc}}}({\bf{u}})+{{\rm{\Omega }}}_{{\rm{nc}}}({\bf{v}})+\frac{{\alpha }_{1}}{2}({\Vert {\bf{X}}{\bf{u}}\Vert }^{2}-1)+\frac{{\alpha }_{2}}{2}({\Vert {\bf{Y}}{\bf{v}}\Vert }^{2}-1),$$

(4)

which is equivalent to

$$ {\mathcal L} {\boldsymbol{(}}{\bf{u}}{\boldsymbol{,}}{\bf{v}}{\boldsymbol{)}}=-{{\bf{u}}}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}+{{\rm{\Omega }}}_{{\rm{nc}}}({\bf{u}})+{{\rm{\Omega }}}_{{\rm{nc}}}({\bf{v}})+\frac{{\alpha }_{1}}{2}{\Vert {\bf{X}}{\bf{u}}\Vert }^{2}+\frac{{\alpha }_{2}}{2}{\Vert {\bf{Y}}{\bf{v}}\Vert }^{2}$$

(5)

from the point of view of optimization. α ₁, α ₂, λ ₁, λ ₂ and γ are nonnegative tuning parameters. Next we will show how to solve this non-convex problem.

The first term −u ^Τ X ^Τ Yv on the right of equation (5) is biconvex in u and v. ${\Vert {\bf{X}}{\bf{u}}\Vert }^{2}$ is convex in u, and ${\Vert {\bf{Y}}{\bf{v}}\Vert }^{2}$ is convex in v. It remains to approximate both Ω_nc(u) and Ω_nc(v) and transform them into convex ones.

The local quadratic approximation (LQA) technique was introduced to quadratically expresses the SCAD penalty³⁸. Based on LQA, we here show how to represent these non-convex penalties in a unified way. First, we have the first-order Taylor expansion of ${P}_{{\lambda }_{1},\gamma }(\sqrt{\mu })$ at μ ₀ P _λ,γ((μ)^1/2) at μ ₀

$${P}_{\lambda ,\gamma }(\sqrt{\mu })\approx {P}_{\lambda ,\gamma }(\sqrt{{\mu }_{0}})+{P^{\prime} }_{\lambda ,\gamma }(\sqrt{{\mu }_{0}})\frac{1}{2\sqrt{{\mu }_{0}}}(\mu -{\mu }_{0}),$$

(6)

where μ ₀ and μ are neighbors, e.g., the estimates at two successive iterations during optimization. Substituting $\mu ={u}_{i}^{2}$ and ${\mu }_{0}={({u}_{i}^{t})}^{2}$ into (6), we have

$${P}_{\lambda ,\gamma }(|{u}_{i}|)\approx {P}_{\lambda ,\gamma }(|{u}_{i}^{t}|)+{P^{\prime} }_{\lambda ,\gamma }(|{u}_{i}^{t}|)\frac{1}{\mathrm{2|}{u}_{i}^{t}|}({u}_{i}^{2}-{({u}_{i}^{t})}^{2})$$

(7)

with ${P^{\prime} }_{\lambda ,\gamma }(|{u}_{i}^{t}|)$ being the supergradient of ${P}_{\lambda ,\gamma }(|{u}_{i}^{t}|)$ (as shown in Table 1) at $|{u}_{i}^{t}|$.

Then we obtain a quadratic approximation to Ω_nc(u):

$${{\rm{\Omega }}}_{{\rm{nc}}}({\bf{u}})=\sum _{i\mathrm{=1}}^{p}{P}_{\lambda ,\gamma }(|{u}_{i}|)\approx \sum _{i\mathrm{=1}}^{p}\frac{{P^{\prime} }_{\lambda ,\gamma }(|{u}_{i}^{t}|)}{\mathrm{2|}{u}_{i}^{t}|}{u}_{i}^{2}+{C}_{{\bf{u}}},$$

(8)

where

$${C}_{{\bf{u}}}=\sum _{i\mathrm{=1}}^{p}[{P}_{\lambda ,\gamma }(|{u}_{i}^{t}|)-\frac{1}{2}{P^{\prime} }_{\lambda ,\gamma }(|{u}_{i}^{t}|)|{u}_{i}^{t}|]$$

is not a function of u and thus will not contribute to the optimization.

In a similar way, we can construct a quadratic approximation to Ω_nc(v)

$${{\rm{\Omega }}}_{{\rm{nc}}}({\bf{v}})=\sum _{j\mathrm{=1}}^{q}{P}_{\lambda ,\gamma }(|{v}_{j}|)\approx \sum _{j\mathrm{=1}}^{q}\frac{{P^{\prime} }_{\lambda ,\gamma }(|{v}_{j}^{t}|)}{\mathrm{2|}{v}_{j}^{t}\mathrm{|}}{v}_{j}^{2}+{C}_{{\bf{v}}},$$

(9)

where

$${C}_{{\bf{v}}}=\sum _{j\mathrm{=1}}^{q}[{P}_{\lambda ,\gamma }(|{v}_{j}^{t}|)-\frac{1}{2}{P^{\prime} }_{\lambda ,\gamma }(|{v}_{j}^{t}|)|{v}_{j}^{t}|]$$

is not a function of v and makes no contribute towards the optimization.

Denote the estimates of u and v in the t-th iteration as u ^t and v ^t, respectively. To update the estimates of u and v in the (t + 1)-th iteration, we substitute the approximate functions of Ω_nc(u) and Ω_nc(v) in equations (8) and (9) into $ {\mathcal L} {\boldsymbol{(}}{\bf{u}}{\boldsymbol{,}}{\bf{v}}{\boldsymbol{)}}$ in 5, and solve the resultant approximate version of the original problem:

$$\begin{array}{rcl}{\rm{\arg }}\,{\rm{\min }}\, {\mathcal L} {\boldsymbol{(}}{\bf{u}}{\boldsymbol{,}}{\bf{v}}{\boldsymbol{)}} & = & {\rm{\arg }}\,{\rm{\min }}\,-{{\bf{u}}}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}+\sum _{i\mathrm{=1}}^{p}\frac{{P^{\prime} }_{{\lambda }_{1},\gamma }(|{u}_{i}^{t}|)}{\mathrm{2|}{u}_{i}^{t}|}{u}_{i}^{2}\\ & & +\,\sum _{j\mathrm{=1}}^{q}\frac{{P^{\prime} }_{{\lambda }_{1},\gamma }(|{v}_{j}^{t}|)}{\mathrm{2|}{v}_{j}^{t}|}{v}_{j}^{2}+\frac{{\alpha }_{1}}{2}||{\bf{X}}{\bf{u}}{||}^{2}+\frac{{\alpha }_{2}}{2}||{\bf{Y}}{\bf{v}}{||}^{2}\end{array}$$

(10)

Obviously, the equation (10) is a quadratical expression, and is biconvex in u and v. This means it is convex in terms of u given v, and vice versa. Then according to the alternate convex search (ACS) method which is designed to solve biconvex problems⁴⁸, the (t + 1)-th estimation of u and v can be calculated via

$$\begin{array}{rcl}{{\bf{u}}}^{t+1} & = & {\rm{\arg }}\,\mathop{{\rm{\min }}}\limits_{{\bf{u}}}-{{\bf{u}}}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{{\bf{v}}}^{t}+\sum _{i\mathrm{=1}}^{p}\frac{{P^{\prime} }_{{\lambda }_{1},\gamma }(|{u}_{i}^{t}|)}{\mathrm{2|}{u}_{i}^{t}|}{u}_{i}^{2}+\frac{{\alpha }_{1}}{2}||{\bf{X}}{\bf{u}}{||}^{2},\\ {{\bf{v}}}^{t+1} & = & {\rm{\arg }}\,\mathop{{\rm{\min }}}\limits_{{\bf{v}}}-{({{\bf{u}}}^{t+1})}^{{\rm{{\rm T}}}}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}+\sum _{j\mathrm{=1}}^{q}\frac{{P^{\prime} }_{{\lambda }_{2},\gamma }(|{v}_{j}^{t}|)}{\mathrm{2|}{v}_{j}^{t}|}{v}_{j}^{2}+\frac{{\alpha }_{2}}{2}||{\bf{Y}}{\bf{v}}{||}^{2}\mathrm{.}\end{array}$$

(11)

Both equations above are quadratic, and thus their closed-form solutions exist. Taking the partial derivative of $ {\mathcal L} {\boldsymbol{(}}{\bf{u}}{\boldsymbol{,}}{\bf{v}}{\boldsymbol{)}}$ in (5) with respect to u and v and setting the results to zero, we have

$${\bf{0}}\in -{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}+({{\bf{D}}}_{1}^{t}+{\alpha }_{1}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{X}}){\bf{u}},$$

(12)

$${\bf{0}}\in -{{\bf{Y}}}^{{\rm{{\rm T}}}}{\bf{X}}{\bf{u}}+({{\bf{D}}}_{2}^{t}+{\alpha }_{2}{{\bf{Y}}}^{{\rm{{\rm T}}}}{\bf{Y}}){\bf{v}},$$

(13)

where ${{\bf{D}}}_{1}^{t}$ is a diagonal matrix with the i-th diagonal entry as $\frac{{P^{\prime} }_{{\lambda }_{1},\gamma }(|{u}_{i}^{t}|)}{|{u}_{i}^{t}|}$ (i∈[1, p]). It can be calculated by taking the partial derivative of equation (7) with respect to u _i. ${{\bf{D}}}_{2}^{t}$ is also a diagonal matrix with the j-th diagonal entry as $\frac{{P^{\prime} }_{{\lambda }_{1},\gamma }(|{v}_{j}^{t}|)}{|{v}_{j}^{t}|}$ (j∈[1, q]), and can be computed similarly. However, the i-th element of ${{\bf{D}}}_{1}^{t}$ does not exist if ${u}_{i}^{t}=0$. According to perturbed version of LQA⁵⁰, we address this by adding a slightly perturbed term. Then the i-th element of ${{\bf{D}}}_{1}^{t}$ is

$${{\bf{D}}}_{1}^{t}(i,i)=\frac{{P^{\prime} }_{{\lambda }_{1},\gamma }(|{u}_{i}|)}{|{u}_{i}|+\zeta }$$

(14)

where ζ is a tiny positive number. Hunter and Li⁵⁰ showed that this modification guarantees optimizing the equation (11). Then we have the updating expressions at the (t + 1)-th iteration

$${{\bf{u}}}^{t+1}=({{\bf{D}}}_{1}^{t}+{\alpha }_{1}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{X}}{)}^{-1}{{\bf{X}}}^{{\rm{{\rm T}}}}{\bf{Y}}{{\bf{v}}}^{t},$$

(15)

$${{\bf{v}}}^{t+1}=({{\bf{D}}}_{2}^{t}+{\alpha }_{2}{{\bf{Y}}}^{{\rm{{\rm T}}}}{\bf{Y}}{)}^{-1}{{\bf{Y}}}^{{\rm{{\rm T}}}}{\bf{X}}{{\bf{u}}}^{t+1}\mathrm{.}$$

(16)

We alternate between the above two equations to graduate refine the estimates for u and v until convergence. The pseudo code of the non-convex SCCA algorithm is described in Algorithm 1.

Computational Analysis

In Algorithm 1, Step 3 and Step 6 are linear in the dimension of u and v, and are easy to compute. Step 4 and Step 7 are the critical steps of proposed algorithm. Since we have closed-form updating expressions, they can be calculated via solving a system of linear equations with quadratic complexity which avoids computing the matrix inverse with cubic complexity. Step 5 and 8 are the re-scale step and very easy to calculate. Therefore, the whole algorithm is efficient.

Data Availability

The synthetic data sets generated in this work are available from the corresponding authors’ web sites, http://www.escience.cn/people/dulei/code.html and http://www.iu.edu/ shenlab/tools/ncscca/. The real data set is publicly available in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database repository, http://adni.loni.usc.edu.

Experiments and Datasets

Data Description

Synthetic Dataset

There are four data sets with sparse true signals for both u and v, i.e., only a small subset of features are nonzero. The number of features of both u and v are larger than the observations to simulate a large-(p + q)-small-n task. The generating process is as follows. We first generate u and v with most feature being zero. After that, the latent variable z is constructed from Gaussian distribution N(0, I _{n × n}). Then we create the data X from ${{\bf{x}}}_{i}\sim N({z}_{i}{\bf{u}},{\sum }_{x})$ and data ${{\bf{y}}}_{i}\sim N({z}_{i}{\bf{v}},{\sum }_{y})$, where (∑_x)_jk = exp(−|u _j − u _k|) and (∑_y)_jk = exp(−|v _j − v _k|). The first three sets have 250 features for u and 600 ones for v, but they have different correlation coefficients. There are 500 features and 900 features in u and v respectively for the last data set. We show the true signal of every data set in Fig. 2 (top row).

Real Neuroimaging Genetics Dataset

Data used in the preparation of this article were obtained from the ADNI database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership by the National Institute on Aging (NIA), the National Institute of Biomedical Imaging and Bioengineering (NIBIB), the Food and Drug Administration (FDA) etc, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). For up-to-date information, see www.adni-info.org. The study protocols were approved by the Institutional Review Boards of all participating centers (Northwestern Polytechnical University, Indiana University and ADNI (A complete list of ADNI sites is available at http://www.adni-info.org/)) and written informed consent was obtained from all participants or authorized representatives. All the analyses were performed on the de-identified ADNI data, and were determined by Indiana University Human Subjects Office as IU IRB Review Not Required.

The real neuroimaging genetics dataset were collected from 743 participants, and the details was presented in Table 2. There were 163 candidate SNP biomarkers from the AD-risk genes, e.g., APOE, in the genotyping data. The structural MRI scans were processed with voxel-based morphometry (VBM) in SPM8^51,52. Briefly, scans were aligned to a T1-weighted template image, segmented into gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF) maps, normalized to MNI space, and smoothed with an 8mm FWHM kernel. We subsampled the whole brain and generated 465 voxels spanning the whole brain ROIs. The regression technique was employed to remove the effects of the baseline age, gender, education, and handedness for these VBM measures. The aim of this study is to evaluate the correlation between the SNPs and the VBM measures, and further identify which SNPs and ROIs are associated.

Table 2 Participant characteristics.

Full size table

Experimental Setup

Benchmarks

In this paper, we are mainly interested in whether these non-convex SCCA methods could enhance the performance of ${\ell }_{1}$-SCCA method based on our motivation. It is reasonable to employ the ${\ell }_{1}$-norm based methods in comparison. Therefore, the structure-aware SCCA methods such as^28,29,32,34 are not contained here as benchmark. Based on different mathematical techniques, there are three different ${\ell }_{1}$-SCCA algorithms. They are the singular value decomposition based method²⁵, the primal-dual based method²⁹ and the LQA based method³². Though the latter two are proposed for capturing group or network structure, they can be easily reformulated to the ${\ell }_{1}$-norm constrained methods, such as setting the parameters associated with the structure penalty to zero²⁹. Therefore, to make the comparison fair and convincing, we choose all of them as benchmarks. With a slight abuse of notation, we use the penalty name to refer a non-convex SCCA method, e.g. ETP for ETP based SCCA method. For the ${\ell }_{1}$-norm based methods, we call them L1-SCCA²⁵, L1-S2CCA³², and L1-NSCCA²⁹.

Parameter Tuning

There are four parameters λ _i(i = 1, 2) and α _i(i = 1, 2) associated with the non-convex SCCA methods, and one pivotal parameter γ. According to their equations, these non-convex penalties can approximate the ${\ell }_{0}$-norm by providing an appropriate γ. In this situation, the λ _i and α _i play a very weak role because theoretically the ${\ell }_{0}$-norm penalized problem does not rely on the parameters. Based on this consideration, we here only tune the γ other than tuning λ _i and α _i by a grid search strategy. This reduces the time consumption dramatically but does not affect the performance significantly. Further, we observe that two γ's perform similarly if they are not significantly different. Thus the tuning range of γ is not continuous. Besides, we set γ = 3.7 for SCAD penalty since³⁸ suggested that this is a very reasonable choice. The details of tuning range for each penalty are contained in Table 3. For λ _i and α _i, we simply set them to 1 in this study.

Table 3 The searching range of optimal γ for each non-convex penalty.

Full size table

Termination Criterion

We use ${{\rm{\max }}}_{i}|{u}_{i}^{t+1}-{u}_{i}^{t}|\le \varepsilon $ and ${{\rm{\max }}}_{j}|{v}_{j}^{t+1}-{v}_{j}^{t}|\le \varepsilon $ as the termination condition for Algorithm 1, where ε is the user defined error bound. In this study, we set ε = 10⁻⁵ according to experiments. All methods use the same setup, i.e., the same partition of the five-fold cross-validation, running on the same platform.

Results on Synthetic Data

Figure 2 shows the heat maps of canonical loadings estimated from all SCCA methods, where each row corresponds to an experimental method. We clearly observe that the non-convex SCCA methods and L1-SCCA correctly identify the identical signal positions to the ground truth across four data sets. Besides true signals, L1-SCCA introduces several undesired signals which makes it be inferior to our methods. As a contrast, L1-NSCCA finds out an incomplete proportion of the ground truth, and L1-S2CCA performs unstably as it fails on some folds. Moreover, we also prioritize these methods using the AUC (area under ROC) criterion in Table 4, where a higher value indicates a better performance. The results exhibit that the non-convex SCCA methods have the highest score at almost every case. L1-SCCA scores similarly to the proposed methods, but later we can see it pays the price at a reduced prediction ability. Table 5 presents the estimated correlation coefficients on both training and testing data, where the best values are shown in boldface. The proposed SCCA methods alternatively gain the best value, and the Log method wins out for the most times. This demonstrates that the non-convex methods outperform ${\ell }_{1}$-norm based SCCA methods in terms of the prediction power. In summary, the proposed methods identify accurate and sparse canonical loading patterns and obtain high correlation coefficients simultaneously, while those ${\ell }_{1}$-norm based SCCA methods cannot.

Table 4 Performance comparison on synthetic data sets. The AUC (area under the curve) values (mean ± std) of estimated canonical loadings u and v.

Full size table

Table 5 Training and testing correlation coefficients (mean ± std) of 5-fold cross-validation synthetic data sets. The best values are shown in boldface.

Full size table

Results on Real Neuroimaging Genetics Data

In this real data study, the genotyping data is denoted by X, and the imaging data is denoted by Y. The u is a vector of weights of all SNPs, and v is a vector of weights of all imaging markers.The canonical correlation coefficients are defined as Pearson correlation coefficient between Xu and Yv, i.e., ${({\bf{X}}{\bf{u}})}^{{\rm{{\rm T}}}}{\bf{Y}}{\bf{v}}/(\Vert {\bf{X}}{\bf{u}}\Vert \Vert {\bf{Y}}{\bf{v}}\Vert )$.

Figure 3 presents the heat maps regrading the canonical loadings generated from the training set. In this figure, each row shows two weights of a SCCA method, where a larger weight stands for a more importance. The weight associated with the SNPs is on the left panel, and that associated with the voxels is on the right. The proposed non-convex SCCA methods obtain very clean and sparse weights for both u and v. The largest signal on the genetic side is the APOE e4 SNP rs429358, which has been previously reported to be related to AD⁵³. On the right panel, the largest signal for all SCCA methods comes from the hippocampus region. This is one of the most notable biomarkers as an indicator of AD, since atrophy of hippocampus has been shown to be related to brain atrophy and neuron loss measured with MRI in AD cohort⁵³. In addition, the L1-S2CCA and SCAD methods identify a weak signal from the parahippocampal gyrus, which is previously reported as an early biomarker of AD⁵⁴. On some folds, the Log method also finds out the lingual region, parahippocampal gyrus, vermis region. Interestingly, all the three regions have shown to be correlated to AD, and could be further considered as an indicating biomarker that can be observed prior to a dementia diagnosis. For example, Sjöbeck and Englund reported that molecular layer gliosis and atrophy in the vermis are clearly severer in AD patients than in the health controls⁵⁵. This is meaningful since the non-convex SCCA methods identify the correct clue for further investigation. On this account, both L1-SCCA and L1-NSCCA are not good choices since they identify too many signals, which may misguide subsequent investigation. The figure shows that L1-S2CCA could be an alternative choice for sparse imaging genetics analysis, but it performs unstably across the five folds. And, the non-convex methods is more consistent and stable than those ${\ell }_{1}$-SCCA methods. To show the results more clearly, we map the canonical weights (averaged across 5 folds) regarding the imaging measurements from each SCCA method onto the brain in Fig. 4. The figure confirms that the L1-SCCA and L1-NSCCA find out many signals that are not sparse. The L1-S2CCA identifies fewer signals than both L1-SCCA and L1-NSCCA, but more than all these non-convex SCCA methods. All the non-convex SCCA only highlights a small region of the whole brain. This again reveals that the proposed methods have better canonical weights which reduces the effort of further investigation.

Besides, we include both training and testing correlation coefficients in Table 6, where their mean and standard deviation are shown. The training results of all methods are similar, with the Log method gains the highest value of 0.33 ± 0.03. As for the testing results, which is our primary interest, all the non-convex SCCA methods obtain better values than these ${\ell }_{1}$-SCCA methods. Besides, the difference between the training and testing performance of the proposed methods is much smaller than that of three ${\ell }_{1}$-SCCA methods. This means that the non-convex methods have better generalization performance as they are less likely to fall into overfitting issue. The result of this real imaging genetics data reveals that the proposed SCCA methods can extract more accurate and sparser canonical weights for both genetic and imaging biomarkers, and obtain higher correlation coefficients than those ${\ell }_{1}$-SCCA methods.

Table 6 Performance comparison on real data set. Training and testing correlation coefficients (mean ± std) of 5-fold cross-validation are shown. The best value is shown in boldface.

Full size table

Conclusion

We have proposed a unified non-convex SCCA model and an efficient optimization algorithm using a family of non-convex penalty functions. These penalties are concave and piecewise continuous, and thus piecewise differentiable. We approximate these non-convex penalties by an ${\ell }_{2}$ function via the local quadratic approximation (LQA)³⁸. Therefore, the proposed algorithm is effective and runs fast.

We compare the non-convex methods with three state-of-the-art ${\ell }_{1}$-SCCA methods using both simulation data and real imaging genetics data. The simulation data have different ground truth structures. The results on the simulation data show that the non-convex SCCA methods identify cleaner and better canonical loadings than the three ${\ell }_{1}$-SCCA methods, i.e. L1-SCCA²⁵, L1-S2CCA³², and L1-NSCCA²⁹. These non-convex methods also recover higher correlation coefficients than ${\ell }_{1}$-SCCA methods, demonstrating that ${\ell }_{1}$-SCCA methods have suboptimal prediction capability as they may over penalize large coefficients. The results on the real data show that the proposed methods discover a pair of meaningful genetic and brain imaging biomarkers, while the ${\ell }_{1}$-SCCA methods return too many irrelevant signals. The correlation coefficients show that the non-convex SCCA methods hold better testing values. This verifies our motivation that the non-convex penalty can improve the prediction ability, and thus has better generalization capability. Obviously, the parameter γ plays a key role in these non-convex penalties. In the future work, we will investigate how to choose a reasonable γ; and explore how to incorporate structure information into the model as structure information extraction is an important task for brain imaging genetics as well as biology studies.

References

Hibar, D. P., Kohannim, O., Stein, J. L., Chiang, M.-C. & Thompson, P. M. Multilocus genetic analysis of brain images. Frontiers in Genetics 2, 73 (2011).
Article PubMed PubMed Central Google Scholar
Hariri, A. R., Drabant, E. M. & Weinberger, D. R. Imaging genetics: perspectives from studies of genetically driven variation in serotonin function and corticolimbic affective processing. Biological psychiatry 59, 888–897 (2006).
Article CAS PubMed Google Scholar
Viding, E., Williamson, D. E. & Hariri, A. R. Developmental imaging genetics: challenges and promises for translational research. Development and Psychopathology 18, 877–892 (2006).
Article PubMed Google Scholar
Mattay, V. S., Goldberg, T. E., Sambataro, F. & Weinberger, D. R. Neurobiology of cognitive aging: insights from imaging genetics. Biological psychology 79, 9–22 (2008).
Article PubMed PubMed Central Google Scholar
Bigos, K. L. & Weinberger, D. R. Imaging genetics - days of future past. Neuroimage 53, 804–809 (2010).
Article CAS PubMed Google Scholar
Scharinger, C., Rabl, U., Sitte, H. H. & Pezawas, L. Imaging genetics of mood disorders. Neuroimage 53, 810–821 (2010).
Article CAS PubMed PubMed Central Google Scholar
Potkin, S. G. et al. Genome-wide strategies for discovering genetic influences on cognition and cognitive disorders: methodological considerations. Cognitive neuropsychiatry 14, 391–418 (2009).
Article PubMed PubMed Central Google Scholar
Kim, S. et al. Influence of genetic variation on plasma protein levels in older adults using a multi-analyte panel. PLoS One 8, e70269 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Shen, L. et al. Whole genome association study of brain-wide imaging phenotypes for identifying quantitative trait loci in MCI and AD: A study of the ADNI cohort. Neuroimage 53, 1051–63 (2010).
Article CAS PubMed PubMed Central Google Scholar
Winkler, A. M. et al. Cortical thickness or grey matter volume? the importance of selecting the phenotype for imaging genetics studies. Neuroimage 53, 1135–1146 (2010).
Article PubMed Google Scholar
Meda, S. A. et al. A large scale multivariate parallel ica method reveals novel imaging–genetic relationships for alzheimer’s disease in the adni cohort. Neuroimage 60, 1608–1621 (2012).
Article PubMed PubMed Central Google Scholar
Nho, K. et al. Whole-exome sequencing and imaging genetics identify functional variants for rate of change in hippocampal volume in mild cognitive impairment. Molecular psychiatry 18, 781 (2013).
Article CAS PubMed PubMed Central Google Scholar
Shen, L. et al. Genetic analysis of quantitative phenotypes in AD and MCI: imaging, cognition and biomarkers. Brain imaging and behavior 8, 183–207 (2014).
Article CAS PubMed Google Scholar
Saykin, A. J. et al. Genetic studies of quantitative MCI and AD phenotypes in ADNI: Progress, opportunities, and plans. Alzheimer’s & Dementia 11, 792–814 (2015).
Article Google Scholar
Wang, H. et al. Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort. Bioinformatics 28, 229–237 (2012).
Article PubMed CAS Google Scholar
Vounou, M., Nichols, T. E. & Montana, G. Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach. NeuroImage 53, 1147–59 (2010).
Article PubMed PubMed Central Google Scholar
Vounou, M. et al. Sparse reduced-rank regression detects genetic associations with voxel-wise longitudinal phenotypes in alzheimer’s disease. Neuroimage 60, 700–716 (2012).
Article PubMed Google Scholar
Zhu, X., Suk, H.-I., Huang, H. & Shen, D. Structured sparse low-rank regression model for brain-wide and genome-wide associations. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 344–352 (Springer, 2016).
Liu, J. et al. Combining fmri and snp data to investigate connections between brain function and genetics using parallel ica. Human brain mapping 30, 241–255 (2009).
Article ADS PubMed PubMed Central Google Scholar
Geladi, P. & Kowalski, B. R. Partial least-squares regression: a tutorial. Analytica chimica acta 185, 1–17 (1986).
Article CAS Google Scholar
Grellmann, C. et al. Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of mri and genetic data. NeuroImage 107, 289–310 (2015).
Article PubMed Google Scholar
Hardoon, D., Szedmak, S. & Shawe-Taylor, J. Canonical correlation analysis: An overview with application to learning methods. Neural Computation 16, 2639–2664 (2004).
Article PubMed MATH Google Scholar
Hardoon, D. R. & Shawe-Taylor, J. Sparse canonical correlation analysis. Machine Learning 83, 331–353 (2011).
Article MATH MathSciNet Google Scholar
Hotelling, H. Relations between two sets of variates. Biometrika 28, 321–377 (1936).
Article MATH Google Scholar
Witten, D. M., Tibshirani, R. & Hastie, T. A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 10, 515–34 (2009).
Article PubMed PubMed Central Google Scholar
Witten, D. M. & Tibshirani, R. J. Extensions of sparse canonical correlation analysis with applications to genomic data. Statistical applications in genetics and molecular biology 8, 1–27 (2009).
Article MATH CAS MathSciNet Google Scholar
Parkhomenko, E., Tritchler, D. & Beyene, J. Sparse canonical correlation analysis with application to genomic data integration. Statistical Applications in Genetics and Molecular Biology 8, 1–34 (2009).
Article MATH MathSciNet Google Scholar
Chen, X., Liu, H. & Carbonell, J. G. Structured sparse canonical correlation analysis. In International Conference on Artificial Intelligence and Statistics, 199–207 (2012).
Chen, X. & Liu, H. An efficient optimization algorithm for structured sparse cca, with applications to EQTL mapping. Statistics in Biosciences 4, 3–26 (2012).
Article Google Scholar
Chen, J. & Bushman, F. D. et al. Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis. Biostatistics 14, 244–258 (2013).
Article PubMed Google Scholar
Lin, D., Calhoun, V. D. & Wang, Y.-P. Correspondence between fMRI and SNP data by group sparse canonical correlation analysis. Medical image analysis 18, 891–902 (2014).
Article PubMed Google Scholar
Du, L. et al. A novel structure-aware sparse learning algorithm for brain imaging genetics. In International Conference on Medical Image Computing and Computer Assisted Intervention, 329–336 (2014).
Yan, J. et al. Transcriptome-guided amyloid imaging genetic analysis via a novel structured sparse learning algorithm. Bioinformatics 30, i564–i571 (2014).
Article CAS PubMed PubMed Central Google Scholar
Du, L. et al. Structured sparse canonical correlation analysis for brain imaging genetics: An improved graphnet method. Bioinformatics 32, 1544–1551 (2016).
Article CAS PubMed PubMed Central Google Scholar
Du, L. et al. Sparse canonical correlation analysis via truncated l ₁-norm-norm with application to brain imaging genetics. In IEEE International Conference on Bioinformatics and Biomedicine, 707–711 (IEEE, 2016).
Du, L. et al. Identifying associations between brain imaging phenotypes and genetic factors via a novel structured scca approach. In International Conference on Information Processing in Medical Imaging, 543–555 (Springer, 2017).
Meinshausen, N. & Bühlmann, P. High-dimensional graphs and variable selection with the lasso. The annals of statistics 1436–1462 (2006).
Fan, J. & Li, R. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348–1360 (2001).
Article MATH MathSciNet Google Scholar
Zou, H. The adaptive lasso and its oracle properties. Journal of the American Statistical Association 101, 1418–1429 (2006).
Article CAS MATH MathSciNet Google Scholar
Shen, X., Pan, W. & Zhu, Y. Likelihood-based selection and sharp parameter estimation. Journal of the American Statistical Association 107, 223–232 (2012).
Article PubMed PubMed Central MATH MathSciNet Google Scholar
Fung, G. & Mangasarian, O. Equivalence of minimal l ₀-and l _p-norm solutions of linear equalities, inequalities and linear programs for sufficiently small p. Journal of optimization theory and applications 151, 1–10 (2011).
Article MATH MathSciNet Google Scholar
Frank, L. E. & Friedman, J. H. A statistical view of some chemometrics regression tools. Technometrics 35, 109–135 (1993).
Article MATH Google Scholar
Geman, D. & Yang, C. Nonlinear image recovery with half-quadratic regularization. IEEE Transactions on Image Processing 4, 932–946 (1995).
Article ADS CAS PubMed Google Scholar
Trzasko, J. & Manduca, A. Highly undersampled magnetic resonance image reconstruction via homotopic l ₁-minimization. IEEE Transactions on Medical imaging 28, 106–121 (2009).
Article PubMed Google Scholar
Zhang, C. Nearly unbiased variable selection under minimax concave penalty. Annals of Statistics 38, 894–942 (2010).
Article MATH MathSciNet Google Scholar
Gao, C., Wang, N., Yu, Q. & Zhang, Z. A feasible nonconvex relaxation approach to feature selection. In AAAI, 356–361 (2011).
Friedman, J. H. Fast sparse regression and classification. International Journal of Forecasting 28, 722–738 (2012).
Article Google Scholar
Gorski, J., Pfeuffer, F. & Klamroth, K. Biconvex sets and optimization with biconvex functions: a survey and extensions. Mathematical Methods of Operations Research 66, 373–407 (2007).
Article MATH MathSciNet Google Scholar
Lu, C., Tang, J., Yan, S. & Lin, Z. Generalized nonconvex nonsmooth low-rank minimization. In IEEE Conference on Computer Vision and Pattern Recognition, 4130–4137 (2014).
Hunter, D. R. & Li, R. Variable selection using mm algorithms. Annals of statistics 33, 1617 (2005).
Article PubMed PubMed Central MATH MathSciNet Google Scholar
Ashburner, J. & Friston, K. J. Voxel-based morphometry–the methods. Neuroimage 11, 805–21 (2000).
Article CAS PubMed Google Scholar
Risacher, S. L. & Saykin, A. J. et al. Baseline MRI predictors of conversion from MCI to probable AD in the ADNI cohort. Current Alzheimer Research 6, 347–61 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hampel, H. et al. Core candidate neurochemical and imaging biomarkers of alzheimer’s disease. Alzheimer’s & Dementia 4, 38–48 (2008).
Article CAS Google Scholar
Echavarri, C. et al. Atrophy in the parahippocampal gyrus as an early biomarker of alzheimer’s disease. Brain Structure and Function 215, 265–271 (2011).
Article CAS PubMed Google Scholar
Sjöbeck, M. & Englund, E. Alzheimer’s disease and the cerebellum: a morphologic study on neuronal and glial changes. Dementia and geriatric cognitive disorders 12, 211–218 (2001).
Article PubMed Google Scholar

Download references

Acknowledgements

Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. L. Du was supported by the National Natural Science Foundation of China (61602384); the Natural Science Basic Research Plan in Shaanxi Province of China (2017JQ6001); the China Postdoctoral Science Foundation (2017M613202); and the Fundamental Research Funds for the Central Universities (3102016OQD0065) at Northwestern Polytechnical University. This work was also supported by the National Institutes of Health R01 EB022574, R01 LM011360, U01 AG024904, P30 AG10133, R01 AG19771, UL1 TR001108, R01 AG 042437, R01 AG046171, R01 AG040770; the Department of Defense W81XWH-14-2-0151, W81XWH-13-1-0259, W81XWH-12-2-0012; the National Collegiate Athletic Association 14132004 at Indiana University.

Author information

Authors and Affiliations

School of Automation, Northwestern Polytechnical University, Xi’an, 710072, China
Lei Du, Junwei Han & Lei Guo
Radiology and Imaging Sciences, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
Kefei Liu, Xiaohui Yao, Jingwen Yan, Shannon L. Risacher, Andrew J. Saykin, Li Shen, Tatiana M. Foroud, Kelley Faber, Sungeun Kim, Kwangsik Nho, Martin R. Farlow, Ann Marie Hake, Brandy R. Matthews, Jared R. Brosch & Scott Herring
University of California, San Francisco, USA
Michael W. Weiner, Norbert Schuff, Howard J. Rosen, Bruce L. Miller & David Perry
University of Southern California, Los Angeles, USA
Paul Aisen, Arthur W. Toga, Karen Crawford, Scott Neu, Lon S. Schneider, Sonia Pawluczyk, Mauricio Beccera, Liberty Teodoro & Bryan M. Spann
Mayo Clinic, Rochester, Minnesota, USA
Ronald Petersen, Clifford R. Jack, Matthew Bernstein, Bret Borowski, Jeff Gunter, Matt Senjem, Prashanthi Vemuri, David Jones, Kejal Kantarci, Chad Ward, Sara S. Mason, Colleen S. Albers, David Knopman & Kris Johnson
University of California, Berkeley, Berkeley, USA
William Jagust & Susan Landau
University of Pennsylvania, Philadelphia, USA
John Q. Trojanowki, Leslie M. Shaw, Virginia Lee, Magdalena Korecka, Michal Figurski, Steven E. Arnold, Jason H. Karlawish, David A. Wolk & Christopher M. Clark
University of California, Davis, Davis, USA
Laurel Beckett, Danielle Harvey, Charles DeCArli, Evan Fletcher, Pauline Maillard, John Olichney, Charles DeCarli & Owen Carmichael
Brigham and Women’s Hospital/Harvard Medical School, Boston, USA
Robert C. Green, Reisa A. Sperling, Keith A. Johnson & Gad Marshall
Washington University St. Louis, St. Louis, USA
John Morris, Marc Raichle, David Holtzman, Nigel J. Cairns, Erin Franklin, Lisa Taylor-Reinwald, Beau Ances, Maria Carroll, Mary L. Creech, Erin Franklin, Mark A. Mintun, Stacy Schneider & Angela Oliver
Prevent Alzheimer’s Disease, 2020, Rockville, USA
Zaven Khachaturian
Siemens, Munich, Germany
Greg Sorensen
Alzheimer’s Association, Illinois, USA
Maria Carrillo
University of Pittsburgh, Pennsylvania, USA
Lew Kuller, Chet Mathis, Oscar L. Lopez, Mary Ann Oakley & Donna M. Simpson
Cornell University, New York, USA
Steven Paul, Norman Relkin, Gloria Chaing, Michael Lin & Lisa Ravdin
Albert Einstein College of Medicine of Yeshiva University, New York, USA
Peter Davies
AD Drug Discovery Foundation, New York, USA
Howard Fillit
Acumen Pharmaceuticals, California, USA
Franz Hefti
Northwestern University, Illinois, USA
M. Marcel Mesulam, Emily Rogalski, Kristine Lipowski, Sandra Weintraub, Borna Bonakdarpour, Diana Kerwin, Chuang-Kuo Wu & Nancy Johnson
National Institute of Mental Health, Maryland, USA
William Potter
Brown University, Rhode Island, USA
Peter Snyder
Eli Lilly, Indiana, USA
Adam Schwartz
University of Washington, Washington, USA
Tom Montine
University of California, San Diego, California, USA
Ronald G. Thomas, Michael Donohue, Sarah Walter, Devon Gessert, Tamie Sather, Gus Jiminez, Archana B. Balasubramanian, Jennifer Mason, Iris Sim, Leon Thal, James Brewer, Helen Vanderswag & Adam Fleisher
University of London, London, UK
Nick Fox
University of California, Los Angeles, California, USA
Paul Thompson, Liana Apostolova, Kathleen Tingus, Ellen Woo, Daniel H. S. Silverman, Po H. Lu & George Bartzokis
University of Michigan, Michigan, USA
Robert A. Koeppe, Judith L. Heidebrink & Joanne L. Lord
University of Utah, Utah, USA
Norm Foster
Banner Alzheimer’s Institute, Arizona, USA
Eric M. Reiman, Kewei Chen, Pierre Tariot, Anna Burke, Ann Marie Milliken, Nadira Trncic, Adam Fleisher & Stephanie Reeder
University of California, Irvine, California, USA
Steven Potkin, Adrian Preda & Dana Nguyen
National Institute on Aging, Maryland, USA
Neil Buckholtz & John Hsiao
Johns Hopkins University, Maryland, USA
Marilyn Albert, Chiadi Onyike, Daniel D’Agostino & Stephanie Kielb
Richard Frank Consulting, New Hampshire, USA
Richard Frank
Oregon Health and Science University, Oregon, USA
Jeffrey Kaye, Joseph Quinn, Lisa Silbert, Betty Lind, Raina Carter & Sara Dolen
Baylor College of Medicine, Texas, USA
Rachelle S. Doody, Javier Villanueva-Meyer, Valory Pavlik, Victoria Shibley, Munir Chowdhury, Susan Rountree & Mimi Dang
Columbia University Medical Center, New York, USA
Yaakov Stern, Lawrence S. Honig & Karen L. Bell
University of Alabama-Birmingham, Alabama, USA
Daniel Marson, David Geldmacher, Marissa Natelson Love, Randall Griffith, David Clark, John Brockington & Erik Roberson
Mount Sinai School of Medicine, New York, USA
Hillel Grossman & Effie Mitsis
Rush University Medical Center, Rush University, Illinois, USA
Raj C. Shah & Leyla deToledo-Morrell
Wien Center, Florida, USA
Ranjan Duara, Maria T. Greig-Custo & Warren Barker
New York University, New York, USA
Martin Sadowski, Mohammed O. Sheikh, Anaztasia Ulysse & Mrunalini Gaikwad
Duke University Medical Center, North Carolina, USA
P. Murali Doraiswamy, Jeffrey R. Petrella, Salvador Borges-Neto, Terence Z. Wong & Edward Coleman
University of Kentucky, Kentucky, USA
Charles D. Smith, Greg Jicha, Peter Hardy, Partha Sinha, Elizabeth Oates & Gary Conrad
University of Rochester Medical Center, New York, USA
Anton P. Porsteinsson, Bonnie S. Goldstein, Kim Martin, Kelly M. Makino, M. Saleem Ismail & Connie Brand
University of Texas Southwestern Medical School, Texas, USA
Kyle Womack, Dana Mathews & Mary Quiceno
Emory University, Georgia, USA
Allan I. Levey, James J. Lah & Janet S. Cellar
University of Kansas, Medical Center, Kansas, USA
Jeffrey M. Burns, Russell H. Swerdlow & William M. Brooks
Mayo Clinic, Jacksonville, Florida, USA
Neill R Graff-Radford, Francine Parfitt & Kim Poki-Walker
Yale University School of Medicine, Connecticut, USA
Christopher H. van Dyck, Richard E. Carson, Martha G. MacAvoy & Pradeep Varma
McGill University, Montreal-Jewish General Hospital, Quebec, Canada
Howard Chertkow, Howard Bergman & Chris Hosein
Sunnybrook Health Sciences, Ontario, Canada
Sandra Black, Bojana Stefanovic & Curtis Caldwell
U.B.C. Clinic for AD & Related Disorders, British Columbia, Canada
Ging-Yuek Robin Hsiung, Benita Mudge, Vesna Sossi, Howard Feldman & Michele Assaly
Cognitive Neurology-St. Joseph’s, Ontario, Canada
Elizabeth Finger, Stephen Pasternack, Irina Rachisky, John Rogers, Dick Trost & Andrew Kertesz
Cleveland Clinic Lou Ruvo Center for Brain Health, Ohio, USA
Charles Bernick & Donna Munic
Premiere Research Inst (Palm Beach Neurology), Florida, USA
Carl Sadowsky & Teresa Villena
Georgetown University Medical Center, Washington D.C, USA
Raymond Scott Turner, Kathleen Johnson & Brigid Reynolds
Stanford University, California, USA
Jerome Yesavage, Joy L. Taylor, Barton Lane, Allyson Rosen & Jared Tinklenberg
Banner Sun Health Research Institute, Arizona, USA
Marwan N. Sabbagh, Christine M. Belden, Sandra A. Jacobson & Sherye A. Sirrel
Boston University, Massachusetts, USA
Neil Kowall, Ronald Killiany, Andrew E. Budson, Alexander Norbash & Patricia Lynn Johnson
Howard University, Washington D.C, USA
Thomas O. Obisesan, Saba Wolday & Joanne Allard
Case Western Reserve University, Ohio, USA
Alan Lerner, Paula Ogrocki, Curtis Tatsuoka & Parianne Fatica
Neurological Care of CNY, New York, USA
Smita Kittur
Parkwood Hospital, Pennsylvania, USA
Michael Borrie, T.-Y. Lee & Rob Bartha
University of Wisconsin, Wisconsin, USA
Sterling Johnson, Sanjay Asthana & Cynthia M. Carlsson
Dent Neurologic Institute, New York, USA
Vernice Bates, Horacio Capote & Michelle Rainka
Ohio State University, Ohio, USA
Douglas W. Scharre, Maria Kataki & Brendan Kelly
Albany Medical College, New York, USA
Earl A. Zimmerman, Dzintra Celmins & Alice D. Brown
Hartford Hospital, Olin Neuropsychiatry Research Center, Connecticut, USA
Godfrey D. Pearlson, Karen Blank & Karen Anderson
Dartmouth-Hitchcock Medical Center, New Hampshire, USA
Laura A. Flashman, Marc Seltzer, Mary L. Hynes & Robert B. Santulli
Wake Forest University Health Sciences, North Carolina, USA
Kaycee M. Sink, Leslie Gordineer, Jeff D. Williamson, Pradeep Garg & Franklin Watkins
Rhode Island Hospital, Rhode Island, USA
Brian R. Ott, Geoffrey Tremont & Lori A. Daiello
Butler Hospital, Rhode Island, USA
Stephen Salloway, Paul Malloy & Stephen Correia
Medical University South Carolina, Carolina, USA
Jacobo Mintzer, Kenneth Spicer & David Bachman
Nathan Kline Institute, New York, USA
Nunzio Pomara, Raymundo Hernando & Antero Sarrael
University of Iowa College of Medicine, Iowa, USA
Susan K. Schultz, Karen Ekstam Smith, Hristina Koleva, Ki Won Nam & Hyungsub Shim
USF Health Byrd Alzheimer’s Institute, University of South Florida, Florida, USA
Amanda Smith, Balebail Ashok Raj & Kristin Fargher

Authors

Lei Du
View author publications
You can also search for this author in PubMed Google Scholar
Kefei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohui Yao
View author publications
You can also search for this author in PubMed Google Scholar
Jingwen Yan
View author publications
You can also search for this author in PubMed Google Scholar
Shannon L. Risacher
View author publications
You can also search for this author in PubMed Google Scholar
Junwei Han
View author publications
You can also search for this author in PubMed Google Scholar
Lei Guo
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Saykin
View author publications
You can also search for this author in PubMed Google Scholar
Li Shen
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Alzheimer’s Disease Neuroimaging Initiative

Michael W. Weiner
, Paul Aisen
, Ronald Petersen
, Clifford R. Jack
, William Jagust
, John Q. Trojanowki
, Arthur W. Toga
, Laurel Beckett
, Robert C. Green
, John Morris
, Leslie M. Shaw
, Zaven Khachaturian
, Greg Sorensen
, Maria Carrillo
, Lew Kuller
, Marc Raichle
, Steven Paul
, Peter Davies
, Howard Fillit
, Franz Hefti
, David Holtzman
, M. Marcel Mesulam
, William Potter
, Peter Snyder
, Adam Schwartz
, Tom Montine
, Ronald G. Thomas
, Michael Donohue
, Sarah Walter
, Devon Gessert
, Tamie Sather
, Gus Jiminez
, Archana B. Balasubramanian
, Jennifer Mason
, Iris Sim
, Danielle Harvey
, Matthew Bernstein
, Nick Fox
, Paul Thompson
, Norbert Schuff
, Charles DeCArli
, Bret Borowski
, Jeff Gunter
, Matt Senjem
, Prashanthi Vemuri
, David Jones
, Kejal Kantarci
, Chad Ward
, Robert A. Koeppe
, Norm Foster
, Eric M. Reiman
, Kewei Chen
, Chet Mathis
, Susan Landau
, Nigel J. Cairns
, Erin Franklin
, Lisa Taylor-Reinwald
, Virginia Lee
, Magdalena Korecka
, Michal Figurski
, Karen Crawford
, Scott Neu
, Tatiana M. Foroud
, Steven Potkin
, Kelley Faber
, Sungeun Kim
, Kwangsik Nho
, Leon Thal
, Neil Buckholtz
, Marilyn Albert
, Richard Frank
, John Hsiao
, Jeffrey Kaye
, Joseph Quinn
, Lisa Silbert
, Betty Lind
, Raina Carter
, Sara Dolen
, Lon S. Schneider
, Sonia Pawluczyk
, Mauricio Beccera
, Liberty Teodoro
, Bryan M. Spann
, James Brewer
, Helen Vanderswag
, Adam Fleisher
, Judith L. Heidebrink
, Joanne L. Lord
, Sara S. Mason
, Colleen S. Albers
, David Knopman
, Kris Johnson
, Rachelle S. Doody
, Javier Villanueva-Meyer
, Valory Pavlik
, Victoria Shibley
, Munir Chowdhury
, Susan Rountree
, Mimi Dang
, Yaakov Stern
, Lawrence S. Honig
, Karen L. Bell
, Beau Ances
, Maria Carroll
, Mary L. Creech
, Erin Franklin
, Mark A. Mintun
, Stacy Schneider
, Angela Oliver
, Daniel Marson
, David Geldmacher
, Marissa Natelson Love
, Randall Griffith
, David Clark
, John Brockington
, Erik Roberson
, Hillel Grossman
, Effie Mitsis
, Raj C. Shah
, Leyla deToledo-Morrell
, Ranjan Duara
, Maria T. Greig-Custo
, Warren Barker
, Chiadi Onyike
, Daniel D’Agostino
, Stephanie Kielb
, Martin Sadowski
, Mohammed O. Sheikh
, Anaztasia Ulysse
, Mrunalini Gaikwad
, P. Murali Doraiswamy
, Jeffrey R. Petrella
, Salvador Borges-Neto
, Terence Z. Wong
, Edward Coleman
, Steven E. Arnold
, Jason H. Karlawish
, David A. Wolk
, Christopher M. Clark
, Charles D. Smith
, Greg Jicha
, Peter Hardy
, Partha Sinha
, Elizabeth Oates
, Gary Conrad
, Oscar L. Lopez
, Mary Ann Oakley
, Donna M. Simpson
, Anton P. Porsteinsson
, Bonnie S. Goldstein
, Kim Martin
, Kelly M. Makino
, M. Saleem Ismail
, Connie Brand
, Adrian Preda
, Dana Nguyen
, Kyle Womack
, Dana Mathews
, Mary Quiceno
, Allan I. Levey
, James J. Lah
, Janet S. Cellar
, Jeffrey M. Burns
, Russell H. Swerdlow
, William M. Brooks
, Liana Apostolova
, Kathleen Tingus
, Ellen Woo
, Daniel H. S. Silverman
, Po H. Lu
, George Bartzokis
, Neill R Graff-Radford
, Francine Parfitt
, Kim Poki-Walker
, Martin R. Farlow
, Ann Marie Hake
, Brandy R. Matthews
, Jared R. Brosch
, Scott Herring
, Christopher H. van Dyck
, Richard E. Carson
, Martha G. MacAvoy
, Pradeep Varma
, Howard Chertkow
, Howard Bergman
, Chris Hosein
, Sandra Black
, Bojana Stefanovic
, Curtis Caldwell
, Ging-Yuek Robin Hsiung
, Benita Mudge
, Vesna Sossi
, Howard Feldman
, Michele Assaly
, Elizabeth Finger
, Stephen Pasternack
, Irina Rachisky
, John Rogers
, Dick Trost
, Andrew Kertesz
, Charles Bernick
, Donna Munic
, Emily Rogalski
, Kristine Lipowski
, Sandra Weintraub
, Borna Bonakdarpour
, Diana Kerwin
, Chuang-Kuo Wu
, Nancy Johnson
, Carl Sadowsky
, Teresa Villena
, Raymond Scott Turner
, Kathleen Johnson
, Brigid Reynolds
, Reisa A. Sperling
, Keith A. Johnson
, Gad Marshall
, Jerome Yesavage
, Joy L. Taylor
, Barton Lane
, Allyson Rosen
, Jared Tinklenberg
, Marwan N. Sabbagh
, Christine M. Belden
, Sandra A. Jacobson
, Sherye A. Sirrel
, Neil Kowall
, Ronald Killiany
, Andrew E. Budson
, Alexander Norbash
, Patricia Lynn Johnson
, Thomas O. Obisesan
, Saba Wolday
, Joanne Allard
, Alan Lerner
, Paula Ogrocki
, Curtis Tatsuoka
, Parianne Fatica
, Evan Fletcher
, Pauline Maillard
, John Olichney
, Charles DeCarli
, Owen Carmichael
, Smita Kittur
, Michael Borrie
, T.-Y. Lee
, Rob Bartha
, Sterling Johnson
, Sanjay Asthana
, Cynthia M. Carlsson
, Pierre Tariot
, Anna Burke
, Ann Marie Milliken
, Nadira Trncic
, Adam Fleisher
, Stephanie Reeder
, Vernice Bates
, Horacio Capote
, Michelle Rainka
, Douglas W. Scharre
, Maria Kataki
, Brendan Kelly
, Earl A. Zimmerman
, Dzintra Celmins
, Alice D. Brown
, Godfrey D. Pearlson
, Karen Blank
, Karen Anderson
, Laura A. Flashman
, Marc Seltzer
, Mary L. Hynes
, Robert B. Santulli
, Kaycee M. Sink
, Leslie Gordineer
, Jeff D. Williamson
, Pradeep Garg
, Franklin Watkins
, Brian R. Ott
, Geoffrey Tremont
, Lori A. Daiello
, Stephen Salloway
, Paul Malloy
, Stephen Correia
, Howard J. Rosen
, Bruce L. Miller
, David Perry
, Jacobo Mintzer
, Kenneth Spicer
, David Bachman
, Nunzio Pomara
, Raymundo Hernando
, Antero Sarrael
, Susan K. Schultz
, Karen Ekstam Smith
, Hristina Koleva
, Ki Won Nam
, Hyungsub Shim
, Norman Relkin
, Gloria Chaing
, Michael Lin
, Lisa Ravdin
, Amanda Smith
, Balebail Ashok Raj
& Kristin Fargher

Contributions

L.D., L.G. and L.S. conceived and designed the research. L.D., K.L. and J.H. carried out the study analysis. X.Y., J.Y, S.L.R. and A.J.S. collected the data from ADNI database. L.D., K.L., L.S. and A.J.S. analyzed the results and wrote the paper. Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report.

Corresponding authors

Correspondence to Lei Du or Li Shen.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A comprehensive list of consortium members appears at the end of the paper

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Du, L., Liu, K., Yao, X. et al. Pattern Discovery in Brain Imaging Genetics via SCCA Modeling with a Generic Non-convex Penalty. Sci Rep 7, 14052 (2017). https://doi.org/10.1038/s41598-017-13930-y

Download citation

Received: 05 May 2017
Accepted: 02 October 2017
Published: 25 October 2017
DOI: https://doi.org/10.1038/s41598-017-13930-y

This article is cited by

Machine Learning for Brain Imaging Genomics Methods: A Review
- Mei-Ling Wang
- Wei Shao
- Dao-Qiang Zhang
Machine Intelligence Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Transcriptome-wide association analysis of brain structures yields insights into pleiotropy with complex neuropsychiatric traits

Gene-SGAN: discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering

An adaptive and robust method for multi-trait analysis of genome-wide association studies using summary statistics

Introduction

Methods

Preliminaries

Sparse Canonical Correlation Analysis (SCCA)

Non-convex Penalty Functions for SCCA

The Proposed Non-convex SCCA Model and Optimization Algorithm

Computational Analysis

Data Availability

Experiments and Datasets

Data Description

Synthetic Dataset

Real Neuroimaging Genetics Dataset

Experimental Setup

Benchmarks

Parameter Tuning

Termination Criterion

Results on Synthetic Data

Results on Real Neuroimaging Genetics Data

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

Alzheimer’s Disease Neuroimaging Initiative

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Machine Learning for Brain Imaging Genomics Methods: A Review

Comments

Search

Quick links