Introduction

Alzheimer’s disease (AD) is the most prevalent age-related neurodegenerative disease, and is characterized by progressive brain atrophy and neuronal cell loss leading to cognitive impairment [1,2,3]. The deposition of β-amyloid (Aβ) plaques and the accumulation of neurofibrillary tangles formed by hyperphosphorylated tau in the brain were recognized as the main neuropathological hallmarks of AD [3, 4]. It is widely accepted that genetic factors play an important role in the pathogenesis of AD [5, 6]. Over 100 mutations in three main genes, amyloid precursor protein (APP), presenilin 1 (PSEN1), and presenilin 2 (PSEN2) have been reported to cause familial AD [5, 6]. However, familial AD constitutes only 5% of total AD incidence. For the remaining 95%, genome-wide association studies (GWASs) in recent years have identified several AD susceptibility genes in populations of European ancestry [7], and the associations of part of these genes with AD were validated in Chinese populations in our previous study (including APOE, PICALM, BIN1, CLU, and MS4A4E [8]) and others [9]. However, the genes identified by GWAS can only explain about 16% of the total phenotypic variance, indicating there is missing heritability for AD [10]. Therefore, additional AD-related genes and variants are yet to be identified.

The mitochondrion is the center of cellular energy production, and its dysfunction contributes to the pathogenesis of neurodegenerative diseases [11,12,13]. An increasing amount of evidence has demonstrated that mitochondrial defects, including deficiency in oxidative phosphorylation (OXPHOS), increased reactive oxygen species (ROS) level, reduced ATP production, impaired balance of mitochondrial fission and fusion, are early events occurring in the brain of AD patients and animal models [11, 14,15,16,17,18,19,20,21,22,23,24]. On the other hand, the proteins of AD pathological pathways, including APP and Aβ, have been found to target mitochondria and disrupt mitochondrial function [11, 23, 25,26,27]. One recent study has found the enhancement of mitochondrial proteostasis could be helpful in reducing the toxic effect of Aβ [28].

Defects in the complexes of the mitochondrial respiratory chain, particularly those decreasing the efficiency of complex IV (also called cytochrome c oxidase (COX)), have been widely reported in AD patients and animal models [19, 20, 29,30,31,32]. In mammals, the COX is composed of 13 subunits [33, 34], and of these, 3 subunits are encoded by the mitochondrial genome (COX1, COX2, and COX3) and are the catalytic core of COX, while the other 10 subunits (COX4, COX5a, COX5b, COX6a, COX6b, COX6c, COX7a, COX7b, COX7c, and COX8) are nuclear-encoded and are structural subunits surrounding the catalytic core [33, 34]. Furthermore, there is another COX subunit NDUFA4, albeit with some controversy [35], newly identified in a recent study [36]. The expression levels of the nuclear-encoded COX subunits were found to be significantly reduced in brains and platelets of AD patients [20, 30, 37,38,39,40,41,42,43]; whereas, the mitochondrial-encoded 3 subunits were not significantly changed or even increased in AD patients [20, 30]. In addition to the subunits, there are many proteins participating in the assembling of COX, which are all nuclear-encoded proteins and are important regulators for the COX structure and function [33, 34, 44]. Among these assembly factors, mutations in COX10, COX15, SURF1, LRPPRC, SCO1, and SCO2 were identified as contributing to disorders of COX deficiency [44, 45]. One study in an Italian population, albeit with a limited sample size, has identified variants in the COX10 and COX15 genes to be associated with AD [46]. In general, the genetic association between the COX-related genes and AD has seldom been observed in human populations.

Considering the fact that decreased COX activity has been regarded as a characteristic of AD, we hypothesize that variants in the COX-related genes may confer genetic risk to AD. However, as yet there is no clear genetic evidence to support the association between the COX-related genes and AD. In this study, we analyzed variants in 17 COX-related genes (11 subunits and 6 assembly factors) in 713 AD patients and 859 controls from Han Chinese populations to look for their potential association with AD. We also performed an integrative analysis using publically available genetic data, dealing with gene expression and AD pathology, to clarify the potential role of the COX-related genes in AD. We were able to provide evidence supporting an active involvement of the COX-related genes in the development of AD.

Materials and methods

Subjects

A two-stage case-control study was performed to analyze the association between common variants in the COX-related genes and AD in this study (Fig. 1a). All the samples were of Han Chinese origin and were analyzed for the APOE and other AD-related genes in our previous studies [8, 47,48,49,50]. In brief, the discovery stage (stage I) was based on samples collected in Southwest China from 333 AD cases (62.8% females, age 74.08 ± 8.89 years, 24.4% were APOE ε4 carriers) and 435 healthy individuals (50.6% females, 17.7% were APOE ε4 carriers). The validation stage (stage II) was based on samples collected in East China from 380 AD cases (54.2% females, age 76.18 ± 9.63 years, 38.8% were APOE ε4 carriers) and 424 healthy controls (71.0% females, 17.8% were APOE ε4 carriers).

Fig. 1
figure 1

The strategy for investigating the association between the COX-related genes and AD. a The workflow in this study. b Protein–protein interactions (PPI) of 17 proteins encoded by the COX-related genes. The PPI analysis was performed using the STRING database (https://string-db.org/cgi/input.pl), which presents known and predicted PPI [94]. Two clusters were obtained, one cluster (red nodes) included 11 subunits of COX; the other cluster (blue and green nodes) contained 6 assembly factors of COX

In addition, we took advantage of the exome sequencing data of 107 genetically unrelated AD patients (authors’ unpublished data) that were selected using an extreme phenotype sampling strategy [51], to identify potential disease-causal/rare/functional variants in the exonic regions of the COX-related genes by comparing to 368 control subjects (including 160 in-house non-dementia controls [52], Han Chinese in Beijing (CHB, N = 103) and Southern Han Chinese (CHS, N = 105) from the 1000 Genomes Project phase 3 [53, 54]). These genetically unrelated AD patients (46.7% females, age 64.6 ± 10.29 years, 37.4% were APOE ε4 carriers) either had an early onset age <= 55 years old and/or a positive family history (presence of disease in one or more first-degree family members). Several patients were identified to carry potentially pathogenic mutations in PSEN1 (one patient), PSEN2 (three patients), and GRN (one patient), but no patient had potentially pathogenic mutation in APP and MAPT (authors’ unpublished data). We focused on these patients with early onset AD and/or family history of the disease, instead of sporadic or late-onset AD, as these patients might be more susceptible to genetic influence and be easier to show the potential consequence of the COX-related gene mutations. This sampling strategy was different from that of the IGAP dataset where all cases with age-at-onset <65 years of age were excluded [7], but it could be used as an initial study for identifying potentially pathogenic variants to be validated in sporadic AD patients.

The AD patients were diagnosed by at least two clinical psychiatrists according to the revised National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA) criteria [55,56,57], and Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV). Sample collection complied with the declaration of Helsinki, with informed consent being obtained from all the subjects. This study was approved by the institutional review board of the Kunming Institute of Zoology, Chinese Academy of Sciences.

Genotyping and sequencing of genes encoding subunits and assembly factors of COX

Genomic DNA was extracted from peripheral blood by using AxyPrep Blood Genomic DNA Miniprep Kit (Axygen, USA). Seventeen genes encoding the subunits (11 genes) and assembly factors (6 genes) of the COX were selected to analyze the common variants and variations in the exonic regions (Fig. 1b and Supplementary Table S1). The linkage disequilibrium (LD) pattern of the −10 ~ +10 kb region of the targeted genes were analyzed by using Haploview software [58] to pick up SNPs that are capable of tagging more SNPs in the targeted region. A total of 41 SNPs each with a minor allele frequency (MAF) >10% in the CHB population in the International HapMap Project database (HapMap, http://hapmap.ncbi.nlm.nih.gov/ [59]) were selected (Supplementary Table S1). According to the LD pattern of Asian population in the 1000 Genomes Project phase 3 (http://www.1000genomes.org) [53, 54], these 41 SNPs are capable of tagging a total of 815 SNPs (r2 > 0.8) in the gene region (Supplementary Table S1). The genotyping analysis was performed by using the SNaPshot assay, as described in our previous studies [8, 47, 49, 50]. In brief, multiplex PCR was performed by using 41 pairs of primers (Supplementary Table S2). After cleaning up with SAP and ExoI (TaKaRa Biotechnology Co. Ltd., Dalian, China), the PCR products were subjected to single-base extension reaction by using respective extension primers (Supplementary Table S2) in a total volume of 10 μL containing 4 μL multiplex PCR products, 5 μL SNaPshot Multiplex Ready Reaction Mix and 0.4–0.8 μM pooled SNP-specific oligonucleotide primers (Supplementary Table S2). After purification, a mixture of 4 μL purified products and 6 μL ddH2O was analyzed by the 3730xl DNA analyzer (Applied Biosystems) at the Kunming Biodiversity Large-Apparatus Regional Center, Kunming Institute of Zoology. We used GeneMarker software (http://www.softgenetics.com/GeneMarker.html) to read the genotyping results.

For sequencing the exon and flanking regions of the COX-related genes, we used the next generation sequencing technologies as described in our previous studies [47, 52, 60]. In brief, the paired-end reads were generated on the HiSeq 4000 sequencer (Illumina). High-quality sequence reads were aligned with the reference genome hg19 by Burrows-Wheeler Aligner (BWA, http://bio-bwa.sourceforge.net/) [61] and Samtools (http://samtools.sourceforge.net/) [62, 63]. The Genome Analysis Toolkit (GATK, https://www.broadinstitute.org/gatk/) [64] was applied for SNP calling.

Association analyses of the variants

The frequency of variants was searched in publically available databases: the 1000 Genomes Project phase 3 (http://www.1000genomes.org) [53, 54] and the Asian samples in the Exome Aggregation Consortium (ExAC: http://exac.broadinstitute.org) [65]. Data from the International Genomics of Alzheimer’s Project (IGAP) [7] was retrieved to investigate whether the variants identified in this study are associated with AD in populations of European ancestry. The potential pathogenicity of each variant was estimated by using the Combined Annotation Dependent Depletion (CADD) database [66]. Quanto software [67] was used to evaluate the statistical power of the studied sample. The deviation from the Hardy–Weinberg equilibrium (HWE) for each SNP was calculated using the PLINK software [68]. The LD plot of the identified variants was checked in LocusZoom (http://locuszoom.org/) [69]. The allele and genotype frequencies of all the SNPs in AD patients and controls were analyzed by using the two-tailed Fisher’s exact test.

The association between the genotype and mRNA expression was analyzed by using the available brain expression quantitative trait loci (eQTL) database: BRAINEAC (http://www.braineac.org/) [70], BrainCloud (http://braincloud.jhmi.edu/) [71] and the Genotype-Tissue Expression project (GTEx: http://www.gtexportal.org/home/) [72].

Data-mining of mRNA expression in AD patients

The expression level of the COX-related genes in AD was retrieved from the AlzData database (www.alzdata.org) [73], in which we have integrated and normalized the original microarray data of 684 AD brain tissues and 562 control tissues from Gene Expression Omnibus (GEO: http://www.ncbi.nlm.nih.gov/sites/GDSbrowser) [19, 74,75,76,77,78,79,80,81,82,83,84,85,86]. We used the stage I data as defined in Xu et al. [73] that contains expression data of 269 AD brain tissues and 271 control brain tissues from four brain regions (entorhinal cortex, hippocampus, temporal cortex, and frontal cortex) to investigate the alteration of mRNA levels of the COX-related genes in AD, by using a two-tailed Student’s t-test with the GraphPad Prism software (GraphPad Software, La Jolla, CA, USA). The expression levels of the COX-related genes were further investigated in the stage II data as defined in Xu et al. [73], which contained the microarray data of 415 AD brain tissues and 291 control tissues from temporal cortex and frontal cortex.

Correlation between mRNA levels of the COX-related genes and pathology level in AD mouse models

The spatial-temporal expression data and pathological features of AD transgenic and wide-type (WT) mice were retrieved from Mouseac (www.mouseac.org) [87], including 219 brain tissues from five kinds of AD mouse models with familial AD mutations and 114 brain tissues of age-matched WT mice. The expression levels of the COX-related genes in the hippocampus and cortex tissues from mice at four life stages (2, 4, 8, and 18 months) were compared between AD mouse models and WT mice. The pathology level of Aβ plaques was quantified. The correlation between mRNA levels of the COX-related genes and pathology level was measured using the Pearson correlation test with the GraphPad Prism software (GraphPad Software, La Jolla, CA, USA).

Convergent functional genomics (CFG) for the COX-related genes in AD

A CFG approach was applied to integrate multiple lines of evidence for the association between the COX-related genes and AD [73, 88, 89]. The CFG score for each gene was estimated based on the following evidence: (1) variant(s) in the gene was associated with AD in the combined Han Chinese population (P < 0.05); (2) variant(s) in the gene was associated with AD in the IGAP population (P < 0.05) [7]; (3) the AD-associated SNP(s) was eQTLs that related to altered gene expression in brain (P < 0.05) [70,71,72]; (4) the gene showed an differential expression in AD patients (P < 0.05) [73]; (5) the gene showed an differential expression in AD mice (P < 0.05) [87]; (6) the mRNA level was correlated with AD pathology in AD mice (P < 0.05) [87]. When any of the above mentioned features were present, one point was assigned, leading to a CFG score ranging from 0 to 6 points. Note that the actual inclusion criteria for the above evidence for the CFG analysis were somewhat arbitrary, as the cutoff of significant P-value was set based on the manner of P-value distribution for each line of evidence [73]. For instance, evidence from the current genetic association analysis and from IGAP was given equal weight despite a much higher likelihood of false-positive association in the IGAP data without correction for multiple hypothesis testing [7]. Therefore, some targets with equally weighted significance at single dimension could be false positive. Nonetheless, the truly disease-related gene would be expected to be cross-validated by multi-dimension of evidence, regardless of the level of significance at single-dimension [73].

Results

Genetic variants in the COX-related genes were associated with AD in Han Chinese

In order to perform a comprehensive analysis of the genetic association between the COX-related genes and AD in Chinese, we selected 17 COX-related genes (11 subunit genes and 6 assembly factors) and screened for common variants in a two-stage case-control study of 1572 Han Chinese subjects (Supplementary Table S1). All of the 41 SNPs were in Hardy–Weinberg equilibrium (Supplementary Table S1). The MAF values of these SNPs ranged from 9.2 to 49.3%. The power to detect the odds ratio (OR) value as 1.25 for risk allele was expected to range from 46.22 to 84.33% under a log additive model, with a false positive rate being under 5%.

In the Stage I study of the cohort from Southwest China, SNPs in NDUFA4, SURF1, COX10, COX6B1, COX5B, COX7C, and COX7A2 were associated with genetic risk of AD (Table 1 and Supplementary Table S3). In the Stage II study of an independent Han Chinese population from East China, the trends of the SNPs in NDUFA4, SURF1, COX10, and COX6B1 were validated, although they did not reach a statistically significant level (Table 1 and Supplementary Table S4). As the sample size of both study cohorts were relatively small, we pooled these samples together to increase the statistical power. In the combined Chinese AD cohorts, SNPs in NDUFA4 (rs12671052, OR = 0.746, P = 0.006; rs12667441, OR = 0.738, P = 0.005), SURF1 (rs571549, OR = 1.230, P = 0.007), COX10 (rs9891988, OR = 0.851, P = 0.026; rs2530377, OR = 0.836, P = 0.018; rs11078233, OR = 1.181, P = 0.023) and COX6B1 (rs10420252, OR = 0.802, P = 0.042) remained to be associated with AD (Table 2 and Supplementary Table S5). Considering the fact that the potential population stratification of the Han regional samples [90] might affect this result, the data of the Han Chinese in Beijing (CHB, N = 103) and Southern Han Chinese (CHS, N = 105) were retrieved from 1000 Genomes Project phase 3 [53, 54] and were combined with the Han Chinese controls in this study as a control of the general populations. We found that the association between SNPs in NDUFA4, SURF1, COX10, COX6B1, and AD remained to be significant in the combined population (Table 2). Note that the significance did not survive the Bonferroni test, in part due to the relatively small sample size.

Table 1 SNPs in the COX-related genes that were associated with AD in the discovery (I) and validation (II) stages
Table 2 SNPs in the COX-related genes found to be associated with AD in the combined Han Chinese population

Exons sequencing of the COX-related genes identified variants associated with AD risk

In order to further explore whether there are potential mutations or undetected rare variants in the COX-related genes to be associated with AD, the coding and flanking regions of the COX-related genes were sequenced in 107 AD cases with extreme phenotypes (onset age <= 55 years old and/or with family history of AD), and were compared with a combined control population containing 368 individuals [52,53,54]. The genotyping data from 4327 East Asians from the ExAC (http://exac.broadinstitute.org) [65] was used as another control. A total of 72 variants were identified in the exons and flanking regions of these genes (Table 3 and Supplementary Table S6). Among them, 9 variants in COX6B1, COX6B2, COX10, COX4I1, COX15, and SURF1 showed significantly different frequencies between the cases and the controls (Table 3). Analysis of the LD pattern revealed that these variants were not linked with those AD-associated SNPs identified in the SNaPshot assay (Supplementary Figure S1). Among these variants, rs7991 in the COX6B1 gene was associated with AD risk (Table 3). Interestingly, the association between rs7991 and AD was also observed in the IGAP dataset [7], though the association was not significant after multiple testing correction, suggesting a potential risk effect of this variant in different populations (Table 3). Pathogenicity assessments of these variants by using the CADD database [66] indicated that rs7991 (COX6B1), rs11190255 (COX15), NC_000017.10:g.14110383C>T (COX10) and rs62637580 (SURF1) belong to the top 10% most deleterious variants in the human genome (Table 3). To sum up, through sequencing the coding and flanking regions of the COX-related genes in AD patients with a high probability of hereditarily transmitted AD, we identified several potentially pathogenic variants in the COX-related genes associated with AD risk.

Table 3 Variants identified in the exon and flanking regions of the COX-related genes in 107 genetically unrelated patients with a high probability of hereditarily transmitted AD

mRNA expression levels of the COX-related genes were significantly decreased in brains of AD patients

Considering the fact that the significantly decreased activity of COX is a characteristic of AD [20, 30, 37,38,39,40,41,42,43], we compared mRNA expression profiles of 269 AD brain tissues and 271 control brain tissues from four brain regions (entorhinal cortex, hippocampus, temporal cortex, and frontal cortex), which were integrated and normalized by the AlzData database (www.alzdata.org) ([73] and references therein), to investigate whether the mRNA expression levels of the COX-related genes were altered in AD brains. Among the 17 analyzed genes, the mRNA levels of 13 genes were significantly decreased in AD patients (Table 4). In particular, we observed a remarkable downregulation of mRNA levels of the COX5B, COX7C, COX7A2, NDUFA4, COX6C, COX8A, COX5A, COX4I1, COX6A1, and COX6B1 genes in the entorhinal cortex and hippocampus tissues of AD patients (Table 4 and Fig. 2), which were the most affected brain regions in AD [2]. Furthermore, the mRNA levels of COX5B, NDUFA4, COX6C, COX8A, COX5A, COX4I1, and COX6B1 were also significantly downregulated in the temporal cortex of AD patients, and COX5B and COX8A were significantly decreased in all four brain tissues of AD patients (Table 4). It is of note that all of the above significantly regulated genes in entorhinal cortex and hippocampus tissues of AD patients were subunits of COX [33], while the mRNA expression level of assembly factors were not consistently altered in AD brains. Further analysis of the Stage II data of the Alzdata database ([73] and references therein) (containing 415 AD brain tissues and 291 control tissues from temporal cortex and frontal cortex) gave similar results and revealed one more gene, COX10, to be significantly decreased in the frontal cortex of AD patients (Supplementary Table S7).

Table 4 Comparison of mRNA levels of COX-related genes between 269 AD brain and 271 control brain tissues
Fig. 2
figure 2

The mRNA levels of the COX-related genes were significantly decreased in entorhinal cortex and hippocampus tissues of AD patients. aj The mRNA expression data of the COX-related genes in entorhinal cortex and hippocampus tissues were retrieved from the NCBI Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov/geo) and were normalized and analyzed in the AlzData database (www.alzdata.org) [73]. The expression data for the entorhinal cortex came from 39 AD patients and 39 controls in the 4 GEO datasets: GSE26927 [75], GSE48350 [80], GSE26972 [84], and GSE5281 [19]. The expression data for the hippocampus came from 74 AD patients and 65 controls in the 5 GEO datasets: GSE28146 [74], GSE48350 [80], GSE5281 [19], GSE29378 [78], and GSE36980 [76]. The difference between AD patients and controls was quantified by using a two-tailed Student’s t-test. Error bars represent the standard error of the mean. *P-value <0.05; **P-value <0.01; ***P-value <0.001; ****P-value <0.0001

Taken together, though a systematic analysis of mRNA expression profiles of AD brain tissues, we found that most of the subunits of the COX were downregulated at the mRNA levels in AD brain tissues and this was consistent with the reduced COX activity in AD brains as reported in previous studies [20, 37, 39,40,41].

The AD risk SNPs in the COX-related genes were associated with altered gene expression level

Based on the finding that the mRNA levels of the COX-related genes were significantly downregulated in AD brains, we investigated whether the AD risk SNPs in these genes would account for the altered gene expression. Through data-mining of the brain eQTL data from the BRAINEAC database (http://www.braineac.org/) [70], which provides the brain eQTL data across 10 brain tissues of 134 neurological normal individuals, we found that the carriers of allele A of SNP rs10420252 have significantly decreased COX6B1 mRNA expression level in the brain (Table 2 and Fig. 3a). Investigation of the BrainCloud database (http://braincloud.jhmi.edu/; [71]) identified that the AD risk genotype CC of rs10778233 was associated with a reduced COX10 mRNA expression level in prefrontal cortex (Fig. 3b). An analysis of the eQTL data in the CTEx project [72] (GTEx, http://www.gtexportal.org/home/) revealed that the AD risk genotype CC of rs571549 was prominently associated with an increased SURF1 mRNA expression level in hippocampus and cortex tissues (Fig. 3c), which was consistent with the upregulated SURF1 mRNA level in the frontal cortex tissues of AD patients (Table 4 and Fig. 3d). Collectively, these results suggested that variants in the COX-related genes may be associated with AD by affecting gene expression.

Fig. 3
figure 3

The AD-associated SNPs in the COX-related genes were eQTLs and affected mRNA expression in brain tissues. a The carriers of the A allele of SNP rs10420252 showed a significantly decreased COX6B1 mRNA expression level. Data were retrieved from the brain eQTL database BRAINEAC [70], and was analyzed by using a two-tailed Student’s t-test. b Genotype CC of SNP rs10778233 was associated with a decreased COX10 mRNA expression level in cortex. Data were retrieved from the BrainCloud database (http://braincloud.jhmi.edu/) [71] and was analyzed by using a linear regression test. c Genotype CC of SNP rs571549 was prominently associated with an increased SURF1 mRNA expression level in frontal cortex, cortex, and hippocampus. Data were retrieved from the CTEx project (GTEx, http://www.gtexportal.org/home/) [72] and was analyzed by using a linear regression test. d The mRNA expression level of SURF1 was increased in frontal cortex tissues from AD patients relative to controls. The data were retrieved from the 7 datasets of GEO: GSE48350 [80], GSE66333 [81], GSE53890 [77], GSE5281 [19], GSE36980 [76], GSE12685 [86], and GSE15222 [85], and were normalized in the AlzData database (www.alzdata.org) [73]. Error bars represent the standard error of the mean. *P-value <0.05; **P-value <0.01; two-tailed Student’s t-test

The mRNA expression levels of the COX-related genes were correlated with the level of Aβ plaques in AD mouse models

In order to explore whether the mRNA expression levels of the COX-related genes were associated with AD pathology in species other than in man, we retrieved and analyzed the gene expression data and pathology data of AD mouse models from Mouseac (www.mouseac.org) [87]. By analyzing the gene expression levels in hippocampus and cortex tissues of wild type and HO _TASTPM mice (with homogenous APP K670N/M671L and PSEN1M146V mutations) at different life stages, we found that most of the COX-related genes were downregulated in hippocampus at the late life stage of the AD mouse models, though only some of them (Cox6a1, Cox7a2, Cox7c, Surf1, and Lrpprc) reached a significant level (Fig. 4). An analysis of the mRNA expression data in the cortex tissues gave similar results (Supplementary Figure S2). Strong negative correlations between the mRNA expression levels of Lrpprc, Cox7a2, Ndufa4, Cox5a, and Cox10 and the level of Aβ plaques were observed in hippocampus and cortex tissues of mouse models with the APP K670N/M671L mutation, while the Cox6b2 gene exhibited a significantly positive correlation (Fig. 4 and Supplementary Figure S2). Overall, these observations further supported the association between COX-related genes and AD, especially at the pathological level.

Fig. 4
figure 4

The mRNA expression levels of the COX-related genes were correlated with the level of Aβ plaque in hippocampus tissues of AD mouse models. The gene expression data and pathology data were retrieved from Mouseac (www.mouseac.org) [87]. Data shown were the age-related mRNA expression (left) and the correlation between mRNA level and Aβ plaque (right) of each of 15 genes (ao). Sco1 and Sco2 had no data available in Mouseac [87]. The age-related mRNA expression level was measured in hippocampus tissues of wild type (n = 7) and HO _TASTPM mice (with homogenous APP K670N/M671L and PSEN1M146V mutations, n = 3) at different life stages (2, 4, 8, and 18 months). Error bars represent the standard error of the mean. *P-value <0.05; **P-value <0.01; ****P-value <0.0001; ns, not significant; two-tailed Student’s t-test. The correlation between mRNA level and Aβ plaques in mice with APPK670N/M671L mutation (n = 44) was measured using the Pearson correlation test. A P-value <0.05 was considered as statistically significant, ns: not significant

CFG analysis prioritized COX6B1, COX10, NDUFA4, and SURF1 genes as being associated with AD

In order to identify the highly AD-relevant genes, the COX-related genes were subjected to a CFG analysis [73, 88, 89]. By using multiple lines of evidence (including the genetic association in Han Chinese and in IGAP populations [7], eQTLs [70,71,72], gene expression in AD patients [73] and AD mice, and pathology correlation in AD mice [87]), each gene was scored and ranked (Table 5). COX6B1 and COX10 showed the highest CFG score of 5, followed by NDUFA4 and SURF1 with a CFG score of 4, indicating that these genes were possibly the most significant of the 17 COX-related genes considered to be associated with AD (Table 5).

Table 5 Convergent functional genomics (CFG) for the COX-related genes in AD

Discussion

Despite the fact that the downregulation of the COX-related genes and defects of the COX activity have been recognized as characteristics of AD [20, 30, 37,38,39,40,41,42,43], there has been little evidence produced to show the genetic association between the COX-related genes and AD [46], especially in populations of Asian origin. We present here an association study exploring the involvement of the COX-related genes in contributing genetic risk to AD. By screening for the common variants (Tag SNPs) and variations in the exon regions of 17 COX-related genes in Han Chinese populations, followed by an integrative analysis with data of eQTL, expression and pathology, we have been able to show an active role of the COX-related genes in AD. To our knowledge, this is the first extensive study evaluating the association of the COX-related genes with AD.

Among the 11 subunit genes of COX, the COX6A1, COX6B1, and NDUFA4 genes showed associations with AD genetic risk (Table 2). Further analysis of the exon regions in patients with a high probability of hereditarily transmitted AD also identified variants in COX6B1, COX6B2, and COX4I1 to be associated with increased AD risk (Table 3). Most of these genes, except for COX6B2, were prominently downregulated in the entorhinal cortex and hippocampus tissues of AD patients (Table 4 and Fig. 2). NDUFA4, a newly recognized subunit of COX [36], exhibited a similar mRNA expression pattern with the other COX subunits (Fig. 2 and Table 4), and was negatively correlated with Aβ plaque burden in AD mouse models (Fig. 4). The data further supported the previous reports for decreased mRNA levels of nuclear-encoded subunits of COX and impaired COX activity in AD brains [20, 37, 39,40,41]. It is of note that three of the subunit genes, COX6A1, COX6B1, and COX6B2, identified as being risk genes in the genetic analysis were members of COX subunit 6. The mRNA expression levels of COX6A1 and COX6B1 were decreased in AD patients and AD mouse models (Table 4, Figs. 2 and 4). However, eQTL analysis showed that the risk genotype of rs10420252 was associated with an increased COX6B1 mRNA level in AD brain (Fig. 3). Interestingly, COX6B2 exhibited a remarkably increased mRNA expression level in the hippocampus tissues of AD patients and mouse models, and there was a positive correlation between its mRNA expression level and the pathological level of Aβ plaque burden in hippocampus and cortex tissues of AD mouse models (Figs. 2 and 4, and Supplementary Figure S2). The increased mRNA expression level of COX6B2 in AD was in contrast to those of COX6A1 and other COX subunits, suggesting that there might be underlying mechanisms to regulate the balance of expression of COX subunits in AD.

Assembly factors are key regulators for the COX structure and function [33, 34, 44]. Many diseases, including Leigh syndrome, myopathy, encephalopathy, to name a few, are caused by mutations of the COX assembly factors [44]. In this study, we screened variants in LRPPRC, SURF1, SCO1, SCO2, COX15, and COX10, which were reported to be mutational hotspots for the COX-related diseases [44]. Variants in SURF1, COX10, and COX15 were associated with AD in Han Chinese population (Table 2 and Table 3). However, the mRNA levels of the three genes were not consistently altered in AD patients. An increased mRNA expression level of the SURF1 gene was observed in the frontal cortex tissues of AD patients (Table 4 and Fig. 3). Consistently, the risk genotype of rs571549 in the SURF1 gene was found to be associated with a significantly increased mRNA level of SURF1 in frontal cortex, cortex, and hippocampus tissues (Fig. 3). The mRNA expression of COX10 was decreased in the frontal cortex of AD patients (Supplementary Table S7). The mRNA level of this gene was also decreased in the hippocampus and cortex tissues from the AD mouse models, and was negatively correlated with Aβ plaque burden in both hippocampus and cortex tissues (Fig. 4 and Supplementary Figure S2). Accordingly, carriers with the risk genotype of rs11078233 had a decreased mRNA level of COX10 in brain tissues (Fig. 3). All of the data indicated that variants in the COX-related genes may be associated with AD through affecting the mRNA expression.

By using an integrative analysis of mRNA expression data of AD patients ([73] and references therein) and mouse models [87], we obtained a more comprehensive dataset (684 AD brain tissues and 562 control tissues; and the brain expression and pathology data of 44 AD mice) to explore the mRNA expression alterations of the COX-related genes in AD. Almost all of the COX-related genes, except for COX15, SURF1, and COXB2, which have been discussed above, were significantly downregulated in AD patients (Table 4 and Supplementary Table S7). In particular, Cox7a2 and Lrpprc were significantly downregulated in mouse models, and we observed a negative correlation of the mRNA expression levels and Aβ plaque burden levels in hippocampus and cortex tissues (Fig. 4 and Supplementary Figure S2). Although, we found no association of genetic variants in these two genes with AD risk in this study, we could not exclude a possibility of rare functional variants or SNPs in the regulatory regions of these genes that would confer genetic risk to AD. Further investigation is needed to clarify this important issue.

Among all the identified variants in this study, only rs7991 in the COX6B1 gene showed an association with AD in the IGAP dataset [7], but the association could not survive after multiple testing correction. The inconsistence of the associations of other SNPs and genes with AD between our sample and the IGAP dataset [7] could be explained by different genetic structures between populations of Asian and European origins that may have different AD susceptible genes/variants [8]. Note that none of the identified common variants showed an association with AD after Bonferroni correction if we followed a stringent P-value <0.001 (0.05/41), partially because of the limited sample analyzed here. However, the association of these variants with AD could not be simply neglected, as we did identify some variants that could alter gene expression and showed different frequencies between cases and controls. These results provided genetic evidence to further support the critical role of COX in AD as reported by previous studies [20, 30, 37,38,39,40,41,42,43].

Whether defects of mitochondria contribute to the pathology of AD is still under debate [20, 91]. Converging lines of evidence have demonstrated that mitochondrial dysfunction occurs before the development of AD pathology [16, 21, 22, 24, 30, 31, 92]. In the early stages of AD, downregulation of many of the OXPHOS related genes has been observed [19, 30,31,32]. Furthermore, oxidative stress as a result of OXPHOS defects has contributed to neuronal death in AD brains [12, 93]. On the other hand, there are multiple lines of direct evidence showing that mitochondria are the direct target of Aβ, thus the mitochondrial defects observed in AD should be the consequence of Aβ toxicity [23, 25,26,27]. A focused functional characterization of the risk COX-related genes in the context of the above studies is essential to define the roles of mitochondrial genes and mitochondrial dysfunction in AD pathology.

Finally, by using a CFG approach which integrating multiple lines of evidence, we have highlighted COX6B1, COX10, NDUFA4, and SURF1 as highly AD-relevant genes. Taken together, through an extensive association analysis of the COX-related genes at the genetic, expressional, and pathological levels, we have provided comprehensive evidence to support the important role of the COX-related genes in the development of AD. Further studies with independent populations for validation and functional assays are necessary to confirm the present conclusion and to characterize the mechanisms of the COX-related genes in AD pathogenesis.