Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as risk factors for Alzheimer’s disease

Holstege, Henne; Hulsman, Marc; Charbonnier, Camille; Grenier-Boley, Benjamin; Quenez, Olivier; Grozeva, Detelina; van Rooij, Jeroen G. J.; Sims, Rebecca; Ahmad, Shahzad; Amin, Najaf; Norsworthy, Penny J.; Dols-Icardo, Oriol; Hummerich, Holger; Kawalia, Amit; Amouyel, Philippe; Beecham, Gary W.; Berr, Claudine; Bis, Joshua C.; Boland, Anne; Bossù, Paola; Bouwman, Femke; Bras, Jose; Campion, Dominique; Cochran, J. Nicholas; Daniele, Antonio; Dartigues, Jean-François; Debette, Stéphanie; Deleuze, Jean-François; Denning, Nicola; DeStefano, Anita L.; Farrer, Lindsay A.; Fernández, Maria Victoria; Fox, Nick C.; Galimberti, Daniela; Genin, Emmanuelle; Gille, Johan J. P.; Le Guen, Yann; Guerreiro, Rita; Haines, Jonathan L.; Holmes, Clive; Ikram, M. Arfan; Ikram, M. Kamran; Jansen, Iris E.; Kraaij, Robert; Lathrop, Marc; Lemstra, Afina W.; Lleó, Alberto; Luckcuck, Lauren; Mannens, Marcel M. A. M.; Marshall, Rachel; Martin, Eden R.; Masullo, Carlo; Mayeux, Richard; Mecocci, Patrizia; Meggy, Alun; Mol, Merel O.; Morgan, Kevin; Myers, Richard M.; Nacmias, Benedetta; Naj, Adam C.; Napolioni, Valerio; Pasquier, Florence; Pastor, Pau; Pericak-Vance, Margaret A.; Raybould, Rachel; Redon, Richard; Reinders, Marcel J. T.; Richard, Anne-Claire; Riedel-Heller, Steffi G.; Rivadeneira, Fernando; Rousseau, Stéphane; Ryan, Natalie S.; Saad, Salha; Sanchez-Juan, Pascual; Schellenberg, Gerard D.; Scheltens, Philip; Schott, Jonathan M.; Seripa, Davide; Seshadri, Sudha; Sie, Daoud; Sistermans, Erik A.; Sorbi, Sandro; van Spaendonk, Resie; Spalletta, Gianfranco; Tesi, Niccolo’; Tijms, Betty; Uitterlinden, André G.; van der Lee, Sven J.; Visser, Pieter Jelle; Wagner, Michael; Wallon, David; Wang, Li-San; Zarea, Aline; Clarimon, Jordi; van Swieten, John C.; Greicius, Michael D.; Yokoyama, Jennifer S.; Cruchaga, Carlos; Hardy, John; Ramirez, Alfredo; Mead, Simon; van der Flier, Wiesje M.; van Duijn, Cornelia M.; Williams, Julie; Nicolas, Gaël; Bellenguez, Céline; Lambert, Jean-Charles

doi:10.1038/s41588-022-01208-7

Download PDF

Letter
Open access
Published: 21 November 2022

Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as risk factors for Alzheimer’s disease

Nature Genetics volume 54, pages 1786–1794 (2022)Cite this article

23k Accesses
67 Citations
183 Altmetric
Metrics details

Subjects

Abstract

Alzheimer’s disease (AD), the leading cause of dementia, has an estimated heritability of approximately 70%¹. The genetic component of AD has been mainly assessed using genome-wide association studies, which do not capture the risk contributed by rare variants². Here, we compared the gene-based burden of rare damaging variants in exome sequencing data from 32,558 individuals—16,036 AD cases and 16,522 controls. Next to variants in TREM2, SORL1 and ABCA7, we observed a significant association of rare, predicted damaging variants in ATP8B4 and ABCA1 with AD risk, and a suggestive signal in ADAM10. Additionally, the rare-variant burden in RIN3, CLU, ZCWPW1 and ACE highlighted these genes as potential drivers of respective AD-genome-wide association study loci. Variants associated with the strongest effect on AD risk, in particular loss-of-function variants, are enriched in early-onset AD cases. Our results provide additional evidence for a major role for amyloid-β precursor protein processing, amyloid-β aggregation, lipid metabolism and microglial function in AD.

Whole-genome sequencing reveals novel ethnicity-specific rare variants associated with Alzheimer’s disease

Article Open access 10 March 2022

Exome-wide age-of-onset analysis reveals exonic variants in ERN1 and SPPL2C associated with Alzheimer’s disease

Article Open access 26 February 2021

Rare variants in IFFO1, DTNB, NLRC3 and SLC22A10 associate with Alzheimer’s disease CSF profile of neuronal injury and inflammation

Article Open access 16 February 2022

Main

Beyond autosomal-dominant early-onset AD (<1% of all AD cases, onset at ≤65 years), the common complex form of AD has an estimated heritability of approximately 70%¹. Using genome-wide association studies (GWAS), 75 mostly common genetic risk factors/loci have been associated with AD risk in populations with European ancestry; however, individually these common variants have low effect sizes². Using DNA sequencing strategies, rare (allele frequency <1%) damaging missense or loss-of-function (LOF) variants in the TREM2, SORL1 and ABCA7 genes were identified to also contribute to the heritability of AD, with substantially higher effect sizes than individual GWAS hits^3,4,5,6,7,8. To detect additional genes for which rare variants are associated with AD risk, it is necessary to compare genetic sequencing data from thousands of AD cases and controls. In a large collaborative effort, we harmonized sequencing data of studies from Europe and the USA and applied a multistage gene burden analysis (Fig. 1a) (for sample descriptions, see Supplementary Table1 and Extended Data Figs. 1 and 2). We observed site-specific technical biases, since data were generated at multiple centers, using heterogeneous methods (Supplementary Table 2). To account for these batch effects, we designed and applied comprehensive quality control (QC) procedures (Methods and Supplementary Tables 3–5).

After sample QC, we first compared gene-based rare-variant burdens between 12,652 AD cases, consisting of 4,060 early-onset AD cases (EOAD, age at onset ≤65 years) and 8,592 late-onset AD cases (LOAD, age at onset >65 years) and 8,693 controls (stage 1 analysis; Supplementary Table 3). We detected 7,543,193 variants after sample and variant QC and annotated LOF variants with LOFTEE and missense variants with the Rare Exome Variant Ensemble Learner (REVEL) score and selected variants with a minor allele frequency (MAF) < 1% (Supplementary Table 4). We defined 4 deleteriousness thresholds by incrementally including variants with lower levels of predicted deleteriousness: LOF (n = 57,543), LOF + REVEL ≥ 75 (n = 111,755), LOF + REVEL ≥ 50 (n = 211,665) and LOF + REVEL ≥ 25 (n = 409,733), respectively. Of the 19,822 autosomal protein-coding genes, we analyzed the 13,222 genes that had a cumulative minor allele count (cMAC) ≥ 10 for the lowest deleterious threshold LOF + REVEL ≥ 25 (Methods); 9,168 genes for the LOF + REVEL ≥ 50 threshold, 5,694 for the LOF + REVEL ≥ 75 threshold and 3,120 genes for the LOF-only threshold (Fig. 1b). For these different deleteriousness thresholds, this analysis has an estimated power of 41, 22, 11 and 4%, respectively to attain a signal with P < 1 × 10⁻⁶ in stage 1, assuming that for a gene, the differential variant burden between cases and controls is associated with an odds ratio (OR) of 10.0 in EOAD and 3.33 in LOAD (Supplementary Table 6). Therefore, this analysis has only the power to discover genes for which either the differential variant burden is associated with a large effect size, and/or genes for which large numbers of damaging variant carriers are observed (Fig. 1b). Using ordinal logistic regression, 31,204 burden tests were performed across 13,222 genes in stage 1 (single genes were tested with up to 4 thresholds). Statistical inflation of test results was negligible (𝝀 = 1.046; Fig. 1c). Of all the burden tests performed, 13 tests, covering 6 genes, indicated a differential rare-variant burden between AD cases and controls (false discovery rate (FDR) < 0.1): SORL1, TREM2, ABCA7, ATP8B4, ADAM10 and ABCA1 (Table 1)).

Table 1 Stages 1 and 2 and meta-analysis AD association statistics

Full size table

To confirm these signals, we applied an analysis model consistent with stage 1 to an independent stage 2 dataset, which after QC, consisted of 3,384 cases and 7,829 controls (Supplementary Table 3–5) and also with negligible P value inflation (𝝀 = 1.016; Extended Data Fig. 3). The effect was tested in the direction observed in stage 1 (one-sided test). All genes selected in stage 1 reached P < 0.05 (Table 1, stage 2). The stage 2 effect sizes of these genes correlated with those observed in stage 1 (Pearson’s r on log odds = 0.91). We then meta-analyzed stage 1 + stage 2 across the 13 tests using a fixed-effect inverse variance method and corrected for the 31,204 tests performed in stage 1 (Holm–Bonferroni) (Table 1). This confirmed the AD association of rare damaging variants in the SORL1, TREM2, ABCA7, ATP8B4 and ABCA1 genes. The association signal of the ADAM10 gene was not significant exome-wide, presumably because prioritized variants in this gene are extremely few and rare, such that the signal can be confirmed only in larger datasets.

Strikingly, most of these genes also map to GWAS loci (SORL1, TREM2, ABCA7, ABCA1 and ADAM10). This led us to perform a focused analysis on GWAS loci, aiming to identify potential driver genes. To maximize statistical power, we merged the full exomes from the stage 1 and stage 2 samples into one mega-sample, again with negligible P value inflation (𝝀 = 1.025; Extended Data Fig. 4). We interrogated genes that were previously prioritized to drive the AD association in the 75 loci identified in the most recent GWAS² (Supplementary Table 7 and Methods). In 67 genes, we observed sufficient prioritized variants (cMAC ≥ 10) to test the burden signal in at least 1 deleteriousness category (a total of 187 tests). In addition to the genes mentioned above, our analysis indicated a suggestive signal of increased AD risk in RIN3, CLU, ZCWPW1 and ACE (FDR < 0.05) (Table 2 and Supplementary Table 8); these signals will have to be confirmed in a larger dataset. Nevertheless, the AD associations in these genes persisted when focusing on the burden of only the very rare variants (MAF < 0.1%), suggesting that the rare-variant burden is not in linkage with, and thus independent from, the GWAS sentinel variant.

Table 2 GWAS-targeted analysis in a mega-dataset without exome extracts

Full size table

Together, the newly associated genes provide additional evidence for a central role for APP processing, lipid metabolism, amyloid-β (Aβ) aggregation and neuroinflammatory processes in AD pathophysiology. Like ABCA7, ATP8B4 encodes a phospholipid transporter. Rare variants in this gene have been associated with the risk of developing systemic sclerosis, an autoimmune disease⁹. In the brain, ATP8B4 is predominantly expressed in microglia. Interestingly, GWAS indicated a potential association of ATP8B4 with AD², mainly through the rare missense variant that was most recurrent in our study (G395S). Of note, the OR point estimate for ATP8B4 LOF variants was close to 1, allowing for the possibility that the missense variants that drive the ATP8B4 association do not depend on a LOF effect. ABCA1 also encodes a phospholipid transporter; it lipidates apolipoprotein E (APOE)¹⁰ and poor ABCA1-dependent lipidation of APOE-containing lipoprotein particles increases Aβ deposition and fibrillogenesis¹¹. In line with this, the rare N1800H LOF variant in ABCA1 was previously associated with low plasma levels of APOE and evidence suggested an association with increased risk of AD and cerebrovascular disease¹². The α-secretase ADAM10 plays a major role in non-amyloidogenic APP metabolism¹³. Evidence for the AD association of rare variants in ADAM10 has remained suggestive until now: two rare missense variants in ADAM10 were reported before to incompletely segregate with LOAD in a few families¹⁴ (these variants did not associate with AD in our study; Supplementary Data) and a nonsense variant in the ADAM10 gene segregated with AD but in a small pedigree¹⁵. RIN3 has been associated with endosomal dysfunction and APP trafficking/metabolism^16,17. CLU (also known as APOJ) affects Aβ aggregation and clearance¹⁸ and ACE is suggested to have a role in Aβ degradation¹⁹. Thus far, the role of the histone methylation reader ZCWPW1 is unclear.

To better comprehend how these genes associate with AD, we analyzed the characteristics of rare damaging variants that contributed to the burden using the mega-sample (Fig. 2 and Table 3). For damaging variants in most genes, we observed increased carrier frequencies in younger cases and larger effect sizes were associated with an earlier age at onset (P = 0.0001) (Supplementary Table 9 and Extended Data Fig. 5). Yet the variants also contributed to an increased risk of LOAD (Fig. 2a,b and Table 3). The largest effect sizes were measured for LOF variants in SORL1, ADAM10, CLU and ZCWPW1; carriers of such variants had the lowest median age at onset, implying a key role for these genes in AD etiology (Table 3 and Extended Data Fig. 6). Moderate variant effect sizes were observed for LOF variants in TREM2, ABCA1 and RIN3, while the smallest variant effects were observed in ABCA7, ATP8B4 and ACE (Fig. 3 and Table 3).

**Fig. 2: Characterization of gene-specific variant features based on the mega-sample.**

Table 3 Mega-analysis: carrier frequency, effect sizes, median age at onset and attributable fraction

Full size table

**Fig. 3: ORs according to age at onset and variant pathogenicity.**

Extremely rare variants contributed more to large effect sizes than less rare variants (P = 0.03; Supplementary Table 10). Indeed, for SORL1, the variants with the lowest variant frequencies had the largest effect sizes (Fig. 2c and Supplementary Table 11) and damaging variants in ADAM10, CLU and ZCWPW1 were all extremely rare (Fig. 2d). Conversely, we observed that rare but recurrent variants contributed to the AD association of TREM2, ABCA7, ATP8B4 and RIN3 (Fig. 2d). The effect sizes of rare coding variant burdens were large compared to the effect sizes of the GWAS sentinel SNPs (Supplementary Tables 7 and 8). Up to 18% EOAD and 14% LOAD cases carried at least 1 predicted damaging variant in 1 of the 10 genes, compared to 9% of the controls (Supplementary Table 12). The fractions of EOAD cases in our sample that could be attributed to a rare variant in a specific gene ranged between 0.1 and 2.4% (approximately 2%: SORL1, TREM2, ABCA7; approximately 1%: ATP8B4, ABCA1, RIN3; and <0.5% for the remaining genes); for LOAD cases, this ranged between 0 and 1.3% (Table 3 and Extended Data Fig. 7).

We performed an age-matched sensitivity analysis to investigate possible effects from other age-related conditions, which supported a role in AD for all ten identified genes (Extended Data Fig. 8). Since APOE status was used as the selection criterion in several contributing datasets, burden tests were not adjusted for APOE-ε4 dosage; in a separate analysis we observed no interaction effects between the rare-variant AD association and APOE-ε4 dosage (Supplementary Table 13 and Methods). Also, the rare-variant burden association was not confounded by somatic mutations due to age-related clonal hematopoiesis (Supplementary Table 14).

Together, we report ATP8B4 and ABCA1 as new AD risk factors with exome-wide significance and we report suggestive evidence for the association of rare variants in the ADAM10 gene with AD risk. Furthermore, we identified RIN3, CLU, ZCWPW1 and ACE as potential drivers in GWAS loci, illustrating how analyses of rare protein-modifying variants can solve this drawback of GWAS studies²⁰. Larger datasets will be required to further confirm these signals. Given the association of LOF variants with increased AD risk, we suggest that the GWAS risk alleles in the respective loci might also be associated with reduced activity of the gene, which will have to be evaluated in further experiments. We observed an increased burden of rare damaging genetic variants in individuals with an earlier age at onset. Nevertheless, damaging variants (including APOE-ε4/ε4) were observed in only 30% of the EOAD cases (Supplementary Table 12), suggesting that additional damaging variants are yet to be discovered (Fig. 1b). Further, the effect of structural variants such as copy number variants and repetitive sequences will need to be investigated in future analyses. The associated genes strengthen our current understanding of AD pathophysiology. When treatment options become available in the future, identification of damaging variants in these genes will be of interest to clinical practice.

Methods

In-depth descriptions of all methods are described in Methods section of the Supplementary Note.

Sample processing, genotype calling and QC

We collected the exome, whole genome sequencing (WGS) or exome extract sequencing data of a total of 52,361 individuals, brought together by the Alzheimer Disease European Sequencing (ADES) consortium, the Alzheimer’s Disease Sequencing Project (ADSP)²¹ and several independent study cohorts (Supplementary Table 1). Exome extract samples only contained the raw reads that cover the ten genes identified in stage 1. Across all cohorts, AD cases were defined according to National Institute on Aging-Alzheimer’s Association criteria²² for possible or probable AD or according to National Institute of Neurological and Communicative Disorders and Stroke-Alzheimer’s Disease and Related Disorders Association criteria²³ depending on the date of diagnosis. When possible, supportive evidence for an AD pathophysiological process was sought (including cerebrospinal fluid biomarkers) or the diagnosis was confirmed by neuropathological examination (Supplementary Table 1). AD cases were annotated with the age at onset or age at diagnosis (2,014 samples); otherwise, samples were classified as late-onset AD (366 samples). Controls were not diagnosed with AD. All contributing datasets were sequenced using a paired-end Illumina platform; different exome capture kits were used and a subset of the sample was sequenced using WGS (Supplementary Table 2).

A uniform pipeline was used to process both the stage 1 and stage 2 datasets. Raw sequencing data from all studies were processed relative to the GRCh37 reference genome, the read alignments of possible chimeric origin were filtered and a GATK-based pipeline was used to call variants, while correcting for estimated sample contamination percentages. Samples were included in the datasets after they passed a stringent QC pipeline: samples were removed when they had high missingness, high contamination, a discordant genetic sex annotation, non-European ancestry, high numbers of new variants (with reference to dbSNP v.150), deviating heterozygous/homozygous or transition/transversion ratios. Further, we removed family members up to the third degree and individuals who carried a pathogenic variant in PSEN1, PSEN2, APP or in other genes causative for Mendelian dementia diseases (stage 1-only) or when there was clinical information suggestive of non-AD dementia. Variants considered in the analysis also passed a stringent QC pipeline: multiallelic variants were split into biallelic variants; variants that were in complete linkage and near each other were merged. Further, we removed variants that had indications of an oxo-G artifact, were located in short tandem repeat and/or low copy repeat regions, had a discordant balance between reads covering the reference and alternate allele, had a low depth for alternate alleles, deviated significantly from Hardy–Weinberg equilibrium, were considered false positives based on GATK variant quality score recalibration or were estimated to have a batch effect. Variants with >20% genotype missingness (read depth < 6) and differential missingness between the EOAD, LOAD and control groups were removed. To account for uncertainties resulting from variable read coverage between samples, we analyzed variants according to genotype posterior likelihoods, that is, the likelihood of being homozygous for the reference allele and heterozygous or homozygous for the alternate allele. To account for genotype uncertainty, the burden test was performed multiple times with independently sampled genotypes and the average P value across these tests is reported.

Variant prioritization and thresholds

We selected variants in autosomal protein-coding genes that were part of the Ensembl basic set of protein-coding transcripts (Gencode v.19/v.29 (ref. ²⁴); Supplementary Note) and that were annotated by the Variant Effect Predictor v.94.542 (ref. ²⁵). Only protein-coding missense and LOF variants were considered (LOF: nonsense, splice acceptor/donor or frameshifts). Missense and LOF variants were required to have a ‘moderate’ and ‘high’ variant effect predictor impact classification, respectively. Then, missense variants were prioritized using REVEL²⁶, annotation obtained from dbNSFP4.1a²⁷ and LOF variants were prioritized using LOFTEE v.1.0.2 (ref. ²⁸). For the analysis, we considered only missense variants with a REVEL score ≥ 25 (score range 0–100) and LOF variants were annotated as ‘high confidence’ by LOFTEE. Variants were required to have at least 1 carrier (that is, at least 1 sample with a posterior dosage >0.5) and an MAF < 1%, both in the considered dataset and the Genome Aggregation Database v.2.1 populations (nonneurological set).

Gene burden testing

The burden analysis was based on four deleteriousness thresholds by incrementally including variants from categories with lower levels of predicted variant deleteriousness: LOF; LOF + REVEL ≥ 75; LOF + REVEL ≥ 50; and LOF + REVEL ≥ 25, respectively. This allowed us to identify the variant threshold providing maximum evidence for a differential burden signal. To infer any dependable signal for a specific deleterious threshold, a minimum of 10 damaging alleles appertaining to this deleteriousness threshold was required, that is, a cMAC ≥ 10. Multiple testing correction was performed across all performed tests (up to four per gene). Burden testing was implemented using ordinal logistic regression. This enabled burden testing to particularly weight EOAD cases since previous findings indicated that high-impact variants are enriched in early-onset (EOAD) cases relative to late-onset (LOAD) cases⁸. This implies that the burden of high-impact deleterious genetic variants is ordered according to burden_EOAD > burden_LOAD > burden_control. Ordinal logistic regression enabled optimal identification of such signals, while also allowing the detection of EOAD-specific burdens (burden_EOAD > burden_LOAD ~ burden_control) and regular case-control signals (burden_EOAD ~ burden_LOAD > burden_control). For protective burden signals, the order of the signals is reversed, that is, burden_EOAD < burden_LOAD < burden_control. We considered an additive model while correcting for six population covariates, estimated after removal of population outliers. P values were estimated using a likelihood-ratio test. Genes were selected for confirmation in stage 2 if the FDR for AD association was <0.1 in stage 1 (Benjamini–Hochberg procedure²⁹). For the GWAS-targeted analysis, a more stringent threshold was used (FDR < 0.05) due to the absence of a separate confirmation stage. For the meta-analysis, genes were considered significantly associated with AD when the corrected P was <0.05 after family-wise correction using the Holm–Bonferroni procedure³⁰. Effect sizes (ORs) of the ordinal logistic regression can be interpreted as weighted averages of the OR being an AD case versus control and the OR being an early-onset AD case or not. To aid interpretation, we additionally estimated ‘standard’ case/control ORs across all samples per age category (EOAD versus controls and LOAD versus controls) and for age-at-onset categories ≤65 (EOAD), 65–70, 70–80 and >80 using multinomial logistic regression, while correcting for 6 PCA covariates.

GWAS driver gene identification

For the 75 loci identified in the most recent GWAS², genes were selected for burden testing based on earlier published gene prioritizations. First, gene prioritizations were obtained from Schwarzentruber et al.³¹ for 33 known loci. For 28 remaining loci, we obtained the tier 1 prioritization from Bellenguez et al.²; for loci without prioritization candidates (14 loci), we selected the nearest gene. In total, 81 protein-coding genes were selected (Supplementary Table 7), of which 67 genes had sufficient damaging allele carriers to be tested for at least 1 variant selection threshold. Gene burden testing was performed as described above and multiple testing correction to identify potential driver genes was performed using the Benjamini–Hochberg procedure, with a cutoff of 5%.

Validation of variant selection

We validated the REVEL variant impact prediction for missense and the LOFTEE impact prediction for LOF variants for all variants with an MAF < 1%, for which there were at least 15 damaging allele carriers. For protein-modifying variants that were not in the most significant burden selection of a gene due to a low predicted impact, we investigated whether they, nevertheless, showed a significant AD association (based on a case/control analysis using logistic regression). Vice versa, for variants that were in the burden selection, we investigated whether their effect size was significantly reduced or oppositely directed from other missense or LOF variants in the burden selection (Fisher’s exact test). Individual variant effects were analyzed in the stage 1 dataset, followed by a confirmation analysis in the stage 2 dataset. Multiple testing correction was performed per gene, with an FDR < 0.1 used as the threshold for stage 1 and Holm–Bonferroni (P < 0.05) for stage 2.

Descriptive measures

A variant carrier was defined as an individual for whom the summed dosage of all the variants in the considered variant deleteriousness category is ≥0.5 (see Methods section in the Supplementary Note). Carrier frequencies (CFs) were determined as the number of carriers/number of total samples. Attributable fraction for cases in an age group was estimated as the probability of a case with an age at onset in the age window i being exposed to a specific gene burden\(\left( {{\mathrm{CF}_{\mathrm{case,gene}},i}} \right)\), multiplied by an estimate of the attributable fraction among the exposed for these cases: \(\left( {\frac{{\mathrm{OR}_{\mathrm{gene},i} - 1}}{{\mathrm{OR}_{\mathrm{gene},i}}}} \right)\) (with the OR being an approximation of the relative risk)^32,33. For large effect sizes, this estimate approaches the difference in carrier frequency between cases and controls: \(\left( {\mathrm{CF}_{\mathrm{case,gene},i}} \right) - \left( {\mathrm{CF}_{\mathrm{control,gene}}} \right)\).

Sensitivity analyses

We determined if the observed effects could be explained by age differences between cases and controls. We constructed an age-matched sample, dividing samples into strata based on age/age at onset, with each stratum covering 2.5 years. Case/control ratios in all strata were kept between 0.1 and 10 by downsampling controls or cases, respectively. Subsequently, samples were weighted using the ‘propensity weighting within strata method’ (Supplementary Note). Finally, a case-control logistic regression was performed both on the unweighted and weighted case-control labels and estimated ORs and confidence intervals (CIs) were compared (Extended Data Fig. 8) Also, we determined if somatic mutations due to age-related clonal hematopoiesis could have confounded the results. We calculated for all heterozygous calls in the burden selection the balance between reference and alternate reads and compared these to reference values (Supplementary Table 14). While APOE was not included as a confounder, we performed a separate APOE interaction analysis (Supplementary Table 13) through a likelihood-ratio test between a model \({{{\mathrm{label}}}}\sim {{{\mathrm{gene}}}}\_{{{\mathrm{burden}}}}\_{{{\mathrm{score}}}} + {{{\mathrm{APOE}}}}\_{{{\mathrm{e}}}}4\_{{{\mathrm{dosage}}}}\) and an interaction model \({{{\mathrm{label}}}}\sim {{{\mathrm{gene}}}}\_{{{\mathrm{burden}}}}\_{{{\mathrm{score}}}} + {{{\mathrm{APOE}}}}\_{{{\mathrm{e}}}}4\_{{{\mathrm{dosage}}}} + {{{\mathrm{APOE}}}}\_{{{\mathrm{e}}}}4\_{{{\mathrm{dosage}}}}\) \(\times {{{\mathrm{gene}}}}\_{{{\mathrm{burden}}}}\_{{{\mathrm{score}}}}\) . This test was performed on a reduced dataset, from which datasets in which APOE status was used as the selection criterion were removed.

Power analysis

Power calculations were performed for ordinal and Firth logistic regression (case-control and EOAD versus rest; Fig. 1b and Supplementary Table 6). Given the ORs for the EOAD and LOAD cases, and the cMAC per gene, we sampled the number of alleles in the EOAD cases, LOAD cases and controls according to a multinomial distribution. We randomized these allele carriers across the dataset and performed the burden test as described above. The power for genes with a cMAC < 10 was set to 0 since these genes were not analyzed.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The genetic variants analyzed in this study are listed in the Supplementary Data attached to this article. Summary statistics of the stage 1 analysis are publicly available at Zenodo (https://doi.org/10.5281/zenodo.6818051)³⁴ and they can also be downloaded from https://holstegelab.eu/data/. For all tests with a cMAC ≥10, this includes Ensembl gene ID, gene name, variant category, cMAC, P value, beta and s.e.m. The ADSP dataset, which includes the ADNI dataset used in this analysis, is publicly available on request from https://dss.niagads.org/datasets/. The accession numbers of the data used in this analysis are: ADSP DBGap: phs000572.v7.p4 (stage 1); ADSP NIAGADS: https://dss.niagads.org/datasets/ng00067-v2/ (stage 2). Source data to Figs. 2 and 3 are published alongside this paper.

Code availability

The software and algorithms used in the analysis are described in the Supplementary Note attached to this Letter. Self-contained code v.0.1.0 can be accessed at https://github.com/holstegelab/shortread_seq_analysis and Zenodo (https://doi.org/10.5281/zenodo.6827458)³⁵.

References

Gatz, M. et al. Role of genes and environments for explaining Alzheimer disease. Arch. Gen. Psychiatry 63, 168–174 (2006).
Article Google Scholar
Bellenguez, C. et al. New insights on the genetic etiology of Alzheimer’s and related dementias. Nat. Genet. 54, 412–436 (2022).
Article CAS Google Scholar
Holstege, H. et al. Characterization of pathogenic SORL1 genetic variants for association with Alzheimer’s disease: a clinical interpretation strategy. Eur. J. Hum. Genet. 25, 973–981 (2017).
Article CAS Google Scholar
Nicolas, G. et al. SORL1 rare variants: a major risk factor for familial early-onset Alzheimer’s disease. Mol. Psychiatry 21, 831–836 (2016).
Article CAS Google Scholar
Cuyvers, E. et al. Mutations in ABCA7 in a Belgian cohort of Alzheimer’s disease patients: a targeted resequencing study. Lancet Neurol. 14, 814–822 (2015).
Article CAS Google Scholar
Jonsson, T. et al. Variant of TREM2 associated with the risk of Alzheimer’s disease. N. Engl. J. Med. 368, 107–116 (2013).
Article CAS Google Scholar
Guerreiro, R. et al. TREM2 variants in Alzheimer’s disease. N. Engl. J. Med. 368, 117–127 (2013).
Article CAS Google Scholar
Bellenguez, C. et al. Contribution to Alzheimer’s disease risk of rare variants in TREM2, SORL1, and ABCA7 in 1779 cases and 1273 controls. Neurobiol. Aging 59, 220 e1-220.e9 (2017).
Article Google Scholar
Gao, L. et al. Identification of rare variants in ATP8B4 as a risk factor for systemic sclerosis by whole-exome sequencing. Arthritis Rheumatol. 68, 191–200 (2016).
Article CAS Google Scholar
Wahrle, S. E. et al. Overexpression of ABCA1 reduces amyloid deposition in the PDAPP mouse model of Alzheimer disease. J. Clin. Invest. 118, 671–682 (2008).
CAS Google Scholar
Koldamova, R., Staufenbiel, M. & Lefterov, I. Lack of ABCA1 considerably decreases brain ApoE level and increases amyloid deposition in APP23 Mice. J. Biol. Chem. 280, 43224–43235 (2005).
Article CAS Google Scholar
Nordestgaard, L. T., Tybjaerg-Hansen, A., Nordestgaard, B. G. & Frikke-Schmidt, R. Loss-of-function mutation in ABCA1 and risk of Alzheimer’s disease and cerebrovascular disease. Alzheimers Dement. 11, 1430–1438 (2015).
Article Google Scholar
Saftig, P. & Lichtenthaler, S. F. The alpha secretase ADAM10: a metalloprotease with multiple functions in the brain. Prog. Neurobiol. 135, 1–20 (2015).
Article CAS Google Scholar
Kim, M. et al. Potential late-onset Alzheimer’s disease-associated mutations in the ADAM10 gene attenuate α-secretase activity. Hum. Mol. Genet. 18, 3987–3996 (2009).
Article CAS Google Scholar
Agüero, P. et al. α-Secretase nonsense mutation (ADAM10 Tyr167*) in familial Alzheimer’s disease. Alzheimers Res. Ther. 12, 139 (2020).
Article Google Scholar
Shen, R. et al. Upregulation of RIN3 induces endosomal dysfunction in Alzheimer’s disease. Transl. Neurodegener. 9, 26 (2020).
Article CAS Google Scholar
Shen, R. & Wu, C. RIN3 binds to BIN1 and CD2AP to increase APP‐CTFS in early endosomes. Alzheimers Dement. 16, e047161 (2020).
Article Google Scholar
Foster, E. M., Dangla-Valls, A., Lovestone, S., Ribe, E. M. & Buckley, N. J. Clusterin in Alzheimer’s disease: mechanisms, genetics, and lessons from other pathologies. Front. Neurosci. 13, 164 (2019).
Article Google Scholar
Hu, J., Igarashi, A., Kamata, M. & Nakagawa, H. Angiotensin-converting enzyme degrades Alzheimer amyloid β-peptide (Aβ); retards Aβ aggregation, deposition, fibril formation; and inhibits cytotoxicity. J. Biol. Chem. 276, 47863–47868 (2001).
Article CAS Google Scholar
Backman, J. D. et al. Exome sequencing and analysis of 454,787 UK Biobank participants. Nature 599, 628–634 (2021).
Article CAS Google Scholar
Bis, J. C. et al. Whole exome sequencing study identifies novel rare and common Alzheimer’s-associated variants involved in immune response and transcriptional regulation. Mol. Psychiatry 25, 1859–1875 (2020).
Article CAS Google Scholar
McKhann, G. M. et al. The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 7, 263–269 (2011).
Article Google Scholar
McKhann, G. et al. Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology 34, 939–944 (1984).
Article CAS Google Scholar
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–D773 (2019).
Article CAS Google Scholar
McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. 17, 122 (2016).
Article Google Scholar
Ioannidis, N. M. et al. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am. J. Hum. Genet. 99, 877–885 (2016).
Article CAS Google Scholar
Liu, X., Li, C., Mou, C., Dong, Y. & Tu, Y. dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs. Genome Med. 12, 103 (2020).
Article CAS Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Article CAS Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Series B Stat. Methodol. 57, 289–300 (1995).
Google Scholar
Holm, S. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 6, 65–70 (1979).
Google Scholar
Schwartzentruber, J. et al. Genome-wide meta-analysis, fine-mapping and integrative prioritization implicate new Alzheimer’s disease risk genes. Nat. Genet. 53, 392–402 (2021).
Article CAS Google Scholar
Cole, P. & MacMahon, B. Attributable risk percent in case-control studies. Br. J. Prev. Soc. Med. 25, 242–244 (1971).
CAS Google Scholar
LaMorte, W.W. in Measures of Association (Boston University School of Public Health, 2018). https://sphweb.bumc.bu.edu/otlt/mph-modules/ep/ep713_association/EP713_Association8.html
Holstege, H. et al. Summary statistics for “Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as risk factors for Alzheimer’s Disease”. Zenodo (2022) https://doi.org/10.5281/zenodo.6818051
Hulsman, M. & Holstege, H. Software (v.0.1.0) used in “Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as risk factors for Alzheimer’s Disease”. Zenodo (2022) https://doi.org/10.5281/zenodo.6827458

Download references

Acknowledgements

We thank all the study participants, their families, the participating medical staff, general practitioners, pharmacists and all laboratory personnel involved in patient diagnosis, blood collection, blood biobanking, DNA preparation and sequencing. The work in this manuscript was carried out on the Cartesius supercomputer, which is embedded in the Dutch national e-infrastructure with the support of the SURF Cooperative. Computing hours were granted in 2016, 2017, 2018 and 2019 to H. Holstege by the Dutch Research Council (project name: 100-plus; project nos. 15318 and 17232). This research was conducted using the funding obtained by the following study cohorts: ADES-FR, AgeCoDe-UKBonn; Barcelona SPIN; AC-EMC; ERF and Rotterdam; ADC-Amsterdam; 100-plus study; EMIF-90+; Control Brain Consortium; PERADES; StEP-AD; Knight-ADRC; UCSF/NYGC/UAB; UCL-DRC EOAD; ADSP. Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (https://adni.loni.usc.edu/). The investigators within ADNI are listed as supplementary authors and can be found in Section 5 of the Supplementary Note. Full consortium acknowledgements and funding sources are listed in Section 4 of the Supplementary Note.

Author information

These authors contributed equally: Henne Holstege, Marc Hulsman, Camille Charbonnier, Gaël Nicolas, Céline Bellenguez, Jean-Charles Lambert.

Authors and Affiliations

Genomics of Neurodegenerative Diseases and Aging, Human Genetics, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, the Netherlands
Henne Holstege, Marc Hulsman, Niccolo’ Tesi & Sven J. van der Lee
Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, the Netherlands
Henne Holstege, Marc Hulsman, Femke Bouwman, Iris E. Jansen, Afina W. Lemstra, Philip Scheltens, Niccolo’ Tesi, Betty Tijms, Sven J. van der Lee, Pieter Jelle Visser & Wiesje M. van der Flier
Amsterdam Neuroscience, Neurodegeneration, Amsterdam, the Netherlands
Henne Holstege, Marc Hulsman, Femke Bouwman, Iris E. Jansen, Afina W. Lemstra, Philip Scheltens, Niccolo’ Tesi, Sven J. van der Lee & Wiesje M. van der Flier
Delft Bioinformatics Lab, Delft University of Technology, Delft, the Netherlands
Henne Holstege, Marc Hulsman, Marcel J. T. Reinders, Niccolo’ Tesi & Sven J. van der Lee
Université Rouen Normandie, INSERM U1245 and CHU Rouen, Department of Genetics and CNRMAJ, Rouen, France
Camille Charbonnier, Olivier Quenez, Dominique Campion, Anne-Claire Richard, Stéphane Rousseau & Gaël Nicolas
Université Lille, INSERM, Centre Hospitalier Universitaire Lille, Institut Pasteur de Lille, U1167-RID-AGE facteurs de risque et déterminants moléculaires des maladies liées au vieillissement, Lille, France
Benjamin Grenier-Boley, Philippe Amouyel, Céline Bellenguez & Jean-Charles Lambert
Medical Research Council Centre for Neuropsychiatric Genetics and Genomics,, Division of Psychological Medicine and Clinical Neuroscience, School of Medicine, Cardiff University, Cardiff, UK
Detelina Grozeva, Rebecca Sims, Lauren Luckcuck, Rachel Marshall, Salha Saad & Julie Williams
Department of Neurology, Erasmus Medical Centre, Rotterdam, the Netherlands
Jeroen G. J. van Rooij, Merel O. Mol & John C. van Swieten
Department of Internal Medicine, Erasmus Medical Centre, Rotterdam, the Netherlands
Jeroen G. J. van Rooij, Robert Kraaij, Fernando Rivadeneira & André G. Uitterlinden
Department of Epidemiology, Erasmus Medical Centre, Rotterdam, the Netherlands
Shahzad Ahmad, Najaf Amin, M. Arfan Ikram, M. Kamran Ikram & Cornelia M. van Duijn
Leiden Academic Centre for Drug Research, Leiden, the Netherlands
Shahzad Ahmad
Nuffield Department of Population Health Oxford University, Oxford, UK
Najaf Amin & Cornelia M. van Duijn
Medical Research Council Prion Unit at University College London, University College London Institute of Prion Diseases, London, UK
Penny J. Norsworthy, Holger Hummerich & Simon Mead
Department of Neurology, II B Sant Pau, Hospital de la Santa Creu i Sant Pau, Universitat Autònoma de Barcelona, Barcelona, Spain
Oriol Dols-Icardo, Alberto Lleó & Jordi Clarimon
Biomedical Research Networking Center on Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
Oriol Dols-Icardo, Alberto Lleó, Pascual Sanchez-Juan & Jordi Clarimon
Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
Amit Kawalia & Alfredo Ramirez
The John P. Hussman Institute for Human Genomics, University of Miami, Miami, FL, USA
Gary W. Beecham, Eden R. Martin & Margaret A. Pericak-Vance
Université Montpellier, INSERM, Institute for Neurosciences of Montpellier, Montpellier, France
Claudine Berr
Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
Joshua C. Bis
Université Paris-Saclay, Commissariat à l’Énergie Atomique et aux Énergies Alternatives, Centre National de Recherche en Génomique Humaine Evry, Gif-sur-Yvette, France
Anne Boland & Jean-François Deleuze
Experimental Neuro-psychobiology Laboratory, Department of Clinical and Behavioral Neurology, Istituto di Ricovero e Cura a Carattere Scientifico Santa Lucia Foundation, Rome, Italy
Paola Bossù
Department of Neurodegenerative Science, Van Andel Institute, Grand Rapids, MI, USA
Jose Bras & Rita Guerreiro
Division of Psychiatry and Behavioral Medicine, Michigan State University College of Human Medicine, Grand Rapids, MI, USA
Jose Bras & Rita Guerreiro
HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
J. Nicholas Cochran & Richard M. Myers
Department of Neuroscience, Catholic University of Sacred Heart, Fondazione Policlinico Universitario A. Gemelli Istituto di Ricovero e Cura a Carattere Scientifico, Rome, Italy
Antonio Daniele
Université Bordeaux, INSERM, Bordeaux Population Health Research Center, Bordeaux, France
Jean-François Dartigues & Stéphanie Debette
Department of Neurology, Bordeaux University Hospital, Bordeaux, France
Stéphanie Debette
UKDRI Cardiff, School of Medicine, Cardiff University, Cardiff, UK
Nicola Denning, Alun Meggy & Rachel Raybould
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Anita L. DeStefano & Lindsay A. Farrer
Framingham Heart Study, Framingham, MA, USA
Anita L. DeStefano & Sudha Seshadri
Department of Neurology, Boston University School of Medicine, Boston, MA, USA
Anita L. DeStefano, Lindsay A. Farrer & Sudha Seshadri
Department of Epidemiology, Boston University, Boston, MA, USA
Lindsay A. Farrer
Department of Medicine (Biomedical Genetics), Boston University, Boston, MA, USA
Lindsay A. Farrer
Neurogenomics and Informatics Center, Washington University School of Medicine, St Louis, MO, USA
Maria Victoria Fernández & Carlos Cruchaga
Psychiatry Department, Washington University School of Medicine, St Louis, MO, USA
Maria Victoria Fernández & Carlos Cruchaga
Hope Center for Neurological Disorders, Washington University School of Medicine, St Louis, MO, USA
Maria Victoria Fernández & Carlos Cruchaga
Dementia Research Centre, University College London Queen Square Institute of Neurology, London, UK
Nick C. Fox, Natalie S. Ryan & Jonathan M. Schott
Fondazione Istituto di Ricovero e Cura a Carattere Scientifico Ca’ Granda, Ospedale Policlinico, Milan, Italy
Daniela Galimberti
University of Milan, Milan, Italy
Daniela Galimberti
Université Brest, INSERM, Etablissement Français du Sang, Centre Hospitalier Universitaire Brest, Unité Mixte de Recherche 1078, GGB, Brest, France
Emmanuelle Genin
Genome Diagnostics, Department of Human Genetics, VU University, AmsterdamUMC (location VUmc), Amsterdam, the Netherlands
Johan J. P. Gille, Daoud Sie, Erik A. Sistermans & Resie van Spaendonk
Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA
Yann Le Guen, Valerio Napolioni & Michael D. Greicius
Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
Jonathan L. Haines
Clinical and Experimental Science, Faculty of Medicine, University of Southampton, Southampton, UK
Clive Holmes
Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, Vrije University, Amsterdam, the Netherlands
Iris E. Jansen
McGill University and Genome Quebec Innovation Centre, Montreal, Quebec, Canada
Marc Lathrop
Department of Human Genetics, Amsterdam UMC, University of Amsterdam, Amsterdam Reproduction and Development Research Institute, Amsterdam, the Netherlands
Marcel M. A. M. Mannens
Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami, Miami, FL, USA
Eden R. Martin & Margaret A. Pericak-Vance
Institute of Neurology, Catholic University of the Sacred Heart, Rome, Italy
Carlo Masullo
Taub Institute on Alzheimer’s Disease and the Aging Brain, Department of Neurology, Columbia University, New York, NY, USA
Richard Mayeux
Gertrude H. Sergievsky Center, Columbia University, New York, NY, USA
Richard Mayeux
Institute of Gerontology and Geriatrics, Department of Medicine and Surgery, University of Perugia, Perugia, Italy
Patrizia Mecocci
Human Genetics, School of Life Sciences, University of Nottingham, Nottingham, UK
Kevin Morgan
Department of Neuroscience, Psychology, Drug Research and Child Health University of Florence, Florence, Italy
Benedetta Nacmias & Sandro Sorbi
IRCCS Fondazione Don Carlo Gnocchi, Florence, Italy
Benedetta Nacmias & Sandro Sorbi
Penn Neurodegeneration Genomics Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Adam C. Naj
Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Adam C. Naj, Gerard D. Schellenberg & Li-San Wang
Genomic and Molecular Epidemiology Laboratory, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino, Italy
Valerio Napolioni
Université Lille, INSERM, Centre Hospitalier Universitaire Lille, UMR1172, Resources and Research Memory Center (MRRC) of Distalz, Licend, Lille, France
Florence Pasquier
Fundació Docència i Recerca MútuaTerrassa and Movement Disorders Unit, Department of Neurology, University Hospital MútuaTerrassa, Barcelona, Spain
Pau Pastor
Memory Disorders Unit, Department of Neurology, Hospital Universitari Mutua de Terrassa, Barcelona, Spain
Pau Pastor
Université de Nantes, Centre Hospitalier Universitaire Nantes, Centre National de la Recherche Scientifique, INSERM, l’institut du Thorax, Nantes, France
Richard Redon
Institute of Social Medicine, Occupational Health and Public Health, University of Leipzig, Leipzig, Germany
Steffi G. Riedel-Heller
Neurology Service, Marqués de Valdecilla University Hospital (University of Cantabria and IDIVAL), Santander, Spain
Pascual Sanchez-Juan
Laboratory for Advanced Hematological Diagnostics, Department of Hematology and Stem Cell Transplant, Lecce, Italy
Davide Seripa
Department of Psychiatry and Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, San Antonio, TX, USA
Sudha Seshadri & Alfredo Ramirez
Laboratory of Neuropsychiatry, Department of Clinical and Behavioral Neurology, Istituto di Ricovero e Cura a Carattere Scientifico Santa Lucia Foundation, Rome, Italy
Gianfranco Spalletta
Department of Neurodegenerative Diseases and Geriatric Psychiatry, University Hospital Bonn, Medical Faculty, Bonn, Germany
Michael Wagner & Alfredo Ramirez
German Center for Neurodegenerative Diseases, Bonn, Germany
Michael Wagner & Alfredo Ramirez
Université Rouen Normandie, INSERM U1245 and CHU Rouen, Department of Neurology and CNRMAJ, Rouen, France
David Wallon & Aline Zarea
Memory and Aging Center, Department of Neurology, University of California, San Francisco, CA, USA
Jennifer S. Yokoyama
Reta Lila Weston Research Laboratories, Department of Molecular Neuroscience, University College London Institute of Neurology, London, UK
John Hardy
Cluster of Excellence Cellular Stress Responses in Aging-Associated Diseases, University of Cologne, Cologne, Germany
Alfredo Ramirez

Authors

Henne Holstege
View author publications
You can also search for this author in PubMed Google Scholar
Marc Hulsman
View author publications
You can also search for this author in PubMed Google Scholar
Camille Charbonnier
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Grenier-Boley
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Quenez
View author publications
You can also search for this author in PubMed Google Scholar
Detelina Grozeva
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen G. J. van Rooij
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Sims
View author publications
You can also search for this author in PubMed Google Scholar
Shahzad Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Najaf Amin
View author publications
You can also search for this author in PubMed Google Scholar
Penny J. Norsworthy
View author publications
You can also search for this author in PubMed Google Scholar
Oriol Dols-Icardo
View author publications
You can also search for this author in PubMed Google Scholar
Holger Hummerich
View author publications
You can also search for this author in PubMed Google Scholar
Amit Kawalia
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Amouyel
View author publications
You can also search for this author in PubMed Google Scholar
Gary W. Beecham
View author publications
You can also search for this author in PubMed Google Scholar
Claudine Berr
View author publications
You can also search for this author in PubMed Google Scholar
Joshua C. Bis
View author publications
You can also search for this author in PubMed Google Scholar
Anne Boland
View author publications
You can also search for this author in PubMed Google Scholar
Paola Bossù
View author publications
You can also search for this author in PubMed Google Scholar
Femke Bouwman
View author publications
You can also search for this author in PubMed Google Scholar
Jose Bras
View author publications
You can also search for this author in PubMed Google Scholar
Dominique Campion
View author publications
You can also search for this author in PubMed Google Scholar
J. Nicholas Cochran
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Daniele
View author publications
You can also search for this author in PubMed Google Scholar
Jean-François Dartigues
View author publications
You can also search for this author in PubMed Google Scholar
Stéphanie Debette
View author publications
You can also search for this author in PubMed Google Scholar
Jean-François Deleuze
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Denning
View author publications
You can also search for this author in PubMed Google Scholar
Anita L. DeStefano
View author publications
You can also search for this author in PubMed Google Scholar
Lindsay A. Farrer
View author publications
You can also search for this author in PubMed Google Scholar
Maria Victoria Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Nick C. Fox
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Galimberti
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuelle Genin
View author publications
You can also search for this author in PubMed Google Scholar
Johan J. P. Gille
View author publications
You can also search for this author in PubMed Google Scholar
Yann Le Guen
View author publications
You can also search for this author in PubMed Google Scholar
Rita Guerreiro
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan L. Haines
View author publications
You can also search for this author in PubMed Google Scholar
Clive Holmes
View author publications
You can also search for this author in PubMed Google Scholar
M. Arfan Ikram
View author publications
You can also search for this author in PubMed Google Scholar
M. Kamran Ikram
View author publications
You can also search for this author in PubMed Google Scholar
Iris E. Jansen
View author publications
You can also search for this author in PubMed Google Scholar
Robert Kraaij
View author publications
You can also search for this author in PubMed Google Scholar
Marc Lathrop
View author publications
You can also search for this author in PubMed Google Scholar
Afina W. Lemstra
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Lleó
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Luckcuck
View author publications
You can also search for this author in PubMed Google Scholar
Marcel M. A. M. Mannens
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Marshall
View author publications
You can also search for this author in PubMed Google Scholar
Eden R. Martin
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Masullo
View author publications
You can also search for this author in PubMed Google Scholar
Richard Mayeux
View author publications
You can also search for this author in PubMed Google Scholar
Patrizia Mecocci
View author publications
You can also search for this author in PubMed Google Scholar
Alun Meggy
View author publications
You can also search for this author in PubMed Google Scholar
Merel O. Mol
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Morgan
View author publications
You can also search for this author in PubMed Google Scholar
Richard M. Myers
View author publications
You can also search for this author in PubMed Google Scholar
Benedetta Nacmias
View author publications
You can also search for this author in PubMed Google Scholar
Adam C. Naj
View author publications
You can also search for this author in PubMed Google Scholar
Valerio Napolioni
View author publications
You can also search for this author in PubMed Google Scholar
Florence Pasquier
View author publications
You can also search for this author in PubMed Google Scholar
Pau Pastor
View author publications
You can also search for this author in PubMed Google Scholar
Margaret A. Pericak-Vance
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Raybould
View author publications
You can also search for this author in PubMed Google Scholar
Richard Redon
View author publications
You can also search for this author in PubMed Google Scholar
Marcel J. T. Reinders
View author publications
You can also search for this author in PubMed Google Scholar
Anne-Claire Richard
View author publications
You can also search for this author in PubMed Google Scholar
Steffi G. Riedel-Heller
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Rivadeneira
View author publications
You can also search for this author in PubMed Google Scholar
Stéphane Rousseau
View author publications
You can also search for this author in PubMed Google Scholar
Natalie S. Ryan
View author publications
You can also search for this author in PubMed Google Scholar
Salha Saad
View author publications
You can also search for this author in PubMed Google Scholar
Pascual Sanchez-Juan
View author publications
You can also search for this author in PubMed Google Scholar
Gerard D. Schellenberg
View author publications
You can also search for this author in PubMed Google Scholar
Philip Scheltens
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M. Schott
View author publications
You can also search for this author in PubMed Google Scholar
Davide Seripa
View author publications
You can also search for this author in PubMed Google Scholar
Sudha Seshadri
View author publications
You can also search for this author in PubMed Google Scholar
Daoud Sie
View author publications
You can also search for this author in PubMed Google Scholar
Erik A. Sistermans
View author publications
You can also search for this author in PubMed Google Scholar
Sandro Sorbi
View author publications
You can also search for this author in PubMed Google Scholar
Resie van Spaendonk
View author publications
You can also search for this author in PubMed Google Scholar
Gianfranco Spalletta
View author publications
You can also search for this author in PubMed Google Scholar
Niccolo’ Tesi
View author publications
You can also search for this author in PubMed Google Scholar
Betty Tijms
View author publications
You can also search for this author in PubMed Google Scholar
André G. Uitterlinden
View author publications
You can also search for this author in PubMed Google Scholar
Sven J. van der Lee
View author publications
You can also search for this author in PubMed Google Scholar
Pieter Jelle Visser
View author publications
You can also search for this author in PubMed Google Scholar
Michael Wagner
View author publications
You can also search for this author in PubMed Google Scholar
David Wallon
View author publications
You can also search for this author in PubMed Google Scholar
Li-San Wang
View author publications
You can also search for this author in PubMed Google Scholar
Aline Zarea
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Clarimon
View author publications
You can also search for this author in PubMed Google Scholar
John C. van Swieten
View author publications
You can also search for this author in PubMed Google Scholar
Michael D. Greicius
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer S. Yokoyama
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Cruchaga
View author publications
You can also search for this author in PubMed Google Scholar
John Hardy
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Ramirez
View author publications
You can also search for this author in PubMed Google Scholar
Simon Mead
View author publications
You can also search for this author in PubMed Google Scholar
Wiesje M. van der Flier
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia M. van Duijn
View author publications
You can also search for this author in PubMed Google Scholar
Julie Williams
View author publications
You can also search for this author in PubMed Google Scholar
Gaël Nicolas
View author publications
You can also search for this author in PubMed Google Scholar
Céline Bellenguez
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Charles Lambert
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.Holstege, G.N. and J.-C.L. jointly supervised the research. H.Holstege, M.H., C.Charbonnier, B.G.-B., O.Q., G.N., C.Bellenguez and J.-C.L. were the core writing and analysis group. H.Holstege, M.H., C.Charbonnier, B.G.-B., O.Q., D.Grozeva, J.G.J.v.R., R.S., S.A., N.A., P.J.N., O.D.-I., H.Hummerich2, A.K., J.C., J.C.v.S., J.H., A.R., S.M. W.M.v.d.F., C.M.v.D, J.W., G.N., C.Bellenguez and J.-C.L. were the ADES cohort working group. M.H., S.J.v.d.L., M.J.T.R., N.T. and H.Holstege represented the 100-plus study and Netherlands Brain Bank cohorts and contributed to sample collection. P.J.V. represented the EMIF-AD-90 study cohort and contributed to sample collection. J.G.J.v.R., M.O.M., J.C.v.S. represented the AC-EMC cohort and contributed to sample collection. M.H., S.J.v.d.L., F.B., B.T., A.W.L., I.E.J., W.M.v.d.F., P.S. and H.Holstege represented the ADC-Amsterdam cohort and contributed to sample collection. A.K., S.G.R.-H., M.W. and A.R. represented the AgeCoDe-UKBonn cohort and contributed to sample collection. C.Charbonnier, O.Q., D.W., A.Z., D.C., A.-C.R., S.R., G.N., A.B., J.-F.Deleuze, M.L., F.P., E.G., J.-F.Dartigues, R.R., S.D., B.G.-B., C.Berr, C.Bellenguez, J.-C.L. and P.A. represented the ADES-France cohort and contributed to sample collection. A.C.N., L.A.F., J.L.H., R.Mayeux, M.A.P.-V., J.C.B., L.-S.W., G.W.B., A.L.D.S., E.R.M., S.Seshadri and G.D.S. represented the ADSP cohort and contributed to sample collection. M.H., S.J.v.d.L., E.A.S., D.Sie, J.J.P.G., M.M.A.M.M., R.v.S. and H.Holstege represented the Amsterdam-UMC cohort and contributed to sample collection. O.D.-I., A.L. and J.C. represented the Barcelona SPIN cohort and contributed to sample collection. N.C.F., J.B., R.G. and J.H. represented the Control Brain Consortium cohort and contributed to sample collection. M.V.F. and C.Cruchaga represented the Knight-ADRC cohort and contributed to sample collection. D.Grozeva, R.R., S.Saad, N.D., A.M., R.Marshall, L.L., A.D., B.N., P.B., C.M., C.H., D.Galimberti, D.Seripa, P.M., S.Sorbi, G.S., K.M., P.S.-J., P.P., R.S. and J.W. represented the PERADES cohort and contributed to sample collection. S.A., N.A., R.K., A.G.U., M.A.I., F.R., M.K.I. and C.M.v.D. represented the Rotterdam and ERF cohorts and contributed to sample collection. Y.L.G., V.N., K.M. and M.D.G. represented the StEP-AD cohort and contributed to sample collection. H.Hummerich, P.J.N., N.S.R., J.M.S. and S.M. represented the UCL-DRC EOAD cohort. J.N.C., R.M.M. and J.S.Y. represented the UCSF/NYGC/UAB cohort and contributed to sample collection.

Corresponding authors

Correspondence to Henne Holstege, Marc Hulsman, Gaël Nicolas or Jean-Charles Lambert.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks James Lupski and Mike Nalls for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Age, gender, APOE genotype distribution.

Age, gender and APOE genotype distribution of all samples, stratified by case/control status.

Extended Data Fig. 2 PCA: Sample population compared to 1,000 G population samples.

Sample population compared to 1,000 G population samples. First two PCA components of the study samples used for the Stage 1 and Stage 2 analysis, shown in context of the 1000 Genomes samples for reference (see Supplementary Note section 1.3.4). Samples in red are considered population outliers. Samples with only exome-extracts were not included in this analysis.

Extended Data Fig. 3 P value inflation in Stage-2 analysis.

P value inflation in Stage-2 analysis: Quantile-quantile plot for Stage-2 (without exome-extract samples), based on a ordinal logistic burden test (see Methods). Results are shown for all burden tests (n = 20,681) for which at least 10 damaging alleles were present in this dataset (based on 4 different variant deleteriousness thresholds per gene). While not used in this analysis, the threshold for multiple testing correction based on FDR < 0.1 is shown for reference. The genomic p-value inflation was 1.016. Note that causative mutations were not separately removed in Stage-2, as we focused on a specific set of genes.

Extended Data Fig. 4 P value inflation in the mega-analysis dataset.

P value inflation in the mega-analysis dataset: Quantile-quantile plot for the mega-analysis dataset (without exome-extract samples) based on a ordinal logistic burden test (see Methods). Results are shown for all burden tests (n = 37,710) for which at least 10 damaging alleles were present in this dataset (based on 4 different variant deleteriousness thresholds per gene). For reference, the threshold for multiple testing correction based on a false discovery rate threshold of 0.1 is shown. P values for the mega-analysis are shown in Supplementary Table 15. The genomic p-value inflation was 1.025.

Extended Data Fig. 5 Variant carrier frequency in controls by age last seen.

Variant carrier frequency in controls by age last seen: Carrier frequency in controls by age last seen for the variant selection threshold with the strongest association, as observed in the mega-analysis (n = 31,905 unique individuals); RIN3, CLU, ZCWPW1, ACE (n = 29,727 unique individuals; that is without exome-extracts) (Table 3, refined). Black: genes significant in the meta-analysis. Grey: genes not significant in meta-analysis. Blue: genes detected in the GWAS targeted analysis.

Extended Data Fig. 6 Age-at-onset by variant deleteriousness category.

Age-at-onset by variant deleteriousness category: Age-at-onset (median and IQR) in the mega-analysis (n = 31,905 unique individuals); RIN3, CLU, ZCWPW1, ACE (n = 29,727 unique individuals; that is without exome-extracts). Samples in variant deleteriousness categories with <10 samples are shown individually. The median age at onset and IQR for the complete mega-analysis dataset is shown on the right. Black: genes significant in the meta-analysis. Grey: genes not significant in meta-analysis. Blue: genes detected in the GWAS targeted analysis.

Extended Data Fig. 7 Attributable fraction per gene and age-at-onset category.

Attributable fraction per gene and age-at-onset category: Attributable fractions as derived based on the mega-analysis in the mega-analysis (n = 31,905 unique individuals); RIN3, CLU, ZCWPW1, ACE (n = 29,727 unique individuals; that is without exome-extracts). The attributable fraction of a gene is an estimate of the fraction of AD cases in a specific age group that have become part of this dataset due to carrying a rare damaging variant in the respective gene (Methods). This estimate accounts only for variants in the burden selection. Black: genes significant in the meta-analysis. Grey: genes not significant in meta-analysis. Blue: genes detected in the GWAS targeted analysis.

Extended Data Fig. 8 Sensitivity Analysis: AD vs Age association.

AD vs Age association: Sensitivity analysis of the gene burden tests (for the most significant deleteriousness thresholds, Table 2) for the mega-analysis dataset (RIN3, CLU, ZCWPW1, ACE: without exome-extracts) (respectively n = 31,905 and n = 29,727 unique individuals). Comparison of the case/control odds ratio of an age-matched and a non-age-matched analysis. Age-matching was performed as described in the methods. Based on the confidence intervals, we cannot exclude that the signals in ACE, ADAM10 and ZCWPW1 are affected by other age-related conditions. Note however, that the signals in ADAM10 and ZCWPW1 are based on very few variants, such that confidence intervals are expected to be wide.

Supplementary information

Supplementary Information

(1) Supplementary Methods. (2) Detailed gene discussion. (3) Supplementary Figs. 1–15 and Tables 1–16. (4) Acknowledgements. (5) Supplementary authors.

Reporting Summary

Peer Review File

Supplementary Data 1

List of variants considered in the burden analysis.

Source data

Source Data Fig. 2

Statistical source data underlying Fig. 2.

Source Data Fig. 3

Statistical source data underlying Fig. 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Holstege, H., Hulsman, M., Charbonnier, C. et al. Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as risk factors for Alzheimer’s disease. Nat Genet 54, 1786–1794 (2022). https://doi.org/10.1038/s41588-022-01208-7

Download citation

Received: 31 July 2021
Accepted: 19 September 2022
Published: 21 November 2022
Issue Date: December 2022
DOI: https://doi.org/10.1038/s41588-022-01208-7

This article is cited by

Characterizing dysregulations via cell-cell communications in Alzheimer’s brains using single-cell transcriptomes
- Che Yu Lee
- Dylan Riffle
- Jing Zhang
BMC Neuroscience (2024)
Gut microbiota-host lipid crosstalk in Alzheimer’s disease: implications for disease progression and therapeutics
- Ya-Xi Luo
- Ling-Ling Yang
- Xiu-Qing Yao
Molecular Neurodegeneration (2024)
APP dyshomeostasis in the pathogenesis of Alzheimer’s disease: implications for current drug targets
- Sònia Sirisi
- Érika Sánchez-Aced
- Alberto Lleó
Alzheimer's Research & Therapy (2024)
Exome-wide analysis implicates rare protein-altering variants in human handedness
- Dick Schijven
- Sourena Soheili-Nezhad
- Clyde Francks
Nature Communications (2024)
The path to next-generation disease-modifying immunomodulatory combination therapies in Alzheimer’s disease
- Marie Sarazin
- Julien Lagarde
- Guillaume Dorothée
Nature Aging (2024)

Subjects

Abstract

Similar content being viewed by others

Main

Methods

Sample processing, genotype calling and QC

Variant prioritization and thresholds

Gene burden testing

GWAS driver gene identification

Validation of variant selection

Descriptive measures

Sensitivity analyses

Power analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links