Abstract
Rupture of an intracranial aneurysm (IA) leads to aneurysmal subarachnoid haemorrhage (ASAH), a severe type of stroke. Some rare variants that cause IA in families have been identified, but still, the majority of genetic causes, as well as the biological mechanisms of IA development and rupture, remain unknown. We aimed to identify rare, damaging variants for IA in three large Dutch families with multiple affected members with IA (Nā=ā9, 11, and 6). By combining linkage analysis and genome sequencing (GS), we identified six rare and damaging variants for which all cases within one of the families were heterozygous. These variants were p.Tyr87Cys in SYCP1, p.Phe1077Leu in FMNL2, p.Thr754Lys in TBC1D2, p.Arg321His in ZNF782, p.Arg979Trp in CCDC180, and p.Val125Met in NCBP1. None of the variants showed association with IA status in a large cohort of 937 patients from the general IA patient population and 1046 controls. Gene expression in IA and cerebral artery tissue further prioritized FMNL2 and TBC1D2 as potential important players in IA pathophysiology. Further studies are needed to characterize the functional consequences of the identified variants and their role in the biological mechanisms of IA.
Similar content being viewed by others
Introduction
An intracranial aneurysm (IA) is a dilation of a cerebral blood vessel which are found in about 3% of the general population [1]. Rupture of an IA leads to aneurysmal subarachnoid haemorrhage (ASAH), a severe type of stroke with an incidence of 6.1 per 100,000 person-years [2]. ASAH is a life-threatening event with substantial morbidity and mortality with one in three patients dying [3]. Of the patients who survive ASAH, approximately half remain dependent on help from others in daily life [3]. Endovascular or surgical treatment of IA can prevent ASAH, thus avoiding its devastating consequences. However, currently, no clinically relevant risk models are available to identify persons at risk of developing and rupturing of IA.
The most important risk factors for IA and ASAH are age, female sex, hypertension, smoking, and a positive family history for ASAH [1, 2]. In persons with two or more first-degree family members with ASAH the life-time risk of ASAH is 50 times higher than in persons without such a family history [1, 4]. Moreover, individuals with an IA and a positive family history for ASAH have an almost 3-fold increased risk of rupture compared to individuals with IAs without such a family history (i.e., sporadic IAs) [5]. The heritability of ASAH has been estimated at 41% [6] and genome-wide association studies (GWAS) of large cohorts of IA cases have identified many common genetic risk factors with small effects, which combined explain roughly half of the total heritability [7]. Besides GWAS, sequencing studies in families with multiple affected family members with IA have identified rare IA-causing genetic variants with a presumable large effect located in the following genes: ADAMTS15 [8], THSD1 [9], ANGTL6 [10], PCNT [11], and ARHGEF17 [12]. For the genes THSD1 and ADAMTS15, linkage analyses had previously already highlighted regions containing these genes [13, 14], showing that linkage analysis with subsequent sequencing is a viable method to identify risk genes for IA.
Despite recent findings of IA risk genes and their associated biological functions, the primary causes of IA remain unknown. Here, we aim to identify rare, damaging genetic variants segregating with IA in three large Dutch families with multiple affected first-degree family members with an unruptured IA or ASAH using a combination of linkage analysis and genome sequencing.
Methods
A flowchart of the methods is shown in Fig.Ā 1.
Participants
Three large Dutch families with multiple affected first-degree relatives with an unruptured IA or an ASAH were selected: pedigree 1 included nine affected relatives in one generation, pedigree 2 included eleven relatives in three successive generations, while pedigree 3 included six in three successive generations (Fig.Ā 2). Genealogical research showed no shared ancestor between these families for at least six generations back. For the variants identified in these families we further determined their role in the general IA patient population, using a cohort of 937 Dutch IA patients, of whom 160 had a positive family history for ASAH, from the University Medical Center Utrecht (TableĀ 1). For the control cohort, GS genotypes from 1046 persons recruited within Project MinE were used [15, 16]. Presence of IA was determined with computed tomography, magnetic resonance, or digital subtraction angiogram. Families or persons with Loeys-Dietz syndrome, Ehlers-Danlos disease, Marfan Syndrome, or autosomal dominant polycystic kidney disease were excluded.
Linkage analysis
The following number of family members were included for genotyping: nine affecteds and five unaffecteds from pedigree 1; six affecteds and 12 unaffecteds from pedigree 2; and four affecteds and eight unaffected from pedigree 3 (Fig.Ā 2). DNA was extracted from blood with the Chemagic DNA Blood10k kit (PerkinElmer, Inc., Waltham, MA, USA). Single nucleotide polymorphism (SNP) genotyping was performed with the Global Screening Array (Illumina, San Diego, CA, USA). Detailed methods on the SNP quality control are described in the Supplementary Note. Two linkage methods were performed. First, we performed non-parametric linkage (NPL) analysis as it is an effective method to detect segments of DNA that are identical-by-descent between affected family members. Due to the age-dependent penetrance of IA, unaffected family members are may also be heterozygous for a pathogenic variant, making the use of unaffected family members in NPL analysis ineffective. Therefore, the second method, parametric linkage analysis, was used to specify disease penetrance values depending on age categories and/or other variables, thereby allowing the inclusion of unaffected family members.
Non-parametric linkage analysis was performed using MORGAN lm_ibdtest [17]. MORGAN uses a Markov Chain Monte Carlo approach to estimate inheritance vectors: a representation of allele sharing by descent. In this iterative process, the first 5,000 iterations were discarded, before saving every 30th iteration from a total of 90,000 iterations for analysis. NPL scores Spairs and Srobdom were calculated [18, 19]. Spairs can detect both recessive and dominant inheritance loci, while Srobdom has the most power to detect linkage in a dominant model [19].
Parametric linkage analysis was performed in three steps: (1) obtaining inheritance vectors using MORGAN gl_auto, (2) determining pedigree baseline log likelihoods with MORGAN base_trait_lods, and (3) calculating logarithm of odds (LOD) scores with MORGAN gl_lods [20]. The gl_auto option was run with the same setting to estimate inheritance vectors as non-parametric linkage analysis. Four inheritance models; two dominant inheritance models and two recessive models, with either low or high penetrance, were used to calculate LOD-scores, using the liability class-specific penetrance matrix option of MORGAN. Phenocopy rate was set at the population prevalence of 3% [1]. Prevalence of IA tends to increase with age until, but the precise interaction between age and penetrance of high impact genetic variants is unknown [1]. Therefore, we selected two age-dependent penetrance models, reaching a maximum penetrance at age 70 (95% for the high penetrance models, 80% for the low penetrance models), and decreasing by 15% each younger decade until reaching the phenocopy rate (Supplementary Note). Expected allele frequency of the trait locus was set at 0.1% for dominant models and 1% for recessive models.
Regions with statistically significant linkage were defined as having LODāā„ā3.3, or an equivalent NPL-scoreāā„ā3.90, derived from [21]:
Or equivalent:
Regions with suggestive statistically significant linkage were defined as LODāā„ā1 or NPL-scoreāā„ā2.15.
Genome sequencing
From each family, two or three of the affected family members were selected for genome sequencing (GS), totaling seven patients (Fig.Ā 2). Library preparation and sequencing was performed by Macrogen Inc. (Seoul, South Korea). The TruSeq DNA PCR-Free Shotgun Library was used on 350ābp fragments. Sequencing was done on the Illumina HiSeq system. Detailed methods about mapping, variants calling and quality control of GS data are given in the Supplementary Note.
Variant selection
After variant calling, variants located on regions identified by linkage analysis with suggestive statistical significance (LODāā„ā1 or NPL-scoreāā„ā2.15) and for which all affected members of the family in which the region was identified were heterozygous, were retained. We also included variants located on previously identified linkage loci (loci from ref. [22]. and specified in Supplementary TableĀ 1). SnpEff was used to annotate variants (more details in the Supplementary Note) [23]. Variants in loci with suggestive significance were selected if they (1) had MAFāā¤ā1% in non-Finnish Europeans in ExAC [24], Dutch controls from Project MinE [16], and Dutch controls from GoNL [25], or were not reported in any of these datasets, and (2) were disruptive of non-disruptive missense, which included any of the following SnpEff annotations: exon loss, frameshift, splice acceptor, splice donor, start site loss, stop site gain, stop site loss or transcript ablation, rare amino acid variant, inframe deletion, inframe insertion, or missense.
Sanger sequencing
In order to validate variants found using GS, and to test segregation in additional affected and unaffected family members, we used Sanger sequencing. In pedigree 1 an additional seven affected family members (resulting in nine affected family members with genotyping information in total) and five unaffected members were included; in pedigree 2 an additional three affected (six in total) and twelve unaffected, and in pedigree 3 an additional two affected (four in total) and eight unaffected members were included. PCR primers were designed using Primer3Plus, or if not possible by manual selection. Surrounding sequences (500ābp on both ends) were obtained from Ensembl v99. In order to design primers and probes that only bind to positions without genetic variants, the genetic variants with MAFāā„ā0.01% were converted to an N before uploading in the TaqMan assay design tool. An overview of the primers is shown in Supplementary TableĀ 2. Primer annealing temperature was optimized for each primer pair, by selecting the temperature resulting in the strongest specific amplification, as determined by gel electrophoresis. Genotypes were called by visually inspecting the Sanger sequencing chromatograms.
TaqMan assay
Segregating variants identified by linkage analysis and GS and validated using Sanger sequencing were additionally genotyped in a large cohort of 937 IA patients (TableĀ 1) using TaqMan assay (Thermo Fisher Scientific Inc. Waltham, MA, USA). Details on assay design and optimization are given in the Supplementary Note. Quantitative polymerase chain reaction was conducted using the QuantStudio 6 Flex Real-Time PCR system (Applied biosystems). Polymerase chain reaction protocol was as follows: 10āmin at 95āĀ°C, 40 cycles of [15ās at 92āĀ°C, 1āmin at 60āĀ°C], and 1āmin at 60āĀ°C. Fluorescence was read by the QuantStudio 6 Flex Real-Time PCR system and analyzed using the QuantStudio 6 and 7 system software. Genotypes of Dutch controls from Project MinE were obtained by GS as described before [15]. Association of the variants with IA was determined using two-tailed Firth logistic regression in PLINK v2.0. Sex and the first 10 genetic principal components projected on the HapMap v3 reference dataset were used as covariates. Only European ancestry individuals were included. These were defined by a maximum of five standard deviations distance from the mean of the genetically Dutch control samples from Project MinE in principal components 1ā4. Since only persons homozygous for the ancestral allele, or heterozygous, were observed, a dominant inheritance model was selected. The analysis was also performed including only cases with positive family history versus controls. A P-value of 0.05 divided by the number of genes included with at least one observed mutated allele (Nā=ā5) was considered statistically significant.
Rare variant association and burden analysis
To assess if additional rare variants in the genes containing potentially pathogenic variants segregating with IA, may be associated with IA, we performed two analyses using exome sequencing data from the UK Biobank: a rare variant burden analysis and a variant-level association analysis of potentially pathogenic variants. Variants were annotated based on predicted functional impact using SNPEff [23]. The association between IA and the number of rare, coding, minor alleles of the selected genes was calculated by two-tailed Firth logistic regression. For the variant-level association analysis, variants predicted to have a high functional impact, and a minor allele frequency below 1% were selected. Association was assessed using two-tailed Firth logistic regression. Statistical significance for the variant-level association analysis was defined as a P-value below 0.05 divided by the number of variants tested for that gene. More details on both analyses are given in the Supplementary Note.
Gene expression analysis
For the genes in which segregating variants were found, we performed a lookup of the gene expression, and of differentially expressed genes between IA tissue and control cerebral artery tissue. We used RNA sequencing read counts, and differential expression data for IA tissue and control cerebral artery tissue analyzed and described before [26]. Differential expression with a false discovery rate below 0.05 was considered statistically significant.
Results
Linkage analysis
After genotype quality control 162,641 SNPs and all 19 cases and 25 controls remained for linkage analysis. Non-parametric linkage analysis identified 12 regions with NPL-scoreāā„ā3.90 (equivalent to LODāā„ā3.3; statistically significant linkage) and 51 regions with NPL-scoreāā„ā2.15 (LODāā„ā1.0; suggestive statistically significant linkage) across the three families (TableĀ 2 and Supplementary TableĀ 3 for genome-wide NPL scores and full linkage statistics). All statistically significant linkage regions were identified using a dominant inheritance model (the Srobdom statistic), while one of these regions was also detected using a recessive model (the Spairs statistic). Parametric linkage analysis also identified five of these regions with at least suggestive evidence for linkage (2q23.3ā24.1 and 2q35 in pedigree 1, 10q22.1ā22.2 and 14q12ā13.1 in pedigree 2, and 5q14.2ā14.3 in pedigree 3), but did not identify any additional regions of interest (Supplementary TableĀ 4). A total of 39 regions with suggestive statistical significance (LODāā„ā1.0) were found using parametric linkage analysis.
Rare variant discovery by genome sequencing
Eleven variants on four distinct regions with at least suggestive evidence for linkage (Supplementary TablesĀ 3 and 4) passed the selection criteria for potential disease-causing variants in any of the families: one variant on locus 1p13.1ā31.1 identified in pedigree 3, two on 2q23.3ā24.1 identified in pedigree 1, one on 8p22.2 (and previously identified by Kim et al. [27]) and found in pedigree 1, and seven on 9q22.2ā22.33 identified in pedigree 1 (Supplementary TableĀ 5). On region 9q22.2ā22.33, variant COL15A1 p.Lys1001Arg was found in all affected members of pedigree 2, while the region was identified by linkage in pedigree 1. Sanger sequencing in the remaining family members confirmed segregation of six of the 11 variants among all affected members, while for the other five, not all cases among one family were heterozygous, and these variants were excluded from further analysis (See TableĀ 3 for both segregating and not segregating variants). The six variants that remained were: SYCP1 p.Tyr87Cys on 1p13.1ā31.1 in pedigree 3, FMNL2 p.Phe1077Leu on 2q23.3ā24.1 in pedigree 1, and four variants on 9q22.2ā22.33 in pedigree 1: TBC1D2 p.Thr754Lys, ZNF782 p.Arg321His, CCDC180 p.Arg979Trp, and NCBP1 p.Val125Met. All affected family members in pedigree 1 were heterozygous for both FMNL p.Phe1077Leu and TBC1D2 p.thr754Lys. Among the unaffected members of pedigree 1, one was heterozygous for both variants. Despite this genotype, this male member (II-8) had a negative screen for IA at age 65. Three unaffected family members were not heterozygous for either variant (II-4, II-7, and II-12), and one for neither (II-9).
Association of segregating variants in the general IA patient population
In the cohort of 937 IA patients (including 777 sporadic and 160 familial patients), for all variants except TBC1D2 p.Thr754Lys, at least one person was heterozygous for the risk allele among both cases and controls (TableĀ 4), but none of the variants showed a statistically significant association with IA. When analysing only cases with a positive family history for ASAH also no association was found. In the UK Biobank dataset, no statistically significant rare variant burden was found for gene containing one of the segregating variants (Supplementary TableĀ 6), and no additional potentially pathogenic variants with a statistically significant association with IA status were found (Supplementary TableĀ 7).
Gene expression in intracranial aneurysm and cerebral artery tissue
The genes FMNL2 and TBC1D2 had high median expression levels (among top 20% highest expressed genes with non-zero median expression) in IA tissue as well as control cerebral artery tissue (median RPKMā=ā11.1 [rangeā=ā1.18ā38.5] and 10.3 [1.55ā61.9], respectively in the combined tissues), whereas NCBP1 and ZNF782 showed a lower expression and SYCP1 was expressed in too low amounts to be reliably detected (Supplementary TableĀ 8). Differential expression analysis of these genes did not show difference in expression between aneurysms and control tissue for any of the genes.
Discussion
Using genome-wide linkage analysis combined with GS in large families with multiple affected members with IA, we identified six rare, damaging variants located in the genes SYCP1, FMNL2, TBC1D2, ZNF782, CCDC180, and NCBP1 which segregated among all cases of one of these families. All genes except SYCP1 were expressed in intracranial aneurysm and control cerebral artery tissue, while FMNL2 and TBC1D2 showed high gene expression in these tissues, further prioritizing these genes. Genotyping the identified variants in a large cohort of IA patients did not show additional evidence that these variants play a role in IA patients beyond the families in which they were discovered.
Of the rare, damaging variants we identified, five were located on either of two regions with strong evidence (2q23.3ā24.1 and 9q22.2ā22.33), and one on a region with suggestive evidence (1p13.1ā31.1). By Sanger sequencing we confirmed that all affected family members of the respective families were heterozygous for the identified variants. For SYCP1 p.Tyr87Cys on 1p13.1ā31.1, all four affected family members of pedigree 3 and only one of eight unaffected members of this family was heterozygous for the mutated allele. This is a reasonably strong segregation, but the number of affected members included remains too low to consider the genetic cause in this family resolved. For pedigree 1, all affected family members were heterozygous for variants on two linkage loci, 2q23.3ā24.1 and 9q22.2ā22.33. This means that all affected family members were heterozygous for both haplotypes. Whether this means that a genetic variant on both loci is required to develop and IA, or whether one of the loci is a false positive cannot be determined.
Variant FMNL2 p.Phe1077Leu is located in the diaphanous autoregulatory domain of Formin-like protein 2. The protein interacts with Rho-GTPases and plays a role in nonbranched actin filament elongation and stability, especially in lamellipodia and filopodia [28,29,30]. Similar to IA-associated Rho-GTPase ARHGEF17, of which pathogenic variants cause intracranial haemorrhage in zebrafish, this could indicate that IA formation or rupture is mediated through actin-dependent smooth muscle cell action [10, 12]. The protein product of TBC1D2 also influences GTPase activity, through activation of the GTPase of Ras-related protein Rab-7a, and thereby reducing cell-cell adhesion. Furthermore, it is involved in cadherin-dependent cell-cell adhesion [28, 31]. Variant TBC1D2 p.Thr754Lys is located in the Rab-GTPase-TBC domain. In the rat, genetic variation in a large region surrounding ortholog Tbc1d2 was strongly associated with ruptures of the internal elastic lamina of the aorta and iliac arteries [32]. The role of this gene is largely unclear, but could indicate that it provides IA risk through impaired elastic lamina integrity. Variant NCBP1 p.Val125Met is located in nuclear cap-binding protein subunit 1 gene in the domain āmiddle domain of eukaryotic initiation factor 4Gā (MIF4G). The protein product is part of the mRNA cap-binding complex and involved in a range of mRNA processes [28, 33]. It is unclear how this gene might contribute to IA pathophysiology. The protein products of SYCP1 and ZNF782 are involved in DNA binding, without apparent implications in blood vessel or aneurysm biology [28]. The role of CCDC180, also known as BDAG1, is not well understood.
Several of the regions identified by linkage analysis in our study were previously already implicated to play a role in IA: some were identified in former genetic analyses of IA and some in genetic analyses of related diseases. The 9q22.2 locus identified in pedigree 1 maps to a locus identified by a GWAS of IA in the Korean population [34]. The locus was further linked to coronary artery disease by linkage analyses in 24 large and in 428 multiplex families (Two-point NPL-scores of 4.54 and 3.72, respectively) [35, 36]. The locus contains TGFBR2, a gene of which pathogenic variants are the cause for 25% of cases of Loeys Dietz syndrome, which predisposes mainly to thoracic aortic aneurysms but also to IA [37,38,39]. Here, no potential disease-causing variants in TGFBR2 were found. Instead, we found four rare, damaging variants on this locus that segregated with disease status in pedigree 1: TBC1D2 p.Thr754Lys, ZNF782 p.Arg321His, CCDC180 p.Arg979Trp, and NCBP1 p.Val125Met. In the same family, linkage was also found on locus 2q23.3ā24.1. This locus was associated with IA in a GWAS of a Finnish population isolate [40] and has been linked to chronic hypertension [41,42,43]. Another locus identified in this study, 10q22.1ā22.2, contains the ACTA2 gene, of which pathogenic variants are the cause 14% of familial thoracic aortic aneurysms and dissection [44]. Here, we were not able to identify rare, damaging variants in ACTA2 or any other gene in this locus. The 5q14.2ā14.3 locus contains the EDIL3 gene. An exome sequencing study on a family of three members with ASAH found that EDIL3 p.Cys128Tyr was shared by all cases. In a cohort of 35 unrelated IA cases, the researchers found another variant in EDIL3 (p.Lys387Gln) in 2 cases [45]. Here, we did not find any rare, damaging variants in EDIL3, nor in other genes in this locus. Locus 9p21.3 is a well-known GWAS risk locus for IA and several other (cerebro-)vascular diseases [7, 46, 47], and locus 6q16.1 was also been identified as risk variant in GWAS of IA [7]. Both regions identified in this study are locatedāĀ±ā3 megabases away from the lead GWAS variants, which could mean that these are located too far away to be driven by the same causal variants as the GWAS loci. No rare, damaging variants were found in these regions.
This study has considerable strengths. First, the combination of linkage analysis with genome sequencing in large families provides an opportunity to detect rare damaging variants on segregating genomic regions. Second, using GS instead of exome sequencing provides a more uniform coverage of the genome and can detect more coding variants than exome sequencing [48], which was used in previous IA sequencing studies [8,9,10,11,12]. Another strength is the use of MORGAN for the linkage analysis. This allows joint analysis of the full pedigrees and leverages dense SNP information to provide high-resolution allele sharing information.
Some limitations apply to this study. First, the genetic association between variants discovered in this study and IA does not prove a causal relationship. Functional follow-up studies are needed to further prioritize these variants. Second, most regions of linkage were identified through an NPL statistic optimized for dominant inheritance (Srobdom), while only one and five of those were also identified using the Spairs statistic or parametric linkage analysis, respectively. Srobdom uses an exponential function that increases quickly with the number of affected persons who share an allele identical-by-descent [19]. This can lead to very high Srobdom in large pedigrees with many closely related persons such as pedigree 1 in this study. To deal with this, we used parametric and non-parametric linkage analysis, allowing for both dominant and recessive inheritance (by defining penetrance categories for parametric linkage analysis, and Spairs and Srobdom for NPL) for the purpose of detecting regions of linkage, and subsequently validated segregation among all affected members of a family using Sanger sequencing. Thereby we identified regions truly shared identical-by-descent among all affected family members, and thus eliminating this potential limitation. Another limitation is that we identified multiple variants of interest within a single locus of linkage, in particular 9q22.2ā22.33. It can be expected that only a single causal variant is present on this locus, indicating the majority of variants segregates with IA simply by being in close proximity to this variant and are not causal. Including additional family members to increase resolution, identifying structural genetic variants, and performing functional experiments are ways to further prioritize variants in this locus.
In summary, we conducted a genome-wide linkage analysis paired with GS in three large families with IA and ASAH patients. This led to the identification of six potential IA-causing variants in families. We did not find evidence that these variants also contribute to IA in the general IA population. Regions of linkage showed overlap with previous genetic studies of IA and cerebrovascular and connective tissue disorders, providing insight in the pathogenic mechanisms of IA. As a future direction of research, we suggest functional follow-up studies to discover the roles of the IA-associated genes, in particular FMNL2 and TBC1D2 which are highly expressed in cerebral artery and intracranial aneurysm tissue, and to discover how the damaging variant in those genes contribute to IA pathophysiology.
Data availability statement
The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Vlak MH, Algra A, Brandenburg R, Rinkel GJ. Prevalence of unruptured intracranial aneurysms, with emphasis on sex, age, comorbidity, country, and time period: a systematic review and meta-analysis. Lancet Neurol. 2011;10:626ā36.
Etminan N, Chang HS, Hackenberg K, de Rooij NK, Vergouwen MDI, Rinkel GJE, et al. Worldwide incidence of aneurysmal subarachnoid hemorrhage according to region, time period, blood pressure, and smoking prevalence in the population: a systematic review and meta-analysis. JAMA Neurol. 2019;76:588ā97. https://doi.org/10.1001/jamaneurol.2019.0006.
Nieuwkamp DJ, Setz LE, Algra A, Linn FH, de Rooij NK, Rinkel GJ. Changes in case fatality of aneurysmal subarachnoid haemorrhage over time, according to age, sex, and region: a meta-analysis. Lancet Neurol. 2009;8:635ā42.
Bor AS, Rinkel GJ, Adami J, Koffijberg H, Ekbom A, Buskens E, et al. Risk of subarachnoid haemorrhage according to number of affected relatives: a population based case-control study. Brain. 2008;131:2662ā5. https://doi.org/10.1093/brain/awn187.
Mensing LA, Greving JP, Verhoeff TA, Rinkel GJE, Ruigrok YM. Comparison of rupture risk of intracranial aneurysms between familial and sporadic patients. Stroke. 2019;50:1380ā3. https://doi.org/10.1161/STROKEAHA.118.023783.
Korja M, Silventoinen K, McCarron P, Zdravkovic S, Skytthe A, Haapanen A, et al. Genetic epidemiology of spontaneous subarachnoid hemorrhage: Nordic Twin Study. Stroke. 2010;41:2458ā62.
Bakker MK, van der Spek RAA, van Rheenen W, Morel S, Bourcier R, Hostettler IC, et al. Genome-wide association study of intracranial aneurysms identifies 17 risk loci and genetic overlap with clinical risk factors. Nat Genet. 2020;52:1303ā13. https://doi.org/10.1038/s41588-020-00725-7.
Yan J, Hitomi T, Takenaka K, Kato M, Kobayashi H, Okuda H, et al. Genetic study of intracranial aneurysms. Stroke. 2015;46:620ā6.
Santiago-Sim T, Fang X, Hennessy ML, Nalbach SV, DePalma SR, Lee MS, et al. THSD1 (Thrombospondin Type 1 domain containing Protein 1) mutation in the pathogenesis of intracranial aneurysm and subarachnoid hemorrhage. Stroke. 2016;47:3005ā13.
Bourcier R, Le Scouarnec S, Bonnaud S, Karakachoff M, Bourcereau E, Heurtebise-ChrĆ©tien S, et al. Rare coding variants in ANGPTL6 are associated with familial forms of intracranial aneurysm. Am J Hum Genet. 2018;102:133ā41.
Lorenzo-Betancor O, Blackburn PR, Edwards E, Vazquez-do-Campo R, Klee EW, Labbe C, et al. PCNT point mutations and familial intracranial aneurysms. Neurology. 2018;91:e2170āe2181. https://doi.org/10.1212/WNL.0000000000006614.
Yang X, Li J, Fang Y, Zhang Z, Jin D, Chen X, et al. Rho guanine nucleotide exchange factor arhgef17 is a risk gene for intracranial aneurysms. Circ Genom Precis Med. 2018;11:e002099 https://doi.org/10.1161/CIRCGEN.117.002099.
Ozturk AK, Nahed BV, Bydon M, Bilguvar K, Goksu E, Bademci G, et al. Molecular genetic analysis of two large kindreds with intracranial aneurysms demonstrates linkage to 11q24-25 and 14q23-31. Stroke. 2006;37:1021ā7. https://doi.org/10.1161/01.STR.0000206153.92675.b9.
Santiago-Sim T, Depalma SR, Ju KL, McDonough B, Seidman CE, Seidman JG, et al. Genomewide linkage in a large Caucasian family maps a new locus for intracranial aneurysms to chromosome 13q. Stroke. 2009;40:S57ā60. https://doi.org/10.1161/STROKEAHA.108.534396.
van der Spek RAA, van Rheenen W, Pulit SL, Kenna KP, van den Berg LH, Veldink JH, et al. The project MinE databrowser: bringing large-scale whole-genome sequencing in ALS to researchers and the public. Amyotroph Lateral Scler Frontotemporal Degener. 2019;20:432ā40. https://doi.org/10.1080/21678421.2019.1606244.
Project MinE ALS Sequencing Consortium. Project MinE: study design and pilot analyses of a large-scale whole-genome sequencing study in amyotrophic lateral sclerosis. Eur J Hum Genet. 2018;26:1537ā46. https://doi.org/10.1038/s41431-018-0177-4.
Basu S, Di Y, Thompson EA. Exact trait-model-free tests for linkage detection in pedigrees. Ann Hum Genet. 2008;72:676ā82. https://doi.org/10.1111/j.1469-1809.2008.00451.x.
Whittemore AS, Halpern J. A class of tests for linkage using affected pedigree members. Biometrics. 1994;50:118ā27.
McPeek MS. Optimal allele-sharing statistics for genetic mapping using affected relatives. Genet Epidemiol. 1999;16:225ā49. 10.1002/(SICI)1098-2272(1999)16:3<225::AID-GEPI1>3.0.CO;2-#
Thompson E. The structure of genetic linkage data: from LIPED to 1M SNPs. Hum Hered. 2011;71:86ā96. https://doi.org/10.1159/000313555.
Abreu PC, Greenberg DA, Hodge SE. Direct power comparisons between simple LOD scores and NPL scores for linkage analysis in complex diseases. Am J Hum Genet. 1999;65:847ā57. https://doi.org/10.1086/302536.
Zhou S, Dion PA, Rouleau GA. Genetics of intracranial aneurysms. Stroke. 2018;49:780ā7. https://doi.org/10.1161/STROKEAHA.117.018152.
Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80ā92. https://doi.org/10.4161/fly.19695.
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alfoldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434ā43. https://doi.org/10.1038/s41586-020-2308-7.
Genome of the Netherlands Consortium. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet. 2014;46:818ā25. https://doi.org/10.1038/ng.3021.
Kleinloog R, Verweij BH, van der Vlies P, Deelen P, Swertz MA, de Muynck L, et al. RNA Sequencing Analysis of Intracranial Aneurysm Walls Reveals Involvement of Lysosomes and Immunoglobulins in Rupture. Stroke. 2016;47:1286ā93. https://doi.org/10.1161/STROKEAHA.116.012541.
Kim CJ, Park SS, Lee HS, Chung HJ, Choi W, Chung JH, et al. Identification of an autosomal dominant locus for intracranial aneurysm through a model-based family collection in a geographically limited area. J Hum Genet. 2011;56:464ā6. https://doi.org/10.1038/jhg.2011.27.
UniProt Consortium. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 2019;47:D506āD515. https://doi.org/10.1093/nar/gky1049.
Block J, Breitsprecher D, Kuhn S, Winterhoff M, Kage F, Geffers R, et al. FMNL2 drives actin-based protrusion and migration downstream of Cdc42. Curr Biol. 2012;22:1005ā12. https://doi.org/10.1016/j.cub.2012.03.064.
Ostberg, NP, Zafar, MA, Ziganshin, BA & Elefteriades, JA, The genetics of thoracic aortic aneurysms and dissection: a clinical perspective. Biomolecules, 2020;10:182.
Frasa MA, Maximiano FC, Smolarczyk K, Francis RE, Betson ME, Lozano E, et al. Armus is a Rac1 effector that inactivates Rab7 and regulates E-cadherin degradation. Curr Biol. 2010;20:198ā208. https://doi.org/10.1016/j.cub.2009.12.053.
Smith JR, Hayman GT, Wang SJ, Laulederkind SJF, Hoffman MJ, Kaldunski ML, et al. The year of the rat: the rat genome database at 20: a multi-species knowledgebase and analysis platform. Nucleic Acids Res. 2020;48:D731āD742. https://doi.org/10.1093/nar/gkz1041.
Izaurralde E, Lewis J, McGuigan C, Jankowska M, Darzynkiewicz E, Mattaj IW. A nuclear cap binding protein complex involved in pre-mRNA splicing. Cell. 1994;78:657ā68. https://doi.org/10.1016/0092-8674(94)90530-4.
Hong, EP, Kim, BJ, Cho, SS, Yang, JS, Choi, HJ, Kang, SH et al. Genomic variations in susceptibility to intracranial aneurysm in the Korean population. J Clin Med 8, https://doi.org/10.3390/jcm8020275 (2019).
Guo Y, Wang F, Li L, Gao H, Arckacki S, Wang IZ, et al. Genome-wide linkage analysis of large multiple multigenerational families identifies novel genetic loci for coronary artery disease. Sci Rep. 2017;7:5472 https://doi.org/10.1038/s41598-017-05381-2.
Gao H, Li L, Rao S, Shen G, Xi Q, Chen S, et al. Genome-wide linkage scan identifies two novel genetic loci for coronary artery disease: in GeneQuest families. PLoS One. 2014;9:e113935 https://doi.org/10.1371/journal.pone.0113935.
Kim ST, Brinjikji W, Kallmes DF. Prevalence of intracranial aneurysms in patients with connective tissue diseases: a retrospective study. Am J Neuroradiol. 2016;37:1422ā6. https://doi.org/10.3174/ajnr.A4718.
Loeys BL, Schwarze U, Holm T, Callewaert BL, Thomas GH, Pannu H, et al. Aneurysm syndromes caused by mutations in the TGF-beta receptor. N. Engl J Med. 2006;355:788ā98. https://doi.org/10.1056/NEJMoa055695.
Pomianowski P, Elefteriades JA. The genetics and genomics of thoracic aortic disease. Ann Cardiothorac Surg. 2013;2:271ā9. https://doi.org/10.3978/j.issn.2225-319X.2013.05.12.
Kurki MI, Gaal EI, Kettunen J, Lappalainen T, Menelaou A, Anttila V, et al. High risk population isolate reveals low frequency variants predisposing to intracranial aneurysms. PLoS Genet. 2014;10:e1004134 https://doi.org/10.1371/journal.pgen.1004134.
Kalmyrzaev B, Aldashev A, Khalmatov M, Polupanov A, Jumagulova A, Mamanova L, et al. Genome-wide scan for premature hypertension supports linkage to chromosome 2 in a large Kyrgyz family. Hypertension. 2006;48:908ā13. https://doi.org/10.1161/01.HYP.0000244107.13957.2b.
Williams PJ, Broughton Pipkin F. The genetics of pre-eclampsia and other hypertensive disorders of pregnancy. Best Pr Res Clin Obstet Gynaecol. 2011;25:405ā17. https://doi.org/10.1016/j.bpobgyn.2011.02.007.
Mein CA, Caulfield MJ, Dobson RJ, Munroe PB. Genetics of essential hypertension. Hum Mol Genet. 2004;13 Spec No 1:R169ā175. https://doi.org/10.1093/hmg/ddh078.
Guo DC, Pannu H, Tran-Fadulu V, Papke CL, Yu RK, Avidan N, et al. Mutations in smooth muscle alpha-actin (ACTA2) lead to thoracic aortic aneurysms and dissections. Nat Genet. 2007;39:1488ā93. https://doi.org/10.1038/ng.2007.6.
Sauvigny T, Alawi M, Krause L, Renner S, Spohn M, Busch A, et al. Exome sequencing in 38 patients with intracranial aneurysms and subarachnoid hemorrhage. J Neurol. 2020;267:2533ā45. https://doi.org/10.1007/s00415-020-09865-6.
Nikpay M, Goel A, Won HH, Hall LM, Willenborg C, Kanoni S, et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 2015;47:1121ā30. https://doi.org/10.1038/ng.3396.
Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2018;50:524ā37. https://doi.org/10.1038/s41588-018-0058-3.
Belkadi A, Bolze A, Itan Y, Cobat A, Vincent QB, Antipenko A, et al. Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants. Proc Natl Acad Sci USA. 2015;112:5473ā8. https://doi.org/10.1073/pnas.1418631112.
Acknowledgements
We like to thank Dr. V. Guryev for providing genetic variant information from the GoNL data resource. This research was conducted using the UK Biobank Resource under application 2532. This study was supported by the Netherlands Cardiovascular Research Initiative: An initiative with support of the Dutch Heart Foundation, CVON2015-08 ERASE. This project has received funding from the European Research Council (ERC) under the European Unionās Horizon 2020 research and innovation programme (PRYSM, grant agreement No. 852173).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
All participants of the study signed informed consent.
Additional information
Publisherās note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bakker, M.K., Cobyte, S., Hennekam, F.A.M. et al. Genome-wide linkage analysis combined with genome sequencing in large families with intracranial aneurysms. Eur J Hum Genet 30, 833ā840 (2022). https://doi.org/10.1038/s41431-022-01059-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41431-022-01059-0