Introduction

Cardiovascular disease, specifically atherosclerotic vascular disease, is emerging as a major comorbidity in HIV-infected (HIV+) individuals, who are living longer with effective treatments.1 By the year 2015, it is estimated that over half of HIV+ individuals in the US will be aged 50 or greater. The risk of cardiovascular disease events is higher in HIV+ individuals than in HIV-negative individuals of the same age group. Previous studies have found similar levels of carotid intima-media thickness (cIMT), a subclinical measure of atherosclerosis, in HIV+ individuals and patients with coronary artery disease. HIV infection appears to increase the risk of atherosclerosis primarily through chronic immune hyperactivation and chronic inflammation.2, 3, 4, 5 In addition, cIMT progression in the common carotid artery (CCA) is known to be accelerated among HIV+ individuals compared with HIV-negative controls.6, 7 Several studies have shown that CCA cIMT is a reliable and valid subclinical measure of early atherosclerosis, and is a strong predictor of incident cardiovascular events in HIV-negative individuals.8, 9 A systematic review and meta-analysis confirmed this strong relationship between CCA cIMT and future vascular events including coronary artery disease.10, 11 Thus, factors contributing to cIMT would provide insights to elucidate biological mechanisms underlying atherosclerosis.

Heritable factors account for 21–38% of the variance in CCA cIMT.12, 13 Two single-nucleotide polymorphisms (SNPs; rs2229116 and rs7177922) in the ryanodine receptor 3 gene (RYR3), encoding a calcium ion (Ca2+) channel mainly expressed in the endoplasmic reticulum of cardiac and endothelial cells, were significantly associated with CCA cIMT in a genome-wide association study (GWAS) in subjects from the Fat Redistribution and Metabolic Change in HIV Infection (FRAM) study.14 The association of one of these non-synonymous SNPs (rs2229116) was replicated in white men in the Multicenter AIDS Cohort Study (MACS).15 Recently other variants in RYR3 were associated with CCA cIMT in HIV+ women in Women’s Interagency HIV Study (WIHS).16 In addition, variants in RYR3 increased risk of atherosclerotic cardiovascular outcomes, specifically heart failure, among hypertensive patients randomized to a Ca2+ channel blocker versus other antihypertensive drugs in the Genetics of Hypertension Associated Treatment (GenHAT) study.17 All of these studies indicate an important role of RYR3, specifically hypothesized through its differential function in Ca2+ mobility in the endoplasmic reticulum that can lead to apoptosis of macrophages in atherosclerotic plaques in HIV patients.5

In this report, we explore the genetic diversity of the RYR3 gene in 96 HIV+ white males in FRAM, and investigate the associations of all the alleles with extreme common cIMT measurements. Sequencing of the extremes confers a substantial efficiency gain in association studies, because an excess of rare genetic variants among individuals at one extreme compared with the other extreme underscores strong evidence of association between total mutational load and cIMT.18

Materials and methods

Source population and sequencing study design

FRAM is a multi-center (n=16) cohort study of 1183 HIV+ men and women and 297 healthy adult subjects examined from 2000 to 2002.19 cIMT measurements in FRAM were obtained by centrally trained and certified ultrasonograpers with B-mode ultrasound in the common and internal carotid arteries using a standardized protocol developed by Dr Daniel O’Leary.20 For sequencing of the RYR3 gene, 48 white males (average age=44.3±7.1 years) with the lowest common cIMT measurements (average common cIMT=0.729±0.075 mm) and 48 white males (average age=54.6±8.7 years) with the highest common cIMT measurements (average common cIMT=1.043 ±0.161 mm) within the FRAM participants were selected for the sequencing study.

The parent FRAM study and this sub-study conformed to the procedures for informed consent approved by institutional review boards at all participating sites and the genetic study at UAB and to human-experimentation guidelines set forth by the United States Department of Health and Human Services.

Fragment libraries

A bait library for the Agilent SureSelect in-solution capture reagent was developed covering all the 104 exonic and 2 kb flanking region sequence of RYR3 gene (a total of 339 480 bp), located at chr15:33 601 161–34 159 307 of hg19 version of the human genome. Following the manufacturer’s protocol for the SureSelect Target Enrichment System (Agilent Technologies, Santa Clara, CA, USA) and as previously described,21 2 μg of genomic DNA was fragmented by sonication using the Covaris S2 (Covaris, Woburn, MA, USA) to achieve a uniform distribution of fragments with a mean size of 150 bp. The sonicated DNA was purified using Agencourt’s AMPure XP Solid Phase Reversible Immobilization paramagnetic bead (Beckman Coulter, Brea, CA, USA) followed by polishing of the DNA ends by removing the 3′ overhangs and filling in the 5′ overhangs, resulting from sonication using T4 DNA polymerase and Klenow fragment (New England Biolabs, Ipswich, MA, USA). Following end polishing, a single ‘A’-base was added to the 3′ end of the DNA fragments using Klenow fragment (3′ to 5′ exo minus). This prepared the DNA fragments for ligation to specialized adapters that have a ‘T’-base overhang at their 3′ ends. The end-repaired DNA with a single ‘A’-base overhang was ligated to the Illumina paired-end adapters (Illumina, San Diego, CA, USA) in a standard ligation reaction using T4 DNA ligase and 3 μM final adapter concentration. Following ligation, the samples were purified using Agencourt’s AMPure XP Solid Phase Reversible Immobilization paramagnetic beads, quality controlled by assessment on the Agilent Bioanalyzer (Agilent Technologies) and then amplified by six cycles of PCR in two identical reactions for each sample to maintain complexity and to avoid bias owing to amplification.

Hybrid selection

Five-hundred nanograms of purified DNA (DNA library) was prepared for hybridization by adding the DNA library to Agilent blocking reagents, denaturing at 95 °C and incubating at 65 °C. All subsequent steps are performed at 65 °C. Hybridization buffer was added to the prepared library and the entire mix was then added to an aliquot of the Agilent SureSelect Capture Library and mixed (Agilent Technologies). The DNA library and biotin-labeled Capture Library were hybridized by incubation at 65 °C for 24 h. Following hybridization, streptavidin-coated magnetic beads were used to purify the RNA-DNA hybrids formed during hybridization. The RNA capture material was digested via acid hydrolysis, followed by elution from the purification beads. The neutralized captured DNA was amplified for 12 cycles of PCR using Herculase II Fusion DNA polymerase (Agilent Technologies). The libraries were purified following amplification and the library assessed using the Agilent Bioanalyzer and quantified via a real-time PCR reaction. As previously described,22 48 libraries, each barcoded with a different sequence during library preparation, were pooled in molar equivalents to generate a 10 nM final stock. The final library stock was then used in paired-end cluster generation at a final concentration of 6–8 pM to achieve a cluster density of ~600 000 mm2 (Illumina HiSeq2000 instrument, Illumina). The flow-cells were sequenced at a paired-end 50 nt condition to an average coverage of ~250x per sample. The RYR3 sequencing data set has been deposited in the NCBI SRA with accession ID SRP047147.

Bioinformatics analysis

Reads were aligned to the hg19 reference sequence using BWA.23 SNPs and small insertion/deletions (indels) were called using Genome Analysis Toolkit (GATK) software package (Broad Institute, Cambridge, MA, USA).24 wANNOVAR (accessed on 19 January 2014) was used to annotate the genetic variants, amino acid change, SIFT scores, PolyPhen2 scores and identify variants in dbSNP.25, 26 Linkage disequilibrium between SNPs present in the HapMap CEU and 1000 genomes project samples was retrieved from the SNAP server (http://www.broadinstitute.org/mpg/snap/, v2.2).

Association analysis

While individuals with the lowest and highest cIMTs were chosen for sequencing, Alf and Abrahams27 showed that analyzing data as continuous variables from extremes instead of treating them qualitatively as a case-control analysis provides better statistical power. Thus, linear regression model was used in PLINK (version 1.07) (http://pngu.mgh.harvard.edu/purcell/plink/) to analyze genetic associations of individual variants with quantitative cIMT. This is a valid analytical strategy as the normality assumption in linear models is for residuals rather than trait measurements. Statistical association analyses with cIMT were performed using the Genotypic (2 df) test, with adjustments for age, duration of d4t and genetic principle component 1. SNP-set (Sequence) Kernel Association Test (SKAT)28 with default parameters and the same covariates was used to test the association of multiple variants with the continuous trait of cIMT in sliding windows along the gene, with a window size of 4 kb and a skip step of 2 kb.

Results

Description of sequencing data (with function annotations)

Of the 3015 variants identified in the RYR3 gene region, 2710 variants (2414 SNPs and 296 indels) passed the standard GATK filtering with an overall Ts/Tv ratio of 2.1, and 2707 of them were annotated by ANNOVAR (Table 1). A vast majority of these SNPs were intronic, with only 39 in the exons. Each individual had a mean count of 736 SNPs (ranging from 528 to 1032) and 170 indels (ranging from 128 to 214), of which there was an average of 7.6 private variants (including singletons and private doubletons) per person (ranging from 0 to 57). Novel variants (516) were noted for upstream (P=0.0001, χ2 test), non-synonymous exonic (P=0.0009), and 3′ untranslated region variants (P<0.0001).

Table 1 Overall characterization of sequence variants in RYR3 gene

Single variant association analysis

Association analysis was conducted with cIMT for 1861 SNPs and indels with minimal minor allele frequency (MAF) of 1% (excluding singletons with a variant in only one chromosome of an individual). The regional plot for association is shown in Figure 1a. The top significantly associated variants (P<0.001, corresponding to top 30 non-redundant association signals) are shown in Table 2. Not surprisingly, one of the top hits (rs2229116) was the GWAS hit, previously reported in the larger cohort, confirming both genotyping and true association in the extreme study design. In addition, immediately around the GWAS hit, there are multiple variants within about 30 kb downstream, with the strongest signals (P=5.74E−10) located about 7–8 kb outside of the region with strong linkage disequilibrium (LD) to the GWAS hit. There are two regions that showed statistically significant associations with cIMT. The first region was upstream of this SNP, spanning chr15:33762337–33768118 bp with the strongest P-value 2.04E−4. This region includes the second exon, but no exonic variants were identified. The second region is downstream of this SNP spanning chr15:34123709–34126522 bp with the strongest P-value 3.76E−4.

Figure 1
figure 1

Ryanodine receptor 3 (RYR3) gene regional plot of genetic associations with carotid intima-media thickness (cIMT) extremes in Fat Redistribution and Metabolic Change in HIV Infection study (FRAM; −log P-value). (a) Single variant association P-values; (b) P-values from single-nucleotide polymorphism (SNP)-set (Sequence) Kernel Association Test (SKAT) using 4 kb sliding window. A full color version of this figure is available at the Journal of Human Genetics journal online.

Table 2 List of top 30 variants in RYR3 gene region associated with cIMT

Coding and functional variant analysis

Among the 39 exonic variants, 15 were missense mutations and are listed in Table 3. Among the four non-singletons, the previously identified rs2229116, causing I731V mutation, was significantly associated with cIMT at P<7.9 × 10−9. None of the SNPs in the promoter were associated with cIMT after Bonferroni's correction (data not shown).

Table 3 List of known and novel non-synonymous variants in RYR3 gene identified from deep sequencing and associated with cIMT

Multivariate analysis

SKAT28 was used to test the association of multiple variants in sliding windows along the gene, with a window size of 4 kb and a skip step of 2 kb. As seen in Figure 1b, the pattern of the association was largely consistent with single-variant association, although the region around the previous GWAS hit rs2229116 (position 33905410) did not show any significant associations with cIMT after stringent Bonferroni's correction. In addition, some peaks were found in the first intron, but no known genomic features were annotated.

In addition, we investigated the effect of rare variants by comparing the frequencies of private variants (singletons) of individuals with high versus low cIMT. Interestingly, although there is no overall difference (P=0.14, t-test) over the entire length of the sequenced segment, the middle part of the gene Chr15:33.75M–33.95M, had enriched for high cIMT specific private variants (confidence band of LOESS regression is above 0; Figure 2).

Figure 2
figure 2

Fat Redistribution and Metabolic Change in HIV Infection study (FRAM) high carotid intima-media thickness (cIMT) extremes are enriched for singleton mutations in chr15:33.75M–33.95M. Raw mutation frequencies are plotted as dots at ±0.0208(=1/48). LOESS smoothed line are shown in blue with confidence band in gray. A full color version of this figure is available at the Journal of Human Genetics journal online.

Discussion

The objective of this study was to comprehensively evaluate the sequence variations present in the exonic and flanking regions of the RYR3 gene and to determine their associations with cIMT. The genetic architecture of the RYR3 gene was examined in 96 HIV+ white males with extremes of cIMT measurements. These results expand on the previously consistently reported SNP, but the region is not fully explained for functional genomics. While the previously reported GWAS hit (rs2229116) was validated in a different study design (P<7.93 × 10−9), multiple independent association signals were also identified throughout the 500 kb RYR3 gene region, including the most significant association with rs28501179, rs10519841 and rs7168848 (P<5.74 × 10−10).

We report 2707 variants in the RYR3 gene region, including 516 (19%) novel ones that have not been reported in the known genetic-variant database (dbSNP). A majority of the novel variants were rare (MAF<0.01). Because the sample size of our study is n=96, the rare mutation is really singletons (1/192). To date, as there are 672 variants (5 frame shift, 414 missense, 3 nonsense and 250 synonymous) within the coding regions in RYR3 reported in dbSNP in NCBI, only 12 have MAF>0.05, of which four are non-synonymous. In our study, only 15 non-synonymous variants were observed, including the four with MAF>0.05 and only one rare SNP is novel (Table 3). Some of these variants reported, which still lack data in different populations including in the dbSNP, might be specific to ethnic groups (for example, white versus other races), or uniquely found within the extremes of common cIMT, or associated with an unknown phenotype in these HIV+ white men. Interestingly, high cIMT individuals were enriched for private (singleton) variants in the middle part of the RYR3 gene. This is intriguing as the overall load of private mutations was not significantly different between low and high cIMT individuals. We used LOESS regression which is a commonly used non-parametric smoothing method but it is rarely used in the context of rare variant association. It captures the density of private mutations throughout the length of the RYR3 gene, which spans over 500 kb region. Our results capture the enrichment signal but it can be tricky to claim any statistical significance. However, all of the rare variants in the coding regions, specifically the ones that are private to the extremes and their concentration in specific region need to be investigated in a larger population, both HIV+ and HIV-negative general population.

This study has several important limitations. The associations of rs2229116 (only exonic in the top 30—Table 2) and other variants around it in the region are still not biologically explained. Evaluations in larger cohorts need to be conducted to examine if these are the true causal variants or are tagging multiple rare functional variants that we observed. Further, genetic associations of these variants need to be assessed in functional studies. The SKAT analysis did not show significant associations beyond Bonferroni's corrections, but this might be the result of low power of SKAT in identifying causal high frequency variants. The sample size was relatively small and we may have missed the identification of certain rare functional variants. However, a study design of extreme cIMT was optimized to provide a cost-effective method to discover causal genetic variations. Allelic heterogeneity is a possibility and thus individuals with extreme cIMT might have rare and/or low frequency functional RYR3 variants not previously reported in the literature. Lanktree et al.29 reported that ‘to obtain similar statistical power, the number needed to genotype is four times greater when using a population-based approach compared to an approach based on targeted genotyping of individuals at the extremes of unexplained atherosclerosis and unexplained protection'. We did not examine other clinical outcomes in these individuals; however, cIMT is a valid subclinical surrogate for atherosclerosis.

In summary, this is the first study to comprehensively examine all the genetic variants through target deep sequencing of RYR3 gene and evaluate their associations with cIMT in HIV+ patients. The results from this study lend further strong support to the role of RYR3 gene in atherosclerosis and provide detailed sequence information of the gene, which can be potentially insightful in understanding the function in the etiology of high cIMT in HIV patients. Even with the small sample size, several significant associations were observed; however, these SNPs need to be followed up in larger sample size and also tested with clinical atherosclerotic outcomes.