Introduction

Lipoprotein (a) (Lp(a)) is an independent risk factor for cardiovascular disease.1, 2 Genetic variants in LPA are strongly associated with both an increased level of Lp(a) and an increased risk of coronary disease,3 suggesting a causal role of Lp(a) in coronary disease. Lp(a) is a low-density lipoprotein (LDL)-like particle that consists of an apolipoprotein(a) covalently linked to apolipoprotein B100 by a disulfide bond.4 Genetic factors have a large impact on the variation of Lp(a) levels and ~70–90% of the total variance of Lp(a) can be attributed to variation within the LPA locus across worldwide populations.5, 6 A copy number variant, the Kringle(IV) type 2 domain, accounts for approximately half of the variance explained by the LPA locus.7, 8 Recent genome-wide association studies (GWAS) in subjects of European descent have identified multiple polymorphisms spanning 12.5 Mb on chromosome 6q26–27, which includes LPA, that are significantly associated with Lp(a) levels independent of each other and of the Kringle IV size polymorphism in LPA (P<5 × 10−8).7 A candidate gene study on multiethnic populations suggested both SNPs at 6q26–27 and the Kringle IV copy number variant were genomic determinants of Lp(a) level, and the proportion of total variance explained by each determinant differs across ethnic groups.9

Lp(a) levels in populations of African ancestry are much higher (2–4-fold) compared with that in populations of European ancestry.10 A genome-wide admixture study on a population of African Americans (AAs) suggested that local ancestry at 6q25.3 was significantly associated with Lp(a) after adjustment for the Kringle IV copy number variant.11 However, so far, no genome- or exome-wide association studies of common and uncommon-coding variants, respectively, have been conducted in populations of African ancestry to assess the importance of other genomic regions on Lp(a) levels.

Here, we present a genome- and an exome-wide association study of Lp(a) among AAs participating in the Jackson Heart Study (JHS). We observed numerous SNPs at the well-established LPA locus and a single SNP in APOE significantly associated with Lp(a) (P<5 × 10−8). A high burden of coding variants in LPA and APOE were also associated with higher Lp(a) levels.

Materials and methods

Study subjects and phenotypes

This study included 1106 AA male and 1789 AA female participants with measured Lp(a) levels and available genome-wide genotype data from the JHS, a longitudinal, population-based cohort from Jackson, Mississippi, USA.12 The design, recruitment and initial characterization of this study was described in detail elsewhere.13 Serum Lp(a) levels (mg dl−1) were measured using a Diasorin nephelometric assay on a Roche Cobas FARA analyzer (Roche Diagnostics Corporation, Indianapolis, IN, USA).14 Fasting LDL, high-density lipoprotein (HDL), triglyceride (TG) and total cholesterol (TC) were measured as described previously.15 For each individual treated with lipid-lowering therapies, the observed lipid value was multiplied by a correction factor (1.352 for LDL, 0.949 for HDL, 1.210 for TG and 1.271 for TC).16 The study protocol was approved by the University of Mississippi Medical Center Institutional Review Board, and written informed consent was obtained from all JHS participants. Descriptive characteristics of the JHS participants in this study were summarized in Table 1.

Table 1 Descriptive statistics of JHS participants

Genome-wide genotype data and genotype imputation

A total of 3030 JHS participants were genotyped using the Affymetrix Genome-Wide Human SNP Array 6.0 (Affymetrix, Santa Clara, CA, USA). Genotyping quality control was conducted using PLINK v.1.07.17 A total of 874 712 SNPs with a call rate >0.95, minor allele frequency (MAF) >0.01 and Hardy–Weinberg equilibrium P-value >1 × 10−6 were included in the genotype imputation target panel. Thirty-eight million SNPs were imputed, using MACH 2.0,18 based on a reference panel consisting of the complete sample of 1000 Genome Project participants (November 2010, Version 3); only SNPs with imputation quality of r2 >0.3 were included in further analysis. The Kringle IV copy number variant was not available in JHS participants.

Exome array

A total of 2790 JHS participants, including 2448 with Affymetrix 6.0 genotype data, were genotyped using the Illumina Human Exome Beadchip (version 12v1_rev5; Illumina Inc., San Diego, CA, USA), consisting of >200 000 putative functional variants selected from >12 000 individual exome and whole-genome sequences across diverse populations and a range of common complex traits.

Statistical analyses

Association of local ancestry estimates

Ethnic differences in Lp(a) levels have been observed between subjects of European and African descent, with some populations of AAs having, on average, almost fourfold higher Lp(a) levels compared with European Americans.10 To assess the impact of genetic admixture within our AA population on Lp(a) levels, we first estimated the average, or global, African ancestry proportion across the genome for all subjects using the software ADMIXTURE,19 assuming K=2 contributing populations. We then tested whether this estimated proportion was associated with natural-log-transformed Lp(a) levels after adjusting for age, sex and body mass index (BMI). Next, ANCESTRYMAP20 was used to estimate local ancestry (probabilities of whether an individual has 0, 1 or 2 alleles of Caucasian ancestry) at 738 831 autosomal SNPs across the genome for each participant in JHS as described previously.15 In brief, local ancestry was inferred using hidden Markov models based on the genotypes from a panel of densely spaced markers with highly differential allele frequencies between African and European populations. We performed admixture mapping across the whole genome by regressing natural-log-transformed Lp(a) levels, adjusting for age, sex, BMI and global ancestry, on the local ancestry estimate at each SNP location. The LOD score for association, defined as the log base10 ratio of the likelihood of the data under a model including local ancestry divided by the likelihood of the data under the model excluding local ancestry, was computed at each of these local ancestry informative marker locations across the genome. For regions showing association of increased African ancestry with higher levels of Lp(a), the LOD scores were assigned positive values, and for regions showing association of increased African ancestry with lower levels of Lp(a), the LOD scores were assigned negative values. LOD scores were plotted across the whole genome, and an LOD score of 5 was assumed to be the threshold of statistical significance.20

Genome-wide association analysis

Lp(a) levels were naturally log-transformed to approximate normality of residuals after accounting for age, sex and BMI. The association between Lp(a) and imputed SNPs were tested using multivariable linear regression models in MACH2QTL v.1.08,18 adjusting for age, sex, BMI and the first 10 principal components generated from EIGENSOFT21 based on a linkage disequilibrium pruned set of SNPs with MAF >0.05. An additive mode-of-inheritance model was assumed for genotype; β-coefficients, representing the estimated change in transformed trait value associated with each additional copy of the effect allele, and their corresponding standard errors were reported

Manhattan plots were made to illustrate the association results across the genome. Quantile–quantile (Q–Q) plots of observed versus expected −log10(P-values) were made to assess any systematic inflation of the regression test statistics across the genome before and after removing the SNPs in the 6q25.3 region widely reported to be significantly associated with Lp(a). The genomic inflation factor (λ), defined as observed median value of the χ2 statistic divided by 0.456, was calculated excluding 6q25.3 SNP results. The observed χ2 statistics were divided by λ to obtain the corrected χ2 statistics and corrected P-values. In regions with significant evidence for association, multivariable ‘conditional’ regression models that included the imputed genotype data of the most strongly associated SNP as an additional covariate were performed to assess the evidence for multiple independently associated SNPs in the region. If a second signal also reached genome-wide significance after conditioning on the top variant, multivariable regression models were repeated to include the genotypes of both SNPs as covariates. Region-specific plots were made to show the magnitude of association between all SNPs and Lp(a) levels, as well as the estimated LD between each SNP in the region and the most strongly associated SNP. Finally, to control our association results for possible confounding due to ancestry, for any observed associated SNP, we identified the genetic position of the most strongly associated SNP in the associated region, selected the local ancestry estimate at the location closest to that SNP and performed multivariable regression models as described above, but now including estimated local ancestry proportion as an additional covariate.

Identifying the most important 6q associated SNPs using LASSO-based resample model averaging

To identify the SNPs that most likely contribute to the observed association at 6q, we applied the LASSO local automatic regularization resample model averaging (LLARRMA) method, a method that combines LASSO variable shrinkage and selection with resample model averaging and multiple imputation,22 to estimate the probability of each SNP to be included in a multi-SNP model that best explains the Lp(a) outcome across alternative realizations of the data. We first extracted the genotypes for SNPs on the Affy 6.0 array that mapped within the 1 Mb region centered around the top genotyped SNP (rs9457986) identified in the initial GWAS scan. Then, we used fastPHASE23 to impute the missing genotypes for SNPs in this region, which failed genotyping on Affy 6.0. Finally, we fitted the LASSO models by regressing the residuals of Lp(a) adjusting for age, gender, BMI, 10 principal components and the local ancestry estimate of rs9457986 on the genotypes using the LLARRMA package in R, and calculated a resample model inclusion probability (RMIP) score for each SNP in this 1 Mb region on chromosome 6. LD statistics (R2 and D′ based on 1000G YRI subjects) between variants with RMIP >0.75 were calculated using GOLD24 and plotted using Haploview.25

Haplotype analyses

Haplotype analyses were conducted using the ‘haplo.stats’ R package,26 to examine specific combinations of allelic variants and whether the observed association signal is likely attributable to a less common unmeasured genetic variant. Haplotypes were constructed among the five genotyped variants in the LPA gene with RMIP score >0.75. The ‘haplo.glm’ function implemented in the ‘haplo.stats’ R package was used to calculate effect coefficients (β), standard errors (SE) and P-values for each haplotype relative to the most common reference haplotype. The ‘haplo.score’ function was used to calculate the global score statistic to test the overall association between haplotypes and log-transformed Lp(a). The same set of covariates used in the genotype analyses were used in the haplotype analyses.

Single variant and gene-based analysis of SNPs on exome array

Single variant and multivariant, gene-set or ‘gene-burden’, association analyses were performed for variants appearing on the exome array. Gene-burden tests performed included the Madsen–Browning test27 and the SKAT-Optimal (SKAT-O)28 test, which picks the ‘best’ combination of the SKAT29 and a Madsen–Browning27 test for gene-based testing. We performed three levels of gene-based analyses: (level 1) the combination of stop-loss, stop-gain and splice-site regardless of MAF; (level 2) the combination of SNPs in level 1 and all variants, which are predicted to be ‘damaging’ using PolyPhen30 and SIFT31 regardless of MAF; (level 3) the combination of stop-loss, stop-gain, splice-site and nonsynonymous variants with MAF upper limits of 3%.

To investigate how much common or rare SNPs associated with Lp(a) in the single-variant analyses explain the gene-based test results, conditional analyses were performed by including the allele count at these lead SNPs as covariates. To control for the possible confounding of Lp(a) association results because of an association between both SNP and Lp(a) with other lipid traits (HDL, LDL, TG and TC), we performed regression models as described above but now including the lipid traits an additional covariate.

Statistical significance

A significance threshold of P<5 × 10−8 was used to define genome-wide significance for all individual SNP results (both GWAS and exome array). Test statistics for all individual SNP results, both GWAS and Exome-chip, were adjusted by the genome inflation factor (λ) before calculating significance (P-values) to account for any possible systematic bias in results. A gene-based association result was defined to be significant if P<0.05 per number of genes. The number of genes for level 1, level 2 and level 3 inclusion criteria were 4752, 13 658 and 15 963, resulting in significance thresholds of 1.1 × 10−5, 3.7 × 10−6 and 3.1 × 10−6, respectively.

Results

Descriptive statistics of the JHS participants in this study were summarized in Table 1.

Admixture mapping for determinants of Lp(a)

Consistent with observed higher levels of Lp(a) in AAs versus EAs, higher levels of estimated global, or average, African ancestry was significantly associated with higher levels of log-transformed Lp(a) (β=0.76, P=1.3 × 10−9). The estimated global proportion of African ancestry obtained from ADMIXTURE was highly correlated with the first principal component from EIGENSTRAT (correlation=0.998). Admixture mapping showed a highly significant association between increased African ancestry at chromosome 6q25.3 and increased Lp(a) levels (Supplementary Figure 1) after adjusting for the global proportion of African ancestry. The estimated global proportion of African ancestry became nonsignificant after including the estimated local ancestry at 6q25.3 in the regression models. SNP rs505000, upstream of SLC22A3, was the local ancestry informative SNP most strongly associated with Lp(a) levels (P=8.2 × 10−27, LOD=24.95).

GWAS SNPs associated with Lp(a) at P<5 × 10−8

Q–Q plots revealed evidence for systematic inflated association results likely because of left-censoring of the Lp(a) measures in a subset of JHS participants (Supplementary Figure 2a). After excluding all the SNPs on chromosome 6, the distribution of the remaining P-values across the genome still demonstrated inflation (Supplementary Figure 2b). After controlling for the genomic inflation factor (λ=1.14), Q–Q plots revealed no substantial evidence for inflation (Supplementary Figures 2c and d). The genomic inflation factor was applied to all individual SNP results. Only the chromosome 6q region reached genome-wide significance for Lp(a) levels (Supplementary Figure 2e) after adjusting for the inflation factor. Overall, 804 SNPs reached genome-wide significance (P<5 × 10−8) (all on chromosome 6), and the top SNPs that reached genome-wide significance are listed in Table 2.

Table 2 Top SNPs that reached genome-wide significance (P<5 × 10−8) after controlling for genomic inflation factor

Lp(a) GWAS results on chromosome 6

All 804 significant SNPs on chromosome 6 mapped to the 6q region, spanning from 153 917 144 to 163 745 411 bp and containing more than 10 genes (Figure 1 and Supplementary Table 1). The strongest signal (rs115848955, P=3.1 × 10−55, MAF=0.05) mapped to the LPA gene. LPA encodes a modified form of low-density lipoprotein, in which a large glycoprotein (Apo(a)) is covalently bound to apolipoprotein B by a disulfide bridge,32 and structurally, the Apo(a) chain contains a region homologous with plasminogen, which gives Lp(a) antifibrinolysis activity by competing with plasminogen’s binding to fibrin. After adjusting for the top SNP (rs115848955), the top associated variant was rs9355814 (P=7.8 × 10−21) (Figure 2 and Supplementary Table 2) and 469 SNPs in the 6q region spanning from 159 092 125 to 163 745 411 bp remained genome-wide significant. However, after adjusting for the local ancestry informative marker closest to rs115848955 (i.e. rs6923917), only 406 SNPs in a relatively narrow region spanning from 160 633 560 to 161 342 219 bp remained significant (P<5 × 10−8) (Supplementary Table 3). The top SNP in the conditional analysis including local ancestry was rs138429428 (P=4.2 × 10−50, Figure 3). The local ancestry estimate closest to rs115848955 explained an estimated 4.4% of the total variation of Lp(a) after accounting for age, gender, BMI and the first 10 PCs. SNP rs115848955 explained an estimated 8.8% of the total variation of Lp(a) after accounting for age, gender, BMI, the first 10 PCs, and the local ancestry estimate nearest this location. In a series of iterative conditional (forward step-wise) analyses that also contained the local ancestry estimate at rs6923917, 11 SNPs remained genome-wide significant (P<5 × 10−8) in the LPA region (see Supplementary Table 4 for the 11 SNP model results). In aggregate, these 11 SNPs plus the local ancestry estimate at rs6923917 explained 28.8% of the total variation in Lp(a) after accounting for age, gender, BMI and the first 10 PCs. Many more LPA region SNPs remained nominally significant (P<0.05) in conditional models including these 11 SNPs.

Figure 1
figure 1

Regional plot of the −log10(P) values for the single-nucleotide polymorphisms (SNPs) in the chromosome 6q region for Lp(a). The X axis shows the human genome build 19 coordinates (Mb) and the genes in the region. The Y axis shows the −log10 association P-values of SNPs on the left, and recombination rates in cM per Mb on the right. Different colors of shading indicate the strength of linkage disequilibrium (LD) (R2) between the top SNP and the other SNPs tested in the region.

Figure 2
figure 2

Regional plot of the −log10(P) values for the single-nucleotide polymorphisms (SNPs) in the chromosome 6q region for Lp(a) after adjusting for the top SNP rs115848955 at this locus. The top SNP was rs9355814 after adjusting for rs115848955.

Figure 3
figure 3

Regional plot of the −log10(P) values for the single-nucleotide polymorphisms (SNPs) in the chromosome 6q region for Lp(a) after adjusting for the local ancestry estimate at rs6923917. The top SNP was rs138429428, and only SNPs in a relatively narrow region spanning from 160 633 560 to 161 342 219 bp remained significant (P<5 × 10−8).

Evidence for multiple associated SNPs in the LPA region on chromosome 6

Fifteen directly genotyped SNPs in the 1 Mb region surrounding the top associated SNP (rs9457986) on chromosome 6 were identified as having stable, independent associations with Lp(a) (RMIP score >0.75) in multivariant models using LLARRMA, as shown in Supplementary Table 5. The linkage disequilibrium estimates between these SNPs were not high, as shown in Supplementary Figure 3. Five of the SNPs (rs6415084 (RMIP=0.93), rs3798221 (RMIP=1), rs9457986 (RMIP=1), rs1367211 (RMIP=0.97) and rs1406888 (RMIP=1)) were in the LPA gene. The five Lp(a) SNPs together explained an estimated 7.3% of the total variance of Lp(a) after accounting for age, gender, BMI, the first 10 PCs and local ancestry estimate at rs9457986. These five SNPs were used to construct haplotypes and to estimate the effect on Lp(a) levels for each additional copy of a particular haplotype, compared with the reference haplotype (Supplementary Table 6). There was significant evidence for an overall association between haplotypes and Lp(a) (global P=1.12 × 10−55); six individual haplotypes were minimally nominally associated (P<0.05) with Lp(a) levels.

Single variant and gene-based analysis for exome array

Seven SNPs on the exome array were associated (P<5 × 10−8) with Lp(a) (Table 3) after adjustment for the genomic inflation factor, six of them were in the chromosome 6q LPA region, and the other was the widely reported chromosome 19 APOE SNP rs7412. Five of the six associated chromosome 6q SNPs remained genome-wide significant after adjusting for top GWAS SNP rs115848955 (Table 3). Five of the six SNPs (excepting rs41272114) were intronic or in intergenic regions. Of the seven SNPs, only rs7412 was not significant in the 1000G imputed GWAS results. Rs7412 (P=3.3 × 10−8) was poorly imputed based on 1000G data and was not included in the original GWAS analyses. In the gene-based analyses, the LPA gene reached exome-wide significance for all three levels of SNP inclusion when using the Madsen–Browning test, as well as for level 2 and level 3 SNPs in the SKAT-O test. The APOE gene reached exome-wide significance for level 2 SNPs using the Madsen–Browning test (Table 4). Q–Q plots for the single-variant (Supplementary Figure 4) and gene-based analyses (Supplementary Figure 5) showed no evidence for systematic inflation in results.

Table 3 SNPs on Exome-chip that reached exome-wide significance (P<5 × 10−8) after controlling for genomic inflation factor
Table 4 Genes on Exome-chip associated with Lp(a) in gene-based analysis

Uncommon functional variants in LPA and APOE

All splice-site, stop altering and nonsynonymous SNPs in LPA and APOE on the exome array are listed in Supplementary Table 7 and Supplementary Table 8, respectively. The most significant individual SNP in LPA on the exome array was rs41272114 (P=6.5 × 10−12, MAF=0.01), which is a splicing-altering variant. LPA SNPs were significantly associated with Lp(a) levels in the gene-burden tests (P=1.2 × 10−21 for level 3 SNPs using Madsen–Browning test). The level 3 gene-based Madsen–Browning test remained significant after removing the top two individual SNP results (i.e. rs41272114 and rs41272110) (Supplementary Table 9). Only three APOE SNPs, including the aforementioned rs7412, were successfully genotyped and informative on the exome array. A second nonsynonymous APOE SNP (rs769455, MAF=0.02), which is specific to populations of African descent, demonstrated strong nominal results that were independent of rs7412 (P=2.0 × 10−5 before adjustment for rs7412 and P=3.5 × 10−5 after adjustment for rs7412). Taken together, rs7412 and rs769455 explain ~2.3% of the total variation of Lp(a) after adjusting for age, gender, BMI and the first 10 PCs.

As APOE is a strong risk factor for lipid traits (LDL and TC), we investigated whether the association between APOE SNPs and Lp(a) were attenuated when adjusting for lipid traits. After adjusting for the untreated LDL and TC levels, we found that the evidence for association decreased, but the signal did not totally disappear (Supplementary Table 8, P-value for rs7412 dropped from 3.2 × 10−8 to 7.6 × 10−4).

Discussion

We conducted genome- and exome-wide association studies for Lp(a) in 2896 AA participating in the JHS. A higher level of estimated global African ancestry was significantly associated with a higher level of Lp(a), and this association of global ancestry was largely explained by the association between Lp(a) and local ancestry in the chromosome 6q25.3 region. We observed significant (P<5 × 10−8) associations for hundreds of SNPs spanning ~10 Mb region on 6q surrounding the LPA gene. Interestingly, after adjusting for local ancestry, the region containing significantly associated SNPs got much narrower and was centered over the LPA gene (<1 Mb). Significant haplotypic effects were also detected in the LPA region that implicates numerous causal variants. A single APOE SNP, rs7412 on the exome array, also reached genome-wide significance. Gene-burden tests found significant associations between Lp(a) and aggregate collections of SNPs in LPA and APOE.

Previously, Deo et al.11 performed a targeted study of the Lp(a) region, using haplotype tagging SNPs, in 4464 JHS participants and 1726 AA participants from the Dallas Heart Study.11 This study also performed a genome-wide admixture analyses based on a panel of 1447 ancestry informative markers, including subjects in the upper and lower quintile of the Lp(a) distribution. Herein, we markedly expand the coverage of both common and rare variants across the entire LPA region; estimate local genetic admixture at 738 831 autosomal SNP locations and perform admixture mapping including all subjects with Lp(a) measures. Eight-hundred and four SNPs in the 6q region (spanning ~10 Mb) were significantly associated with Lp(a) levels (P<5 × 10−8). After adjusting for the most strongly associated SNP rs115848955, multiple signals at 6q region spanning ~5 Mb remained significantly associated with Lp(a). After adjusting for the local ancestry at 6q25.3, the region that harbors SNPs significantly associated with Lp(a) (P<5 × 10−8) became much narrower (from 9.8 to 0.7 Mb) and was centered around the three genes SLC22A, LPL2 and LPA. This result suggests confounding between local ancestry and SNPs spanning the larger 6q region identified to be associated with Lp(a). Given the relatively recent admixture in the AA population, local ancestry can confound associations across a relatively large region surrounding the population-specific or population-enriched causal variant(s).33, 34 The observation that the associations in and near LPA remains robust after adjustment for local ancestry at LPA, whereas the evidence for association further away markedly declines, suggests that the ancestry-specific (or highly-enriched) causal risk variant(s) resides in or near LPA and that most, if not all, of the observed associations outside this narrower region are spurious associations. Interestingly, a similar extended region of association with Lp(a) surrounding LPA has been observed in more homogeneous European populations. An obvious candidate to explain some of the differences in association results between European and African populations is the Kringle IV polymorphism, which has not been measured in JHS participants. Deo et al.11 demonstrated that the Kringle IV polymorphism explains some of the ancestry effect differences, but noted that several associated common SNPs, with strong allele frequency differences between populations of African and European ancestry, in and around LPA explain the majority of the population differences in Lp(a) levels. To our knowledge, there is no evidence suggesting the allele frequency differences between populations at these, or other nearby LPA variants, are due to selective pressures. Future functional studies will be necessary to better understand the significance of these allele frequency differences.

A common nonsynonymous variant at APOE on exome array, rs7412, was identified to be significantly associated with Lp(a) in single-variant analysis (MAF=0.11, leading to an Arg to Cyc substitution, P=3.2 × 10−8). Another low-frequency non-synonymous variant at APOE, rs769455, was also nominally associated with Lp(a) (MAF=0.02, leading to an Arg to Cyc substitution, P=2.0 × 10−5). Prior studies have investigated the relationship between APOE genotypes and Lp(a) levels, but the results were inconsistent. Some studies reported no impact of APOE genotypes on Lp(a) levels,35, 36, 37, 38, 39 whereas others reported significant associations between them.40, 41, 42, 43, 44, 45 A recent study found that among AAs, lower Lp(a) levels were observed in APOE ɛ2 carriers, and this association was only observed in subjects with large apoA size (defined as >26 Kringle IV repeats) but not in the subjects with small apoA size.46 Another study on Caucasian males also reported that the effect of APOE on Lp(a) levels was only observed in subjects of largest quartile of apoA size, but with lower Lp(a) levels for APOE ɛ 4 carriers.42 In our study, APOE ɛ2 genotype was associated with lower Lp(a) levels, which is consistent with the prior study on AAs. We found no evidence supporting any such interaction between APOE genotype and any of our top LPA region SNPs (data not shown).

In summary, we observed that local ancestry at 6q25.3 was an important risk factor for Lp(a) in AA, and that SNPs at the well-established LPA locus were significantly associated with Lp(a) (P<5 × 10−8) after adjusting for the local ancestry at 6q25.3. Before covariate adjustment for local ancestry at 6q25.3, the observed region containing associated SNPs spanned ~10 Mb. After covariate adjustment, the associated region was only 700 kb. We also observed a significant association for a nonsynonymous variant in APOE. Future large multiethnic studies which include high-coverage sequence data, and the Kringle IV polymorphism, would be ideally suited to better understand the complex genetic architecture of the LPA region that leads to strong population differences in Lp(a) levels.