Introduction

Cardiovascular disease (CVD) ranks as the leading cause of morbidity and mortality globally1, and increases extraordinarily in the developing country2. Cardiometabolic risk3,4 especially lipid metabolism dysfunction5 represents a key event in atherosclerosis, a pathogenesis of CVD. High total cholesterol (TC), triglyceride (TG), low-density lipoprotein cholesterol (LDL-C) and apolipoprotein (Apo) B concentrations, as well as low high-density lipoprotein cholesterol (HDL-C), ApoA1 levels and the ApoA1/ApoB ratio (A1/B) are considered as complex traits to which both genetic and environmental factors contribute6,7.

Despite hundreds of genome-wide hits from genome-wide association studies (GWAS), a large portion of variations in lipid metabolism attributable to heritability remains unexplained8. Because of stringent statistical cutoffs necessary in the GWAS methodology, it is argued many common variants with an appreciable effect on phenotypic variations are reported as false negatives and dismissed9. To correct away the hidden heritability, fine mapping follow from high-density replicated GWAS data need only use the tag single nucleotide variants (SNVs) and regions of linkage disequilibrium (LD) independent of annotation or relationship to nearby genes10.

Recently, the compelling genes for modifying lipid metabolism emerged from very large replicated GWAS: the ATP/GTP binding protein-like 4 gene (AGBL4 [MIM 616476]), the LDL receptor related protein 8 gene (LRP8 [MIM 602600]) and the proprotein convertase subtilisin/kexin type 9 gene (PCSK9 [MIM 607786])11,12,13,14,15,16. The objective of this study was to perform association analysis to identify integrative mutations, haplotypes and gene-by-gene (G × G) interactions of the AGBL4 (rs320017 A > G, rs320018 A > G and rs320019 G > A), LRP8 (rs6694764 G > A, rs1288519 A > C, rs872315 G > A, rs1288520 C > T and rs1288521 C > T) and PCSK9 (rs533375 C > T, rs584626 A > G, rs585131 A > G and rs540796 G > A) associated with lipid phenotypic variations in the Jing and Han populations. Furthermore, we wanted to test if the association analysis of these loci based on haplotype clusters and G × G interactions increase power over single-locus tests.

Results

Study participants

Demographic, epidemiological and clinical characteristics of the 2, 552 analyzed study subjects are summarized in Table 1. The values of body mass index (BMI), waist circumference (WC) and the percentage of individuals whom consumed alcohol were higher, as well as the level of systolic blood pressure (SBP) was lower in Jing than Han (P < 0.05–0.001). For plasma lipid phenotypic variations, there were higher plasma TC and TG levels, as well as lower A1/B in Jing (P < 0.001, for each). However, no difference was noted in fasting plasma glucose, HDL-C and LDL-C levels between the two ethnic groups (P > 0.05 for all).

Table 1 Demographic, epidemiological and clinical characteristics.

Single-mutation association

The detected 12 mutations in this motif are located in a closely genomic region of chromosome 1 (Fig. 1). As shown in Tables 2 and 3, the genotype and allele frequencies of these variants were different between the two populations (P < 0.05–0.001). All mutations exhibit the Hardy-Weinberg equilibrium (HWE, P > 0.05 for all). We tested each mutation individually for association with plasma lipid levels separately in each population. We discovered the association of the AGBL4, LRP8 and PCSK9 mutations with TC (rs320017, rs320018, rs320019 and rs533375), TG (rs320017, rs320018, rs320019, rs6694764, rs872315, rs1288520, rs1288521, rs533375, rs584626, rs585131 and rs540796) and HDL-C (rs6694764, rs1288519, rs872315, rs1288520, rs1288521 and rs585131) in Jing minority. However, in the Hans, with TG (rs320017, rs320018, rs320019, rs1288519, rs872315, rs1288521, rs533375, rs584626, rs585313 and rs540796), HDL-C (rs6694764 and rs584626), LDL-C (rs6694764 and rs1288520), ApoA1 (rs6694764, rs1288519, rs1288520, rs1288521, rs533375 and rs584626), ApoB (rs320019 and rs5333375) and A1/B (rs320017, rs320018, rs320019 and rs533375). (P < 0.05–0.001; Fig. 2).

Table 2 Prevalence of genotype frequencies in the Jing and Han populations [n (%)].
Table 3 Prevalence of allele frequencies in the Jing and Han populations [n(%)].
Figure 1
figure 1

The positions of the AGBL4, LRP8 and PCSK9 mutations.

Figure 2
figure 2

Single-mutation association with lipid phenotypic variations.

Haplotype-based association

Multiple-locus linkage disequilibrium (LD) elucidated the detected sites were not statistically independent separately in each population. Figures 3 and 4 show the LD blocks and the haplotypes for blocks separately in the Jing and Han ethnic groups. As shown in Table 4, the commonest haplotypes were AGBL4 A-A-G, LRP8 G-A-G-C-C and PCSK9 C-A-A-G (>50% of the samples). The frequencies of the AGBL4 A-A-G, AGBL4 G-G-A, LRP8 A-A-G-T-C, LRP8 A-C-A-T-T, LRP8 G-A-G-C-C, PCSK9 C-A-A-G, PCSK9 C-G-A-A and PCSK9 T-G-G-A haplotypes were quantitative significantly different between the Jing and Han populations (P < 0.05–0.001). We confirmed that the AGBL4, LRP8 and PCSK9 haplotypes were associated with TC (AGBL4 G-G-A and PCSK9 C-G-A-A), TG (AGBL4 G-G-A, LRP8 G-A-G-C-C and PCSK9 C-A-A-G), HDL-C (AGBL4 A-A-G), ApoA1 (PCSK9 C-A-A-G), and A1/B (PCSK9 C-A-A-G) in Jing minority. However, they were associated with TG (AGBL4 G-G-A, LRP8 G-A-G-C-C and PCSK9 C-A-A-G), HDL-C (LRP8 A-A-G-T-C), LDL-C (LRP8 A-A-G-T-C), ApoA1 (PCSK9 C-G-A-A), ApoB (AGBL4 G-G-A) and A1/B (LRP8 A-C-A-T-T and PCSK9 C-A-A-G) in Han Chinese. (P < 0.05–0.001; Fig. 5).

Table 4 Prevalence of haplotype frequencies in the Jing and Han populations [n (frequency)].
Figure 3
figure 3

The LD plot represents pair-wise r2 and haplotypes frequency in the Jing population.

Figure 4
figure 4

The LD plot represents pair-wise r2 and haplotypes frequency in the Han population.

Figure 5
figure 5

Haplotype-based association with lipid-related traits.

G × G interaction-based association

As shown in Table 5, the commonest G × G interaction was A-A-G-G-A-G-C-C-C-A-A-G (>50% of the samples). The frequencies of the A-A-G-A-A-G-T-C-C-A-A-G, A-A-G-G-A-G-C-C-C-A-A-G, A-A-G-G-A-G-C-C-C-G-A-A, G-G-A-A-C-A-T-T-T-G-G-A and G-G-A-A-C-G-T-T-T-G-G-A G × G interactions were significantly different between Jing and Han populations (P < 0.05–0.001). We identified that the G × G interactions among the detected mutations of AGBL4, LRP8 and PCSK9 were related with TC (G-G-A-A-C-A-T-T-T-G-G-A), TG (A-A-G-G-A-G-C-C-C-A-A-G and A-A-G-G-A-G-C-C-C-G-A-A), HDL-C (A-A-G-A-A-G-T-C-C-A-A-G) and ApoB (A-A-G-A-A-G-T-C-C-A-A-G) in Jing minority. However, in the Hans, with TG (A-A-G-G-A-G-C-C-C-A-A-G and A-A-G-G-A-G-C-C-C-G-A-A) and LDL-C (A-A-G-A-A-G-T-C-C-A-A-G). (P < 0.05–0.001; Fig. 6).

Table 5 Prevalence of G × G interaction frequencies in the Jing and Han populations [n (frequency)].
Figure 6
figure 6

G × G interaction-based association with plasma lipid levels.

Integrative association analysis of mutation, haplotype and G × G interaction

Table 6 depicts the integrative association analysis of mutation, haplotype and G × G interaction of AGBL4, LRP8 and PCSK9 with lipid phenotypic variations separately in the two ethnic groups. Generalized linear models adjusted for age, gender, BMI, WC, SBP, DBP, pulse pressure, cigarette smoking, alcohol consumption and fasting plasma glucose level demonstrated mutations, haplotypes and G × G interactions of AGBL4, LRP8 and PCSK9 quantitative significantly correlated with lipid-related traits. (P < 0.05–0.001). Furthermore, the association analysis based on haplotype clusters and G × G interactions probably increased power over single-locus tests especially for TG.

Table 6 Association of integrative AGBL4, LRP8 and PCSK9 mutations, haplotypes and G × G interactions with lipid-related traits in the Jing and Han populations.

Discussion

The main finding of the present study encompass (i) it elucidated the frequencies of mutation, haplotype and the G × G inter-locus interaction among AGBL4, LRP8 and PCSK9 genes in the Jing ethnic minority and Han population, which may be proposed as an potential supplement to the 1000 Genomes database (ii) it gave integrative mutation, haplotype and G × G interaction evidence to prove there are possible interaction between the AGBL4, LRP8 and PCSK9 genes and serum lipid concentrations; and (iii) it demonstrated association analysis based on haplotype clusters and G × G interactions probably increased power over single-locus tests especially for TG.

Aspects of primary prevention differ in some respects in ethnic minority groups when compared with general population17. Jing, as a group of migrants from Vietnam to south of China, maintains the higher cardiometabolic risk especially higher TC and TG, and lower A1/B ratio than local Han population living in the same natural and social environments. It is important to recognize that definitions of cardiometabolic risk especially dyslipidemia derived in local Han population perhaps inappropriate for ethnic minority groups. Resulting disease risks may remain difference in second and third generation migrants, even though blood pressure, fasting plasma glucose level and cigarette smoking lifestyle are converging towards those of the general Han population. Our present study pronounces differences in genetics values. The challenge now is to ensure that prevention and treatment services are ready to respond to these demographic and ethnic structure. Epidemiological survey has revealed that the Jing ethnic minority maintains genetic homogeneity. In the present study, all of the mutations satisfied with HWE separately in each population. It has been proved that the Jing and Han populations have different genetic ancestry from a statistical point of view. Our results showed that there was quantitative significantly different distributions of the detected 12 mutations of AGBL4, LRP8 and PCSK9 genes, their haplotypes and their G × G inter-locus interactions between the Jing and Han populations. These genetic heterogeneity may be correlated with the heterogeneousness of cardiometabolic risk especially dyslipidemia between the Jing and Han populations.

Environmental exposures cannot be ignored. We summarized the values of weight, BMI and WC were significantly different between the two populations. Maybe they are related with the custom of fish intake. Jing is an oceanic ethnic minority like Kinh populations in North Vietnam, survival relying on fishing18. Maybe there are differences in saturated fatty acid (SFA), polyunsaturated tatty acid (PUFA; n-3 PUFA and n-6 PUFA), and monounsaturated fatty acid (MUFA)19 intake to compare with the local Han population in their diet structure. Unfortunately it is only a hypothesis, because lack of dietary intake data. Consensus exists pertaining to the scientific evidence regarding effects of various those bad dietary fatty acids rich in fish on cardiometabolic risk including lipid phenotypic variations reported in a previous study20. What’s more, the cardiometabolic risk is known to be lower in light-to-moderate alcohol drinkers than in abstainers21. The effects of alcohol on lipid metabolism, especially the HDL cholesterol-elevating effects, are thought to greatly contribute to the cardio-protective action of alcohol22. On the other hand, excessive alcohol consumption has been shown to cause hypertriglyceridemia23,24, which is a prevalent risk factor for CVD. With regard to mechanisms underlying the effects of alcohol on lipid metabolism25,26,27, alcohol consumption has been shown to increase the activity of lipoprotein lipase and decrease the activity of cholesteryl ester transfer protein, resulting in elevation of HDL cholesterol28. Hypertriglyceridemia induced by excessive alcohol drinking may be mainly due to an increase in the synthesis of large very low-density lipoprotein (VLDL) particles in the liver. Consistently, the % of participants who consumed alcohol was different between the two groups. Wine culture plays a pivotal role in the history of China Han ethnic group. Many Han populations are good at alcohol consumption, especially in festivals.

Our data come from nuclear family and pedigree data, unfortunately, pedigree information were not documented. Heritability is a measure of familial resemblance29. Estimating the heritability of a trait represents one of the first steps in the gene mapping process. Or we can estimate heritability for quantitative traits from nuclear and pedigree data using the ASSOC program in the Statistical Analysis for Genetic Epidemiology (S.A.G.E.) software package. Estimating heritability rests on the assumption that the total phenotypic variance of a quantitative trait can be partitioned into independent genetic and environmental components30.

A number of clinical studies have demonstrated that inhibition of PCSK9 alone and in addition to statins potently reduces lipid phenotypic variation concentrations31,32. Plasma lipid phenotypic variation especially plasma TG level is heritable and modifiable33. Several groups have successfully to identify signals for TG and other lipid traits, including HDL-C, LDL-C, and TC34. However, the lead GWAS signals may not themselves be functional rather in LD with the actual underlying susceptibility mutations. The limitation in GWAS derives from the fact that the human genome is superficially screened using single independently tag SNVs. It is acknowledged that complex disease is not caused by or associated with one single variant. The functional mutation often acts through regional gene mutations, including haplotypes and G × G interactions. Therefore, GWAS, epigenome-wide association studies (EWAS) and transcriptome-wide association studies (TWAS) are only a starting and require subsequent fine mapping and functional validation to identify the actual susceptibility variants and gene interactions. AGBL4, LRP8 and PCSK9 genes are neighbors. Integrative mutations, haplotypes and G × G interactions evidence connects AGBL4, LRP8 and PCSK9 gene to lipid phenotypic variations perhaps can further elaborate the clinical application of PCSK9 inhibitors.

There are several limitations in our study. Firstly, the number of participants available for minor allele frequency (MAF) of some mutations was not high enough to calculate a strong power as compared with many previous GWAS and replication studies. Secondly, as an association analysis and observation study, inherent methodologic limitations that generate bias and confounding mean that causal inferences cannot reliably be drawn. Thirdly, take into consideration the randomized clinical trials (RCTs) provide the best opportunity to control for confounding and avoid certain biases. Consequently, well-designed, high-quality further therapeutic intervention study, including prophylactic agent, treatment, surgical approach, or diagnostic test is needed. Moreover, there are still many unmeasured environmental and genetic factors including TFA, SFA, PUFA (including n-3 PUFA and n-6 PUFA) and MUFA that needed to be considered. In addition, the relevance of this finding has to be defined in further high caliber of studies including incorporating the genetic information of AGBL4, LRP8 and PCSK9 gene mutations, haplotypes and G × G interactions in vivo and vitro functional studies to confirm the impact of a variant on a molecular level including transcription and expression. The last but not the least, discussion of race and ethnicity in medicine must rigorously avoid polarization and the further perpetuation of disparate health care.

In summary, there are potential interaction between the AGBL4, LRP8 and PCSK9 genes and serum lipid concentrations. And the association analysis based on haplotype clusters and G × G interactions probably increased power over single-locus tests especially for TG. These genetic heterogeneity may be correlated with the heterogeneousness of cardiometabolic risk between the Jing and Han populations. Differences in lipid phenotypic variations between the two populations might partially attribute to AGBL4, LRP8 and PCSK9 gene mutations, haplotypes and G × G interactions.

Materials and Methods

Ethical approval

The study were carried out following the rules of the Declaration of Helsinki of 1975 (http://www.wma.net/en/30publications/10policies/b3/), revised in 2008. All participants from contributing populations gave written informed consent to participate in epidemiologic investigation and genetic analysis. All study protocols in this motif have approval from the Ethics Committee of the First Affiliated Hospital, Guangxi Medical University (No: Lunshen-2011-KY-Guoji-001; Mar. 7, 2011).

Subjects

Two groups of study population including 1272 unrelated participants of Jing (624 males, 49.06% and 648 females, 50.94%) and 1280 unrelated subjects of Han (636 males, 49.69% and 644 females, 50.31%) were randomly selected from our previous stratified randomized samples35. All participants were rural fishery (Jing) and/or agricultural (Han) workers from the three islands of Wanwei, Wutou and Shanxin in the county of Fangchenggang in the province of Guangxi, China, near the Sino-Vietnamese border. The participants’ age ranged from 18 to 80 years with a mean age of 57.27 ± 12.85 years in Jing and 56.85 ± 13.32 years in Han; respectively. The gender ratio and age distribution were matched between the two groups. All participants were essentially healthy with no history of coronary artery disease, stroke, diabetes, hyper- or hypo-thyroids, and chronic renal disease. They were free from medications known to affect lipid profiles.

Epidemiological survey

The epidemiological survey was carried out using internationally standardized method, following a common protocol36. Information on demographics, socioeconomic status, and lifestyle factors were collected with standardized questionnaires. Cigarette smoking status was categorized into groups of cigarettes per day: ≤20 and >2037. Alcohol consumption was categorized into groups of grams of alcohol per day: ≤25 and >2538. Several parameters such as blood pressure, height, weight and WC were measured, while BMI (kg/m2) was calculated. BMI was categorized into four groups: underweight (BMI < 18.5), normal weight (18.5 ≤ BMI < 24), overweight (24 ≤ BMI < 28) and Obesity (28 ≤ BMI)39. Likewise, WC was categorized into groups including normal group (WC ≤ 85 for male and WC ≤ 80 for female) and abdominal obesity (WC > 85 for male and WC > 80 for female)40.

Biochemical measurements

A fasting venous blood sample of 5 ml was drawn from the participants. The levels of fasting plasma TC, TG, HDL-C and LDL-C in the samples were determined by enzymatic methods with commercially available kits. Fasting plasma ApoA1 and ApoB levels were assessed by the immuneturbidimetric immunoassay.

Diagnostic criteria

The normal values of fasting plasma TC, TG, HDL-C, LDL-C, ApoA1 and ApoB levels, as well as the A1/B ratio in our Clinical Science Experiment Center were 3.10–5.17, 0.56–1.70, 1.16–1.42, 2.70–3.10 mmol/L, 1.20–1.60, 0.80–1.05 g/L, and 1.00–2.50; respectively41.

Mutation selection

We selected 12 mutations in the AGBL4, LRP8 and PCSK9 with the following assumption: (i) tag SNVs, which were established by Haploview (Broad Instituteof MIT and Harvard, Cambridge, MA, USA, version 4.2); (ii) functional mutations (http://snpinfo.niehs.nih.gov/snpinfo/snpfunc.htm) in functional areas of the gene fragment from NCBI dbSNP Build 132 (http://www-ncbi-nlm-nih-gov.ezp-prod1.hul.harvard.edu/SNP/); (iii) a known minor allele frequency (MAF) higher than 1% in European ancestry from the Human Genome Project Database; and (iiii) mutations might be associated with the lipid-related traits or cardiometabolic risk in the latest studies.

Genotyping

Genomic DNA was extracted from leucocytes of venous blood using the phenol-chloroform method. Genotyping of 12 mutations was performed by PCR and Sanger sequencing. The characteristics of each mutation and the details of each primer pair, annealing temperature, length of the PCR products are summarized in Supplemental Tables 1 and 2. The PCR products of the samples were sequenced with a sequencer ABI Prism 3100 Genetic Analyzer (Applied Biosystems, International Equipment Trading Ltd., Vernon Hills, IL, USA) in Shanghai Sangon Biological Engineering Technology & Services Co. Ltd., Shanghai China.

Statistical Analyses

The statistical analysis were performed with the statistical software SPSS 21.0 (SPSS Inc., Chicago, IL, USA). Quantitative variables were presented as the mean ±SD for those, that are normally distributed, whereas the medians and interquartile ranges for TG, which is not normally distributed. General characteristics between the two groups were compared by the ANCOVA. The distributions of the genotype, allele, haplotype and G × G interaction between the two groups were analyzed by the chi-squared test; The HWE, Pair-wise LD, frequencies of haplotype and G × G interaction comprising the mutations were calculated using Haploview (version 4.2; Broad Institute of MIT and Harvard). The association of the genotypes, haplotypes and G × G interactions with lipid phenotypic variations was tested by the Univariant. Any variants associated with the lipid phenotypic variations at a value of P < 0.05 were considered statistically significant. Generalized linear models were used to assess the association of the genotypes (common homozygote genotype = 1, heterozygote genotype = 2, rare homozygote genotype = 3), alleles (the minor allele non-carrier = 1, the minor allele carrier = 2), haplotypes (the haplotype non-carrier = 1, the haplotype carrier = 2) and G × G interactions (the G × G interaction non-carrier = 1, the G × G interaction carrier = 2) with lipid phenotypic variations. The model of age, gender, BMI, WC, SBP, DBP, pulse pressure, cigarette smoking, alcohol consumption and fasting plasma glucose level were adjusted for the statistical analysis. The pattern of pair-wise LD between the selected mutations was measured by D′ and r2 using the Haploview software.

Additional Information

How to cite this article: Guo, T. et al. Integrative mutation, haplotype and G × G interaction evidence connects ABGL4, LRP8 and PCSK9 genes to cardiometabolic risk. Sci. Rep. 6, 37375; doi: 10.1038/srep37375 (2016).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.