Main

The paraoxonase 1 (PON1) gene belongs to the paraoxonase gene cluster on 7q21.3–22 and codes for an enzyme with broad substrate specificity.1 The PON1 enzyme has lactonase and esterase activity and thus is able to catalyze the hydrolysis of lipid peroxides and organophosphate pesticides.1,2 Although its physiologic function has not been fully elucidated, the PON1 enzyme attaches to high-density lipoprotein particles in serum and has been shown to inhibit low-density lipoprotein oxidation, suggesting that PON1 may play a role in atherogenesis.2 Two nonsynonymous PON1 polymorphisms with possible regulatory effects on enzyme activity,2 namely rs662 (c.575A>G or p.Gln192Arg) and rs854560 (c.163T>A or p.Leu55Met) have been extensively investigated as potential risk factors for atherosclerosis-related phenotypes, including coronary artery disease, peripheral arterial disease, and ischemic stroke.25

Two previously published systematic reviews suggested that the G allele of rs662 is associated with a small increase (per-allele odds ratio [OR] = 1.12) in the risk of coronary artery disease, whereas no such association was found for rs854560.4,5 Because cerebrovascular and coronary artery disease share many pathophysiologic mechanisms, it is plausible that rs662 could also be a risk factor for ischemic stroke.68 However, most published studies investigating the relationship between PON1 polymorphisms and ischemic stroke are small in sample size and inconclusive in their results. We therefore performed a meta-analysis to mitigate their shortcomings and summarize the totality of available published evidence on the association between the aforementioned PON1 polymorphisms and ischemic stroke.

METHODS

Search strategy

We searched the MEDLINE and SCOPUS databases, and the Human Genome Epidemiology Network Literature Finder (last search June 30th, 2009) to identify English-language studies investigating the association between the PON1 rs662 or rs854560 polymorphisms and ischemic stroke. Search terms included combinations of terms such as “paraoxonase,” “PON1,” “rs662,” “rs854560,” “Gln192Arg,” “Leu55Met,” “Q192R,” “L55M,” “stroke,” “cerebrovascular disease,” “cerebral infarction” and their synonyms. The exact search is available on request by the authors. We did not consider other PON1 polymorphisms or polymorphisms in other members of the PON family of genes (PON2, PON3) because the available evidence on them is limited. We perused the reference lists of all retrieved articles and relevant reviews. We also searched the online archives of Stroke, Annals of Neurology, and Cerebrovascular Diseases, three journals that have published several genetic association studies in ischemic stroke. Eligible studies were those that used case-control, nested case-control, or cohort designs and validated genotyping methods to investigate the frequency of the two polymorphisms in unrelated ischemic stroke patients and unaffected individuals. We did not consider narrative reviews, editorials, and letters to the editor or other articles not reporting primary research results. Family-based studies were excluded because their design and analysis is different from that of population association studies.

Data abstraction

One investigator abstracted detailed information from each publication regarding study design, matching and ascertainment of controls, demographics, ethnicity of participants (Caucasian continental ancestry, East Asian or other), definition of ischemic stroke, genotyping methods, disease stage, family history, and counts of genotypes and alleles in stroke cases and controls/unaffected individuals. We relied on the definitions used in each study to subclassify stroke. When studies reported genotype distributions per stroke subtype, we also extracted data for each subtype separately, to use in subgroup analyses. We excluded from main analyses studies where patients with ischemic and hemorrhagic stroke were merged together but examined their impact on the meta-analysis results in sensitivity analyses.

Deviations from Hardy-Weinberg equilibrium in the controls

For each study, we examined whether the distribution of the genotypes in the control group deviated from the Hardy-Weinberg equilibrium (HWE) predicted proportions using an exact test.9 For studies that did not provide genotype counts, but reported allele frequencies only, we relied on the authors' assessment of deviations from the HWE in the controls.

Meta-analysis

Main analyses compared allele frequencies of the variant and common allele (G vs. A for rs662 and A vs. T for rs854560) between cases and controls. We also evaluated dominant (variant allele carriers versus homozygotes for the common allele) and recessive (homozygotes for the variant allele versus all others) genetic models, for both polymorphisms. We used the OR as a metric of choice. We calculated summary ORs and their 95% confidence intervals (95% CI) using the DerSimonian and Laird random effects model.10 We tested for between-study heterogeneity with Cochran's Q statistic (considered statistically significant at P < 0.10) and assessed its extent with the I2 statistic.11,12 I2 ranges between 0 and 100% and expresses the proportion of between study variability that is attributed to heterogeneity rather than chance. Larger I2 values imply more extensive heterogeneity.

Subgroup and sensitivity analyses

We performed prespecified subgroup analyses by source of controls (healthy versus diseased), ethnicity, use of imaging to confirm the diagnosis of stroke, HWE in the control group, and stroke subtype (atherothrombotic versus cardioembolic). We tested for “small-study effects” (differential magnitude of effects in large versus small studies)13 with the Harbord modification of the Egger test.14 These tests are often erroneously referred to as “publication bias tests.”13,15 We used random effects meta-regression to compare the OR of the first study with the summary OR of subsequent studies or for comparisons between subgroups, as suggested elsewhere.16,17 We did not perform adjustments for deviations from the HWE in control genotypes because the necessary genotype counts were not available in a substantial number of studies.9,18

Assessing the probability of “false positive” findings

Associations with P-values <0.05 are conventionally referred to as “formally statistically significant.” Such associations can nevertheless be spurious “false positives,” as the end result of chance or bias. Letting biases aside, the probability that a given formally significant result is a “false positive” increases with diminishing prior data in support of the association, decreasing (true) strength of the association, and decreasing statistical power to detect it.19,20 We calculated the probability that associations with P-values lower than 0.05 are “false” as suggested by Wacholder et al.19,20 To this end, we assumed that the true OR was 1.05 (very small effect, similar to those detected in large meta-analyses of genomewide studies), 1.20 or 1.50 (modest effect, expected in most common diseases21), and that the prior probabilities of genuine association ranged from 10−8 (a conservative prior, akin to a variant of genomewide significance) to 0.10 (an indicator of strong prior support, appropriate for a functional polymorphism in a gene with strong biological plausibility).20 We considered probabilities of “false” association smaller than 0.20 indicative of an important finding.

Software

Analyses were performed using Stata (version 11/SE, Stata Corp., College Station, TX) and MetaAnalyst (version 3.0 beta, Tufts Medical Center, Boston, MA).22 For all tests, except those for heterogeneity, P-values were two sided and statistical significance was defined as P < 0.05. We did not perform any adjustments for multiple comparisons.

RESULTS

Literature flow

Our search identified 1040 citations of which 37 were considered potentially eligible and were retrieved in full text. Of those, 19 were excluded (four assessed PON1 activity but not genotypes, nine did not report on ischemic stroke, four were reviews/editorials, and two23,24 reported on overlapping populations with already included studies). Finally, 22 studies (i.e., 22 independent case-control strata) reported in 17 publications were eligible for the main analyses (Fig. 1).2339

Fig. 1
figure 1

Literature flow. *Xu et al.39 reported information on five independent case-control groups that were considered as separate “studies” in the analyses. Wang et al.38 reported information on five independent case-control groups that were considered as separate “studies” in the analyses.

Study characteristics

Detailed study characteristics are presented in Table 1. All 22 studies (7384 cases/11,074 controls total) evaluated the rs662 variant and 16 of them (5518 cases/8951 controls total) reported genotyping for the rs854560 variant as well. Sample sizes ranged from 48 to 2092 (median 339). Eleven studies included populations of Caucasian, 10 of East Asian, and 1 of Hispanic descent. For most studies (n = 16), there was no information on ischemic stroke subtypes. Three studies reported separate genotype distributions for atherothrombotic and cardioembolic strokes and three included only atherothrombotic strokes. The control groups in four studies for rs662 and one for rs854560 were not in HWE. Notably, only one study reported that genotyping was conducted blinded to the case/control status of participants and no study used genotyping quality-control procedures.

Table 1 Characteristics of eligible studies

Meta-analysis of rs662

Figure 2 shows the forest plot for rs662. The summary random effects OR was 1.10 per G allele copy (95% CI, 1.04–1.17; P = 0.001), with no evidence for statistical heterogeneity (PQ = 0.12, I2 = 27%). The association remained significant under a dominant model (OR = 1.31; 95% CI, 1.05–1.63; P = 0.02) and a recessive model (OR = 1.22; 95% CI, 1.05–1.43; P = 0.01), with evidence of between-study heterogeneity in the dominant model (P < 0.001, Table 2). These results remained unchanged in terms of magnitude and significance, when the two additional studies that included a minority of hemorrhagic strokes were also included (24 case-control strata, 8008 cases/13,810 controls; per-allele OR = 1.12; 95% CI, 1.05–1.20; P = 0.001), but there was evidence of between-study heterogeneity (P = 0.02).

Fig. 2
figure 2

Forest plot of PON1 rs662 and ischemic stroke. Meta-analysis of studies investigating the association of PON1 rs662 with ischemic stroke using a random effects model. Each study is shown by the point estimate of the OR (square proportional to the weight of each study) and 95% CI for the OR (extending lines); the summary OR and 95% CIs by random effects calculations is depicted as a diamond. Values higher than 1 indicate that the G allele is associated with increased ischemic stroke risk.

Table 2 Main analysis, subgroup, and sensitivity analysis for studies investigating the association between PON1 rs662 and ischemic stroke risk

Meta-analysis of rs854560

Regarding rs854560, there was no evidence of an association with ischemic stroke risk (OR = 0.97 per A allele copy; 95% CI, 0.90–1.04; P = 0.37) (Fig. 3). Between-study heterogeneity was nonsignificant (P = 0.23; I2 = 20%). Alternative genetic models also did not reveal evidence of an association (Table 3).

Fig. 3
figure 3

Forest plot of PON1 rs854560 and ischemic stroke. Meta-analysis of studies investigating the association of PON1 rs854560 with ischemic stroke using a random effects model. Values higher than one indicate that the A allele is associated with increased ischemic stroke risk. Layout similar to Figure 2.

Table 3 Main analysis, subgroup, and sensitivity analysis for studies investigating the association between PON1 rs854560 and ischemic stroke risk

Additional analyses

Overall, subgroup analysis results were consistent with the main analyses. Tables 4 and 5 present meta-regression analyses for the study-level covariates that we investigated for rs622 and rs854560, respectively. Overall, the factors we explored did not significantly affect the effect size of the genetic associations investigated.

Table 4 Meta-regression results for PON1 rs662
Table 5 Meta-regression results for PON1 rs854560

Figure 4 depicts the calculated probability of false association between rs622 and ischemic stroke (not applicable for rs854560 where an association was not found). The figure demonstrates that unless there is a strong prior belief in the association between rs622 and ischemic stroke, the probability that the association is a false positive finding is relatively high, for all possible ORs examined.

Fig. 4
figure 4

Probability of “false positive” association between PON1 rs622 and ischemic stroke. Estimation of the probability of “false positive” association between rs622 and ischemic stroke plotted against a wide range of prior probabilities for a genuine association. Calculations as described in Wacholder et al. 2004.20 The horizontal dashed line indicates a threshold of a relatively low probability of “false positive” association, operationally set at 20%.

There was no evidence that smaller studies had systematically different results compared with larger studies (Harbord test P = 0.14 for rs662 and P = 0.75 for rs854560). For rs662, the OR of the first study (1.57, 95% CI, 1.22–2.03) was statistically significantly different and more extreme compared to the pooled OR of all subsequent studies (interaction test P = 0.005). Omitting the first study reduced between study heterogeneity (PQ = 0.42; I2 = 3%) and the association remained significant (OR = 1.08; 95% CI, 1.03–1.14; P = 0.002). Including versus excluding the first study resulted in no appreciable changes for rs854560.

DISCUSSION

In this systematic review and meta-analysis of epidemiological studies, we found a statistically significant association between the PON1 variant (arginine-encoding, G) allele at rs662 and ischemic stroke. The magnitude of this association is small, as expected for common variants and common diseases. In addition, we found no evidence for an association between rs854560 and the same phenotype. The meta-analyses results were robust in subgroup and sensitivity analyses; however, these analyses may not be powered to detect modest between-subgroup differences because of the relatively small number of studies per subgroup. Although there is a biologically plausible role for PON1 in the pathogenesis of ischemic stroke, we estimated that the probability of the association between rs662 and ischemic stroke being “false-positive” is relatively high for a wide range of assumptions.

The PON1 enzyme attaches to high-density lipoprotein particles and prevents low-density lipoprotein oxidation1 and may therefore have a role in the pathophysiology of cardiovascular disease development.40,41 There is evidence that polymorphisms in the PON1 gene influence PON1 activity. Rs662 modifies PON1 enzymatic activity in a substrate-dependent manner, and rs854560 is in linkage disequilibrium with functional promoter polymorphisms.41,42 Yet, PON1 polymorphisms explain only a fraction of the variability in PON1 serum activity,2,43,44 and it is likely that additional genetic and environmental influences contribute to the ischemic stroke phenotype.

The identified association between rs622 and ischemic stroke is consistent with the modest relationship between rs622 and coronary artery disease risk, which has a summary OR of 1.12 per G allele copy in recent meta-analyses of a different set of studies than those summarized here.4,5 Such congruent levels of risk for ischemic stroke and coronary artery disease conferred by the same variant have been also described for other genetic associations.45 The fact that these two correlated atherosclerotic phenotypes are associated with the rs622 variant has several alternative explanations.

First, it is possible that the associations of rs662 with both phenotypes are genuine and independent of each other, in which case further laboratory investigation is required to elucidate the underlying mechanisms. Another possible explanation is that the rs622 variant primarily affects an intermediate or surrogate phenotype common to the pathophysiology of both coronary artery disease and ischemic stroke, such as dyslipidemia,34,46 increased carotid intima-media thickness,6,46 or inflammation.47 If this is true, the direct effect of rs622 on the hypothesized intermediate phenotype is expected to be quite larger than the observed associations on the downstream clinical phenotypes of ischemic stroke and coronary artery disease (both around 1.10 per copy of G allele).

Conversely, it may be that only one of the associations is true or that both are spurious. If only one of the two phenotypes was truly associated with rs622, then the other would also appear to be associated as well, because coronary artery disease and ischemic stroke tend to occur together, i.e., they are correlated. However, in that case, the OR of the phenotype that is not independently associated with the PON1 genotype would be expected to be closer to the null (even for strong between-phenotype correlations).

The association of rs662 and ischemic stroke seems to be consistent with the “winner's curse” phenomenon, where the first publication on a gene-disease association reports a spurious or exaggerated effect size that is not replicated by subsequent research.16 “Winner's curse” may be the result of various selection biases, such as time-lag bias, shortcomings in the design, and conduct of individual studies, or chance. Briefly, time lag bias exists when the order of study publication depends on study results, e.g., with statistically significant studies being published first, and nonsignificant studies published subsequently. In its extreme form, nonsignificant studies remain unpublished (publication bias). Publication bias likely exists in genetic and genomic topics but cannot be measured directly. This is because most of the so-called “publication bias diagnostics” simply test for systematic differences between more and less precise studies, for which publication bias is only one of many possible explanations.13,15

Further, although there are no validated quality characteristics to distinguish association studies with higher versus lower risk of bias, one can use criteria with substantial face validity. In our topic, only one study explicitly mentioned blinding of investigators to the case/control status of participants, no studies reported using genotyping quality control procedures, and in several studies, the genotypic frequencies in the control groups deviated from those expected under HWE. Finally, as shown in Figure 4, the probability that the association is due to chance (“falsely” positive) is relatively high for a wide range of assumptions. It remains to be defined whether functional evidence on rs622 and pathophysiological evidence implicating PON1 in ischemic stroke are supportive of strong biological plausibility. In such case, further studies would be required to disentangle the mechanistic effects of this genetic variant and confirm the findings of our meta-analysis.

In conclusion, we found evidence of a weak association between rs662 and ischemic stroke risk, similar in magnitude to the corresponding association of the variant with coronary disease. Genetic variation in the paraoxonase gene cluster merits further investigation, preferably using haplotyping approaches to comprehensively assess its relationship with atherosclerotic disease risk and elucidate the molecular basis of the observed genetic effects.