Introduction

Cerebral infarction (CI) is a leading cause of disability and death worldwide. Vascular neurologists have recently put forward a new ‘universal’ tissue definition of CI.1 This review proposes that CI be defined as brain or retinal cell death because of prolonged ischemia. It is generally accepted that atherosclerosis of the cerebral vessels induces cerebral ischemia. Some studies2 have shown that the proportion of causative factors for CI explained by genetic factors may be as large as 20 to 40%, although diet, exercise and smoking remain critical risk factors. A number of polymorphisms in candidate genes are associated with atherosclerosis in certain populations and have been of considerable help in identifying CI susceptibility loci.3

Recently, the purinergic system has been shown to have an undisputed and crucial role in the modulation of vascular tone, and to be of similar importance as a regulator of the inflammatory response occurring in atherosclerosis.4 Purinergic receptors are ubiquitously expressed throughout the human body and are classified as P1 receptors and P2 receptors. P2 receptors include P2X and P2Y receptors, and the latter are seven-membrane span receptors coupled through G proteins. At least eight P2Y receptor subtypes (P2Y1, P2Y2, P2Y4, P2Y6, P2Y11, P2Y12, P2Y13 and P2Y14) have been identified5 in pharmacological and molecular cloning studies. The P2Y2 receptor, which is also referred to as P2RY2 (official name: purinergic receptor P2Y, G-protein coupled, 2), stimulates the growth of vascular smooth muscle cells (SMCs), endothelial cells and blood cells, and is involved in platelet aggregation and coagulation of regulation and inflammation, all of which are associated with the development of CI.6

Extracellular nucleotides, including adenosine triphosphate (ATP) and uridine triphosphate, that are released from a variety of arterial and blood cells can bind to P2RY2 on the cell surface. Activation of P2RY2 has been shown to induce both the proliferation and migration of vascular SMCs and apoptosis, a process involved in the evolution of atherosclerotic plaques.7 Some evidence suggests that P2RY2 is implicated in the development of vascular disease.8 Interestingly, a more recent study found that high shear stress, associated with vascular diseases, can selectively upregulate P2Y2 receptors in perfused arterial SMCs.9 It has been proposed that upregulation of P2Y receptors is a potential diagnostic indicator for the early stages of atherosclerosis.10 A direct pathological role of P2RY2 is reinforced by recent evidence11 showing that the upregulation and activation of P2RY2 in rabbit arteries mediate the intimal hyperplasia that accompanies atherosclerosis. Guns et al.12 found that in the aorta of P2Y2-knockout mice, endothelium-dependent relaxation by ATP was inhibited, showing the role of P2RY2 in endothelial dysfunction. These studies show that the P2RY2 gene has an important role in the process of atherosclerosis, the main cause of CI.

To our knowledge, there have been no earlier studies on the association between the human P2RY2 gene and CI. This study was a retrospective haplotype-based case–control study. The aim was to investigate the association between the human P2RY2 gene and CI by using single nucleotide polymorphisms (SNPs) in conjunction with separate analyses of data pertaining to gender groups.

Methods

Subjects

The study group consisted of 237 CI patients (mean age, 63.2±12.8 years, m/f ratio=1.69). Subtypes of CI included thrombotic and lacunar stroke. Diagnosis was based on neurological examination and on the findings of computed tomography or magnetic resonance imaging, or both. In addition, all patients had neurological deficit ratings greater than grade 3 on the modified Rankin Scale. The study also enrolled 254 Japanese subjects as the control 1 group (mean age, 77.8±4.2 years, m/f ratio=0.90). All subjects were members of the New Elder Citizen Movement in Japan and resided in the Greater Tokyo Metropolitan Area. Although some of the subjects in the control 1 group had vascular risk factors such as hypertension, hypercholesterolemia and diabetes mellitus, none had a history of CI. All of the subjects in the control 1 group were confirmed to have grade 0 on the modified Rankin Scale of neurological deficits. Individuals with atrial fibrillation were excluded from both the CI and control groups. Participants with cancer or autoimmune disease, including antiphospholipid antibody syndrome, were also excluded.13 A total of 255 sex-matched healthy individuals (mean age, 51.4±10.0 years, m/f ratio=1.66) were also enrolled as the control 2 group. None of the subjects in the control 2 group had a history of hypertension and CI, and all had systolic blood pressure <130 mm Hg and diastolic blood pressure <85 mm Hg. The patients and two control groups were taking no medications influencing the adenylate cyclase pathway. Informed consent was obtained from each subject in accordance with the protocol approved by the Human Studies Committee of Nihon University.14 Sample sizes were considered to be appropriate for a case–control study.15

Genotyping

The human P2RY2 gene is located on chromosome 11q13.5–q14. 1, spans approximately 18.1 kb, and contains three exons. There are 104 SNPs for the human P2RY2 gene listed in the National Center for Biotechnology Information SNP database Build 129 (http://www.ncbi.nlm.nih.gov/SNP). We screened the data for Tag SNPs on the International HapMap Project website (http://www.hapmap.org/index.html.ja) using a cutoff level of r20.5. For minor allele frequencies, we used a cutoff level of 0.2. According to the above criteria, we selected rs4944831 (SNP1, C-12034890-10, registration number by Applied Biosystems), rs1783596 (SNP2, C-8893575-10), rs4944832 (SNP3, C-27987464-10), rs4382936 (SNP4, C-1830488-20) and rs10898909 (SNP5, C-1830487-10) for this study (Figure 1). SNP1 was located in intron 1, whereas SNP2 was located in the coding region of exon 3. SNP3, SNP4 and SNP5 were located in the 3′-flanking region.

Figure 1
figure 1

Structure of the human P2RY2 gene. The gene consists of three exons (boxes) separated by two introns. Lines show introns and intergenic regions. Filled box shows the coding region. Arrows indicate the locations of single nucleotide polymorphisms (SNPs). kbp, kilobase pairs.

Blood samples were collected from all participants and genomic DNA was extracted from peripheral blood leukocytes by phenol and chloroform extraction.

Genotyping was performed using the TaqMan SNP Genotyping Assay (Applied Biosystems, Foster City, CA, USA). TaqMan SNP Genotyping Assays were performed by Taq amplification.16 In the 5′-nuclease assay, discrimination occurs during the polymerase chain reaction (PCR) because of allele-specific fluorogenic probes that, when hybridized to the template, are cleaved by the 5′-nuclease activity of Taq polymerase. The probes contain a 3′-minor groove-binding group that hybridizes to single-stranded targets with greater sequence-specificity than ordinary DNA probes. This reduces nonspecific probe hybridization, and results in low background fluorescence in the 5′-nuclease PCR assay (TaqMan; Applied Biosystems). Cleavage results in increased emission of a reporter dye. Each 5′-nuclease assay requires two unlabeled PCR primers and two allele-specific probes. Each probe is labeled with two reporter dyes at the 5′-end. In this study, VIC and FAM were used as reporter dyes. The primers and probes used in the TaqMan SNP Genotyping Assays (Applied Biosystems) were chosen on the basis of the information available on the Applied Biosystems website (http://www3.appliedbiosystems.com/AB_Home/index.htm).

PCR amplification was performed using 2.5 μl of TaqMan Universal Master Mix, No. AmpErase UNG (2 × ) (Applied Biosystems) in a 5-μl final reaction volume containing 2 ng DNA, 0.046 μl of TaqMan SNP Genotyping Assay Mix (40 × ), primers at a concentration of 331.2 nmol l−1 each, and probes at a final concentration of 73.6 nmol l−1 each. Thermal cycling conditions consisted of 95 °C for 10 min, and then 50 cycles of 92 °C for 15 s and 60 °C for 1 min in a GeneAmp 9700 system (Applied Biosystems).

Each 96-well plate contained 80 DNA samples of an unknown genotype and four reaction mixtures containing reagents but no DNA (control samples). The control samples without DNA are a necessary part of the Sequence Detection System (SDS) 7700 signal processing, as outlined in the TaqMan Allelic Discrimination Guide (Applied Biosystems). Plates were read on the SDS 7700 instrument with the end-point analysis mode of the SDS version 1.6.3 software package (Applied Biosystems). Genotypes were determined visually on the basis of the dye-component fluorescent emission data depicted in the X–Y scatter-plot of the SDS software. Genotypes were also determined automatically by the signal processing algorithms of the software. The results of each scoring method were saved in two separate output files for later comparison.

Biochemical analysis

We measured serum concentrations of creatinine using the standard methods used by the Clinical Laboratory Department of Nihon University Hospital.

Statistical analysis

All continuous variables were expressed as means±s.d. Differences in continuous variables between the CI patients and controls were analyzed using the Mann–Whitney U-test. Differences in categorical variables were analyzed using Fisher’s exact test. The Hardy–Weinberg equilibrium was assessed by χ2 analysis. Differences in the distributions of genotypes and alleles between the CI patients and controls were analyzed using Fisher’s exact test. On the basis of the genotype data of the genetic variations, we performed linkage disequilibrium analysis and haplotype-based case–control analysis, using the expectation maximization algorithm17 and the software SNPAlyze version 3.2 (Dynacom, Yokohama, Japan). Pairwise linkage disequilibrium analysis was performed using five SNPs. We used D′ values greater than 0.5 to assign SNP locations to one haplotype block. SNPs with an r2 value less than 0.5 were selected as tagged. In the haplotype-based case–control analysis, the frequency distribution of the haplotypes and P-values were calculated by χ2 analysis. In addition, logistic regression analysis was performed to assess the contribution of major risk factors. Statistical significance was established at P-values less than 0.05. Statistical analyses were performed using SPSS software for Windows (version 12; SPSS, Chicago, IL, USA).

Results

Table 1 shows the clinical characteristics of the study participants. The mean age of the control 1 group was higher than that of the CI patients. In humans, the use of the so-called ‘super controls’ has been widely accepted in case–control studies for common diseases that appear later in life.13 Here, we used a ‘super control’ group, as healthy elderly subjects have been found to be more suitable than young or middle-aged subjects when determining phenotypes of cerebrovascular diseases related to aging. CI is an age-influenced disease, and the use of a ‘super control’ group rather than an age-matched control group is therefore better for increasing statistical power in these types of experiments. For the three groups, all subjects, men and women, the following values were significantly higher in the CI patients than in the two control groups: systolic blood pressure, diastolic blood pressure, and prevalence of hypertension and diabetes mellitus. For all subjects and men, serum creatinine and the prevalence of smoking and drinking and hyperlipidemia were significantly higher for the CI patients than for the control 1 group.

Table 1 Characteristics of study participants

Table 2 shows the distributions of genotypes and alleles of the five SNPs. As the elderly subjects in the ‘super-control’ group may have had protective SNPs, and the patients and the control 1 group were not sex-matched, we added another group that was sex-matched and had a younger age (control 2 group). The genotype distribution of each SNP did not show significant differences from the Hardy–Weinberg equilibrium values in the control 1 and control 2 groups (data not shown). For women, the distribution of the dominant form of rs4944832 (GG vs. GA+AA) differed significantly between the CI patients and the control 1 group (P=0.043), and between the CI patients and the control 2 group (P=0.029). There were also significant differences in the allelic distribution of rs4944832 between the female CI patients and the control 1 group (P=0.039), and between the female CI patients and the control 2 group (P=0.017). Dominant and recessive models were defined by their overall frequency among controls. As similar results were seen in the control 2 group, we believe that the present ‘super-elderly’ subjects had no protective SNPs.

Table 2 Genotype and allele distributions in CI patients and two control groups

Table 3 shows the results of logistic regression analysis. As the relationship between rs4944832 and CI in women in Table 2 was not clear after using Bonferroni's correction,18 we examined this relationship by logistic regression analysis to confirm the results. For women, logistic regression was performed using the following parameters: GG genotype of rs4944832, pulse rate, creatinine with or without hypertension, diabetes mellitus or drinking frequency, because these confounding factors showed significant differences among women in Table 1. The GG genotype of rs4944832 was found to be significantly higher in female CI patients than in the control 1 group (P=0.021) and the control 2 group (P=0.005), which further showed the relationship between rs4944832 and CI in women.

Table 3 Odds ratios and 95% confidence intervals for each risk factor, and GG genotype of rs4944832 associated with cerebral infarction in female subjects

Figure 2 shows the patterns of linkage disequilibrium for the P2RY2 gene in the control 1 group, with their D′ and r2 values. Four SNPs (SNP2, SNP3, SNP4 and SNP5) were located in one haplotype block, as the D′ values were beyond 0.25. However, SNP1 was not located in this haplotype block. As the r2 values for SNP2–SNP3 were greater than 0.5, at least one SNP from SNP2 and SNP3 was suitable for the haplotype-based case–control study. Therefore, given that the minor allele frequency of SNP2 was larger than that of SNP3, we constructed the haplotypes using SNP2, SNP4 and SNP5.

Figure 2
figure 2

Pairwise linkage disequilibrium (LD) in the human P2RY2 gene, as evaluated by D′ and r2 values. Pairwise LD was computed for the five marker pairs that were studied in the human P2RY2 gene. Pairs in LD (D′0.25 or r20.5) are shown as shaded values.

In the haplotype-based case–control study, there were four combinations: SNP2–SNP4, SNP2–SNP5, SNP4–SNP5 and SNP2–SNP4–SNP5 (Table 4). For all subjects, the overall distribution of the haplotype established by SNP2–SNP4–SNP5 was significantly different between the CI patients and the control 1 group (P=0.027). For all subjects, the frequency of the T-A-G haplotype (SNP2–SNP4–SNP5) was also significantly higher (P=0.031), whereas the frequency of the T-C-G haplotype (SNP2–SNP4–SNP5) was significantly lower (P=0.029) for the CI patients than for the control 1 group. For women, the frequency of the T-A-A haplotype (SNP2–SNP4–SNP5) was significantly higher for the CI patients than for the control 1 group (P=0.012).

Table 4 Haplotype analysis in CI patients and control 1

Discussion

In 1994, Parr et al.19 first reported the sequence and functional expression of cDNA-encoding P2RY2 cloned from airway epithelial cells, whereas in 1996, Dasari et al.20 mapped the human P2RY2 gene to chromosome 11q13.5–q14.1 using a PCR analysis of human–rodent hybrid cell line DNAs. Earlier studies in mice have identified important roles for P2RY2 in a variety of processes, including nucleotide-regulated Ca2+ signaling in lung fibroblasts and airway epithelial cells, nucleotide-stimulated Cl secretion in the trachea and gallbladder, neuronal growth, stimulation of K+ secretion in the colon and neutrophil chemotaxis.21 P2RY2 may participate in various physiological responses by the activation of nucleotides. Extracellular ATP or UTP binds to P2RY2, and this combination activates phospholipase C and/or adenylate cyclase.22 In turn, phospholipase C activation generates inositol 1,4,5-triphosphate, a mediator of Ca2+ release from intracellular stores, and diacylglycerol, an activator of protein kinase C, whereas adenylate cyclase generates cyclic adenosine monophosphate, an activator of protein kinase A.

Atherosclerosis related to CI is an immunoinflammatory process that involves complex interactions between the vessel wall and blood components and is thought to be initiated by endothelial dysfunction.23 From a vascular biology perspective, the processes of cellular adhesion, monocyte and macrophage attachment, and transmigration of immune cells across the endothelium are crucial steps in early atherogenesis and in the later stages of mature plaque rupture.24 Inflammatory cells express P2RY2 with multiple effects. P2RY2 is expressed on T lymphocytes and macrophages and has been suggested to be important in atherosclerosis.25 Recent studies have found an important role for P2RY2 in the development of intimal hyperplasia in rabbit carotid arteries.11 With regard to cerebral vessels, increased stroke risk has been associated with an increased rate of atherosclerosis progression in carotid vessels. Various evidence26, 27 has also shown that P2Y2 receptors in the smooth muscle cell mediate the dilation of rat cerebral arterioles. Marrelli et al.28 found that P2Y2 receptors have an important role in the dilation of the rat’s middle cerebral artery after ischemia reperfusion. These studies indicate that the P2RY2 gene has a close relationship with CI.

On the basis of the results of these animal studies, we planned to assess the association between the P2RY2 gene and CI using a haplotype-based case–control study. In this study, we found that the distribution of the dominant model of rs4944832 (GG vs. GA+AA) differed significantly between the female CI patients and the two control groups. The present results indicate that the risk of CI is increased in subjects with the GG genotype of rs4944832, which is associated with CI for Japanese women. We also found a significant difference in the allelic distribution of rs4944832 between the female CI patients and the two control groups. In this study, logistic regression analysis indicated that for women, the GG genotype of rs4944832 was significantly more common in the CI patients than in the two control groups. This indicates that the GG genotype is a genetic marker for CI in Japanese women.

Morris et al.29 found that for genes with multiple susceptibilities, analysis based on haplotypes has advantages over analysis based on individual SNPs, particularly when linkage disequilibria between the SNPs are weak. Consequently, in this study, we successfully established haplotypes for the P2RY2 gene from the various combinations of the three selected SNPs. For the present haplotype analysis, although there were significant differences in the frequency of the T-A-A haplotype (SNP2–SNP4–SNP5) between the female CI patients and the control 1 group, we believe that this difference is not particularly important, as the overall distribution of the combination (SNP2–SNP4–SNP5) for women did not significantly differ between the CI patients and the control 1 group (P=0.076). In this study, for all subjects, the frequency of the T-A-G haplotype was significantly higher for the CI patients than for the control 1 group, and the T-A-G haplotype can be regarded as a susceptibility haplotype for CI in Japanese subjects. The frequency of the T-C-G haplotype was significantly lower for the CI patients than for the control 1 group, and the T-C-G haplotype can be regarded as a resistance haplotype for CI in Japanese subjects.

In this study, the GG genotype of rs4944832 showed a gender-specific significant difference (for women only). Although being male is recognized as a risk factor for CI, some basic and clinical studies have shown the opposite results.30, 31 ATP, the ligand for P2RY2, has been shown to increase the production of progesterone and estradiol in human granulosa–luteal cells.32 Several clinical and experimental studies have suggested that estrogen is not universally neuroprotective in experimental cerebral ischemia 31 and may not be beneficial for ischemic preconditioning in experimental ischemia models.33 Recent clinical studies have suggested that there are unanticipated and paradoxical effects of estrogen and hormone replacement therapy relative to stroke risk, as it is currently administered to women.34 One study has suggested that women on hormone replacement therapy have a higher risk of perioperative stroke.35 Therefore, depending on the ischemic conditions present and the level of ATP, estrogen has the ability to harm ischemically compromised brain tissue.

In conclusion, the T-A-G haplotype of the human P2RY2 gene may be a susceptibility haplotype for CI in Japanese subjects, and the GG genotype could be a genetic marker for CI, particularly in Japanese women. This study also shows that the T-C-G haplotype is a resistance marker for CI in Japanese subjects. Further studies are needed to isolate functional mutations in the P2RY2 gene that modulate the process of atherosclerosis, and to evaluate the function of P2RY2 variants that are involved in the metabolism of sex hormones.