Introduction

Prostate cancer ([Mendelian Inheritance in Man 176807]) is the most commonly diagnosed nonskin malignancy and the second leading cause of cancer-related death among men in United States and first leading cause of death among Hispanics/Latinos1,2; and in Asian countries, especially in China, the incidence of prostate cancer is increasing3. Worldwide, the disease is the second most common cancer in men after lung cancer4. Prostate cancer is a complicated and multifactorial disease. The precise etiology and pathological mechanism of prostate cancer remains unclear. Age, family history, and ethnicity are the most consistently addressed risk factors associated with prostate cancer. However, age and inherited factors are estimated to be responsible for 5% to 9% percentage of prostate cancer5. Therefore, identifying a preventable cause of prostate cancer would produce an important influence of public health.

The substantial differences aforementioned in incidence of prostate cancer worldwide may be due to ethnic variation6. Therefore, certain researchers indicated that different levels of androgens across varying ethnicity may contribute to these differences6,7. The exact mechanism through which androgen is involved in the etiology of prostate cancer remains unclear. The androgen receptor gene [Mendelian Inheritance in Man 313700] is located at Xq11.2-q12, and the length of androgen receptor gene is more than 90 kb8. The androgen receptor gene is comprised of eight exons that encode four functional domains, which include the transactivation domain, the DNA binding domain, a hinge region, and the carboxyl-terminal ligand binding domain9. There are two main polymorphisms including CAG and GGN repeats in the androgen receptor gene. Moreover, CAG was associated with the transcriptional activity of the AR in response to ligand binding. Therefore, the correlation between these polymorphisms of androgen receptor and risk of prostate cancer has received much attention. Three published meta-analyses6,10,11 and several narrative reviews12,13,14,15 have addressed the association between the repeat polymorphisms and prostate cancer susceptibility. However, the conclusions of these previous meta-analyses were not consistent and the narrative reviews could not quantify the estimate. Additionally, more studies have been published since the most recent meta-analysis. Therefore, we performed the present meta-analysis aimed to provide a more precise and comprehensive result for the relationship between CAG and GGN repeat polymorphisms of androgen receptor gene and prostate cancer susceptibility.

Methods

Eligible criteria

For inclusion in this meta-analysis, the publication had to meet the following eligible criteria: (1) the exposure was androgen receptor gene CAG and GGN repeat polymorphisms; (2) populations were men with prostate cancer (cases) without prostate cancer (controls); (3) the outcome was incident of prostate cancer; (4) the study design was retrospective or prospective (i.e. nested) case-control study; (5) study provided distribution of genotype, odds ratio (OR) and corresponding 95% confidence interval (CI), mean difference (MD) and corresponding standard error (SE), and mean repeats in case and control groups with related SE. For duplicated publication, we included the most recent or that providing the most information. If one publication provided different groups of ethnicity, we considered the each group as a separate study. We conducted the meta-analysis according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement in reporting meta-analysis. The protocol (registration number: CRD42016036971) of the meta-analysis was published in the PROSPERO register (http://www.crd.york.ac.uk/PROSPERO/).

Search strategy

A comprehensively literature search was performed in PubMed, Embase, CBM, CNKI and Wanfang databases up to March, 2016, without restriction to regions, publication types, or languages. The search strategy was as following: (“polymorphism” AND “prostate cancer” AND “androgen receptor”). In addition, references in the recent reviews or meta-analysis and included articles were identified for any further potential related studies.

Data extraction

Data from the included studies were extracted and summarized independently by two authors (HW and XT-Z). Any disagreement was resolved by discussion of which data should be extracted. The following information was extracted: last name of first author, publication year, country of study, ethnicity, study design, control status, sample size, age of the cases and controls, percentage of advanced prostate cancer cases (T3-T4, M0; T0-T4, M1), the repeat cutpoint of polymorphisms, mean number of repeats in case and control groups with related SE, dichotomous data (short versus long repeats), and estimate with corresponding 95% CI (including OR for dichotomous data and continuous data). We defined the long CAG and GGN repeats as ≥22 and >16 repeats as previously published6, respectively. Otherwise, <22 and ≤16 were short CAG and GGC repeats, respectively.

Statistical analysis

We calculated ORs and 95% CIs for short CAG repeats (<22) versus long CAG repeats (≥22) and short GGN repeats (≤16) versus long GGN repeats (>16) using dichotomous data6. We summarized ORs and corresponding 95% CIs for per decrement of CAG and GGN repeats. We also summarized the MDs in number of repeats between cases and controls. In this meta-analysis, all pooled analyses were performed with random-effects model using the method of DerSimonian-Laird, with the estimate of heterogeneity being taken from the from the Mantel-Haenszel model. Subgroup analyses were also performed according to ethnicity (Caucasian, Asian, Africa, or Hispanic), study design (prospective, i.e. nested or retrospective case-control study), control status, and histology grade of prostate cancer (localized and advanced). In addition, meta-regression analysis was also performed for interaction of between-group. Sensitivity analysis was performed by removing each study at a time. Publication bias was detected using contour-enhanced funnel plot and Egger’s linear regression method. Statistical analysis was performed using Stata 12.0 software. A two-sided P value of 0.05 was used, except for heterogeneity test (0.1).

Results

Study characteristics

A total of 717 relevant publications were identified from the electronic literature search. The PRISMA flow diagram was presented in Fig. 1, which shows the detail of inclusion and exclusion of studies. Ultimately, 51 publications16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66 were included in the meta-analysis, in which 51 publications16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66 with 61 case-control studies (14 803 cases and 18 888 controls, Fig. 2) for CAG repeats and 14 publications16,18,20,21,23,24,32,33,35,47,53,54,56,57 with 16 case-control studies (2986 cases and 3705 controls, Fig. 3) for GGN repeats. The characteristics of included studies were shown in Table 1.

Figure 1
figure 1

Flow chart for this meta-analysis.

Figure 2
figure 2

Sample size of the CAG repeat polymorphism.

Figure 3
figure 3

Sample size of the GGN repeat polymorphism.

Table 1 Characteristics of included studies in the meta-analysis.

Association between CAG repeats polymorphism and prostate cancer risk

Fifty-one case-control studies conveyed data on the short versus long CAG repeats. The pooled analysis showed that men with short CAG repeats carried higher risk of prostate cancer than long CAG repeats (OR = 1.31, 95% CI 1.16 to 1.47; I2 = 74.9%, P for heterogeneity <0.01; Fig. 4). Thirty-three case-control studies presented the data for per one CAG decrement and the summarized OR was 1.04 (95% CI 1.02 to 1.07; I2 = 83.4%, P for heterogeneity <0.01; Fig. 5) for men with per one CAG decrement. The aggregated analysis suggested that prostate cancer cases seemed to have on average 0.85 fewer CAG repeat length than controls (MD = −0.85, 95% CI −1.28 to −0.42; I2 = 88.7%, P for heterogeneity <0.01; Fig. 6) with 23 case-control studies.

Figure 4
figure 4

Forest plot of short CAG repeats versus long CAG repeats.

Figure 5
figure 5

Forest plot of per one CAG repeat decrement and risk of prostate cancer risk.

Figure 6
figure 6

Forest plot of difference in number of CAG repeat length between cases and controls.

Association between GGN repeats polymorphism and prostate cancer risk

Sixteen case-control studies provided data on the short versus long GGN repeats. The pooled results showed that men with short GGN repeats carried higher risk of prostate cancer than long GGN repeats (OR = 1.38, 95% CI 1.05 to 1.82; I2 = 69.1%, P for heterogeneity <0.01; Fig. 7). Six case-control studies presented the data for per one GGN decrement and the summarized OR was 0.99 (95% CI 0.95 to 1.03; I2 = 0.0%, P for heterogeneity = 0.93; Fig. 8) per GGN. The summarized MD of GGN repeats showed no significant difference between the prostate cancer cases and controls (MD = 0.05, 95% CI −0.09 to 0.18; I2 = 0.0%, P for heterogeneity = 0.95; Fig. 9) with six case-control studies.

Figure 7
figure 7

Forest plot of short GGN repeats versus long GGN repeats.

Figure 8
figure 8

Forest plot of per one GGN repeat decrement and risk of prostate cancer risk.

Figure 9
figure 9

Forest plot of difference in number of GGN repeat length between cases and controls.

Haplotype analysis of CAG and GGN repeat polymorphisms

Six case-control studies provided data for haplotype analysis. The estimated ORs were 2.06 (95% CI 1.29 to 3.29; I2 = 69.3%, P for heterogeneity = 0.006), 1.79 (95% CI 1.08 to 2.96; I2 = 75.8%, P for heterogeneity = 0.001), and 1.21 (95% CI 0.94 to 1.56; I2 = 0, P for heterogeneity = 0.99) for haplotypes CAG <22/GGN ≤16, CAG <22/GGN >16, and CAG ≥22/GGN ≤16 compared with CAG ≥22/GGN >16 (Fig. 10).

Figure 10
figure 10

Haplotype analysis of CAG and GGN repeat polymorphisms and risk of prostate cancer.

Subgroup, meta-regression and sensitivity analysis

Subgroup analyses were conducted according to ethnicity, study design, control status, and histology grade of prostate cancer. The results of subgroup analyses showed that the elevated risk of prostate cancer in both CAG and GGN repeat polymorphisms were more predominant among Caucasian populations (Tables 2, 3 and 4) and the increased risk of prostate cancer of long GGN repeats were more predominant in advanced prostate cancer cases (Fig. 11). Meta-regression analysis did not detect any significant difference between subgroups (Tables 2, 3 and 4). Subgroup analysis showed that the result of CAG repeat length and risk of prostate cancer was robust (Fig. 12).

Table 2 The results of overall and subgroup analyses of the association between CAG repeats and prostate cancer risk.
Table 3 Results of length of CAG repeats and risk of prostate cancer.
Table 4 Results of the association between GGN repeats and prostate cancer.
Figure 11
figure 11

Subgroup analysis of histology grade of prostate cancer.

Figure 12
figure 12

Sensitivity analysis of CAG repeat decrement and risk of prostate cancer risk.

Publication bias

Publication bias was detected using contour-enhanced funnel plot and Egger’s linear regression method. Contour-enhanced funnel plots showed that publication bias might exist for the short versus long CAG repeat polymorphism (Fig. 13) and no publication bias existed in the short versus long GGN repeat polymorphism (Fig. 14). Egger’s linear regression method supported the aforementioned conclusion (P = 0.004 for CAG repeats; P = 0.07 for GGN repeats).

Figure 13
figure 13

Contour-enhanced funnel plot of CAG repeat polymorphism.

Figure 14
figure 14

Contour-enhanced funnel plot of GGN repeat polymorphism.

Discussion

The present meta-analysis summarizes the evidence to date regarding the association between CAG and GGN repeat polymorphisms of androgen receptor and the risk of prostate cancer. The results suggested that short CAG and GGN repeats in the androgen receptor gene were associated with increased risk of prostate cancer, especially in Caucasians.

The short CAG repeats (<22) and short GGN repeats (≤16) carry a roughly 1.31- and 1.38-fold higher risk of developing prostate cancer compared with subjects with long CAG (≥22) repeats and long GGN repeats (>16), respectively. Each decrement in CAG repeat presented 1.04-fold higher risk of developing prostate cancer. Prostate cancer cases presented an average 0.85 fewer CAG repeats than controls. In Caucasians, the aforementioned elevated risk was increased. This could be due to that more studies conducted in Caucasians, which provided greater statistical power for detecting small gene effect. Specifically, the prostate cancer cases in Caucasian population carried an average 1.09 fewer CAG repeats than controls. This difference might yield certain measurable biological impact in prostate carcinogenesis, such as early diagnosis and gene therapy.

An interaction between CAG and GGN repeat polymorphisms in increasing the prostate cancer susceptibility was documented by our meta-analysis. Haplotype analysis showed that short CAG and short GGN repeats carriers presented 2.06-fold higher risk of developing prostate cancer compared with long CAG and long GGN repeats carriers. Moreover, the short CAG repeats and long GGN repeats carriers presented 1.79-fold higher risk of developing prostate cancer compared with long CAG and long GGN repeats carriers.

In 2004, Zeegers et al.6 published the first meta-analysis regarding the association between CAG and GGN repeat length polymorphisms in the androgen receptor gene and prostate cancer risk, in which included 23 articles with 19 retrospective case-control studies and 5 prospective case-control studies, comprising a total of 4274 cases and 5275 controls. They found that the presence of shorter repeats seemed to be modestly associated with prostate cancer risk. However, they did not found any significant difference in number of repeats between cases and controls. In 2012, Gu et al.10 aggregated 27 articles to evaluate the relationship between CAG repeat polymorphism and prostate cancer risk. Their meta-analysis demonstrated that the CAG repeat polymorphism in androgen receptor gene with more than 20 repeats might confer a protective effect among the prostate cancer cases among men 45 years or older only. In 2013, Sun et al.11 carried out another meta-analysis regarding the association between CAG repeat polymorphism and prostate cancer risk, which included 47 studies with 13 346 cases and 15 172 controls. They suggested that a short CAG repeat polymorphism might increase the risk of prostate cancer compared with the longer CAG repeat, especially in Caucasians and Asians. Compared with the previous meta-analysis6,10,11, our meta-analysis was more comprehensively searched and our meta-analysis included 51 case-control studies (14 803 cases and 18 888 controls) for CAG repeats and 16 case-control studies (2986 cases and 3705 controls) for GGN repeats. In addition, our meta-analysis performed haplotype analysis and suggested that there exists an interaction between CAG and GGN repeat polymorphisms in increasing the prostate cancer susceptibility. Moreover, we found a significant difference in number of CAG repeat length between cases and controls, and the absolute difference in more than 1 repeat in Caucasians.

The present retrospective analysis has some limitations. First, the evidence of between study heterogeneity was apparent, and the heterogeneity might distort the conclusion of the current meta-analysis67,68,69. Additionally, the meta-regression analysis failed to identify the source of heterogeneity. Second, the standard of cutpoint of repeat length polymorphisms varied in different studies. This might in part contribute to the between study heterogeneity. Third, the screening policy of prostate cancer also varies between countries. Especially in United States, the prostate-specific antigen screening of the general population is more commonly used than other countries6. These different screening policies might also be responsible for the between study heterogeneity. Fourth, the publication bias was detected in the present meta-analysis for the association between CAG repeat polymorphism and risk of prostate cancer. The existing publication bias indicated that certain studies with negative results for the association between CAG repeat polymorphism and prostate cancer risk are under-represented in the literature. The publication bias also might distort the conclusion of the present meta-analysis. Ultimately, the meta-analysis is a secondary analysis; therefore, we could not handle the problem of between study heterogeneity.

In summary, our meta-analysis indicated that short CAG and GGN repeats in androgen receptor gene were associated with increased risk of prostate cancer, especially in Caucasians.

Additional Information

How to cite this article: Weng, H. et al. Androgen receptor gene polymorphisms and risk of prostate cancer: a meta-analysis. Sci. Rep. 7, 40554; doi: 10.1038/srep40554 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.