Introduction

Breast cancer, as a multifactorial disease, is the most common cancer in the world1. The major influences on breast cancer appear to be environmental and genetic factors2,3. Previous studies indicate that genetic influences account for about 27% of the breast cancer risk4. A few genes including BRCA1, BRCA2 and ATM have been known to be associated with the risk of breast cancer5. In the past 30 years, the incidence of breast cancer has been increasing rapidly in many countries. According to the data of International Agency for Research on Cancer (IARC), there are additional 1.68 million breast cancer patient in 2012, which is 25% of the women with tumor6.

It’s reported that genetic factors play an important role in the development of breast cancer. With the completion of the human genome project (HGP) in 2001, single nucleotide polymorphisms (SNPs) come into view as the essential factor in the development of diseases 7,8,9,10. In addition to the highly penetrant (BRCA1, BRCA2 and TP53) and moderately penetrant (CHEK2, BRIP1, ATM and PALB2) genetic variants, breast cancer has been associated with low penetrant risk (FGFR2, TNRC9, MAP3K1 and LSP1)11. About 5% of breast cancer incidence is attributable to high-penetrance mutations12. Therefore, it’s important to evaluate the association between low penetrant and breast cancer risk.

Recent studies have identified the single nucleotide polymorphism rs3803662 associated with breast cancer risk7,13,14,15,16,17,18,19,20,21,22,23,24. In 2010, Rulla M Tamimi et al. and TV Gorodnova et al. found significant association in Swedish and Russian, respectively15,17. In 2011, Martha L. Slattery et al. found the association in Hispanic subsequently22. In 2014, Isabel Elematore et al. considered that rs3803662 was associated with breast cancer risk in Chilean24. In 2016, Yaning He et al. also find the association in Chinese Han population23. However, the sample size of most studies was too small. One of the aims of Meta-Analysis is leading to a higher statistical power and more robust result. Single study does not have significant result statistically sometimes. It is due to that the sample size was so small that statistical power was low. We can improve statistical power by means of combining some samples with small sample size. Therefore, we conducted a meta-analysis of the previously published studies involving SNP rs3803662 and breast cancer to get a more comprehensive result.

Results

Literature search

A flow diagram for the study selection process is shown in Fig. 1. A total of 95 articles were identified by the search strategy. 77 articles were removed because they don’t evaluate the association between rs3803662 and breast cancer risk or have sufficient data to calculate the ORs with 95% CIs. Afterwards, 4 articleswere excluded due to that the control group doesn’t meet the Hardy-Weinberg Equilibrium (HWE) and 1 article was excluded because that the study population is male, not female.

Figure 1
figure 1

The flow diagram showing the study selection process.

A total of 95 articles were identified by the search strategy. Because don’t meet the qualifications, 77 articles were removed and 18 articles remained for further screening. Afterwards, since the control groups don’t meet the Hardy-Weinberg Equilibrium (HWE), 4 articles were excluded. Besides, 1 article was excluded due to that the study population is male, not female. Finally, 13 studies were included in the meta-analysis.

Study characteristics

The primary characteristics of the 13 studies are summarized in Table 1. A total of 29405 participants with 14306 cases and 15099 controls were included in this study. The studies were divided into two subgroups according to the ethnicity of their participants: East-Asian subgroup with 4 studies and Caucasian subgroup with 8 studies and others subgroup with 1 study.

Table 1 The primary characteristics of the thirteen studies.

Association between rs3803662 polymorphism and breast cancer risk

The forest plot concerning the association between the rs3803662 polymorphism and the risk of breast cancer is shown in Fig. 2. Figure 2A–D are forest plots of dominant model, recessive model, additive model and Allele frequency, respectively.

Figure 2
figure 2

The forest plot of different model.

(A) dominant model; (B) recessive model; (C) additive model; (D) allele frequency.

In Caucasian subgroup, dominant model (TT + CT vs CC): OR = 1.17 (1.06, 1.29); recessive model (TT vs CT + CC): OR = 1.25 (1.13, 1.39); additive model (TT + CC vs CT): OR = 0.93 (0.84, 1.03); allele frequency (T vs C): OR = 1.15 (1.08, 1.22).

In East Asian subgroup, dominant model (TT + CT vs CC): OR = 0.86 (0.63, 1.18); recessive model (TT vs CT + CC): OR = 0.96 (0.74, 1.25); additive model (TT+CC vs CT): OR = 1.01 (0.87, 1.17); allele frequency (T vs C): OR = 0.95 (0.77, 1.16).

Assessment of publication bias

Funnel plot were carried out to assess publication bias. There are many reasons leading to publication bias: low quality research with small sample, heterogeneity and negative result. Figure 3A–D are funnel plots of dominant model, recessive model, additive model and Allele frequency, respectively. We found publication bias in Fig. 3B (recessive model). In this study, it mainly comes from few researches with small sample and heterogeneity.

Figure 3
figure 3

The funnel plot of different model.

(A) dominant model; (B) recessive model; (C) additive model; (D) allele frequency.

Discussion

Our meta-analyses, including 14306 cases and 15099 controls group numbers from 13 case-control studies, explored the association between the rs3803662 polymorphism and the risk of breast cancer. The result indicates that rs3803662 is significant associated with breast cancer risk in Caucasian women. And we did not find the association Asian women.

The SNP rs3803662 is significant associated with breast cancer risk in Caucasian women which is consistent with results of previous studies15,16,17,18,19,20,21,22. But in Asian women, we get inconsistent result. That can be attributed to the following reasons. First, only 4 studies with 5237 cases and 5130 controls were included in the meta-analysis according to the study selection criteria7,13,14,23. The inadequate participant results in that rs3803662 is not significantly associated with breast cancer risk in Asian women statistically. The more the studies were included in meta-analysis, the more accurate the result will be. More independent studies concentrating on the association in Asian women should be added into this analysis and we will focus on the latest research. Second, we hypothesize that genetic background varies among different ethnic populations. Furthermore, this variation leads to change in susceptibility to some cancers. The hypothesis needs a huge amount of samples to validate whether it is true or not.

The SNP rs3803662 lies 8 kb upstream of TNRC9 which is located on the chromosome 16q12 and consists of seven exons25. Rs3803662 has been confined to estrogen receptor-positive tumors26. Though we are not familiar with the function of TNRC9, this gene has been found to be relevant to bone metastasis in breast cancer27. TNRC9 is not only expressed in brain, but also expressed in breast with higher expression level in breast cancer compared to that in normal tissue28,29. And the minor allele of rs3803662 was associated with lower mRNA expression of TNRC9 gene30. James Owain Jones et al. found that TNRC9 maps to a known breast cancer susceptibility locus and hypothesized that TNRC9 could be a candidate tumor suppressor gene in 16q31.

There are some limitations in this study. First, we conducted subgroup analysis according to the ethnicity of participants. Compared with Caucasian subgroup, only 4 studies with 5237 cases and 5130 controls were included in the study. Second, the variation in genetic background may have effects on the susceptibility to breast cancer. More studies focusing on the same population should be included.

In conclusion, rs3803662 polymorphism is significantly associated with breast cancer risk in Caucasian women and we did not find the association in Asian women.

Methods

Search strategy

A comprehensive literature searches for suitable studies published up to 2016 was conducted in the PubMed, EMbase and Web of Science database. Studies that investigated the rs3803662 and breast cancer were included in this meta-analysis. Studies should be published as a full paper. The search was conducted using the following keywords: “rs3803662” and “breast cancer”.

Study selection criteria

Two independent reviewers first screened the titles and abstracts to identify the relevant investigations. Then, full articles were read to include the eligible studies that met the following criteria: (1) used a case-control study design, (2) evaluated the association between rs3803662 and the risk of breast cancer, (3) provided the number of genotypes in case-controls groups for calculating ORs, (4) the control group has to satisfy HWE.

Data extraction

For every eligible study, the following data were extracted by two independent reviewers: name of the first author, publication date, the ethnicity of study population and the number of genotype in case-control group. In addition, the P value of HWE in control group was calculated.

Statistical analysis

All statistical analysis was conducted by STATA version 14.0 (STATA Corporation, College Station, TX, USA). The Odds Ratios (ORs) with 95% confidence intervals (CIs) was calculated to evaluate the association between the rs3803662 polymorphism and breast cancer risk.

The heterogeneity means the variation between different researches in systematic review. It has two major types: the clinical heterogeneity and the heterogeneity of methodology. The former comes from the difference of participants, interventions and outcome among studies. The latter derives from distinction of experimental design. We use Q and I2 statistics to assess the heterogeneity. If I2 < 50% or the P-value of heterogeneity >0.10, we use fixed effects model. Otherwise, the random effects model was selected.

The Begg’s rank correlation test and Egger’s linear regression test were used to assess the publication bias. The genetic models we use here include dominant model, recessive mode, additive model and allele frequency32,33.

Additional Information

How to cite this article: Yang, Y. et al. Association of single nucleotide polymorphism rs3803662 with the risk of breast cancer. Sci. Rep. 6, 29008; doi: 10.1038/srep29008 (2016).