Introduction

The latest GLOBOCAN estimates of cancer incidence have revealed that female breast cancer is the most commonly diagnosed cancer worldwide, with around 2.3 million cases in 2020, which equates to 11.7% of all cancer cases1. Thus, female breast cancer is a critical component of the global health burden and this disease is particularly problematic for China, because Chinese women have the highest age-standardized rates of cancer incidence worldwide (39.10 per 100,000 population)1,2 and breast cancer was the most commonly diagnosed type of tumor in women living in China in the year 20152,3.

Despite recent improvements in 5-year patient survival rates (from 73% between 2003–2005 to 82% in 2012–2015)4, increasing numbers of women who are detected early in the disease process, as well as increased access to effective treatments for breast cancer and greater amounts of government funding for cancer control1,3,4, breast cancer remains a major health issue among Chinese women3. The current situation regarding the high burden of malignant breast neoplasms in China leaves much room for improvement.

The precise etiology of breast cancer remains unclear, although the main risk factors have been identified as female gender, older age and having a family history of breast cancer5. Several genome-wide association studies have identified gene-environment interactions and breast cancer susceptibility single nucleotide polymorphisms (SNPs) that are associated with breast cancer risk6,7. Thus, exploring risk-associated SNPs may help to estimate individual risks for developing breast cancer and assist with its earlier diagnosis.

Angiopoietin-2 (ANGPT2) is a protein coding gene found on chromosome 8 that is capable of serving as a permissive angiogenic signal in the presence of the angiogenic inducer, vascular endothelial growth factor (VEGF), as this combination may facilitate endothelial cell migration and proliferation, whereas in the absence of VEGF, ANGPT2 induces endothelial cell apoptosis and subsequent vascular regression8,9. Interestingly, a case–control study involving Turkish patients with gynecological cancers found no evidence in support of DNA sequence variations in VEGF and ANGPT2 genes and their contribution to ovarian, cervical and endometrial cancers10. In contrast, in a study involving 421 critically ill patients with acute kidney injury (AKI) who were of European ancestry, a genetic variant (rs2920656) near the ANGPT2 gene significantly lowered the risk of developing the AKI sub-phenotype AKI-SP2 and lowered plasma Ang2 levels11. In a study that enrolled Chinese Han subjects, those who had the ANGPT2 SNP rs12674822 were at increased risk for rheumatoid arthritis12, while another investigation that also involved Chinese Han subjects reported that two ANGPT2 SNPs, rs12674822 and rs11137037, were correlated with the development of lung cancer and its progression13. These findings suggest that it is worth exploring a possible correlation between ANGPT2 gene polymorphisms and breast cancer diagnosis.

This case–control study is based on blood samples obtained from 539 healthy, cancer-free women and from 464 women with malignant breast neoplasms. It sought to determine whether Chinese Han women with ANGPT2 SNPs are susceptible to developing a malignant breast neoplasm and whether these SNPs correlate with clinical disease status.

Materials and methods

Study participants

Between 2013 and 2018, we enrolled 464 female patients (cases) diagnosed with a malignant neoplasm of the breast at Dongyang People's Hospital (Dongyang, Zhejiang, China) and 539 healthy, cancer-free females (controls), each of whom provided one 5-mL blood sample. The control group inclusion criteria specified that the enrolled patients underwent physical examination in the same hospital. A healthy status was defined as an absence of cancer history or other chronic diseases. Fully informed written consent was obtained from all study participants before study entry. The trial was performed in accordance with the principles of the Declaration of Helsinki14, Good Clinical Practices, and all local laws. The Ethics Committee of Dongyang People's Hospital approved the study protocol (Approval No.2020-YX-117). Clinicopathological data included age, cigarette smoking status, alcohol consumption, clinical stage, pathological grade, estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and Ki-67 status. Pathohistological diagnosis relied on the World Health Organization breast tumor classification criteria and tumors were graded by the Scarff-Bloom-Richardson method15,16. Malignant breast neoplasms were classified into the following 4 subgroups: Luminal A (ER-positive [+] and/or PR+, HER2-negative [−], Ki-67 < 14%); Luminal B (ER+ and/or PR+, HER2, Ki-67 ≥ 14%, ER+ and/or PR+, HER2+); HER2-enriched (ER, PR, HER2+), or as triple-negative breast cancer (TNBC; ER, PR, HER2)17,18,19. A standardized questionnaire at study entry collected sociodemographic data and electronic medical records provided clinicopathological information.

Selection of ANGPT2 polymorphisms

The ANGPT2 SNPs selected for this study were identified from multi-allelic copy number variation (CNV) profiles encompassing the q23 region of chromosome 8 containing ANGPT2 genes. We screened some of these ANGPT2 SNPs with minor allele frequencies > 5% from the records held on the 1000 Genome Projects website (https://www.internationalgenome.org/), to ensure the accuracy of our genotyping. Finally, nonsynonymous SNPs rs2442598, rs734701, rs1823375, 11,137,037, and rs12674822 were selected and validated by the Single Nucleotide Polymorphism database (dbSNP) (https://www.ncbi.nlm.nih.gov/snp/), as previously described20.

Genomic DNA extraction and genotyping by real-time PCR

Total genomic DNA was extracted from peripheral blood leukocytes using a QIAamp DNA blood mini kit (Qiagen, CA, USA), according to the manufacturer’s instructions20. Extracted DNA was stored at − 20 °C and prepared for genotyping by polymerase chain reaction (PCR). DNA was dissolved in TE buffer (10 mM Tris pH 7.8, 1 mM EDTA) and stored at − 20 °C until quantitative PCR analysis. Five ANGPT2 SNP probes were purchased from Thermo Fisher Scientific Inc. (USA), and allelic discrimination analysis of ANGPT2 SNPs was assessed using a QuantStudio™ 5 Real-Time PCR system (Applied Biosystems, CA, USA), according to the manufacturer’s instructions. Data were further analyzed with QuantStudio™ Design & Analysis Software (Applied Biosystems)21,22,23. Genotyping PCR was carried out in a total volume of 10 μL containing 20–70 ng genomic DNA, 1 U Taqman Genotyping Master Mix (Applied Biosystems) and 0.25 μL probes. The protocol included an initial denaturation step at 95 °C for 10 min, followed by 40 cycles of 95 °C for 15 s and 60 °C for 1 min.

Statistical analysis

Testing for Hardy–Weinberg equilibrium was performed to examine the genotypic frequencies of ANGPT2 rs2442598, rs734701, rs1823375, rs11137037 and rs12674822 in the cancer-free controls (2 degrees of freedom [df2]). Between-group differences were considered significant if p-values were less than 0.05. Since the data were independent and normally distributed, chi-square analysis or the Fisher’s exact test was used to compare relationships between ANGPT2 SNPs and clinicopathological parameters. Odds ratios (ORs) with their 95% confidence intervals (CIs) calculated associations between genotype frequencies and the risk of a malignant breast neoplasm. To further exclude the impact of confounding variables, adjusted odds ratios (AORs) with 95% CIs were estimated by multiple logistic regression models that controlled for age. Analysis of variance (ANOVA) was used to compare the age distribution of the cases and controls, and the Scheffé test was applied for post hoc analysis. The linkage disequilibrium (LD) haplotype analysis was observed using Haploview software (version 4.2) analysis.

Results

All of the study participants were of Chinese Han ethnicity. Table 1 presents their demographic characteristics and clinical parameters. The mean age was 55.05 ± 11.75 years in the breast cancer cohort and 52.89 ± 11.36 years in the controls (p < 0.05). The majority of study participants did not smoke cigarettes or consume alcohol; among those who did, smoking was significantly more common in the control cohort than in the case cohort (2.2% vs 0.2%; p < 0.05), while alcohol consumption was more common among cases than controls (7.1% vs 3.9%; p > 0.05). Most cases were diagnosed with clinical stage I/II disease (78.7%), but pathological grade II/III (90.3%). Tumors were mostly ER+ (71.8%), PR+ (59.7%), or HER2 (66.6%) (Table 1).

Table 1 Demographic and clinicopathological characteristics of healthy controls and patients with malignant breast neoplasms.

ANGPT2 genotyping results of distribution frequencies for all blood samples are listed in Table 2. The genotypic frequency of the ANGPT2 genetic variant rs2442598 conformed to the Hardy–Weinberg equilibrium (χ2 value, 0.386, p = 0.825; df2) in controls. The frequencies of rs734701, rs1823375, rs11137037 and rs12674822 also satisfied the Hardy–Weinberg equilibrium (χ2 value, 4.679, p = 0.096; χ2 value, 0.889, p = 0.641; χ2 value, 0.494, p = 0.781 and χ2 value, 2.473, p = 0.290; respectively). In cases and controls, most of those with the rs2442598 SNP and rs11137037 SNP were homozygous for the AA genotype, most of those with the rs734701 SNP were homozygous for the TT genotype, most of those with the variant rs1823375 were homozygous for CC, while most of those with the variant rs12674822 were homozygous for GG (Table 2). Distributions of ANGPT2 variants rs2442598, rs734701, rs1823375, rs11137037 and rs12674822 did not differ significantly between cases and controls, even after controlling for age (Table 2).

Table 2 Distribution frequencies of ANGPT2 genotypes in controls and patients with malignant breast neoplasms.

Next, ANGPT2 alleles in patients with malignant breast neoplasms were examined to clarify the role of ANGPT2 polymorphisms in clinical stage, primary tumor size, lymph node metastasis, distant metastasis, and pathological grade. For the 464 patients with malignant breast neoplasms, a significant correlation between rs1823375 homozygous variants (CC vs GG) and clinical stage (OR: 0.274, 95% CI: 0.081–0.927, p < 0.05) was observed, which persisted when estimated by multiple logistic regression models (AOR: 0.255, 95% CI: 0.074–0.876, p < 0.05) (Table 3).

Table 3 Odds ratios (ORs) and 95% confidence intervals (CIs) of clinical status and ANGPT2 rs1823375 genotypic frequencies in 464 patients with malignant breast neoplasms.

We further analyzed the correlation between clinical parameters and rs2442598, rs734701, rs1823375, rs11137037 and rs12674822 genotyping frequencies for the different breast cancer subtypes (Luminal A, Luminal B, HER2-enriched and TNBC) (Table 4). Among patients with the HER2-enriched subtype, those carrying the CG or the CG+GG allele at rs1823375 were significantly less likely than CC allele carriers to be of lymph node status N1–N3 (OR: 0.325; 95% CI: 0.112–0.940 and 0.313; 0.114–0.862, respectively) (Table 4).

Table 4 Odds ratios (ORs) and 95% confidence interval (CIs) of Ang2 genotypic frequencies and clinical subtypes in patients with malignant breast neoplasms.

We further explored the haplotypes to evaluate the combined effects of the ANGPT2 polymorphisms on malignant breast neoplasm susceptibility. The most common haplotype in the control cohort was A-C-C-A-G (21.4%), which was therefore chosen as the reference. Compared with the reference, the T-T-C-A-T ANGPT2 haplotype significantly increased the risk for developing a malignant breast neoplasm by 1.385-fold (95% CI: 1.025–1.871; p < 0.05) (Table 5). A reconstructed linkage disequilibrium (LD) plot of the five ANGPT2 SNPs is shown in Fig. 1. Our reconstructed results illustrate one haploblock in which rs734701 and rs2442598 showed 69% LD (Fig. 1).

Table 5 Distribution frequencies of ANGPT2 haplotypes in healthy controls and patients with malignant breast neoplasms.
Figure 1
figure 1

Linkage disequilibrium patterns of five single nucleotide polymorphisms in the ANGPT2 gene.

Discussion

Several genome-wide association studies have used genetic SNPs as potential markers for early diagnosis and targeted therapy in breast cancer24,25,26. A recent investigation attributed 18% of the familial risk of breast cancer risk to SNPs6, although any potential correlation between ANGPT2 gene polymorphisms and malignant breast neoplasms has not been explored so far. Previous studies have reported finding similar genetic characteristics in breast tumors for Chinese Han women and women of European ancestry27,28. ANGPT1, ANGPT2 and FGF2 have been linked in a US study to pathological complete responses to bevacizumab, the first approved recombinant humanized monoclonal antibody for breast cancer29, although most investigations into breast cancer genetic polymorphisms have focused on BRCA1/230,31. Our previous work found nonsynonymous ANGPT2 gene polymorphisms in lung and colorectal cancers13,20, which suggests that ANGPT2 SNPs might also be associated with a risk for breast cancer. It is therefore important and appropriate to investigate the impacts of ANGPT2 genetic variants upon the risk of malignant breast neoplasms among women of Chinese Han ethnicity.

Based on blood samples from 539 healthy, cancer-free women and 464 women with malignant breast neoplasms, we found no significant between-group distribution frequencies for any of the five ANGPT2 SNPs. We therefore sought to determine whether any of the ANGPT2 SNPs correlated with clinical status or the pathological grade of the tumors. In analyses adjusted for confounding factors, patients carrying the GG allele at rs1823375 were at lower risk for developing clinically staged breast cancer. Interestingly, our previous research revealed that this variant rs1823375 increased the risk of colorectal cancer in a Chinese Han cohort20. A possible explanation for this difference in risk between different cancer types is that gender has disparate effects, but this requires further research.

We also found that the variant rs1823375 lowered the risk of developing lymph node metastasis in patients with HER2+ disease. A recent study reported that stromal overexpression of the SNAIL transcription factor SNAI2 is linked to poor prognosis in patients with Luminal B HER2+ breast cancer, while the absence of SNAI2 in the stroma of Luminal B HER2+ breast tumor cells is associated with lower levels of plasma Ang232. Another study has indicated that ANGPT2 variants rs2515409 and rs13269021 are associated with pathological complete responses to bevacizumab treatment in breast cancer29. Accordingly, ANGPT2 mRNA levels and survival rates in women with breast cancer deserve more investigation in future clinical trials.

LD in the human genome is critical for genetic variation and features in the detection and treatment of disease33. Haplotype analyses can provide significant statistical power for clarifying the contribution of genes to disease susceptibility34. To analyze the common haplotypes, we used Haploview software and the PHASE program. As shown in Fig. 1, we identified three LD haploblocks (rs2442598, rs734701 and rs12674822) that were minimally associated with the risk of malignant breast neoplasms; other SNPs were outside the haploblock. Details of the underlying mechanisms require further investigations. When we analyzed the contributions of different haplotype combinations of the five ANGPT2 variants investigated in this study upon the risk of a malignant breast neoplasm, we found that the TTCAT haplotype increased the risk. It is possible that the ANGPT2 TTCAT haplotype is in LD with other functional polymorphisms that increase the susceptibility for a malignant breast neoplasm.

The main strength of our study is its systematic analysis of ANGPT2 SNPs and susceptibility to malignant breast neoplasms in our study population. Potential limitations include the possibility that the study observations merely reflect a cross-sectional relationship instead of actual causality. Moreover, the study participants were all enrolled from only one hospital, which gives rise to possible selection bias. Lastly, our modest sample size requires a larger, independent cohort study to confirm our findings.

Conclusion

In conclusion, our systematic genotyping results demonstrate an association between ANGPT2 gene variants and susceptibility for a malignant breast neoplasm and its progression among Chinese Han women. In this study, the ANGPT2 rs1823375 polymorphism appeared to be protective against clinically staged breast cancer and also lymph node disease, while the T-T-C-A-T ANGPT2 haplotype appeared to significantly increase the risk for developing a malignant breast neoplasm. These findings suggest that the rs1823375 polymorphism can serve as a diagnostic marker for malignant breast neoplasms.