Introduction

Glioblastoma multiforme (GBM) represents the most common and lethal form of primary brain tumor with an annual incidence of 5.26 per 100,000 people1,2 and it stands for more than 60% of all brain tumors in adults3. Although a significant number of modern therapies against GBM is available, it is still a deadly disease with a poor prognosis4. Precise chromosomal segregation in dividing cancer cells as well as disturbances during the spindle assembly checkpoint can contribute to malignant transformation5. Genetic modifications in mitotic genes could increase sensitivity to neoplastic transformation through alterations of gene expression profiles6,7. Aurora kinases are members of serine-threonine kinases family which are of great importance for the cell cycle control8. Aurora kinase A (AURKA) is involved in proper functioning of a few oncogenic signaling processes such as mitotic entry, spindle assembly, centrosome functioning, chromosome alignment and/or segregation and cytokinesis9,10,11. Aurora kinase B (AURKB) is a component of chromosomal passenger complex and mediates in chromatin modification, spindle checkpoint regulation, cytokinesis and correct kinetochore/microtubule attachment9,12. Aurora kinase C (AURKC) is also a member of chromosomal passenger complex which takes part in mitotic events such as accurate centrosome functioning13, and is required to regulate chromosome segregation during meiosis I14. Polo-like kinase 1 (PLK1) is engaged in several cellular processes including centrosome maturation, mitotic checkpoint activation and spindle assembly, kinetochore/microtubule binding, cytokinesis and cellular proliferation15,16,17. PLK1 overexpression is proved to be associated with poor prognosis in several cancer entities18. Moreover, it has been demonstrated that polymorphisms in PLK1 affect its expression, thus possess the ability to potentially influence the risk of disease onset and progression18.

In our case–control study, we evaluated the impact of single nucleotide polymorphisms rs1047972, rs2273535, rs8173 and rs911160 (AURKA), rs2289590 and rs2241909 (AURKB), rs11084490 and rs758099 (AURKC) and rs42873 (PLK1) in mitotic checkpoint genes on glioblastoma multiforme development in Bosnia and Herzegovina population. Using bioinformatic analysis of genetic variants, we estimated the impact of the polymorphic DNA sequences in introns and untranslated regions (UTRs) within AURKA, AURKB, AURKC and PLK1 genes on transcription factors binding sites.

Methods

Design of the study and study groups

Our study group consisted of 129 patients with diagnosed glioblastoma multiforme (GBM) at the Clinical Pathology and Cytology at the University Clinical Center Sarajevo, Bosnia and Herzegovina. Of that, 68 were men and 61 women, with a mean age of 58 years at the moment of diagnosis (data were missing in 4 cases) (Table 1). The formalin fixed paraffin embedded (FFPE) cancer tissue sections were collected in the course of surgical procedures. Written informed consents, which allow the use of samples in this study, were obtained from all patients prior to the surgery. On the other side, 203 healthy blood donors (ethnicity matched to the cases), upon regular medical examinations, were randomly selected and signed up as a control group for the present study. Control samples had no history of neoplastic formation, were not related to the patients and/or to each other. Three milliliters of blood were taken from each control individual and kept at − 80 °C. An informed written consent was obtained from the participants, with personal and medical information being enciphered in order to ensure maximum anonymity in compliance with the World Medical Association’s Declaration of Helsinki. This study was approved by the University Clinical Centre Sarajevo Ethical Committee (No. 0302-36765).

Table 1 General information for glioblastoma multiforme patients.

DNA extraction

Genomic DNA from glioblastoma multiforme (GBM) formalin fixed paraffin embedded tissues was extracted using the Chemagic FFPE DNA Kit special (PerkinElmer Inc., Waltham, MA, USA). DNA washing and elution was performed on Chemagic Magnetic Separation Module I robot (PerkinElmer Inc., Waltham, MA, USA), following manufacturer’s recommendations. All sample transfers were conducted with the four-eye principle to avoid mixing errors of the samples. DNA from peripheral blood lymphocytes (controls) was isolated using the Promega™ Wizard™ Genomic DNA Purification Kit Protocol (Promega Corp., Fitchburg, WI, USA) in concordance with the manufacturer’s instructions. The qualitative/quantitative analysis of the extracted DNA was performed on the DropSense96 photometer (Trinean, Gentbrugge, Belgium) and Synergy™ 2 Multi Mode Reader (BioTek, Inc., Winooski, VT, USA).

SNP selection

In total, nine single nucleotide polymorphisms (SNPs) in segregation genes, precisely rs1047972, rs2273535, rs8173 and rs911160 (AURKA), rs2289590 and rs2241909 (AURKB), rs11084490 and rs758099 (AURKC) and rs42873 (PLK1) were chosen. The locations of the selected variants in mitotic genes are shown in Fig. 1, whereby gene structures were obtained from the Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank (PDB)19. The parameters described below are used for the selection of the genetic variants: (1) previously established association related to certain tumors; (2) minor allele frequency (MAF) fewer or equal to 10% in the population of Utah residents with Northern and Western European ancestry (CEU) as highlighted by the Phase 3 1000 Genomes; and (3) tagging single nucleotide polymorphisms (tagSNPs) status, which was computed in silico using LD Tag SNP Selection (tagSNP) (https://snpinfo.niehs.nih.gov)20. To predict tagSNPs status, following parameters were used: (a) 1 kb of the upstream–downstream sequences from gene; (b) linkage disequilibrium (LD) lower threshold of 0.8; and (c) minor allele frequency range from 0.05 to 0.5 for the CEU subpopulation (Table 2; Fig. 2).

Figure 1
figure 1

The positions of rs1047972, rs2273535, rs8173 and rs911160 (AURKA), rs2289590 and rs2241909 (AURKB), rs11084490 and rs758099 (AURKC) and rs42873 (PLK1) genetic variants within mitotic checkpoint genes. White boxes represent untranslated regions (UTRs). Blue boxes refer to protein coding regions (exons). The black lines connecting the boxes indicate non-coding regions (introns). The gene structures were downloaded from the Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank (PDB), GRCh38 Genome Assembly.

Table 2 Basic characteristics of the studied genetic variants.
Figure 2
figure 2

Minor allele frequencies (MAFs) for polymorphisms rs1047972, rs2273535, rs8173 and rs911160 (AURKA), rs2289590 and rs2241909 (AURKB), rs11084490 and rs758099 (AURKC) and rs42873 (PLK1) in various populations. ALL all individuals from 1000 Genome Project Phase 3 release, Ctrl studied control population, CEU Utah residents with Northern and Western European ancestry, EUR European population, GBM studied glioblastoma multiforme group, MAF minor allele frequency.

Genotyping of SNPs

Genotyping of the studied variants was performed using TaqMan SNP genotyping assays (Applied Biosystems, Foster City, CA), whose ID numbers are shown in Table 2. The polymerase chain reaction (PCR) mixtures (5 µl for GBM samples and 10 µl for the control samples) consisted of 20X TaqMan® assay along with 2X Master Mix (Applied Biosystems, Foster City, CA), and 20 nanograms of genomic DNA. PCR profile was conducted following manufacturer’s recommendations, hence initial denaturation at 95 °C for 10 min, 45 cycles at 92 °C for 15 s and 60 °C for 90 s, using the ViiA 7 Real Time PCR System (Applied Biosystems, Foster City, CA). At least two negative controls were included in each plate. The results of the PCR reaction were analyzed using TaqMan® Genotyper Software (Applied Biosystems, Foster City, CA, USA).

Statistical analysis

The genotype frequencies of the polymorphisms, for both case and control populations were tested for Hardy–Weinberg equilibrium (HWE) using Michael H. Court’s online HWE calculator (http://www.tufts.edu)21. Significance of the differences in genotype frequencies between GBM patients and controls was determined by use of the Chi-square test or Fisher’s exact test. Multinomial logistic regression was used to test the association between investigated genetic variants and the GBM risk. In this regard, odds ratio (OR) with 95% confidence interval (CI) were calculated to evaluate the relative risk. Statistical analyses were performed using SPSS 20.0 software package (SPSS, Chicago, IL, USA). P ≤ 0.05 was chosen as a threshold significance value. Minor allele frequency (MAF) plot was created in R22 using ggplot2 R package23.

Analysis of haplotypes

To determine the haplotype block structure and perform haplotype analysis, which included corrections for multiple comparisons by 10,000 permutations, Haploview software, version 4.224 and SNP tools V1.80 (MS Windows, Microsoft Excel) were used. In order to create the haplotype block, solid spine of the linkage disequilibrium (LD) algorithm with a minimum Lewontin’s D′ value of 0.8 was chosen.

In silico analysis of polymorphisms

Effects of the polymorphic DNA sequences [polymorphisms in non-coding and untranslated regions (UTRs)] on transcription factors binding sites (TFBSs) were assessed in silico. Bioinformatic functional assessment was conducted using PROMO (ALGGEN) software, which is using data from TRANSFAC database V8.325,26. FASTA sequences for the studied variants were extracted from Ensembl release 98 (http://www.ensembl.org/index.html)27. Identification of TFBSs was determined in concordance with the following criteria: human species, all sites and factors.

Results

Genotypes frequencies for studied SNPs

For all the investigated polymorphisms, rs1047972 (AURKA), rs2273535 (AURKA), rs8173 (AURKA), rs911160 (AURKA), rs2289590 (AURKB), rs2241909 (AURKB), rs11084490 (AURKC), rs758099 (AURKC) and rs42873 (PLK1) was determined to be in Hardy–Weinberg equilibrium (HWE) in both, case and control groups (P > 0.05) (Table 3). After Chi-square test and Fisher exact test were performed to calculate distribution at genotype level (results summarized in Table 3), a significant difference in genotypes frequencies between GBM patients and controls for rs2289590 in AURKB (P = 0.038) was detected.

Table 3 Genotypes frequencies and Hardy–Weinberg equilibrium for the studied polymorphisms.

Impact of polymorphisms on glioblastoma multiforme risk

Patients with rs2289590 (AURKB) heterozygous AC genotype had a lower risk of glioblastoma multiforme (GBM) development in comparison with the reference AA genotype (OR = 0.54, 95% CI = 0.33–0.88, P = 0.015) (Table 4). Furthermore, the rs11084490 (AURKC) CG genotype was also associated with a decreased GBM risk in comparison with the reference CC genotype (OR = 0.57, 95% CI = 0.34–0.95, P = 0.031).

Table 4 Risk of glioblastoma multiforme associated with the studied genetic variants.

On the other side, no significant effects on GBM susceptibility were revealed for rs1047972, rs2273535, rs8173 and rs911160 in AURKA, rs2241909 in AURKB, rs758099 in AURKC and rs42873 in PLK1 (P > 0.05).

Haplotype analysis

After collecting raw genotyping data for the investigated SNPs in AURKA, namely rs1047972, rs2273535, rs8173 and rs911160, we carried out haplotype analysis using the Haploview software. The outcome of this analysis revealed that no haplotype block was created with an average Lewontin’s D < 0.8 (Fig. 3), therefore no haplotypes were accessible for the examination of their potential association with GBM risk.

Figure 3
figure 3

Linkage disequilibrium among single nucleotide polymorphisms in the AURKA gene. The color plot represents Lewontin's D′ values and logarithm of odds (LOD). Blue squares, LOD < 2 and D′ < 1; Green squares, LOD ≥ 2 and D′ < 1. The values within the squares refer to the Lewontin's D′ × 100.

Bioinformatic analysis of the polymorphisms

In silico analysis revealed that polymorphic sequences in transcription factors binding sites (TFBSs), within non-coding and untranslated regions (UTRs) of AURKA, AURKB, AURKC and PLK1 genes, bind various transcription factors (TFs). Our results showed that the region comprising G allele of rs911160 (AURKA) was linked with C/EBPalpha, C/EBPbeta and GR-beta proteins, while for the C allele, extra binding sites for NF-Y, NFI-CTF and NF-1 were recognized (Table 5). As for rs2289590 (AURKB), an additional motif for YY1 binding was identified when C allele was taken into account. In the case of rs11084490 (AURKC), there were no observed differences in transcription factor binding site motif (XBP-1), when different alleles, either C or G, were present. The region including C allele of rs758099 (AURKC) was related with binding sites for NF-1, NF-Y, XBP-1, ENKTF-1, CTF, PEA3 and POU2F1, while for the region surrounding T allele, NF-1, NF-Y, GATA-1 and TFII-I transcription factors were detected. For the polymorphic sequence which include the G allele of rs42873 (PLK1) was demonstrated to be linked with an additional recognition motif for c-Jun DNA-binding factor.

Table 5 Bioinformatic analysis of the studied genetic variants.

Discussion

Our study focused on the assessment of an association between polymorphisms rs1047972, rs2273535, rs8173 and rs911160 (AURKA), rs2289590 and rs2241909 (AURKB), rs11084490 and rs758099 (AURKC) and rs42873 (PLK1), and a risk of glioblastoma multiforme (GBM) development in the population of Bosnia and Herzegovina.

Aurora kinase B (AURKB) is a part of chromosomal passenger complex (CPC), which covers processes such as the segregation of chromatids, cytokinesis and histone modifications28 and for which has been proven to be overexpressed in different types of cancers including brain, prostate and thyroid29. Furthermore, it has been suggested that aurora B overexpression induces abnormalities in chromosome segregation, aneuploidy and tumor development30. We examined the rs2289590 polymorphism in AURKB, and after Chi-square and Fisher exact tests were performed, a significant difference in genotypes frequencies between GBM patients and control group was observed. Additionally, a protective role of the rs2289590 AC genotype against higher GBM risk was found. In silico analysis of rs2289590 polymorphic region detected additional binding site for the Yin-Yang 1 (YY1) transcription factor, in the presence of C allele.

The YY1 transcription factor is implicated in the regulation of basic processes such as development, cell growth and differentiation, cell cycle progression and apoptosis whereby, it has been demonstrated that YY1 overexpression is linked to an uncontrolled cell proliferation, resistance to apoptotic stimuli and metastasis, thus influencing the process of carcinogenesis itself31,32. Transcription factors (TFs) are crucial gene regulators with unique roles during the cell cycle and when their expression is impaired, they fail to provide accurate cellular functioning and stability, which could lead to neoplastic transformation32,33. Single nucleotide polymorphisms (SNPs) in regulatory domains can disturb gene expression profile through potential disruption of sequence specific DNA-binding motifs (removing existing and/or creating new ones), therefore altering the binding of correct TFs34,35. Moreover, it has been suggested that introns, especially long ones, carrying more functional cis-acting elements could accommodate several TFs binding sites, and consequently affect transcription regulation36. Our results for the rs2289590 intron variant in AURKB suggested that binding of an extra YY1 transcription factor when C allele is present, could alter AURKB expression, which might result in lower susceptibility to GBM occurrence. Roles of introns in transcription regulation have been reported in cell cycle and apoptotic genes, emphasizing the significance of intronic genetic variants in carcinogenesis37. In addition to this, SNPs in introns can be used as molecular markers for disease susceptibility and/or as targets in the development of new therapeutics38.

Aurora kinase C (AURKC) is a member of a chromosomal passenger complex, similarly as Aurora kinase B, which plays important role in mitotic events, segregation and centrosome functioning during meiotic events13,14. In cancer cells, the subcellular localization of AURKC is the same as that of AURKB suggesting that they could have similar functions39. AURKC overexpression has been observed in malignant thyroid cell lines and tissues40. Further, it has been demonstrated that AURKC overexpression stimulate centrosome amplification, multinucleation and that its aberrant expression in somatic cells has an oncogenic potential41. In this study, we assessed potential relationship between rs11084490 in AURKC and GBM risk. A link between heterozygous CG genotype and decreased GBM risk was observed. Polymorphism rs11084490 is located within the AURKC 5′ untranslated region. Untranslated regions (UTRs) play role in posttranscriptional regulation of gene expression by a modulation of mRNA stability, nucleo-cytoplasmatic transport, subcellular localization and translation efficiency, thus are involved in fine control of protein product and may affect the quantity and quality of the protein encoded42. Several eukaryotic 5′UTR elements/structures, such as RNA G-quadruplexes (RG4s), hairpins, upstream open reading frames (uORFs) and start codons, Kozak sequences around the initiation codons, iron responsive elements (IREs) and internal ribosome entry sites (IRESs), highly affect mRNA translation43. It has been shown that 5′ uORF-altering SNPs and mutations, by disrupting motifs within 5′UTR alter downstream protein expression, thus are capable of causing modified effects in terms of susceptibility to certain diseases such as esophageal cancer, multiple myeloma and many others44,45. Hence, observed association of the rs11084490 (AURKC) polymorphism with a decreased GBM risk in our study, could be due to altered AURKC translation mediated by heterozygous (CG) genotype affecting some of the above-mentioned functional motifs in AURKC 5′UTR.

Conclusion

The results of the present study demonstrated that AURKB (rs2289590) and AURKC (rs11084490) polymorphisms reduce the risk of glioblastoma multiforme development. These findings undoubtedly indicate the existence of the possible positive roles of genetic variations in AURKB and AURKC genes during brain carcinogenesis. Our data could be beneficial to the future assessments of the functional impact of these polymorphisms. However, our study is based on a reduced number of cases which in a way represents its limitation, and it is therefore necessary that larger prospective studies confirm these allegations.