Introduction

Thyroid cancer of follicular origin is the most common endocrine malignancy, with significantly increasing incidence. The majority of tumors are further classified in well-differentiated carcinomas (WDTCs) including papillary thyroid carcinomas (PTC; incidence, 80–90%) and follicular thyroid carcinomas (FTC; incidence, 10–15%). These are distinguished from poorly differentiated (PDTC; incidence, 1–6%) and considerably rarer anaplastic thyroid carcinomas (ATC; incidence, 1–2%) [1, 2]. The majority of WDTCs have a good prognosis, and PDTC an intermediate prognosis. ATC, however, present a nearly uniformly fatal disease, accounting for the majority of thyroid cancer-associated deaths [3, 4].

Diagnosis of an ATC, either arising de novo or as aggravation of a WDTC or PDTC, demands for urgent and radical surgical interventions due to the local aggressive behavior and early metastasis formation, but still therapy options remain palliative. Therefore, the possible accurate identification, even of already microscopic foci of ATC within lower-grade thyroid cancer, would bring a benefit to any patient in sufficient time [1, 3, 5].

Although, histological classification of thyroid cancer remains the gold standard for diagnosis, massive-parallel sequencing results are increasingly being considered. Mutations in BRAF, (H/K/N) RAS, and other genes are found to a certain degree in all thyroid cancers of follicle epithelium, including ATC. However, these insights have failed to greatly improve patient survival [3, 6, 7]. An FDA-approved therapy option targeting BRAF-mutated ATC by applying dabrafenib and trametinib [8] changed the therapeutic landscape of the disease, but continued dismal outcomes highlight the need to identify selective markers and reveal targetable genes expressed in ATC. To further point at specific marker-driven approaches for ATC therapy and diagnosis improvement, the investigation of the ATC transcriptome should be instrumental. To date, microarrays and RNA-sequencing (RNA-seq) datasets on thyroid carcinomas are available [6, 9], but did not elucidate factors exclusive to ATC. Notably, ATCs are not included in TCGA transcriptomic datasets limiting avenues for biomarker and drug development.

The oncofetal IGF2 mRNA binding protein 1 (IGF2BP1) is a bona fide oncofetal protein, upregulated in some advanced solid cancers. The protein promotes the expression of oncogenes like MYC, LIN28B, and SRF by impairing their mRNA decay [10,11,12]. Consistent with its role supporting oncogene expression, IGF2BP1 was reported as a posttranscriptional driver of tumor cell proliferation, migration, metastatic potential, and therapy resistance [13, 14]. In that respect, it represents a prognostic marker for low survival probability in ovarian and neuroblastic cancers [15, 16].

Here, we reveal that IGF2BP1 and MAGEA3 are the first reliable protein and RNA markers of ATC, specifically distinguishing this malignancy from any other thyroid cancer of follicular origin, including PDTC. Besides their diagnostic value, therapeutic targeting of both may provide a promising future perspective for the treatment of ATC, independent of mutational status.

Methods

Patient samples

For the test cohort ten human primary ATC, six PTC and six FTC samples were collected from 1999 to 2012 at the University clinic of Halle, Germany. In addition, six nonmalignant thyroid tissue samples (NT) samples served as healthy controls. Specimens were formalin-fixed and paraffinized for immunohistochemistry, or snap frozen in liquid nitrogen and stored at −80 °C. All samples were re-evaluated histologically and with review of patient records by two pathologists (NP and CW).

An independent in-house tissue microarray (TMA I) contained 147 primary thyroid cancer samples (20 ATC, 18 PDTC, 82 PTC, and 29 FTC) and 108 paired normal thyroid tissue samples of all entities. A commercial microarray (TMA II, TH8010a, Biomax) contained 6 primary ATC, 44 PTC, 20 FTC, and 10 unpaired normal tissue samples.

The clinical characteristics of the tumor cohorts are summarized in Table 1.

Table 1 Clinical characteristics of tumor cohorts.

Immunohistochemistry

Immunohistochemistry was performed on 3 µm thick, consecutive sections of formalin-fixed, paraffin-embedded samples with the Bond Polymer refine detection Kit (Leica, DS9800), according to the manufacturer’s instructions on a fully automated immunohistochemistry stainer (Leica Bond). Sections were imaged with an Olympus BX50/51 microscope. Two pathologists (US and MB), independently and blinded to the clinical data, scored all samples by using a Histoscore, as described previously [17]. In brief, the relative amount of tumor cells being positively stained (%) was multiplied by their intensity from 0 (negative), 1 (weak), 2 (moderate), to 3 (intense). Expression classified into absent (0), low (1–100), intermediate (101–200), or strong (201–300) overall expression. Antibodies are summarized in Supplementary Table S2.

Western blotting

For Western blotting, cells were lysed in lysis buffer (50 mM Tris-HCl (pH 7.4), 50 mM NaCl, 2 mM MgCl2, 1% SDS). Protein expression was analyzed by Western blotting with indicated antibodies (Supplementary Table 2), by an infrared scanner (LICOR).

Databases

For Kaplan–Meier analysis, patient survival was analyzed by using the cBioPortal platform (http://cbioportal.org), combining patient data from the TCGA for PTC and FTC and from the MSKCC (Memorial Sloan Kettering Cancer Center) for ATC.

For the analysis of microarray data derived from Landa et al. (GEO Series accession number GSE76039; [6]), GEO2R was used.

Deep-sequencing and differential gene expression

Total RNA was isolated from fresh frozen tissues by using the miRNeasy Kit (Qiagen), according to the manufacturer’s instructions. Total RNA-sequencing library preparation and sequencing was performed at the IKFZ (Leipzig, Germany). For total RNA-seq low-quality read ends as well as remaining parts of sequencing adapters were clipped using Cutadapt (v 1.4.2 or 1.6). Subsequently, reads were aligned to the human genome (UCSC GRCh19) using TopHat (v 2.0.12; [18]) or Bowtie2 (V 2.2.4; [19]), respectively. FeatureCounts (v 1.4.6; [20]) was used for summarizing gene-mapped reads. Ensembl (GRCh37.75; [21]) was used for annotations. Differential gene expression (DE) was determined by the R package edgeR (v 3.12.1; [22]) using TMM normalization, essentially as described previously [23].

Presented data have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GEO Series accession number GSE126729. RNA-seq data are also available via the R2: Genomics Analysis and Visualization Platform (http://r2.amc.nl; datasets: “Tumor Thyroid Carcinoma – Huettelmaier”) for interactive use.

GSEA analysis

Gene set enrichment analyses (GSEA) were performed as described previously [24]. The gene set hallmarks collection (H) was used for a list of all protein-coding genes ranked according to fold changes. The respective data were visualized by using the R package clusterProfiler [25].

Statistics

Statistical analysis was performed using GraphPad Prism software (V7.0). Statistical significance was determined by using nonparametric Mann–Whitney test. Positive/negative predictive values (PPV, NPV) and diagnostic odds ratios (DOR) were determined by using MedCalc (V19.1.3). A principal component analysis was performed by using the R package pcaExplorer [26].

Results

IGF2BP1 is de novo expressed in ATC

The ATC is the most fatal thyroid malignancy (Supplementary Fig. S1a), but with the exception of few analyzed samples, comprehensive transcriptome analyses aiming to identify selective biomarkers are still rare [6, 9, 27, 28]. To identify novel markers of ATC, thyroid carcinoma gene expression was analyzed by RNA-seq in a test cohort (Table 1; clinical characteristics of the studied tumor cohorts). The protein-coding transcriptome of ten ATCs was compared with gene expression of six PTCs, six FTCs, and six NTs. A principle component analysis of transcriptome data illustrated that primary ATC samples cluster well together and are strikingly distinct from PTC, FTC and NT (Fig. 1a). In detail, a comparison of numbers on differentially expressed genes to NT revealed approx. 8000 differentially expressed genes (FDR ≤ 0.01) for ATC, but only approx. 500 and 100 for PTC and FTC, respectively (Supplementary Fig. S1b).

Fig. 1: IGF2BP1 is de novo expressed in ATC.
figure 1

a Principle component (PC) analysis on RNA-seq data derived from the test cohort, including 10 ATC, 6 PTC, 6 FTC, and 6 NT samples. b Dot plot presentation of cancer hallmark gene sets upon a GSEA for differentially expressed genes for 10 ATCs vs 18 noATCs (6 PTC, 6 FTC, and 6 NT samples), investigated in (a). NES = normalized enrichment score. c Volcano plot of log2 mRNA fold changes plotted against the −log10 FDR (false discovery rate) for 10 ATCs versus 18 noATCs, investigated in (a). Horizontal dashed line indicates threshold (false discovery rate, FDR ≤ 0.01). Indicated in red are the top 20 ATC-exclusive genes, identified as described in the text.

Cancer hallmark gene set enrichment analysis (GSEA) comparing ATC to PTC, FTC and NT, collectively referred to as “noATC,” dissected the molecular pathology aside frequently described mutations (Fig. 1b). These gene sets classify differentially expressed genes into well-defined hallmark pathways of cancer. Thus, essentially deregulated genes represented the invasive/pro-metastatic potential (epithelial-to-mesenchymal transition (EMT)), supported by dissociation of the apical junctional complexes, but also represented high rates of proliferation by ensuring fast cell cycle progression (G2M checkpoint; E2F targets). In support, genes directly activated by the MYC oncogene are upregulated, which also drive EMT and proliferation. ATCs are further distinguished by an increased expression of markers for inflammatory response and a severe alteration of metabolic processes, most prominently elevated glycolysis [29]. Pro-mesenchymal dedifferentiation of ATC was further supported by the reduced expression of thyroid markers like TSHR, epithelial markers like E-cadherin (CDH1) and the upregulation of stemness and EMT-associated markers MYC, SNAI2, TWIST1, OCT3/4 (POU5F1), LIN28B and NANOG (Supplementary Fig. S1c) [30,31,32]. Collectively, this indicated severe deregulation of the protein-coding transcriptome in ATC and suggested protein markers distinguishing this malignancy.

To identify exclusive protein markers of ATC, we assessed the de novo expression (mean FPM in noATC samples < 1; fold change in ATC against noATC samples > 50) of transcripts and evaluated the consistency of mRNA expression in ATC by ranking genes by increasing relative standard deviation (RSD) of expression in ATC (Fig. 1c; Table 2). The top 20-ranked de novo expressed protein-coding genes with low RSD of expression in ATC demonstrated the most consistent de novo expression for IGF2BP1 (Fig. 1c — top 20 genes in red). Interestingly, 12 of the 20 protein-coding genes, including IGF2BP1 and MAGEA (melanoma-associated antigen) proteins, are reported testis antigens [33]. These genes are of advanced interest in the focus of immunotherapy.

Table 2 Top 20 identified ATC-exclusive markers.

As our test cohort did not include PDTC samples, mRNA expression was analyzed in an independent microarray dataset, comparing the transcriptomes of ATC and PDTC [6]. Reinvestigation of this study revealed that IGF2BP1 mRNA was reliably observed in 40% (8/20) of ATC samples. In sharp contrast, IGF2BP1 mRNA remained at background levels in all 17 PDTC samples included in the study (Supplementary Fig. S1d). Similar findings could be drawn for the MAGEA representative MAGEA3, with 40% (8/20) positive ATC and 5.9% (1/17) PDTC samples (Supplementary Fig. S1e). These findings provided independent support of the gene expression analysis in the here presented test cohort suggesting that IGF2BP1, but also MAGEA3, are selective markers of ATC to distinguish this malignancy even from PDTC.

Exclusive expression of IGF2BP1 and MAGEA3 in ATC was initially validated by Western blotting in two ATC-derived cell lines (C643 and 8305C) and the individual samples of each thyroid cancer subtype comprised in the test cohort (Fig. 2a, b). The sharp and exclusive upregulation of both proteins in ATC samples was associated with enhanced expression of MYC, as well as the loss of CDH1 expression, confirming transcriptome studies. This suggested that IGF2BP1 provides a robust, positive marker for ATC at the mRNA as well as protein level. To evaluate this in further detail, representative tumor samples of the test cohort were analyzed by immunohistochemistry, confirming the selective de novo expression of IGF2BP1 protein in paraffinized ATC tissue (Fig. 2c; Supplementary Fig. S2). Importantly, in none of the other samples of the test cohort, IGF2BP1 protein expression was observed. This suggested IGF2BP1 as the first positive marker of ATC.

Fig. 2: IGF2BP1 protein expression is detectable by Western blot and IHC in ATC samples.
figure 2

a Representative Western blot analysis of indicated proteins in two ATC-derived cell lines (C643 and 8305C) and protein lysates of individual samples analyzed by RNA-seq in (a). VCL served as loading control. b Scatter dot plot presentation of quantified log2 protein expression, determined for test cohort samples, investigated in (a). c IGF2BP1 expression analyzed by immunohistochemistry in representative samples investigated in (a). HE, hematoxylin eosin staining. Scale bars, 100 µm. Statistical significance was determined by Mann–Whitney test in (d) (***p ≤ 0.001; *p ≤ 0.05; n.s. not significant).

IGF2BP1 de novo expression is unlikely a consequence of chromosomal aberrations

Recently, the IGF2BP1 gene locus, located on the long arm of chromosome 17 (17q21.32), was found to be commonly gained and associated with poor survival probability in breast cancer and neuroblastoma [16, 34]. To further elucidate, whether alterations in copy numbers could be associated with de novo expression of IGF2BP1, shallow whole genome sequencing (sWGS) of the ATC samples from the initial test cohort was performed. However, copy numbers of IGF2BP1 gene locus remained unchanged in 90% (9/10) of all ATC samples from the test cohort (Supplementary Fig. S3). Remarkably, in one tumor we detected a breakpoint at the IGF2BP1 locus (sample #5).

IGF2BP1 distinguishes ATC from other thyroid carcinoma of follicular origin

To investigate the potential use of IGF2BP1 as a diagnostic marker of ATC, two independent thyroid carcinoma validation cohorts, one previously assembled in-house tissue microarray (TMA I: 20 ATC, 147 tumor samples total and 108 paired NT samples) and a commercial tissue microarray (TMA II: 6 ATC, 70 tumor samples total, and 10 unpaired NT samples) were analyzed for IGF2BP1 protein expression by immunohistochemistry (Table 3). In addition, the analysis of MAGEA3, as a second promising marker, and MYC, as a well-known upregulated gene in high-grade thyroid carcinoma [35], was considered to be included into the study. IGF2BP1 protein expression, determined by Histoscores, was observed in 70% (14/20) of analyzed ATC in TMA I and 50% (3/6) in the TMA II (Fig. 2a, b; Supplementary Fig. S4a–c). Less stringent and consistent upregulation was also observed for MAGEA3 in 35% (7/20) ATC in TMA I, whereas expression in TMA II could only be detected in 16.7% (1/6) of ATC samples. Further, MYC expression was detectable in the majority of ATC samples (TMA I: 75%, 15/20; TMA I: 33.3%, 2/6). However, MYC expression was, to a lower extend, also observed in all other types of thyroid carcinoma, excluding this oncogene as a selective marker.

Table 3 Detectable expression of indicated proteins from the tumor cohorts.

Importantly, IGF2BP1 protein expression could not be observed in any other tested thyroid tissue/tumor sample, except for 5.6% (1/18) of PDTC samples with a Histoscore < 100. MAGEA3 protein expression was detectable in 0% (0/18) and MYC in 11.1% (2/18) of all PDTC samples.

To investigate IGF2BP1, but also MAGEA3 and MYC detection toward usability for diagnostic applications, we determined PPV and NPV, as well as DOR by testing with binary classification for the combination of the test cohort, TMA I and II. In sum, 75% (27/36) ATC samples were positive for IGF2BP1, whereas 5.6% (1/18) PDTC, 0% (0/132) of PTC and 0% (0/55) FTC samples revealed detectable IGF2BP1 protein expression. Diagnostic tests revealed overall PPV and NPV of ~100% and an exceptional overall DOR of 612 (95% CI: 74.6–5021) to identify ATC by IGF2BP1 detection (Fig. 3b, c). For MAGEA3 comparable PPV and NPV, but also a DOR of 411 (95% CI: 23.8–7098.7) were determined. Some detectable MYC protein expression in PDTC, PTC, and FTC led to a lower PPV of 61.9% (95% CI: 49.3–73.1) and a DOR of 30.7 (95% CI: 12.6–74.8). In conclusion, this indicated the potential use of IGF2BP1, but also MAGEA3, expression for discriminating ATC from other thyroid malignancies, including PDTC.

Fig. 3: IGF2BP1 specifically identifies ATC of any other thyroid carcinoma of follicular origin by IHC.
figure 3

a IGF2BP1 expression analyzed by immunohistochemistry in representative samples derived from tissue microarray (TMA) I. HE, hematoxylin eosin staining. Scale bars, 100 µm. b Percentage view of IGF2BP1, MAGEA3 and MYC-histoscores for TMA I. Sample numbers are indicated.

This was strikingly supported by investigating IGF2BP1 protein expression in a patient-derived ATC sample with PTC content (Fig. 4c). In support of the ATC-selective expression of IGF2BP1, de novo expression was exclusively observed in the ATC area.

Fig. 4: IGF2BP1 and MAGEA3 perform well as positive markers for ATC diagnosis.
figure 4

Plots of positive/negative predictive values (PPV/NPV) (a) and diagnostic odds ratio (DOR) (b) values for ATC diagnosis determined for indicated proteins by including all patient samples from the test cohort, tissue microarray I and II. Error bars indicate 95% confidence intervals (95% CI). c Detection of IGF2BP1 protein expression analyzed by immunohistochemistry on a patient-derived sample with ATC and PTC content.

Discussion

ATC is the most lethal malignancy of the thyroid, still lacking robust positive markers. In contrast to WDTC, the ATC is characterized by a rapid invasive growth, early metastasis and severe therapy resistance. Therefore, surgery in a limited stage is often the only potentially curative option [1, 2]. Aiming at a specific marker-driven approach to improve early ATC diagnosis, we combined a comparative RNA-seq analysis of distinct thyroid carcinomas of follicular origin and immunohistochemistry within a single methodological pipeline. This revealed robust and exclusive de novo expression of IGF2BP1, providing the first positive marker of this malignancy suitable for diagnosis on the mRNA and protein level.

Comparative transcriptome analyses and a subsequent cancer hallmark GSEA clearly dissected the molecular pathology of this distinct thyroid carcinoma from other subtypes, besides its frequently described mutations [7]. The GSEA outlines the molecular causes of histologic features for high rates of proliferation, but also an invasive behavior and in sum its high grade of dedifferentiation from its thyroid origin.

Further, the RNA-seq indicated de novo expression of testis antigens including MAGEA3 and IGF2BP1 as outstanding markers of ATC.

By sWGS we demonstrated that the de novo expression of IGF2BP1 in ATC can not be explained by gene gain, as previously observed in other tumors [16, 34]. Copy numbers of the IGF2BP1 gene locus remained unchanged in 90% (9/10) amongst ATC samples. But the here presented observation fits previous reports on copy number alterations in ATC, showing that the chromosomal region 17q21 is not frequently gained or lost [7, 9, 36]. This suggested that the de novo expression of IGF2BP1 in ATC largely results from epigenetic and/or transcriptional deregualtion.

In two independent TMAs immunohistochemistry analyses demonstrated that the majority of ATC samples were IGF2BP1 positive. This rate likely will be improved by optimizing the sensitivity of immunostaining, since IGF2BP1 mRNA expression was observed in all analyzed ATC from the initial test cohort. IGF2BP1 also performed well against MAGEA3 and MYC, for which less pronounced intensities were observed in fewer ATC and also some WDTC samples. Supporting a differential ATC diagnosis from PDTC, only 5.6% (1/18) of PDTC samples identified with a low IGF2BP1 histoscore by immunohistochemistry. Its mRNA expression was completely absent in an independent microarray analysis. Comparable findings were made in terms of MAGEA3 detection.

Accordingly, the protein detection of IGF2BP1 and MAGEA3 appeared to be suitable for the diagnosis of ATC with exceptional results from diagnostic tests, including PPV, NPV, and DOR. Finally MYC immunoreactivity could be excluded to be useful for differential diagnosis of ATC, although it was recently reported to correlate with dedifferentiation in thyroid neoplasias [35]. Still, IGF2BP1 revealed the highest consistency.

Further, IGF2BP1-positive samples identified in TMA I showed distinct WDTC/PDTC or just ATC content (Supplementary Table S1). Thus, we can largely exclude that IGF2BP1 expression in ATC is dependent on the disease origin. Nevertheless, our study stresses that IGF2BP1 immunohistochemistry has the potential to help not only defining a diagnosis but also to identify early dedifferentiation in areas of solid histoarchitecture in case of WDTC or even PDTC to prevent underestimation of tumor severity.

Notably, the histological diagnosis of ATC can be challenging due to heterogeneous histological appearance and similarity to cancers like the undifferentiated lung adenocarcinoma, lymphoma, or thyroid angiosarcoma [1, 37]. One should consider that IGF2BP1 was reported to be de novo expressed in several high-grade malignancies [14, 38]. Thus, IGF2BP1 detection could potentially lead to false-positive diagnosis in the case of a rarely observed lung cancer-derived metastasis to the thyroid [1, 39]. Nonetheless, established markers for immunostaining of ATC samples are cytokeratins and the only retained thyroid-specific transcription factor PAX8. These markers can often present with weak and focal immunoreactivity [1, 40]. Thus, positive markers will, again, help defining a diagnosis.

In view of the probably mutation-independent and sharp upregulation of IGF2BP1 in ATC, as well as its frequently reported expression in other aggressive cancers [14, 38], our study strongly suggests expediting the clinical evaluation and also improvement of IGF2BP1-directed inhibitors in cancer therapy. In this respect, the IGF2BP1-specific inhibitor BTYNB has recently been developed, which could prove promising in preclinical investigations [41].

Interestingly, we identify MAGEA proteins, including MAGEA3, which is induced in a variety of metastatic cancers and has been targeted most recently in a phase-II clinical trial [42]. However, it failed in an extensive phase-III clinical trial in immunotherapy, but remains as a promising candidate for novel targeted treatment opportunities of ATC [43].

Conclusively, our study provides new insights into the whole transcriptomic landscape of this highly aggressive neoplasia, which is distinct from other thyroid carcinomas, besides its frequently reported characteristic mutational burden, including TP53 or TERT-promoter mutations [1, 6, 7]. In consequence IGF2BP1, but also MAGEA3 are promising new candidates for fast clarification of disease severity, other than the established markers, although the here presented data requires additional confirmation by staining of whole sections and the conduction of extended studies in future.