Revisiting the transcriptional analysis of primary tumours and associated nodal metastases with enhanced biological and statistical controls: application to thyroid cancer

Tarabichi, M; Saiselet, M; Trésallet, C; Hoang, C; Larsimont, D; Andry, G; Maenhaut, C; Detours, V

doi:10.1038/bjc.2014.665

Download PDF

Full Paper
Open access
Published: 12 May 2015

Revisiting the transcriptional analysis of primary tumours and associated nodal metastases with enhanced biological and statistical controls: application to thyroid cancer

M Tarabichi¹,
M Saiselet¹,
C Trésallet²,
C Hoang²,
D Larsimont³,
G Andry⁴,
C Maenhaut^1,5^na1 &
…
V Detours¹^na1

British Journal of Cancer volume 112, pages 1665–1674 (2015)Cite this article

2163 Accesses
45 Citations
1 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

Background:

Transcriptome profiling has helped characterise nodal spread. The interpretation of these data, however, is not without ambiguities.

Methods:

We profiled the transcriptomes of papillary thyroid cancer nodal metastases, associated primary tumours and primary tumours from N0 patients. We also included patient-matched non-cancerous thyroid and lymph node samples as controls to address some limits of previous studies.

Results:

The transcriptomes of patient-matched primary tumours and metastases were more similar than those of unrelated metastases/primary pairs, as previously reported in other organ systems. This similarity partly reflected patient background. Lymphoid tissues in the metastases confounded the comparison of patient-matched primary tumours and metastases. We circumvented this with an original data adjustment, revealing a differential expression of stroma-related gene signatures also regulated in other organs. The comparison of N0 vs N+ primary tumours uncovered a signal irreproducible across independent data sets. This signal was also detectable when comparing the non-cancerous thyroid tissues adjacent to N0 and N+ tumours, suggesting a cohort-specific bias also likely present in previous similarly sized studies. Classification of N0 vs N+ yielded an accuracy of 63%, but additional statistical controls absent in previous studies revealed that this is explainable by chance alone. We used large data sets from The Cancer Genome Atlas: N0 vs N+ classification was not better than random for most cancers. Yet, it was significant, but of limited accuracy (<70%) for thyroid, breast and head and neck cancers.

Conclusions:

The clinical potential of gene expression to predict nodal metastases seems limited for most cancers.

Comprehensive bioinformatics analysis unveils THEMIS2 as a carcinogenic indicator related to immune infiltration and prognosis of thyroid cancer

Article Open access 08 April 2024

Single-cell transcriptomic analysis of the tumor ecosystems underlying initiation and progression of papillary thyroid carcinoma

Article Open access 18 October 2021

Expressed prognostic biomarkers for primary prostate cancer independent of multifocality and transcriptome heterogeneity

Article 22 February 2022

Main

Thyroid cancers are the most frequent endocrine malignancies. Eighty per cent of them are of the papillary subtype (PTC). In these cancers, regional lymph node metastasis is correlated with poorer survival (Lundgren et al, 2006). Therapeutic dissection of lymph nodes within the central neck area improves disease outcome (Nixon and Shaha, 2013; Venkat and Guerrero, 2013) and is recommended by current international guidelines (Pacini et al, 2006; American Thyroid Association (ATA) Guidelines Taskforce on Thyroid Nodules and Differentiated Thyroid Cancer et al, 2009), whereas the benefit of prophylactic dissection remains a matter of heated debate (Nixon and Shaha, 2013; Venkat and Guerrero, 2013). The biology underlying the nodal dissemination of PTC is poorly understood.

A number of cancer studies in other organ systems have explored with transcriptome profiling the relationship between primary tumours and their associated nodal metastases and the characteristics of those primary tumours producing nodal metastases (Huang et al, 2003; Kikuchi et al, 2003; Nagata et al, 2003; Hao et al, 2004; Takada et al, 2004; Tamoto et al, 2004; Croner et al, 2005; Hoang et al, 2005; O’Donnell et al, 2005; Weigelt et al, 2005; Xi et al, 2005; Bidus et al, 2006; Roepman et al, 2006a; Wang et al, 2006a; Feng et al, 2007; Inamura et al, 2007; Méndez et al, 2007, 2011; Nguyen et al, 2007; Pei et al, 2007; Suzuki and Tarin, 2007; Kashiwazaki et al, 2008; Kroon et al, 2008; Vecchi et al, 2008; Ellsworth et al, 2009; Wong et al, 2009; Xie et al, 2010; Edfeldt et al, 2011; Smeets et al, 2011). A recurrent finding was that molecular phenotypes of nodal metastases are similar to those of the primary tumours they originate from (Perou et al, 2000; Hao et al, 2004; Hoang et al, 2005; O’Donnell et al, 2005; Weigelt et al, 2005; Roepman et al, 2006a; Feng et al, 2007; Inamura et al, 2007; Vecchi et al, 2008; Ellsworth et al, 2009). This has been interpreted as evidence that most cells in the primary tumour have the potential to seed metastases (Hao et al, 2004; Hoang et al, 2005; Inamura et al, 2007; Ellsworth et al, 2009), in contrast with the classical view that a few cells acquire metastatic capacity within the primary tumour (Fidler and Kripke, 1977). Furthermore, several studies defined the biological context of nodal spread by pointing out transcription of genes related to the immune response (Weigelt et al, 2005; Wang et al, 2006a; Vecchi et al, 2008; Ellsworth et al, 2009; Xie et al, 2010) and the stroma (Hao et al, 2004; O’Donnell et al, 2005; Weigelt et al, 2005; Roepman et al, 2006a; Feng et al, 2007; Suzuki and Tarin, 2007; Vecchi et al, 2008; Ellsworth et al, 2009) differing between primary and metastatic samples. Finally, the prediction of nodal spread from primary tumour yielded conflicting results ranging from no classification (Xi et al, 2005; Kroon et al, 2008) to perfect classification of N0 (without apparent nodal spread) and N+ (with nodal spread) diseases (Kikuchi et al, 2003; Tamoto et al, 2004; O’Donnell et al, 2005; Roepman et al, 2005; Inamura et al, 2007; Pei et al, 2007; Kashiwazaki et al, 2008; Méndez et al, 2011). Whether these discrepancies reflect technical issues or a cancer type-specific classification potential remains unclear.

Data presented in these studies are not without ambiguities. Patient’s genetic and physiological background could both contribute to the transcriptional similarity between primary tumours and patient-matched nodal metastases. The immune-related signals reported in nodal metastasis could, as noted by several authors, reflect the presence of lymphoid tissues alongside the metastasis. Laser capture microdissection has been proposed to address this ambiguity in some studies (Hoang et al, 2005; Wang et al, 2006a; Inamura et al, 2007; Nguyen et al, 2007; Ellsworth et al, 2009), whereas others have advocated the analysis of bulk tissues because they better reflect the wider context of metastasis (Hoang et al, 2005; O’Donnell et al, 2005; Méndez et al, 2007). The N0 vs N+ classification problem has been approached with a very wide range of statistical methods and validation procedures in addition to differences in lymph node screening protocols.

This study applies the transcriptomic approach to the analysis of nodal metastatic spread in PTCs. Numerous biological and computational controls have been included to address the ambiguities of previous studies.

Materials and methods

Samples and patients

Samples derived from 31 patients diagnosed for PTC were obtained from the Jules Bordet Institute (Brussels, Belgium) and La Pitié-Salpêtrière Hospital (Paris, France). They were selected on the basis of RNA quality rather than clinicopathologic parameters. Protocols were approved by the ethics committees of the institutions.

Extraction and quality assessment

RNA was extracted with RNeasy Mini Kit columns (Qiagen, Hilden, Germany). The histology of each sample was verified using haematoxylin- and eosin-stained sections. Percentages of tumour cells were estimated and, when present, the percentages of adjacent non-tumour cells or lymphocyte infiltration were quantified. Tumours with <70% cancer cells were discarded. RNA concentrations were measured using NanoDrop ND-1000 spectrophotometer (Life Technologies, Grand Island, NY, USA). RNA integrity was assessed using an automated gel electrophoresis system (Experion; Bio-Rad, Hercules, CA, USA), yielding a score from 1 (fully degraded) to 10 (intact RNA): the RNA Quality Index. Only samples with a score of 7 or above were considered.

Microarrays hybridisation

Genome-wide mRNA profiles were obtained through the hybridisation of samples onto Affymetrix GeneChip Human Genome U133 Plus 2.0 Arrays (Affymetrix, Santa Clara, CA, USA). RNA amplification, hybridisation and image scanning were performed according to standard Affymetrix protocols.

Mutational status

We assessed the mutational status of BRAF, RET/PTC1, RET/PTC3, NRAS, HRAS and KRAS in the PTC samples. After a DNase treatment with DNaseI Amplification Grade Kit (Invitrogen, Carlsbad, CA, USA), 1 μg of total RNA was used for reverse transcription using hexamers (3.6 μg μl⁻¹; Roche, Basel, Switzerland) and reverse transcriptase (SuperscriptII RNase H ReverseTranscriptase Kit; Invitrogen). Polymerase chain reactions were performed on 2 μl of cDNA using the recombinant Taq DNA Polymerase Kit (Invitrogen) and appropriate primer pairs (primer sequences and PCR conditions provided in a previous study; Saiselet et al, 2012). Polymerase chain reaction products were purified with the QIAquick PCR Purification Kit (Qiagen) according to the manufacturer’s instructions. Sequencing was performed with the BigDye Terminator V3.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, CA, USA) with the sequencer ABI PRISM 3130 (Applied Biosystems) and the genetic analysis program 3130-XI.

Microarray analysis

All analyses and images were performed in R v.3.1.0 (R Core Team, 2013).

CEL files were normalized with fRMA (McCall et al, 2010) v.1.6.0 with default parameters and hgu133plus2frmavecs v.1.1.8 annotation. The arrays used contain 54 613 probe sets (probes) representing 20 027 genes, annotated with R packages annotate v.1.32.3 (Gentleman, 2014) and hgu133plus2.db v2.6.3 (Carlson et al, 2014).

Hierarchical clustering was performed using the function hclust with ‘method=ward’. The distance between samples was defined as 1−|ρ_S|, where ρ_S is Spearman’s correlation coefficient. Principal component analysis was computed with prcomp.

Differential expression was assessed with Rank Product (Breitling et al, 2004), a nonparametric method related to the fold-change method (Hong and Breitling, 2008).

The four normal lymph node (NLN) profiles were used to assess the contaminant fraction in nodal metastasis profiles (LNM) as follows:

where ge(tissue) is the expression profile of a tissue; x is the fraction of NLN contaminant and δ is an unknown signal associated with metastasis and measurement errors. Taking the ratios of expression:

Thus, the fraction of contaminant x can be estimated from the slope between LNM/PTC and NLN/PTC ratios. This method was assessed with a data set of tissue mixtures of known contaminant fraction (Supplementary Figure S3). Estimated and real contaminant fractions were similar and highly correlated (Spearman’s ρ=0.94; P=1.16 × 10⁻⁷⁸). To deconvolute the contaminant signal, for each log₂-transformed LNM-PTC ratio, a loess normalization was performed with regard to their NLN-PTC ratio.

Gene set enrichment analysis was run with 1000 label permutations to derive a P-value and its associated false discovery rate for each gene set (Subramanian et al, 2005).

After balancing the number of N0 and N+ samples, nested cross-validation was performed using support vector machine as the classification method (Chang and Lin, 2011) and top scoring pairs (Leek, 2009) for the feature selection. The data sets were divided randomly into five parts of equal size and equal number of samples from each class. One part was set aside as the test set, whereas the training set consisted of the four remaining parts. The training set was used in the inner fivefold cross-validation to select the best number of probes N ∈ {8, 16, 32, 64}, that is, the number of probes maximising the average accuracy of the predictors.

False discovery rates, a confidence measure that accounts for multiple testing, <0.05 were considered significant for differential expression and <0.25 for gene set analysis, as recommended by Subramanian et al (2005).

This microarray data set is available from the Gene Expression Omnibus (Edgar et al, 2002) under accession number GSE60542.

Results

No strong global transcriptional difference between N0 primary tumours, N+ primary tumours and nodal metastases

We profiled the transcriptomes of 11 primary PTCs with no detectable nodal invasion, 17 primary PTCs with nodal invasion and 17 patient-matched nodal metastases. We also profiled a number of control samples. These included 24 patient-matched non-cancerous thyroid tissues (11 from N0, 13 from N+ patients), and 4 normal lymph nodes, and technical and biological replicates including additional nodal metastasis for 3 patients, adjacent blocks for 5 primary tumours, 4 non-cancerous thyroid tissues and 1 nodal metastasis. The non-cancerous thyroid tissues were taken from the thyroid tissues adjacent to the tumor, and in the contralateral thyroid lobe whenever possible.

To compare the global molecular phenotypes of these samples, we projected them on the two first principal components, representing together 46% of the total variance of the data set (Figure 1). Samples were grouped into three clusters: tumour tissues, non-cancerous thyroid tissues and normal lymph nodes. We eliminated from further analysis five presumably mislabelled ‘tumour’ samples with global phenotypes akin to non-cancerous tissues (Figure 1, red labels). The tumour cluster included N0, N+ primary tumours and nodal metastases demonstrating the absence of a strong global transcriptional difference between these tissues.

The transcriptomes of primary tumours and patient-matched lymph node metastases are highly correlated owing to both patient and tumour common backgrounds

Several publications reported that the global transcriptomes of primary tumours are more related to those of their associated metastasis than to the metastasis of other primary tumours from other patients (Perou et al, 2000; Hao et al, 2004; O’Donnell et al, 2005; Weigelt et al, 2005; Roepman et al, 2006a; Feng et al, 2007; Inamura et al, 2007; Vecchi et al, 2008; Ellsworth et al, 2009). To verify if this applied to our data, we ran a hierarchical clustering on all pairs of nodal metastases and matched primary tumours (Figure 2A). Samples clustered by patient for 11 in 13 pairs, confirming earlier studies.

The present and previous analyses provided little quantitative insight and did not investigate the contribution of patient and tumour backgrounds in determining sample similarities. Nodal samples were significantly more correlated with primary tumours from the same patients than with unrelated primary tumours (Figure 2B). The correlations between matched primary/nodal samples and between primary tumour replicates were not significantly different.

Patient-matched non-cancerous and primary tumour samples were significantly more correlated than unmatched primary and non-cancerous samples. Thus, part of the primary tumour phenotype is related to patient background. The former correlation, however, was significantly lower than the correlations between tumour replicates and between non-cancerous tissue replicates. Patient-matched nodal metastasis and non-cancerous thyroid tissues were not more correlated than equivalent unmatched samples. We concluded that the observed pairing of the patient-matched primary and nodal transcriptomes was owing to both tumour- and patient-specific backgrounds.

Normal lymphoid and non-cancerous thyroid tissues likely confounded the expression differences between primary tumours and nodal metastases

The transcriptomes of primary tumours and nodal metastases were undistinguishable on a global scale, yet defined gene expression signature may set them apart and provide insights into the metastatic process. We performed a supervised search for genes differentially expressed between N+ primary tumours and their matched metastases (details in Material and Methods) – an approach widely used in previous studies (Huang et al, 2003; Kikuchi et al, 2003; Nagata et al, 2003; Tamoto et al, 2004; Croner et al, 2005; O’Donnell et al, 2005; Roepman et al, 2005; Xi et al, 2005; Bidus et al, 2006; Inamura et al, 2007; Méndez et al, 2007, 2011; Nguyen et al, 2007; Pei et al, 2007; Kashiwazaki et al, 2008; Kroon et al, 2008; Wong et al, 2009; Smeets et al, 2011).

We detected 1074 probes more expressed in metastases than in primary tumours (Supplementary Table S3). To gain functional insight, we ran a gene set enrichment analysis (GSEA) (Subramanian et al, 2005) using a gene set database composed of the Gene Ontology biological processes based on categories, signatures of primary tumour vs nodal metastasis from previous studies (Nagata et al, 2003; Hao et al, 2004; Hoang et al, 2005; Roepman et al, 2005; Xi et al, 2005; Bidus et al, 2006; Finak et al, 2006; Wang et al, 2006b; Feng et al, 2007; Inamura et al, 2007; Nguyen et al, 2007; Suzuki and Tarin, 2007; Casey et al, 2009; Ellsworth et al, 2009; Edfeldt et al, 2011) and a signature of normal thyroid differentiation (Tomás et al, 2012). Gene sets from previous studies and gene sets associated with the immune system were upregulated in nodal metastasis (Figure 3A). This signal could have resulted from the trivial fact that we compared tissues with two different backgrounds, that is, thyroid and lymphoid. To address this confounding variable problem, we compared the fold changes of probes obtained when comparing nodal metastases vs primary tumours with the fold changes obtained when comparing normal lymph nodes vs non-cancerous thyroid (Figure 3B). The two variables were highly correlated, ρ_S=0.39 (P<2 × 10⁻¹⁶), demonstrating that the 1074 upregulated probes and the gene sets were not informative about the metastatic process, but were due to the confounding effect of normal lymphoid cells.

A lower number of probes, 27, were downregulated in metastases compared with primary tumours (Supplementary Table S3). Gene set enrichment analysis detected the thyroid differentiation signature as the unique downregulated gene set (Figure 3A). This raised the possibility that expression negatively associated with nodal metastases was confounded by the absence of normal, differentiated, thyroid cells in lymph nodes.

Correcting for contaminant reveals an increased stromal signal in nodal metastases compared with matched primary

The normal lymph node transcriptomes enabled estimating lymphoid cell fraction in the nodal metastasis (Material and Methods and Supplementary Figure S1). It was below the 30% limit as determined by the pathologic examination in all but two samples (Figure 4A). These two samples were precisely those not clustering with their matched primary tumour in Figure 2A, strongly suggesting that these two exceptions to the general clustering pattern were caused by low tumour content.

Expression data were adjusted for lymphoid cell fraction (Materials and Methods and Supplementary Figure S1) and gene set enrichment was investigated as described in the previous section (Figure 4B; see Supplementary Table S3 for the list of differentially expressed genes). Two signatures of nodal metastasis from breast cancer studies (Suzuki and Tarin, 2007; Ellsworth et al, 2009), and one from gastric cancer using laser capture microdissection (Wang et al, 2006b), were upregulated in PTC nodal metastases. Two signatures obtained by comparing stroma with other cell types (Finak et al, 2006; Casey et al, 2009) were also upregulated in the PTC metastases.

Surprisingly, the three cancer-derived signatures were downregulated in nodal metastases in the original publications (Wang et al, 2006b; Suzuki and Tarin, 2007; Ellsworth et al, 2009). To further investigate why the same signatures had an opposite regulation in PTC metastases, we estimated their expression in primary tumour RNA-seq data from The Cancer Genome Atlas (TCGA). All three signatures had a significantly higher median expression in breast than thyroid tumours (Supplementary Figure S2). Thus, the referential defining of ‘up-’ and ‘nodal up-’ and ‘downregulation’ is different in these two types of cancer. An equal expression of stroma-related genes in nodal metastases could lead to opposite senses of differential expression when comparing with related primary tumours. We noted, however, that this tentative explanation could not apply to gastric cancer because it had the lowest stroma-related expression among all TCGA cancers.

Differential expression between N0 and N+ tumours is reproducible in patient-matched non-cancerous tissues, but not in independent PTC data sets

The unsupervised analysis (Figure 1) revealed no global difference between the transcriptomes of N0 and N+ primary tumours. Again, we searched for specific signatures of nodal status with a supervised analysis.

A preliminary analysis determined that nodal status was not confounded with any other recorded clinical variables in our samples (variables listed in Supplementary Table S1). Mutational status was not associated with nodal status: 16 out of 26, that is ∼60% of PTC were BRAF mutated (8 N0 and 8 N+). We detected 129 probes more expressed in N+ primary tumours compared with N0 and 11 less expressed (Supplementary Table S3). Gene set enrichment analysis uncovered a single significant gene set, a signature upregulated in nodal metastases in a previous breast cancer study (Suzuki and Tarin, 2007), but downregulated in N+ samples in our data set. As we will see below, such discrepancy is typical.

The same analysis was performed with the non-cancerous tissues adjacent to the N+ and N0 tumours. Eight hundred and forty-seven probes were detected as upregulated and 8 as downregulated in non-cancerous tissues adjacent to N+ tumours (Supplementary Table S3). Three gene sets reached significance with GSEA: calcium-independent cell–cell adhesion was upregulated; two signatures of upregulated genes in nodal metastases elsewhere (Feng et al, 2007; Suzuki and Tarin, 2007) were downregulated. Hence, the categories associated with nodal infiltration seemed similar in the tumour and matched non-cancerous tissues. To confirm this impression, we estimated with GSEA the expression of the tumour-derived N+ signature in the non-cancerous tissues (Figure 5A and B), and conversely, of the non-cancerous tissue-derived N+ signature in the tumours (Figure 5C and D). Both had a consistent expression in the alternate patient-matched tissue. In other words, there was high similarity in transcriptional signals associated with nodal status in the non-cancerous and tumour tissues. This is a straightforward consequence of the established fact (Figure 2) that transcriptomes of a tumour and its matched non-cancerous tissues were correlated on a global scale.

To validate these results, we considered the 129 and 11 genes sets as signatures and evaluated their expression in three published PTC data sets with nodal status information. These signatures were not consistently regulated. To assess comprehensively the discrepancies between data sets, we selected the 200 top genes ordered by their fold change of expression and associated with nodal status in each one of the three data sets and in our data set. We then tested all these signatures in all data sets (Figure 5C). No signature showed consistent expression across data sets.

Taken together, these results suggest that the signals associated with nodal status are cohort-specific – all the PTC cohorts are relatively small, except TCGA – and are associated with patient rather than tumour background.

Nodal invasion can be predicted from primary tumour transcriptomes for some, but not all cancer types

A number of studies present predictors of nodal status from primary tumour gene expression (Kikuchi et al, 2003; Tamoto et al, 2004; O’Donnell et al, 2005; Roepman et al, 2005; Inamura et al, 2007; Méndez et al, 2007; Pei et al, 2007; Kashiwazaki et al, 2008). Although the previous section ruins any hopes to find a relevant signal in our data, we conducted a comparable analysis to illustrate a widely overlooked statistical point. Using state-of-the-art machine learning methods and a nested cross-validation protocol immune to feature and model selection biases (Material and Methods), nodal status could be predicted with accuracy of 63%.

However, error estimation is variable depending on the specific split of the data into training and validation samples (Ambroise and McLachlan, 2002) – an issue rarely controlled for. We reran the above analysis 100 times on our data, but each time the N0/N+ labels were shuffled randomly among the samples. Twelve per cent of the runs produced accuracy estimates >63% (Figure 6A) in our data set, that is, a 63% accuracy has a high probability to occur by chance. We extended this analysis to other published PTC data sets and obtained similar results (Figure 6A), except for the TCGA thyroid data set.

This problem is more likely with small data sets. With 340 PTC samples, the TCGA made it possible to overcome the small size of our data set. We ran the classification analysis, including the random shuffling control, on the TCGA PTC data set and all other TCGA data sets with nodal status information (Figure 6B). The N0 vs N+ classification accuracy was significant in thyroid, breast and head and neck, but not high (<70%). No evidence for N0 vs N+ classification accuracy better than random was found in 11 other cancers. Note that significant classification accuracies were obtained for large data sets. Thus, data set size and cancer type influenced the classification potential of expression data.

Discussion

Nodal metastases tended to be more similar to their patient-matched primary tumours than to unrelated primary tumours (Figure 2). This trend, also present in previous studies (Perou et al, 2000; Hao et al, 2004; O’Donnell et al, 2005; Weigelt et al, 2005; Roepman et al, 2006a; Feng et al, 2007; Inamura et al, 2007; Vecchi et al, 2008; Ellsworth et al, 2009), has been interpreted in the context of the stepwise and the ab initio models of metastatic progression. In contrast to the stepwise model, the ab initio model posits that metastatic potential is acquired early during tumour expansion and is detectable in bulk samples. It is also compatible with a small or no difference between primary and nodal tumours (Inamura et al, 2007; Ellsworth et al, 2009). However, the evidence provided by the expression profile comparison is limited, as it may depend on the cell types present in the samples (Roepman et al, 2006b). Here, we found that all the mismatched nodal samples were highly contaminated by lymphoid tissues. Another potential confounder is the presence of multiple primary foci (Weigelt et al, 2005). The interpretation of similarity results is further complicated by our findings that primary tumours are more similar to their contralateral non-cancerous thyroid tissues than to unrelated non-cancerous thyroid tissues. Thus, the stronger similarity between primary and nodal tumours is owing to both acquired tumour-specific features and patient-specific genetics and biology.

The global transcriptional profiles of primary PTCs and nodal metastases were very similar (Figure 1). Yet, a supervised analysis revealed hundreds of differentially expressed genes, most of them suggesting an important immune component in the metastases (Figure 3). Again, this result echoes studies conducted in other cancer types (Weigelt et al, 2005; Wang et al, 2006b; Vecchi et al, 2008; Ellsworth et al, 2009; Xie et al, 2010). A number of these studies have addressed or discussed the contamination of nodal samples by lymphoid tissues (Hoang et al, 2005; O’Donnell et al, 2005; Weigelt et al, 2005; Suzuki and Tarin, 2007; Vecchi et al, 2008; Ellsworth et al, 2009; Méndez et al, 2011). We proposed an original approach to the problem. We demonstrated that in our data fold changes were correlated to those obtained from the comparison of normal lymph nodes and non-cancerous thyroids (Figure 3), suggesting an overwhelming confounding effect of lymphoid tissues. Adjusting for lymphoid tissue content revealed the upregulation of five published stroma-related signatures in other tissues (Finak et al, 2006; Wang et al, 2006b; Suzuki and Tarin, 2007; Casey et al, 2009; Ellsworth et al, 2009). Four of five of these signatures were derived from the transcriptional profiles of laser capture microdissected primary and nodal tumours (Wang et al, 2006b; Ellsworth et al, 2009), or epithelium and stroma (Finak et al, 2006; Casey et al, 2009), suggesting that our statistical adjustment removed lymphoid contamination, but not the biologically relevant signal. Three of 9 and 8 of 13 genes were shared between two studies of microdissected lung and breast primary, respectively, with associated nodal tumours (Hoang et al, 2005; Ellsworth et al, 2009) and our unadjusted gene list, which mostly reflects immune infiltration. These overlaps could reflect imperfect microdissection or a small-amplitude immune signal unrelated to contamination. Finally, non-cancerous thyroid cells present in the primary tissue blocks could have produced the spurious downregulation of thyroid differentiation genes (Figure 3). Because the adjustment for the lymphoid signal also removed the thyroid signal (Figure 4B), the latter was probably an indirect consequence of lymphoid contamination: the transcriptomes of bulk tissues contaminated by lymphoid tissues are obviously less thyroid-like.

We compared the expression profiles of primary PTCs, which are associated or not with nodal metastasis. We found a signal associated with nodal invasion, in agreement with studies in other cancer types (Kikuchi et al, 2003; Tamoto et al, 2004; O’Donnell et al, 2005; Roepman et al, 2005; Inamura et al, 2007; Méndez et al, 2007; Pei et al, 2007; Kashiwazaki et al, 2008). To further gauge the significance of these results, we compared the signatures associated with N0/N+ obtained from four thyroid cancer data sets. The signature obtained from one data set was typically not associated with N0/N+ in the three others (Figure 5). Thus, our N0/N+ signature and, presumably, those published in other studies of comparable size, are not general markers of nodal invasion. Anecdotal comparisons of signatures already hinted at this conclusion (Méndez et al, 2007). Intriguingly, the same N0/N+ transcriptional signal was also present in the patient-matched non-cancerous tissues in our data (Figure 5). To the best of our knowledge, no other study presented this control. It suggests that the signature captures patient, rather than tumour-specific characteristics. This could arise from a predisposition to develop nodal metastases. However, the inconsistency of the signals suggests a cohort-specific bias. Only 2 (Takada et al, 2004; Kroon et al, 2008) among more than 20 studies we surveyed tested whether the N0/N+ status was associated with other available clinical variables. We searched for such association in our data, but did not find any. Yet, association with unobserved patient characteristics cannot be ruled out and is not unlikely given the limited sample size that characterizes the studies at hand.

Accuracy of N0 apart from N+ primary tumours classification was 63% in our thyroid cancer data set. However, a random permutation control demonstrated that an equal or higher accuracy might result from chance alone with a probability of 0.12. None of the studies we surveyed presented this control. It is particularly needed for small studies because accuracy estimates have a high variance (Figure 6). Published N0/N+ classification accuracies range from near perfect to near random. Several factors may account for this variation, including the type of cancer investigated, the comprehensiveness of lymph node investigation and technical details of the statistical procedure. Feature selection bias is an endemic flaw in the ‘omic’ literature that results in near-perfect – but grossly inflated – classification accuracies (Ambroise and McLachlan, 2002). It occurs when the marker genes are determined using the entire data set and then when the model is adjusted and ‘validated’ with classical cross-validation. Such approach breaks the statistical principle of separation of training and validation data. Five studies reporting accuracies of 89–100% presented this flaw (Takada et al, 2004; Tamoto et al, 2004; Inamura et al, 2007; Méndez et al, 2007, 2011). Studies without feature selection bias reported less successful classification. Focusing on the classification of N0 vs N+ with more than 10 nodes invaded in breast primary tumours, Huang et al (2003) obtained an accuracy of 90%. However, a less contrasted class definition yielded a modest 62% accuracy (Smeets et al, 2011), in line with our analysis of the TCGA data (57%, P=0/100, N=606; Figure 6) and in line with the fact that nodal status is a strong survival predictor that is independent of gene expression-based prognostic signatures in this disease (Van de Vijver et al, 2002; Van’t Veer et al, 2002; Ramaswamy et al, 2003; Wang et al, 2005; Ivshina et al, 2006). In penile cancer accuracy was 54%, that is, near random (Kroon et al, 2008). It was 62–67% in colon cancer (Croner et al, 2005), but significance could not be reached in TCGA (56%, P=6/100, N=136). Excluding studies with feature selection bias (Takada et al, 2004; Inamura et al, 2007) and a study with only four validation samples (Kikuchi et al, 2003), N0/N+ lung cancers could not be classified (Xi et al, 2005). We did not reach significance with TCGA lung adenocarcinomas (54%, P=6/100, N=230) and small-cell carcinoma (56%, P=11/100, N=184). In oral cancer, Roepman et al (2005) reported a 77% correct classification and Nguyen et al (2007) reported a 92% correct classification using microdissected samples. The Cancer Genome Atlas head and neck cancer samples could be classified but with lower accuracy (62%, P=0/100, N=198). Overall, our TCGA analyses show that N0 and N+ tumours cannot be classified for most cancers: accuracies are typically in the range 50–60% and not significantly better than permutation controls. Papillary thyroid carcinomas are among the most amenable to N0/N+ classification, but accuracy is still limited (67%, P=0/100, N=340).

We have presented new biological and statistical controls that expose some of the limits of published studies of the relationship between primary tumours and nodal metastases. The potential clinical relevance of gene expression to predict nodal metastases seems limited for most cancers.

Accession codes

Accessions

Gene Expression Omnibus

GSE60542

Change history

12 May 2015
This paper was modified 12 months after initial publication to switch to Creative Commons licence terms, as noted at publication

References

Ambroise C, McLachlan GJ (2002) Selection bias in gene extraction on the basis of microarray gene-expression data. Proc Natl Acad Sci USA 99: 6562–6566.
Article CAS Google Scholar
American Thyroid Association (ATA) Guidelines Taskforce on Thyroid Nodules and Differentiated Thyroid Cancer, Cooper DS, Doherty GM, Haugen BR, Hauger BR, Kloos RT, Lee SL, Mandel SJ, Mazzaferri EL, McIver B, Pacini F, Schlumberger M, Sherman SI, Steward DL, Tuttle RM (2009) Revised American Thyroid Association management guidelines for patients with thyroid nodules and differentiated thyroid cancer. Thyroid 19: 1167–1214.
Article Google Scholar
Bidus MA, Risinger JI, Chandramouli GVR, Dainty LA, Litzi TJ, Berchuck A, Barrett JC, Maxwell GL (2006) Prediction of lymph node metastasis in patients with endometrioid endometrial cancer using expression microarray. Clin Cancer Res 12: 83–88.
Article CAS Google Scholar
Breitling R, Armengaud P, Amtmann A, Herzyk P (2004) Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Lett 573: 83–92.
Article CAS Google Scholar
Carlson M, Falcon S, Pages H, Li N (2014) hgu133plus2.db: Affymetrix Human Genome U133 Plus 2.0 Array annotation data (chip hgu133plus2), v2.14.0, http://www.bioconductor.org/packages/release/data/annotation/html/hgu133plus2.db.html.
Casey T, Bond J, Tighe S, Hunter T, Lintault L, Patel O, Eneman J, Crocker A, White J, Tessitore J, Stanley M, Harlow S, Weaver D, Muss H, Plaut K (2009) Molecular signatures suggest a major role for stromal cells in development of invasive breast cancer. Breast Cancer Res Treat 114: 47–62.
Article CAS Google Scholar
Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2: 27 1–27:27.
Article Google Scholar
Croner RS, Peters A, Brueckl WM, Matzel KE, Klein-Hitpass L, Brabletz T, Papadopoulos T, Hohenberger W, Reingruber B, Lausen B (2005) Microarray versus conventional prediction of lymph node metastasis in colorectal carcinoma. Cancer 104: 395–404.
Article CAS Google Scholar
Delys L, Detours V, Franc B, Thomas G, Bogdanova T, Tronko M, Libert F, Dumont JE, Maenhaut C (2007) Gene expression and the biological phenotype of papillary thyroid carcinomas. Oncogene 26: 7894–7903.
Article CAS Google Scholar
Detours V, Delys L, Libert F, Weiss Solís D, Bogdanova T, Dumont JE, Franc B, Thomas G, Maenhaut C (2007) Genome-wide gene expression profiling suggests distinct radiation susceptibilities in sporadic and post-Chernobyl papillary thyroid cancers. Br J Cancer 97: 818–825.
Article CAS Google Scholar
Dom G, Tarabichi M, Unger K, Thomas G, Oczko-Wojciechowska M, Bogdanova T, Jarzab B, Dumont JE, Detours V, Maenhaut C (2012) A gene expression signature distinguishes normal tissues of sporadic and radiation-induced papillary thyroid carcinomas. Br J Cancer 107: 994–1000.
Article CAS Google Scholar
Edfeldt K, Björklund P, Åkerström G, Westin G, Hellman P, Stålberg P (2011) Different gene expression profiles in metastasizing midgut carcinoid tumors. Endocr Relat Cancer 18: 479–489.
Article CAS Google Scholar
Edgar R, Domrachev M, Lash AE (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucl Acids Res 30: 207–210.
Article CAS Google Scholar
Ellsworth RE, Seebach J, Field LA, Heckman C, Kane J, Hooke JA, Love B, Shriver CD (2009) A gene expression signature that defines breast cancer metastases. Clin Exp Metast 26: 205–213.
Article CAS Google Scholar
Feng Y, Sun B, Li X, Zhang L, Niu Y, Xiao C, Ning L, Fang Z, Wang Y, Zhang L, Cheng J, Zhang W, Hao X (2007) Differentially expressed genes between primary cancer and paired lymph node metastases predict clinical outcome of node-positive breast cancer patients. Breast Cancer Res Treat 103: 319–329.
Article CAS Google Scholar
Fidler IJ, Kripke ML (1977) Metastasis results from preexisting variant cells within a malignant tumor. Science 197: 893–895.
Article CAS Google Scholar
Finak G, Sadekova S, Pepin F, Hallett M, Meterissian S, Halwani F, Khetani K, Souleimanova M, Zabolotny B, Omeroglu A, Park M (2006) Gene expression signatures of morphologically normal breast tissue identify basal-like tumors. Breast Cancer Res 8: R58.
Article Google Scholar
Gentleman R (2014) annotate: Annotation for microarrays, v1.42.1, http://www.bioconductor.org/packages/release/bioc/html/annotate.html.
Hao X, Sun B, Hu L, Lähdesmäki H, Dunmire V, Feng Y, Zhang S-W, Wang H, Wu C, Wang H, Fuller GN, Symmans WF, Shmulevich I, Zhang W (2004) Differential gene and protein expression in primary breast malignancies and their lymph node metastases as revealed by combined cDNA microarray and tissue microarray analysis. Cancer 100: 1110–1122.
Article CAS Google Scholar
Hoang CD, Guillaume TJ, Engel SC, Tawfic SH, Kratzke RA, Maddaus MA (2005) Analysis of paired primary lung and lymph node tumor cells: a model of metastatic potential by multiple genetic programs. Cancer Detect Prev 29: 509–517.
Article CAS Google Scholar
Hong F, Breitling R (2008) A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments. Bioinformatics 24: 374–382.
Article CAS Google Scholar
Huang E, Cheng SH, Dressman H, Pittman J, Tsou MH, Horng CF, Bild A, Iversen ES, Liao M, Chen CM, West M, Nevins JR, Huang AT (2003) Gene expression predictors of breast cancer outcomes. Lancet 361: 1590–1596.
Article CAS Google Scholar
Inamura K, Shimoji T, Ninomiya H, Hiramatsu M, Okui M, Satoh Y, Okumura S, Nakagawa K, Noda T, Fukayama M, Ishikawa Y (2007) A metastatic signature in entire lung adenocarcinomas irrespective of morphological heterogeneity. Hum Pathol 38: 702–709.
Article Google Scholar
Ivshina AV, George J, Senko O, Mow B, Putti TC, Smeds J, Lindahl T, Pawitan Y, Hall P, Nordgren H, Wong JEL, Liu ET, Bergh J, Kuznetsov VA, Miller LD (2006) Genetic reclassification of histologic grade delineates new clinical subtypes of breast cancer. Cancer Res 66: 10292–10301.
Article CAS Google Scholar
Kashiwazaki H, Hassan NMM, Hamada J-I, Moriuchi T, Yamazaki Y, Tei K, Totsuka Y, Inoue N (2008) Gene expression profile changes correlated with lymph node metastasis in oral squamous cell carcinoma. Odontology 96: 38–43.
Article CAS Google Scholar
Kikuchi T, Daigo Y, Katagiri T, Tsunoda T, Okada K, Kakiuchi S, Zembutsu H, Furukawa Y, Kawamura M, Kobayashi K, Imai K, Nakamura Y (2003) Expression profiles of non-small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs. Oncogene 22: 2192–2205.
Article CAS Google Scholar
Kroon BK, Leijte JAP, van Boven H, Wessels LFA, Velds A, Horenblas S, van’t Veer LJ (2008) Microarray gene-expression profiling to predict lymph node metastasis in penile carcinoma. BJU Int 102: 510–515.
Article CAS Google Scholar
Leek JT (2009) The tspair package for finding top scoring pair classifiers in R. Bioinformatics 25: 1203–1204.
Article CAS Google Scholar
Lundgren CI, Hall P, Dickman PW, Zedenius J (2006) Clinically significant prognostic factors for differentiated thyroid carcinoma: a population-based, nested case-control study. Cancer 106: 524–531.
Article Google Scholar
McCall MN, Bolstad BM, Irizarry RA (2010) Frozen robust multiarray analysis (fRMA). Biostatistics 11: 242–253.
Article Google Scholar
Méndez E, Fan W, Choi P, Agoff SN, Whipple M, Farwell DG, Futran ND, Weymuller EA Jr, Zhao L-P, Chen C (2007) Tumor-specific genetic expression profile of metastatic oral squamous cell carcinoma. Head Neck 29: 803–814.
Article Google Scholar
Méndez E, Lohavanichbutr P, Fan W, Houck JR, Rue TC, Doody DR, Futran ND, Upton MP, Yueh B, Zhao LP, Schwartz SM, Chen C (2011) Can a metastatic gene expression profile outperform tumor size as a predictor of occult lymph node metastasis in oral cancer patients? Clin Cancer Res 17: 2466–2473.
Article Google Scholar
Nagata M, Fujita H, Ida H, Hoshina H, Inoue T, Seki Y, Ohnishi M, Ohyama T, Shingaki S, Kaji M, Saku T, Takagi R (2003) Identification of potential biomarkers of lymph node metastasis in oral squamous cell carcinoma by cDNA microarray analysis. Int J Cancer 106: 683–689.
Article CAS Google Scholar
Nguyen ST, Hasegawa S, Tsuda H, Tomioka H, Ushijima M, Noda M, Omura K, Miki Y (2007) Identification of a predictive gene expression signature of cervical lymph node metastasis in oral squamous cell carcinoma. Cancer Sci 98: 740–746.
Article CAS Google Scholar
Nixon IJ, Shaha AR (2013) Management of regional nodes in thyroid cancer. Oral Oncol 49: 671–675.
Article Google Scholar
O’Donnell RK, Kupferman M, Wei SJ, Singhal S, Weber R, O’Malley B, Cheng Y, Putt M, Feldman M, Ziober B, Muschel RJ (2005) Gene expression signature predicts lymphatic metastasis in squamous cell carcinoma of the oral cavity. Oncogene 24: 1244–1251.
Article Google Scholar
Pacini F, Schlumberger M, Dralle H, Elisei R, Smit JWA, Wiersinga W European Thyroid Cancer Taskforce (2006) European consensus for the management of patients with differentiated thyroid carcinoma of the follicular epithelium. Eur J Endocrinol 154: 787–803.
Article CAS Google Scholar
Pei H, Zhu H, Zeng S, Li Y, Yang H, Shen L, Chen J, Zeng L, Fan J, Li X, Gong Y, Shen H (2007) Proteome Analysis and Tissue Microarray for Profiling Protein Markers Associated with Lymph Node Metastasis in Colorectal Cancer. J Proteome Res 6: 2495–2501.
Article CAS Google Scholar
Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, Fluge O, Pergamenschikov A, Williams C, Zhu SX, Lønning PE, Børresen-Dale AL, Brown PO, Botstein D (2000) Molecular portraits of human breast tumours. Nature 406: 747–752.
Article CAS Google Scholar
Ramaswamy S, Ross KN, Lander ES, Golub TR (2003) A molecular signature of metastasis in primary solid tumors. Nat Genet 33: 49–54.
Article CAS Google Scholar
R Core Team (2013) R: A Language and Environment for Statistical Computing.
Roepman P, de Jager A, Koerkamp MJAG, Kummer JA, Slootweg PJ, Holstege FCP (2006a) Maintenance of Head and Neck Tumor Gene Expression Profiles upon Lymph Node Metastasis. Cancer Res 66: 11110–11114.
Article CAS Google Scholar
Roepman P, de Koning E, van Leenen D, de Weger RA, Kummer JA, Slootweg PJ, Holstege FCP (2006b) Dissection of a metastatic gene expression signature into distinct components. Genome Biol 7: R117.
Article Google Scholar
Roepman P, Wessels LFA, Kettelarij N, Kemmeren P, Miles AJ, Lijnzaad P, Tilanus MGJ, Koole R, Hordijk G-J, Vliet PC, van der, Reinders MJT, Slootweg PJ, Holstege FCP (2005) An expression profile for diagnosis of lymph node metastases from primary head and neck squamous cell carcinomas. Nat Genet 37: 182–186.
Article CAS Google Scholar
Saiselet M, Floor S, Tarabichi M, Dom G, Hébrant A, van Staveren WCG, Maenhaut C (2012) Thyroid cancer cell lines: an overview. Front Endocrinol (Lausanne) 3: 133.
Article CAS Google Scholar
Smeets A, Daemen A, Vanden Bempt I, Gevaert O, Claes B, Wildiers H, Drijkoningen R, Van Hummelen P, Lambrechts D, De Moor B, Neven P, Sotiriou C, Vandorpe T, Paridaens R, Christiaens MR (2011) Prediction of lymph node involvement in breast cancer from primary tumor tissue using gene expression profiling and miRNAs. Breast Cancer Res Treat 129: 767–776.
Article CAS Google Scholar
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 102: 15545–15550.
Article CAS Google Scholar
Suzuki M, Tarin D (2007) Gene expression profiling of human lymph node metastases and matched primary breast carcinomas: clinical implications. Mol Oncol 1: 172–180.
Article Google Scholar
Takada M, Tada M, Tamoto E, Kawakami A, Murakawa K, Shindoh G, Teramoto K, Matsunaga A, Komuro K, Kanai M, Fujiwara Y, Shirata K, Nishimura N, Miyamoto M, Okushiba S, Kondo S, Hamada J, Katoh H, Yoshiki T, Moriuchi T (2004) Prediction of lymph node metastasis by analysis of gene expression profiles in non-small cell lung cancer1. J Surg Res 122: 61–69.
Article CAS Google Scholar
Tamoto E, Tada M, Murakawa K, Takada M, Shindo G, Teramoto K, Matsunaga A, Komuro K, Kanai M, Kawakami A, Fujiwara Y, Kobayashi N, Shirata K, Nishimura N, Okushiba S, Kondo S, Hamada J, Yoshiki T, Moriuchi T, Katoh H (2004) Gene-expression profile changes correlated with tumor progression and lymph node metastasis in esophageal cancer. Clin Cancer Res 10: 3629–3638.
Article CAS Google Scholar
Tomás G, Tarabichi M, Gacquer D, Hébrant A, Dom G, Dumont JE, Keutgen X, Fahey TJ 3rd, Maenhaut C, Detours V (2012) A general method to derive robust organ-specific gene expression-based differentiation indices: application to thyroid cancer diagnostic. Oncogene 31: 4490–4498.
Article Google Scholar
Vecchi M, Confalonieri S, Nuciforo P, Viganò MA, Capra M, Bianchi M, Nicosia D, Bianchi F, Galimberti V, Viale G, Palermo G, Riccardi A, Campanini R, Daidone MG, Pierotti MA, Pece S, Di Fiore PP (2008) Breast cancer metastases are molecularly distinct from their primary tumors. Oncogene 27: 2148–2158.
Article CAS Google Scholar
Van’t Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AAM, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH (2002) Gene expression profiling predicts clinical outcome of breast cancer. Nature 415: 530–536.
Article Google Scholar
Venkat R, Guerrero MA (2013) Recent advances in the surgical treatment of differentiated thyroid cancer: a comprehensive review. Scientific World J 2013: 425136.
Article Google Scholar
Van de Vijver MJ, He YD, van ’t Veer LJ, Dai H, Hart AAM, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, Parrish M, Atsma D, Witteveen A, Glas A, Delahaye L, van der Velde T, Bartelink H, Rodenhuis S, Rutgers ET, Friend SH, Bernards R (2002) A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347: 1999–2009.
Article CAS Google Scholar
Wang L, Zhu J-S, Song M-Q, Chen G-Q, Chen J-L (2006a) Comparison of gene expression profiles between primary tumor and metastatic lesions in gastric cancer patients using laser microdissection and cDNA microarray. World J Gastroenterol 12: 6949–6954.
Article CAS Google Scholar
Wang L, Zhu J-S, Song M-Q, Chen G-Q, Chen J-L (2006b) Comparison of gene expression profiles between primary tumor and metastatic lesions in gastric cancer patients using laser microdissection and cDNA microarray. World J Gastroenterol 12: 6949–6954.
Article CAS Google Scholar
Wang Y, Klijn JGM, Zhang Y, Sieuwerts AM, Look MP, Yang F, Talantov D, Timmermans M, Meijer-van Gelder ME, Yu J, Jatkoe T, Berns EMJJ, Atkins D, Foekens JA (2005) Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet 365: 671–679.
Article CAS Google Scholar
Weigelt B, Wessels LFA, Bosma AJ, Glas AM, Nuyten DSA, He YD, Dai H, Peterse JL, van’t Veer LJ (2005) No common denominator for breast cancer lymph node metastasis. Br J Cancer 93: 924–932.
Article CAS Google Scholar
Wong F-H, Huang C-YF, Su L-J, Wu Y-C, Lin Y-S, Hsia J-Y, Tsai H-T, Lee S-A, Lin C-H, Tzeng C-H, Chen P-M, Chen Y-J, Liang S-C, Lai J-M, Yen C-C (2009) Combination of microarray profiling and protein–protein interaction databases delineates the minimal discriminators as a metastasis network for esophageal squamous cell carcinoma. Int J Oncol 34: 117–128.
CAS PubMed Google Scholar
Xie HL, Li ZY, Gan RL, Li XJ, Zhang QL, Hui M, Zhou XT (2010) Differential gene and protein expression in primary gastric carcinomas and their lymph node metastases as revealed by combined cDNA microarray and tissue microarray analysis. J Dig Dis 11: 167–175.
Article CAS Google Scholar
Xi L, Lyons-Weiler J, Coello MC, Huang X, Gooding WE, Luketich JD, Godfrey TE (2005) Prediction of lymph node metastasis by analysis of gene expression profiles in primary lung adenocarcinomas. Clin Cancer Res 11: 4128–4135.
Article CAS Google Scholar

Download references

Acknowledgements

We thank Françoise Lallemand, Sandy Haussy, Samira Majjaj for their contribution, Chantal Degraef for her technical assistance, Andrea Karambelas for final proofreading and Jacques E Dumont for his advice. VD was funded by FNRS Grants 14657170 and 19556102. MT and MS were supported by FRIA/FNRS grant. Other supports include Welbio, Fondation Contre le Cancer, Fonds de la Recherche Scientifique Médicale (FRSM), Fonds National de la Recherche Scientifique (FNRS), Plan Cancer (action 29), Télévie, Les Amis de l’Institut Bordet and Fondation Van Buuren.

Author contributions

CM conceived the experiments; CM, MS, CT, CH, DL and GA collected the data and carried out experiments; MT analysed the data and generated the figures; MT, CM and VD interpreted the results; MT and VD wrote the manuscript.

Author information

C Maenhaut and V Detours: These authors contributed equally to this work.

Authors and Affiliations

IRIBHM, Université Libre de Bruxelles (ULB), Route de Lennik, Brussels, 808-1070, Belgium
M Tarabichi, M Saiselet, C Maenhaut & V Detours
Department of General and Digestive Surgery and Pathology, Pitié-Salpêtrière Hospital, Université Pierre and Marie Curie, 47-83, boulevard de l'Hôpital, Paris, France,
C Trésallet & C Hoang
Department of Pathology, Jules Bordet Institute, Université Libre de Bruxelles (ULB), Boulevard de Waterloo, Brussels, 125-1000, Belgium
D Larsimont
Thoracic Surgery, Jules Bordet Institute, Université Libre de Bruxelles (ULB), Boulevard de Waterloo, Brussels, 125-1000, Belgium
G Andry
WELBIO, Route de Lennik, Brussels, 808-1070, Belgium
C Maenhaut

Authors

M Tarabichi
View author publications
You can also search for this author in PubMed Google Scholar
M Saiselet
View author publications
You can also search for this author in PubMed Google Scholar
C Trésallet
View author publications
You can also search for this author in PubMed Google Scholar
C Hoang
View author publications
You can also search for this author in PubMed Google Scholar
D Larsimont
View author publications
You can also search for this author in PubMed Google Scholar
G Andry
View author publications
You can also search for this author in PubMed Google Scholar
C Maenhaut
View author publications
You can also search for this author in PubMed Google Scholar
V Detours
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to C Maenhaut or V Detours.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

This work is published under the standard license to publish agreement. After 12 months the work will become freely available and the license terms will switch to a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License.

Supplementary Information accompanies this paper on British Journal of Cancer website

Supplementary information

Supplementary Legneds (DOC 27 kb)

Supplementary Figure 1 (JPG 193 kb)

Supplementary Figure 2 (JPG 134 kb)

Supplementary Figure 3 (JPG 86 kb)

Supplementary Table 1 (XLS 41 kb)

Supplementary Table 2 (XLS 28 kb)

Supplementary Table 3 (XLS 494 kb)

Rights and permissions

From twelve months after its original publication, this work is licensed under the Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/

Reprints and permissions

About this article

Cite this article

Tarabichi, M., Saiselet, M., Trésallet, C. et al. Revisiting the transcriptional analysis of primary tumours and associated nodal metastases with enhanced biological and statistical controls: application to thyroid cancer. Br J Cancer 112, 1665–1674 (2015). https://doi.org/10.1038/bjc.2014.665

Download citation

Received: 23 September 2014
Revised: 12 November 2014
Accepted: 18 November 2014
Published: 12 May 2015
Issue Date: 12 May 2015
DOI: https://doi.org/10.1038/bjc.2014.665

Keywords

This article is cited by

Adding pieces to the puzzle of differentiated-to-anaplastic thyroid cancer evolution: the oncogene E2F7
- Mila Gugnoni
- Eugenia Lorenzini
- Gloria Manzotti
Cell Death & Disease (2023)
Upregulation of dendrocyte-expressed seven transmembrane protein is associated with unfavorable outcomes in differentiated thyroid cancer
- Jie-Jen Lee
- Yi-Chiung Hsu
- Shih-Ping Cheng
Endocrine (2023)
LncRNA H19 is a potential biomarker and correlated with immune infiltration in thyroid carcinoma
- Yunus Sahin
Clinical and Experimental Medicine (2022)
The molecular and gene/miRNA expression profiles of radioiodine resistant papillary thyroid cancer
- Carla Colombo
- Emanuela Minna
- Maria Grazia Borrello
Journal of Experimental & Clinical Cancer Research (2020)
Identification of hub genes in papillary thyroid carcinoma: robust rank aggregation and weighted gene co-expression network analysis
- Yang Liu
- Ting-Yu Chen
- Chao Zhang
Journal of Translational Medicine (2020)