Cancers are heterogeneous diseases. Substantial inter-tumor heterogeneity of many cancers has been well documented [1]. Heterogeneity also exists within individual tumors, as a single tumor can display multiple different molecular features within it, a phenomenon termed intra-tumor heterogeneity. Genomic intra-tumor heterogeneity has been reported in both hematologic and solid tumors and may be associated with treatment failure and emergence of drug resistance [2]. Gerlinger and colleagues [3, 4] demonstrated profound genomic intra-tumor heterogeneity in renal cell carcinomas using a multiregion whole-exome sequencing approach. Using similar approach, our group [5] and de Bruin et al. [6] have independently shown clear evidence of genomic intra-tumor heterogeneity of non-small cell lung cancers, but to a much less extent compared to that in renal cell carcinomas, suggesting different cancers may have different genomic intra-tumor heterogeneity architecture.

RNA and protein expression intra-tumor heterogeneity has also been studied, but only at single gene and limited gene panel levels. For example, spatial heterogeneity of estrogen receptor and human epidermal growth factor receptor 2 (HER-2) protein was reported in breast cancers [7, 8] and HER-2 protein expression was found to be discordant between different regions within the same gastric cancers [9]. Gyanchandani and colleagues [10] analyzed five RNA expression-based prognostic panels on 181 tumor samples from 71 estrogen receptor-positive breast cancers and revealed discordant risk of recurrence by using different regions from the same tumors, suggesting that a single biopsy could either under- or over-estimate recurrence risks of breast cancer due to the intra-tumor heterogeneity. To the best of our knowledge, comprehensive gene expression intra-tumor heterogeneity is not well defined in any cancer types. Here, we present our analyses of gene expression profiles of 35 spatially separated regions from 10 patients with resected non-small cell lung cancer.

Materials and methods

Multiregion sampling and clinical information

We collected 35 tumor samples from 10 patients with non-small cell lung cancer (three or four regions/tumor) including lung adenocarcinoma (n = 6), squamous cell carcinoma (n = 2), large-cell carcinoma (n = 1), and pleomorphic carcinoma (n = 1). Multiregion sampling was done as illustrated in Supplementary Figure 1. Detailed clinical information for every patient, such as tumor size, stage, prognosis, and treatment history, can be found in Supplementary Table 1.

Data preprocessing and normalization

The gene expression data were generated by using the Affymetrix GeneChip® Human Gene 1.0 ST arrays. The platform contains 33,252 transcript clusters (corresponding to 20,009 unique genes). The R package, aroma.affymetrix, was used to process the expression measurements. Briefly, the CEL formatted files were loaded first, and then the robust multiarray analysis method was used for data processing and background adjustment. The background-corrected measurements were then logarithm transformed (base 2) and quantile normalization was applied.

Gene annotation to transcript cluster

Once the data were processed, we updated the annotation based on the most recent annotation file (HuGene-1_0-st-v1.na36.hg19.transcript.csv, 7/6/16) obtained from the NetAffx™ Analysis Center of Affymetrix, and annotated transcript clusters with NCBI RefSeq genes. For a cluster with multiple unique gene symbols, we aligned the sequence of the cluster with each of the candidate genes using bl2seq (BLAST 2 Sequences) in BLAST package (v2.2.12). A gene with the highest bit-score was considered as the gene for the cluster. If multiple genes show the same best score for a cluster, we assigned them all to the cluster.

Unique gene-level expression profiling

If a gene is assigned with only one transcript cluster, the cluster represents the expression level of the gene. If multiple transcript clusters point to the same gene, we first calculated Pearson correlation coefficient between them and took the average if they are correlated (r > 0.7). Otherwise, individual sequences from the clusters were compared with the gene sequence with bl2seq, and the best matching cluster was selected to represent expression level for the gene. If there are multiple transcript clusters reaching the same best score, we selected one with the highest average expression level across all samples.

Molecular subtyping and epithelial–mesenchymal transition score calculation

To calculate molecular subtype scores, we obtained subtype predictor centroids for lung adenocarcinoma and squamous cell carcinoma [11, 12]. For lung adenocarcinoma, we renamed subtypes as proposed by The Cancer Genome Atlas [13]: bronchioid to terminal respiratory unit, magnoid to proximal proliferative, and squamoid to proximal inflammatory. Molecular subtype scores were calculated with the method previously used [14]. Briefly, we used the following procedure: (i) extracting expression values of the genes overlapped with the gene lists of subtype centroids, (ii) mean centering within the specific histological group (adenocarcinoma or squamous), (iii) averaging the expression values if there are multiple probes indicating the same gene, (iv) calculating Spearman rank correlation between values of a given sample and centroids, and (v) selecting the subtype showing the highest correlation coefficient with p value <0.01 as a predicted molecular subtype. We did not calculate subtype scores for large-cell and pleomorphic carcinoma samples.

For epithelial–mesenchymal transition score calculation, we used the 76-gene epithelial–mesenchymal transition signature panel [15] with the average-based method introduced previously [16]. First, expression values of each gene were standardized by the mean and standard deviation. For each sample, an epithelial–mesenchymal transition score was calculated as the average expression values for mesenchymal genes minus the average expression values for epithelial genes.

Prognostic gene signature analysis

We queried published studies on mRNA microarray-based prognostic signatures for non-small cell lung cancer [17] and selected two frequently cited prognostic signatures for further analyses [18, 19]. The first signature consisting of six genes [18] was originally developed using RT-PCR and validated on four independent public microarray data sets. For each of the six genes, expression values were subtracted by the median of the gene. We skipped housekeeping gene normalization, which was done for the RT-PCR data in the original paper. Euclidean distances to the training cluster medians (high- or low-risk cluster provided by the original paper) were used to predict prognostic subgroups. The prediction was performed only when the ratio of two distances to the training cluster medians is less than 0.9. The second signature consisting of 15 genes [19] uses supervised principal component analysis to calculate risk scores. For each of the 15 genes, expression values were transformed to Z-scores by centering to the mean and scaling to the standard deviation of the gene. In the original paper, the risk scores of four independent published microarray data sets for validation were dichotomized at −0.1, the median risk score determined in the training data set. With the same principal component analysis rotation matrix and Cox regression coefficients, we derived risk scores for our samples and classified them based on the same cutoff −0.1.

Calculation of scores for innate anti-PD-1 resistance content signatures

We used the gene set variation analysis program with default settings to calculate the enrichment scores for the 22 innate anti-PD-1 resistance content signatures reported by the previous publication [20]. The definition files of these signatures were downloaded either from the Molecular Signature Database (MSigDB) [21] or the supplementary file of the publication. For each signature, the enrichment scores across all samples were converted to Z-scores. To compute the innate anti-PD-1 resistance “co-enrichment” score for a given sample, we averaged Z-scores for the sample across the 22 innate anti-PD-1 resistance content signatures.

Impact of tumor microenvironment on gene expression heterogeneity

Using a gene expression-based algorithm ESTIMATE [22], we calculated tumor microenvironment scores (stromal score, immune score, and corresponding tumor purity) for each sample. Then, we correlated them with epithelial–mesenchymal transition, innate anti-PD-1 resistance, and lung adenocarcinoma subtype scores. Overall gene expression profiles were represented by the first principal components and correlated with the tumor microenvironment scores. Pearson correlation coefficient was used to measure the degree of correlation.

Results

Gene expression heterogeneity between and within non-small cell lung cancers

To determine the variation of gene expression patterns between different regions of the individual tumors and across the different tumors, we performed unsupervised hierarchical clustering. When comparing all samples across the different patients, different tumor regions from the same patients clustered together for all patients except two tumor samples Pa3.R2 and Pa5.R1, which clustered with tumor samples from other patients (Fig. 1a). The principal component analysis also showed a similar pattern with the two tumor samples mixing with tumors from different patients (intraclass correlation coefficient of the first principal component: 0.96, 95% CI (0.9, 0.98), Fig. 1b). These results suggest marked inter-individual gene expression heterogeneity and to a less extent intra-tumor heterogeneity. However, certain tumors may have high level of gene expression intra-tumor heterogeneity that is comparable to the heterogeneity between different tumors.

Fig. 1
figure 1

a Heat map generated by unsupervised hierarchical clustering with all transcript clusters (n = 21,052). The 1-correlation distance and ward.D2 linkage were used for clustering. The rows and columns represent transcript clusters and tumor samples, respectively. Samples from the same patients were illustrated in the same colors below the column dendrogram. b Principal component analysis plot with the two dimensions (PC1 and PC2) along which the data are the largest spread. Samples from the same patients were shown in the same colors. Two samples (Pa3.R2 and Pa5.R1), which clustered with tumor samples from other patients, are indicated

Varied molecular subtypes within the same tumors

Molecular subtyping has become a critical component of precision oncology and gene expression profiling has been used for molecular subtyping in many types of cancers including breast cancer [23], glioblastoma [24], colorectal cancer [25], and lung cancer [11,12,13, 26]. Lung adenocarcinomas can be classified into terminal respiratory unit, proximal proliferative, and the proximal inflammatory subtypes [13], while lung squamous cell carcinomas can be classified into classical, basal, secretory, and primitive subtypes [11]. In the previous studies, the molecular subtyping has been based on a single tumor sample. Since we observed evidence of gene expression intra-tumor heterogeneity, we next investigated whether the molecular subtype is concordant between different tumor regions within the same tumors. As shown in Fig. 2 and Supplementary Table 2, heterogeneous molecular subtype scores were observed between different tumor regions within the same tumors. For example, Pa1.R1 had the highest score for proximal inflammatory subtype, while the other two tumor regions of Pa1 demonstrated the highest scores for proximal proliferative subtype. In Pa3, terminal respiratory unit is the predicted subtype in Pa3.R2 while all three other samples from Pa3 are predicted to be proximal proliferative subtype.

Fig. 2
figure 2

Heat maps for adenocarcinoma subtype scores (top heat map) and for squamous cell carcinoma subtype scores (bottom heat map). The predicted molecular subtypes are shown with corresponding abbreviations (a “dot” represents no statistically significant molecular subtype)

Varied prognostic gene signatures within the same tumors

Numerous gene expression-based profiles of non-small cell lung cancer have been generated over the past 20 years and many diagnostic and prognostic gene signatures have been proposed. Given the substantial gene expression intra-tumor heterogeneity observed in our cohort, we next investigated whether different tumor regions would show different gene signatures that are associated with prognosis of lung cancer patients.

The first signature we looked at was epithelial–mesenchymal transition. Epithelial–mesenchymal transition is a biological process, by which epithelial cells lose cell adhesion molecules and become more like mesenchymal cells with high migratory and invasive capabilities. Epithelial–mesenchymal transition was reported to be a potential prognostic and predictive marker in patients with lung cancer [27,28,29,30]. Using our previously developed 76-gene epithelial–mesenchymal transition signature panel [15], we computed an epithelial–mesenchymal transition score for each sample (Fig. 3a and Supplementary Table 3). Consistent with previous studies [31, 32], different tumors demonstrated different epithelial–mesenchymal transition scores and the poorly differentiated large-cell carcinoma, pleomorphic carcinoma, and squamous cell carcinomas showed higher (i.e., more mesenchymal) scores than adenocarcinomas (p value: 0.003). Different regions of the same tumors demonstrated similar epithelial–mesenchymal transition scores for the most part (intraclass correlation coefficient of epithelial–mesenchymal transition score: 0.71, 95% CI (0.43, 0.90)); however, considerable intra-tumor heterogeneity of epithelial–mesenchymal transition scores were observed in some tumors. For example, within tumor Pa6, only Pa6.R4 clearly demonstrated overexpression of mesenchymal-specific genes and corresponding high epithelial–mesenchymal transition score, while all the other three tumor regions showed very low epithelial–mesenchymal transition scores (Fig. 3a). In addition, although the expression levels of epithelial–mesenchymal transition-related genes were mostly correlated between different regions within the same tumors, the correlation was weak for many tumor pairs further suggesting the spatially heterogeneous expression of epithelial–mesenchymal transition-related genes (Fig. 3b and Supplementary Figure 2).

Fig. 3
figure 3

a Epithelial–mesenchymal transition scores for all tumor samples. Pa1 to Pa6: adenocarcinoma, Pa7 and Pa8: squamous cell carcinoma, Pa9: large-cell carcinoma and Pa10: pleomorphic carcinoma. b All pairwise epithelial–mesenchymal transition-related gene expression correlations for Pa6. The Pearson correlation coefficient is shown within each plot

Next, we selected two other frequently cited mRNA microarray-based prognostic signatures [18, 19] and calculated the risk scores in each tumor region. As expected, the risk score varied between different tumor regions within the same tumors (Supplementary Table 4). Interestingly, when every tumor region was classified into high- and low-risk prognostic subgroups based on the same criteria in the original studies, discordant prognostication derived from different tumor regions within the same tumor was observed in 1 of the 10 patients using the 6-gene signature and in 7 of the 10 patients using the 15-gene signature, respectively. These results suggest that gene expression-based prognostic signature from a single biopsy could either under- or over-estimate the risk of recurrence of patients with non-small cell lung cancer due to the gene expression intra-tumor heterogeneity.

Varied innate anti-PD-1 resistance signatures within the same tumor

Recent studies have shown that gene expression-based molecular subtypes were associated with response to immune checkpoint inhibition in bladder cancer [33] and melanoma [20]. Hugo et al. reported innate anti-PD-1 resistance transcriptomic signature in melanoma and other cancers including lung cancer [20]. Using the same innate anti-PD-1 resistance signature, we characterized our tumor samples (Fig. 4a). For a given sample, we computed the innate anti-PD-1 resistance co-enrichment score to measure the level of co-enrichment of the 22 innate anti-PD-1 resistance content signatures (Supplementary Table 5). In general, innate anti-PD-1 resistance co-enrichment score was similar within the same tumor (intraclass correlation coefficient of innate anti-PD-1 resistance co-enrichment score: 0.77, 95% CI (0.52, 0.93)). However, spatial heterogeneity was observed in multiple tumors. When we dichotomized samples by the innate anti-PD-1 resistance co-enrichment score cutoff (0.35) that was used in the original literature, 4 in 10 patients harbored at least one discordant tumor region. For example, in tumor Pa6, Pa6.R4 is considered as an anti-PD-1-resistant tumor by showing the high innate anti-PD-1 resistance co-enrichment score of 0.94. However, the other three tumor regions (Pa6.R1, Pa6.R2, and Pa6.R3) showed low innate anti-PD-1 resistance co-enrichment scores of −0.94, 0.06, and −0.29, respectively. Our data suggest that the intra-tumor heterogeneity may need to be taken into consideration if gene expression-associated signatures were to be used as biomarkers for immune checkpoint blockade therapy.

Fig. 4
figure 4

a Gene set variation analysis enrichment scores for 22 innate anti-PD-1 resistance content signatures were calculated for each sample. The rows represent signatures and the columns represent samples. Innate anti-PD-1 resistance co-enrichment status was determined by the innate anti-PD-1 resistance co-enrichment score cutoff 0.35 and shown on top of the heat map. b Correlation of tumor microenvironment scores with various gene expression signatures examined in this study. Pearson correlation coefficients are shown and the ones with p value <0.01 are marked with an asterisk

Varied tumor microenvironment

Cancers are not homogenous masses of malignant cells. Many other cells are recruited to the tumor tissue including fibroblasts, blood and lymphatic vessel endothelial cells, and infiltrating immune cells, which constitute the tumor microenvironment. Each cell type may have distinct gene expression profile of its own and the overall gene expression profile of a given tumor sample reflects the amount of each cell type and the interactions between cancer cells and non-transformed cells in that tumor microenvironment. To understand the impact of tumor microenvironment on gene expression intra-tumor heterogeneity, we inferred the proportions of infiltrating stromal and immune cells for each tumor region using a gene expression-based algorithm ESTIMATE [22] and correlated tumor microenvironment scores with the gene expression signatures examined in this study. Our results showed that different tumor regions have different proportions of cancer cells, infiltrating stromal and immune cells (Supplementary Table 6). As shown in Fig. 4b and Supplementary Figure 3a, the overall gene expression variation was only weakly correlated with stromal score, immune score, or tumor purity with Pearson correlation coefficient of −0.41, −0.31, and 0.39, respectively. However, we observed high level of correlation (absolute Pearson correlation coefficient: 0.58–0.87) between epithelial–mesenchymal transition/innate anti-PD-1 resistance score and stromal/immune score (Fig. 4b and Supplementary Figure 3b–c). Among the lung adenocarcinoma molecular subtypes, proximal inflammatory subtype was positively correlated with stromal and immune scores (Fig. 4b and Supplementary Figure 3f).

Discussion

Using an approach of multiregion sampling and comprehensive gene expression analysis of 10 non-small cell lung cancers, we demonstrated clear evidence of gene expression intra-tumor heterogeneity in all tumors studied. Two tumor samples showed substantial level of global intra-tumor heterogeneity comparable to heterogeneity between different patients (Fig. 1). In addition to the global gene expression profile, substantial intra-tumor heterogeneity was also observed for molecular subtypes and various clinically relevant gene expression signatures including epithelial–mesenchymal transition signature, a recently reported immune therapy response signature and two frequently cited prognostic signatures (Figs. 24 and Supplementary Tables 25). Interestingly, global intra-tumor heterogeneity did not always correlate with intra-tumor heterogeneity observed in the signatures with selected genes.

Programmed cell death protein 1 (PD-1) immune checkpoint blockade has shown unprecedented durable clinical responses in patients with various cancer types including metastatic non-small cell lung cancer [34]; however, the response rate is suboptimal [35, 36]. There are currently no reliable biomarkers to predict response to immune checkpoint blockade. Expression of programmed death-ligand 1 (PD-L1) protein was reported to be predictive of benefit from immune checkpoint blockade treatment, but robust responses seen in patients with low PD-L1 argues against its value as an exclusionary predictive biomarker [37]. Gene expression-based signatures have been recently reported to correlate with response to immune checkpoint inhibition in various cancer types and proposed as potential predictive markers [20, 33]. Using a recently published innate anti-PD-1 resistance signature analysis, our results showed 4 out of the 10 patients harbored substantially different innate anti-PD-1 resistance scores in different tumor regions within the same tumors that may lead to discordant prediction of response to anti-PD-1 checkpoint inhibition. When correlating to the tumor microenvironment, innate anti-PD-1 resistance score was found to correlate negatively with tumor purity, and positively with immune score and stromal score, which is as expected because it is well known that tumor infiltrating lymphocyte profile, particularly CD8+ T cell infiltrate is associated with response to anti-PD-1 therapy [38]. What is interesting is that the correlation of innate anti-PD-1 resistance score with stromal score was higher than that with the immune score (correlation coefficient of 0.87 versus 0.67), indicating that in addition to immune cells, other stromal cells may also contribute to impact on the anti-PD-1 response signature.

Cancer tissues are composed of both cancer cells and non-cancerous stromal cells that form the tumor microenvironment. Our data demonstrated heterogeneous tumor microenvironments in spatially separated tumor regions within the same tumor (Supplementary Table 6). This heterogeneous tumor microenvironment is closely correlated to some characteristics of tumor represented by the gene expression signatures explored in this study (Supplementary Figure 3). However, these correlations were only marginal at overall gene expression profile. In addition, different tumor regions from the same patients almost always clustered together in spite of different tumor purity, stromal or immune scores (Fig. 1). Taken together, our data suggest that the gene expression intra-tumor heterogeneity observed in this study may be attributed to the different tumor microenvironments as well as spatial difference in gene expression profiles of lung cancer cells.

Many prognostic gene signatures have been developed by diverse analytical techniques. Commercially available and clinically applicable prognostic tests like Pervenio™ Lung RS [39] are often based on small prognostic and predictive gene sets with RT-PCR protocol. Future studies include the investigation on the intra-tumor heterogeneity of those signatures with the same techniques by which they were developed. Moreover, higher-resolution technologies, such as RNA sequencing on a larger cohort of patients, ideally supported by genomics and epigenetic data will help us to better understand the gene expression intra-tumor heterogeneity in lung cancer and its impact on cancer biology and clinical outcomes.

In summary, our study is the first study, to the best of our knowledge, on comprehensive gene expression intra-tumor heterogeneity of any cancer type. We observed clear evidence of gene expression intra-tumor heterogeneity from all tumors in this cohort for both the overall gene expression profile and the expression signatures with selected genes. Since many of these gene expression signatures have been reported to be associated with clinical outcomes of lung cancers and proposed as potential prognostic and/or predictive biomarkers, our results suggest that gene expression intra-tumor heterogeneity should be taken into consideration when evaluating gene expression-based biomarkers.