Genomic profiling of subcutaneous patient-derived xenografts reveals immune constraints on tumor evolution in childhood solid cancer

He, Funan; Bandyopadhyay, Abhik M.; Klesse, Laura J.; Rogojina, Anna; Chun, Sang H.; Butler, Erin; Hartshorne, Taylor; Holland, Trevor; Garcia, Dawn; Weldon, Korri; Prado, Luz-Nereida Perez; Langevin, Anne-Marie; Grimes, Allison C.; Sugalski, Aaron; Shah, Shafqat; Assanasen, Chatchawin; Lai, Zhao; Zou, Yi; Kurmashev, Dias; Xu, Lin; Xie, Yang; Chen, Yidong; Wang, Xiaojing; Tomlinson, Gail E.; Skapek, Stephen X.; Houghton, Peter J.; Kurmasheva, Raushan T.; Zheng, Siyuan

doi:10.1038/s41467-023-43373-1

Download PDF

Article
Open access
Published: 22 November 2023

Genomic profiling of subcutaneous patient-derived xenografts reveals immune constraints on tumor evolution in childhood solid cancer

Funan He ORCID: orcid.org/0000-0002-4592-4034^1,2^na1,
Abhik M. Bandyopadhyay¹^na1,
Laura J. Klesse ORCID: orcid.org/0000-0003-1323-7720^3,4,5,
Anna Rogojina¹,
Sang H. Chun⁶,
Erin Butler^3,4,5,
Taylor Hartshorne ORCID: orcid.org/0009-0009-2036-9757³,
Trevor Holland¹,
Dawn Garcia¹,
Korri Weldon¹,
Luz-Nereida Perez Prado¹,
Anne-Marie Langevin^7,8,
Allison C. Grimes^1,7,
Aaron Sugalski⁷,
Shafqat Shah⁷,
Chatchawin Assanasen^7,8,
Zhao Lai^1,8,9,
Yi Zou¹,
Dias Kurmashev¹,
Lin Xu ORCID: orcid.org/0000-0001-5815-4457^3,4,10,
Yang Xie^4,10,11,
Yidong Chen^1,2,8,
Xiaojing Wang ORCID: orcid.org/0000-0002-2631-4708^1,2,8,
Gail E. Tomlinson^1,7,8,
Stephen X. Skapek^3,4,5,
Peter J. Houghton^1,8,9,
Raushan T. Kurmasheva ORCID: orcid.org/0000-0003-3212-2363^1,8,9 &
…
Siyuan Zheng ORCID: orcid.org/0000-0002-1031-9424^1,2,8

Nature Communications volume 14, Article number: 7600 (2023) Cite this article

2085 Accesses
17 Altmetric
Metrics details

Subjects

Abstract

Subcutaneous patient-derived xenografts (PDXs) are an important tool for childhood cancer research. Here, we describe a resource of 68 early passage PDXs established from 65 pediatric solid tumor patients. Through genomic profiling of paired PDXs and patient tumors (PTs), we observe low mutational similarity in about 30% of the PT/PDX pairs. Clonal analysis in these pairs show an aggressive PT minor subclone seeds the major clone in the PDX. We show evidence that this subclone is more immunogenic and is likely suppressed by immune responses in the PT. These results suggest interplay between intratumoral heterogeneity and antitumor immunity may underlie the genetic disparity between PTs and PDXs. We further show that PDXs generally recapitulate PTs in copy number and transcriptomic profiles. Finally, we report a gene fusion LRPAP1-PDGFRA. In summary, we report a childhood cancer PDX resource and our study highlights the role of immune constraints on tumor evolution.

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Article 18 April 2024

Feasibility of functional precision medicine for guiding treatment of relapsed or refractory pediatric cancers

Article Open access 11 April 2024

Discovery of tumor-reactive T cell receptors by massively parallel library synthesis and screening

Article 23 April 2024

Introduction

Childhood cancers represent about 1% of newly diagnosed cancer cases in the US. Though rare, cancer is the leading cause of disease-related death in children¹. More than 60% of childhood cancer cases are solid tumors. The average five-year survival rate for children with solid cancers exceeds 80%, but survival for patients with metastatic or refractory tumors is still poor. Further, multimodality treatments cause long-term health problems and increase the risk of secondary cancer^2,3. Molecularly targeted therapies and immunotherapy can improve overall patient outcomes, but their development requires faithful preclinical models and a better understanding of antitumor immunity.

Patient-derived xenografts (PDXs) are an important model in cancer research. They are crucial for preclinical and mechanistic studies of rare cancers such as pediatric solid tumor because they can preserve tumor tissue in vivo^4,5,6,7. PDXs are established by engrafting tumor tissue either subcutaneously or orthotopically into immunocompromised mice. Compared with orthotopic PDXs, subcutaneous PDXs are easier to establish and monitor tumor size. Preclinical testing studies with subcutaneous PDXs showed that they can robustly inform drug activity in patients^8,9.

A fundamental question about PDXs is how well they recapitulate the patient tumors (PTs). In adult cancers, PDXs were found to recapitulate PTs in histology, genetics, and pharmacokinetics^10,11,12,13. However, genomic profiling of large PDX cohorts found evidence of clonal evolution during engraftment and passaging, leading to debates over model fidelity^14,15,16. In childhood solid cancers, similar genomic profiling efforts were undertaken, but often without matched PTs or germline samples¹⁷. Other studies were focused on single cancer types^18,19,20, or orthotopic models^21,22, or with a limited sample size^23,24,25. Moreover, many rare childhood cancers such as hepatoblastoma were often not included. Importantly, both adult and childhood cancer studies have found PDXs that showed poor mutational similarities with PTs^18,21,26, but the underlying mechanism leading to the disparity remains obscure.

Here, we report genomic profiling of 68 solid childhood cancer subcutaneous PDXs. These models were established from 65 pediatric solid tumors across 16 cancer types.

Results

Overview of patient samples, PDXs, and genomic data

We generated 90 subcutaneous PDXs from 194 fresh solid tumor samples using a previously published protocol²⁷ (“Methods”). All patients were younger than 18 years old at the time of tumor collection, with both biological sexes represented (Male:Female, 1.2:1). Of the patients with treatment information available, 38% received prior treatment, primarily chemotherapy.

We observed high engraftment rates in clear cell sarcoma (100%) and Wilms tumor (85%), and lower rates in neuroblastoma (26%) and brain tumors (23%) (Fig. 1a). The engraftment rates for neuroblastoma and Wilms tumor were similar to that of previously published orthotopic models²¹, but the rate for osteosarcoma in our cohort was higher (67% vs. 48%). The average time from tumor implantation (P0) to PDX harvest (P1) also varied from 30 weeks for neuroblastoma to 13 weeks for hepatoblastoma (Fig. 1b).

**Fig. 1: Overview of PDXs and sequencing data.**

We performed low pass whole genome sequencing (WGS), whole exome sequencing (WES), and mRNA sequencing (RNAseq) on 68 PDXs (Fig. 1c). Among the 68 PDXs, 27 (40%) had the matched patient tumor (PT) and 40 (59%) had normal germline DNA. Isogeneity of the matched samples was confirmed using DNA and RNA sequencing data (Supplementary Fig. 1). The PDX cohort comprised 14 Wilms tumors, 13 hepatoblastomas, 12 osteosarcomas, 10 germ cell tumors, and 19 others. These PDXs were derived from tumor tissues of 65 patients, all younger than 18 years (median 6.5; mean 7.7). Male to female ratio (1.3:1) was similar to the overall patient cohort (n = 90, 1.2:1). Thirty patients were of Hispanic ancestry, and this was confirmed using ancestry informative markers²⁸ (Supplementary Fig. 2). Treatment information was collected for 62 patients, 23 of whom received prior treatment (37%). Sixty-two models were derived from primary tumors and five from metastatic tumors. Clinical data of the samples were summarized in Supplementary Data 1. To help disseminate this resource, we have built an intuitive online portal (pediatric solid tumor PDX portal, https://pstPDX.streamlit.app). Requests for PDX materials can also be made on the site.

We observed higher tumor purity in PDXs than in PTs (p = 2.1 × 10⁻³, two-sided t test; Supplementary Fig. 3a, b). Immune and stromal cell signature scores were lower in PDXs (Supplementary Fig. 3c–e). Interestingly, the signature scores of stromal cells in PDXs were positively correlated with those in PTs, suggesting early passage PDXs still retained some stromal cells from the PT (rho = 0.66, p = 0.003; Spearman correlation). We did not observe significant correlation for immune scores (Supplementary Fig. 3f).

Mutational similarity between PT and PDX

We used multiple tools to detect somatic mutations and indels (insertions and deletions) (Methods). In total, we identified 1786 mutations and 161 indels from WES data. Ninety-two percent of the point mutations were validated in RNAseq or low pass WGS. The unvalidated mutations had lower variant allele fractions (VAFs, mean 0.13 vs. 0.34 for validated mutations). Deep sequencing of a PT/PDX pair yielded a 100% validation rate on 60 mutations that were captured in the assay (Supplementary Data 2). We did not observe significant differences in mutation rate between PTs and PDXs except in Wilms tumor, or tumors with and without a germline control except in germ cell tumor (Supplementary Fig. 4a, b). Across the cancer types, Wilms tumor showed the lowest mutation rate (median: 0.18 mutations/Mb), and osteosarcoma showed the highest (median: 0.56 mutations/Mb) (Fig. 2a). These mutation rates agree with results from recent pan-pediatric cancer analyses^29,30.

**Fig. 2: Mutation rates and mutational signatures.**

Few cancer genes showed recurrent mutations across the cohort, consistent with the overall low mutation rate of childhood cancers (Supplementary Fig. 4c). The exception was CTNNB1, which was mutated in 7 of 11 (64%) hepatoblastomas with exome sequencing data. Mutation rates of known driver genes from our dataset were generally consistent with the literature (Supplementary Data 3). Pan-cancer analysis with MutsigCV³¹ identified only CTNNB1 and TP53 as significant mutated genes across the PDXs (FDR < 0.1) (Supplementary Data 4).

We observed significantly higher mutation rates in prior treated PTs or their derived PDXs than in treatment-naïve samples (p = 2.3 × 10⁻⁴, Wilcoxon rank sum test; Fig. 2a). To corroborate the association between higher mutation rates and chemotherapy, we deconvoluted mutations into mutational signatures. Such deconvolution can identify mechanisms that cause mutations in the cancer genome³². We in total analyzed 14 samples with at least 20 mutations (Fig. 2b). Among the 10 samples that were derived from patients who had received chemotherapy, we found evidence of chemotherapy related mutational signatures in 8 (2 PTs and 6 PDXs), seven associated with the platinum drug related signatures SBS31 and SBS35, and one with SBS86, a signature currently associated with unknown chemotherapy treatment. The samples demonstrating signature SBS31 or SBS35 were derived from six patients. Except for an osteosarcoma patient (560-LM) who received unspecified chemo-treatment, all the other five patients received cisplatin, a platinum-based drug. This data supports the mutational signature analysis. The sample not exhibiting chemotherapy signatures (1981_PDX) showed SBS15, a signature associated with microsatellite instability (MSI). Consistently, the sample showed a high MSI score (42.4% vs. average of all others, 1.6%).

For the two PTs that exhibited high mutation rates and chemotherapy signatures (1792_PT, 1957_PT), their corresponding PDXs also exhibited the same signatures (Fig. 2b). Another PT sample, 585_PT, was dropped from mutational signature analysis due to its low mutation count (n = 6); however, the matched PDX exhibited SBS31 and SBS35. The consistency in demonstrating chemotherapy signatures was not necessarily driven by shared mutations between PTs and PDXs. For 585 and 1957, PTs and PDXs had little overlap in somatic mutations (Fig. 2c). Using PDX-specific mutations yielded the same chemotherapy signatures for the two samples (Supplementary Fig. 4d). Thus, these data suggest the related mutations in these PDXs were inherited from the seeding PTs.

Next, we examined mutational similarities between PTs and PDXs using 25 PT/PDX pairs (Fig. 2c). We defined mutational similarity as the fraction of shared mutations over all mutations found in each pair. Overall, 78% of mutations were shared between PTs and PDXs. The median mutational similarity was 0.52, higher than those observed in recently published pediatric cancer PDX cohorts^18,21 but lower than that in adult tumors²⁶. Limiting the comparison to cancer genes (Supplementary Data 5) increased the median mutation similarity to 0.95 for the 20 pairs where at least one cancer gene mutation was observed (Supplementary Fig. 4e, f). Oncogenic or likely oncogenic mutations demonstrated a high level of overlap (28/30, 93%) between PTs and PDXs. Five pairs showed low mutational similarity (<0.2), including two (1959, 1979) with no shared mutations. To test if the low mutational similarity was due to sequencing coverage, we performed capture enrichment and deep sequencing on a pair of PT and PDX samples (585_PT and 585_PDX). By WES, six mutations were found in 585_PT, and 58 mutations were found in 585_PDX, 56 of which were not found in 585_PT. Deep sequencing captured 54 mutations found in 585_PDX, all validated. Similarly, all six mutations found in 585_PT were validated by deep sequencing (Supplementary Data 2). None of the PT or PDX specific mutations were found in the matched sample in the deep sequencing data, suggesting limited impact by sequencing depth on the observed PT/PDX mutational similarity in this case.

To understand how intratumoral heterogeneity can impact PT/PDX mutational similarity, we obtained seven additional PDX samples that matched four PTs. Six of the seven PDXs were established from a distinct patient tumor block, and the remainder was a second block of the originally sequenced PDX. Comparison of these additional PDXs with matched PTs demonstrated generally consistent mutation similarities in these samples (Supplementary Fig. 4g).

Distinct evolutionary patterns during engraftment

To explore the clonal dynamics in tumor engraftment, we inferred mutation clonality using a consensus approach for the 25 PT-PDX pairs (“Methods”). Overall, 82% of mutations in PTs and 84% of mutations in PDXs were clonal, but this percentage was highly case specific (Supplementary Fig. 5). While 88% of PT clonal mutations were observed in the PDX, only 22% of PT subclonal mutations were observed in the PDX. This result was consistent with the expectation that clonal mutations more likely pass on than subclonal mutations. To further validate mutation clonality, we examined presence of PT clonal and subclonal mutations in the additional PDXs. For the four PTs with multiple PDXs, all PT clonal mutations (n = 30) that were observed in the original PDX were also observed in the additional PDXs. In contrast, only two of the 17 PT subclonal mutations were observed in the additional PDXs. Notably, 33% of PDX clonal mutations were not found in the PT, suggesting clonal expansion during engraftment (Supplementary Fig. 5a).

We next classified paired samples into distinct evolutionary patterns based on changes in mutation clonality from PTs to PDXs. For this analysis, we excluded the two PT/PDX pairs (1959 and 1979) that showed no mutational overlaps. We observed three patterns. In the first pattern, PDXs retain clonal mutations from the PT and exhibit a similar clonal composition. We call this pattern ‘clone retention’ (Fig. 3a and Supplementary Fig. 5b). This pattern constituted 70% (16/23) of the pairs classified. The second pattern was characterized by expansion of PT subclones in the PDX (Fig. 3b and Supplementary Fig. 5c). This pattern, termed “clone sweeping,” was observed in four pairs (17%). The last pattern was characterized by loss of PT clonal mutations and retention of early mutations in the PDX (Fig. 3c and Supplementary Fig. 5d). This pattern, termed ‘branch seeding’, was observed in three pairs (13%). The loss of PT clonal mutations was not due to copy number deletion in the paired PDX. One example of this pattern was a hepatoblastoma sample (1957); only two of the 41 PT mutations were found in the PDX, one of which was in CTNNB1, an early driver of the cancer type³³ (Fig. 3c).

**Fig. 3: Evolutionary patterns from PTs to PDXs.**

The evolutionary patterns appeared to be reproducible across multiple PDXs. In two samples (1913,1932) that were classified as clone sweeping, the evidence for these classifications were that in both cases, a PT subclonal mutation became a clonal mutation in the PDX (LRP2 for 1913, and BMP4 for 1932). Interestingly, the same LRP2 mutation was also identified in the two additional 1913 PDXs where the mutation appears to be clonal (VAF 0.44 and 0.45, vs. 0.09 in PT). Similarly for 1932, the BMP4 mutation was observed in the two additional PDXs, also with much higher VAFs in the PDXs (0.37 and 0.38 vs. 0.12 in PT).

Both patterns of clone sweeping and branch seeding indicate that a subclone in the PT seeds the PDX, likely by outcompeting other clones. For simplicity, we lump them together as one group (group 2), to compare with samples showing the clone retention pattern (group 1). Unlike continued expansion of the major PT clone in the PDX (group 1), clonal selection observed in group 2 would take longer to establish a major clone. Consistent with this idea, the median time for group 2 models to reach the harvest tumor volume after implantation was 22 weeks, compared to 13 weeks for group 1 models (p = 0.03, Wilcoxon rank sum test; Fig. 3d).

The longer engraftment time could explain the increased number of PDX specific mutations in group 2 (Fig. 4a). To test this possibility, we correlated the two and found no significant correlation (p = 0.18, Spearman correlation test; Supplementary Fig. 5e). This lack of correlation remained after controlling for the PT mutation rate for each PDX (p = 0.13; Supplementary Fig. 5f).

**Fig. 4: Evolutionary pattern is associated with mutational similarity and antitumor immunity in PT.**

To provide further evidence for the distinct evolutionary paths, we analyzed tumor telomere lengths. Telomeres progressively shorten along cell divisions³⁴; thus, continued growth of the same clones from PT to PDX such as in group 1 would likely result in shorter telomeres in the PDX. We estimated average tumor telomere lengths using both WGS and WES data (Methods). The two data types yielded consistent telomere length estimates in PDXs, PTs, and germline samples (Supplementary Fig. 6a and Supplementary Data 6). Across the cancer types, germ cell tumor showed the longest telomeres (Fig. 3e and Supplementary Fig. 6b), likely due to its origin from telomerase-competent germ cells. Among non-germ cell cancers, osteosarcoma showed the highest telomere length, consistent with a recent report³⁵.

PDXs showed overall shorter telomeres than matched PTs (p = 1.5 × 10⁻³, paired t test; Fig. 3f and Supplementary Fig. 6c). The pattern of telomere shortening was more pronounced in group 1 than in group 2 tumors (p = 0.033, Wilcoxon rank sum test; Fig. 3g and Supplementary Fig. 6d). These data provide additional evidence that group 2 tumors underwent a distinct evolutionary path from group 1 tumors.

Clonal selection during engraftment associates with genetic heterogeneity and antitumor immunity in the PT

To provide insights into the three evolutionary patterns, we correlated them with PT/PDX mutational similarity, prior treatment, and tumor genetic heterogeneity defined as the fraction of subclonal mutations in a sample. ‘Clone retention’ tumors showed an average mutational similarity of 0.73, compared to 0.42 for ‘clone sweeping’ and 0.06 for ‘branch seeding’ (group 1 vs. group 2, p = 0.0012, Wilcoxon rank sum test). The decreasing mutational similarity was associated with PT (p = 0.034, Spearman correlation; Supplementary Fig. 7a) but not PDX genetic heterogeneity (p = 0.38, Fig. 4a). Thus, PTs with more complex clonal structures tend to generate genetically more distinct subcutaneous PDXs. We did not find a significant association between chemotherapy and the evolutionary patterns (p = 1, chi-square test).

We next asked what drove the clonal selection in group 2 tumors. For these pairs, we denote the major clone in the PT as C_pt and the major clone in the PDX as C_pdx. Based on their evolutionary pattern, clone C_pdx was a minor clone in the PT. We hypothesized that C_pdx had a growth advantage so that it could overtake C_pt during engraftment. To test this hypothesis, we compared proliferation markers and cell cycle signatures between PTs and PDXs, assuming expression of bulk tumor reflected the major clone’s. Consistent with the hypothesis, expression of proliferation markers and cell cycle signatures was higher in PDXs than in the matched PTs for group 2 tumors, suggesting the PDX major clone was indeed more proliferative in these pairs. In contrast, the opposite pattern was observed in group 1 tumors (Fig. 4b).

If C_pdx was more proliferative, why was it not the major clone in the PT? We reasoned that its expansion could be constrained in the PT, but such constraint was weakened or even nullified in the PDX. Because PDX-host mice have no functional immune system, immune surveillance may contribute to this constraint by preferentially targeting C_pdx.

To test this hypothesis, we first compared mutational load. For both groups, no significant difference in mutational load was observed between PTs and PDXs (Group 1, p = 0.2; Group 2, p = 0.3, paired t test; Supplementary Fig. 7b). We next compared neoantigens (Supplementary Data 7; “Methods”). In group 1, no difference was found in clonal neoantigen load between PTs and PDXs (p = 0.87, paired t test). However, in group 2, PDXs showed significantly more clonal neoantigens than their matched PTs (p = 0.03, paired t test; Fig. 4c). This pattern remained after controlling for the total number of clonal mutations (p = 0.03, paired t test; Supplementary Fig. 7c). Thus, despite being more proliferative, C_pdx also expressed more neoantigens.

There was evidence that subclonal neoantigens are more immunogenic than clonal ones³⁶. Because C_pdx was a subclone in the PT, we sought to find signs of antitumor immunity in group 2 PTs. We compared their expression with those from group 1 PTs. Pathway analysis with GSEA identified inflammasomes as the second highest pathway ranked by normalized enrichment score in group 2 PTs (p = 0.03; Fig. 4d and Supplementary Data 8). Inflammasomes are the receptors and sensors of the innate immune system³⁷. They are assembled in professional antigen-presenting cells (APCs), which are constituents of the innate immunity and bridge the innate and adaptive immune systems³⁸. High inflammasome activity suggests possible activation of the innate immunity. Consistently, gene signatures related to natural killer cells, a major effector cell of the innate immune system, were higher in group 2 PTs (Supplementary Fig. 7d). Expression of HLA genes, which encode major histocompatibility complexes (MHCs) on the surface of APCs, was significantly higher in group 2 PTs than group 1 PTs (p = 0.02, Wilcoxon rank sum test; Fig. 4e). We next examined tumor microenvironment using multiple deconvolution tools (“Methods”). Interestingly, the abundance of cancer associated fibroblast, a known immunosuppressive cell population³⁹, was consistently reported lower in group 2 PTs (Supplementary Fig. 7e).

Taken together, these results suggest in group 2 PT/PDX pairs, a more proliferative but also more immunogenic PT subclone was selected to seed the PDX. In the context of immune deficiency in the host mice and activated immune responses in the PT, these data implicate a role of immune environment changes in fostering this selection during engraftment.

PDXs retain somatic copy number alterations

Whether somatic copy number alterations (SCNAs) undergo PDX specific evolution has been recently debated in adult cancer^14,15,16. To examine SCNA conservation in our PDXs, we inferred copy number profiles using low pass WGS data (Methods). This data provides better resolution than exome sequencing-based estimates. Using these data, we identified 15 amplification and 19 deletion peaks (Supplementary Fig. 8a). Genes located in the amplification peaks included MYC, cell cycle genes CCND3 and CCNE1, chromatin regulators SETDB1 and EZH2, and DNA repair gene XRCC2. Genes located in deletion peaks included TP53, PTEN, DNA repair genes RAD51, FANCA, ATM, CHEK1, POLD1, apoptosis regulators BAX and BCL2, hypoxia regulator HIF1A, and interestingly PD-L1.

On the cancer type level, SCNA profiles were similar between PTs and PDXs, and were consistent with the literature (Supplementary Fig. 8b). For instance, we observed frequent gain of chr1q (57%) and loss of chr11 (30%) in Wilms tumor at a rate similar to previous reports^40,41. Few SCNAs were observed in hepatoblastoma except arm-level gains of 1q (46%), 2q (41%), 20 (41%), and 8q (24%), as previously reported⁴². We observed frequent gain of chromosome 12p (85%), 21 (62%), 7p (54%), and loss of chromosome 4 (46%) and 5 (38%) in germ cell tumor. These rates were also consistent with previous studies^43,44.

To quantify SCNA conservation, we first compared tumor ploidy, a measure of genome wide SCNA. We found high tumor ploidy, likely driven by whole genome doubling (WGD), in 77% of germ cell tumors and 67% of osteosarcomas (Fig. 5a). Tumor ploidy was highly similar between PDXs and PTs, including the group 2 tumors (rho = 0.98, p = 3.2 × 10⁻¹⁵, Spearman correlation; Fig. 5b), suggesting conservation of karyotype in the PDX. However, we did observe two exceptions (8%, patients 1959 and 1979) where drastic change in ploidy was found in the PDX. Consistent with this result, the PT and PDX of the two pairs did not show any overlap in mutations.

**Fig. 5: Conservation of somatic copy number alterations (SCNAs) in PDXs.**

Next, we compared global chromosomal instability using a genomic instability (GI) score (“Methods”; Supplementary Data 9). Osteosarcoma, a cancer characterized by high chromosomal instability⁴⁵, showed the highest scores (Supplementary Fig. 9a). Tumors with relatively quiet genomes like neuroblastoma, clear cell sarcoma, and hepatoblastoma showed the lowest scores. GI scores were positively correlated between PTs and PDXs (rho = 0.75, p = 1.7 × 10⁻⁵, Spearman correlation; Supplementary Fig. 9b).

Finally, we correlated copy number profiles for each PT/PDX pair (Methods). After excluding samples with few SCNAs (total GI score <0.1), we found strong pairwise correlations between PTs and PDXs (Fig. 5c). Limiting this analysis to cancer genes yielded a similar result (Supplementary Fig. 9c). These strong correlations remained between PTs and multiple PDXs that were derived from the same patient tumor (Supplementary Fig. 9d). The correlations were similar between group 1 and group 2 tumors (p = 0.6, t test; Supplementary Fig. 9e), suggesting SCNAs were primarily clonal. We then asked if focal events were retained in the PDX. In total we identified 292 focal events in nine samples, seven of which were sarcomas. The aggregated length of these events ranged from 2 Mb to 462 Mb (Supplementary Fig. 9f). Overall, the overlap of focal events was better than that of mutations, with 86% of them shared between PTs and PDXs (Fig. 5d). Unlike mutations, conservation of PT focal events in the PDX was observed in each pair analyzed. Only one PDX (2035) showed notably more private focal events. Taken together, these data show strong conservation of SCNAs in early passage PDXs.

Transcriptomic analysis shows tissue effect and identifies fusions

We next examined how the PDXs recapitulated PTs in gene expression. Unsupervised clustering grouped samples into tissues of origin except clear cell sarcoma (Fig. 6a). Close analysis showed that clear cell sarcomas were divided into two groups, one consisting of samples collected from the kidney (1754 and 2324), and the other consisting of samples collected from the bone (529). This tissue-of-origin dominated pattern was previously observed in adult cancers⁴⁶.

**Fig. 6: Transcriptomic similarity and gene fusion.**

We observed highly correlated expression profiles of the matched PTs and PDXs (rho range 0.92–1, Spearman correlation; Fig. 6b). To put these correlations in context, we compared PDXs that were derived from two metastatic lesions of the same patient (560 lung and skin metastases), and PDXs derived from different blocks of the same tumor (1939). The correlation between the two PDXs of patient 560 was 0.94, and the correlation between the two PDXs of patient 1939 was 0.97. The high correlation was similar across the three evolutionary patterns (p = 0.87, t test; Supplementary Fig. 10a). Hepatoblastoma and Wilms tumor samples showed significant intra-lineage correlations, corroborating results from unsupervised clustering. These results show that gene expression is highly conserved in PDXs and is dictated by both cancer genetics and tissue of origin.

To identify molecular alterations, we called gene fusions using RNAseq data. We identified 161 high-confidence gene fusions (Supplementary Data 10; “Methods”), including disease-defining fusions such as reciprocal EWSR1-ATF1 in a clear cell sarcoma (patient 529) and BCOR-CCNB3 in a Ewing-like sarcoma (patient 2197). Most of the fusion events (n = 125, 78%) were found in osteosarcoma and clear cell sarcoma, and their distribution across cancer types was generally consistent with that of chromosomal instability (Supplementary Fig. 10b, c).

Paired PTs and PDXs showed significant overlaps in fusions. Of the 18 paired samples with RNAseq data, we identified at least one fusion in six pairs, and 97% (29/30) of the fusions detected in the PT were also found in the PDX (Supplementary Fig. 10d).

Next, we mapped the fusions to kinases and clinically actionable genes (Supplementary Data 10). Of the fusions identified in PDXs, 14 involved a kinase gene and 12 involved a clinically actionable gene, including TAOK1-NTRK3 (Fig. 6c). Inhibition of NTRK fusions showed promising clinical benefits in patients^47,48. Importantly, we observed a fusion, LRPAP1-PDGFRA, in a glioblastoma and a germ cell tumor. The fusion preserved the protein kinase domain of PDGFRA and was associated with high PDGFRA expression (Fig. 6c and Supplementary Fig. 10e). Exon-level expression aligned with the fusion breakpoints. We further validated the fusion in both samples using RT-PCR (Fig. 6d).

Discussion

Solid tumors are rare in children; the rarity poses a significant challenge for building resources at scale. Preserving tumor tissue in rodents is essential for preclinical and mechanistic studies and for resource sharing. Here, we have built a resource of 68 subcutaneous xenografts derived from pediatric solid tumors, including several very rare cancer types. All the PDXs have been molecularly characterized, and the tissue materials are ready to be distributed upon request.

With this resource, we determined conservation of mutations, SCNA, and expression profiles in PDXs. We found that early-passage PDXs faithfully retain expression profiles of the PT, suggesting gene expression is tumor-intrinsic. The conservation of gene expression is not dependent on model mutational similarity; thus, expression can be a more robust tool to transfer preclinical insights from PDXs to PTs. We observed that PDXs generally conserve the SCNA profiles of the PT. The conservation of SCNAs was also observed in adult cancer PDXs¹⁴. These observations are consistent with results from single cell sequencing that suggested most SCNAs are early events during transformation⁴⁹. Further insights in SCNA stability can be gleaned through its characterization over serial PDX passaging^19,50.

We found significant mutation disparity in ~30% of the PDXs, and this disparity was associated with high genetic heterogeneity of the PT. Thus, more heterogenous tumors tend to generate genetically more different PDXs. The association could result from sampling bias, where a tumor block distinct from the PT seeds the PDX. Alternatively, engraftment disrupts the clonal equilibrium of the cancer ecosystem and a subclone outcompetes other clones to become the dominant clone in the PDX. Both mechanisms are possible, but sampling bias is unlikely the major force as it cannot explain the longer engraftment time and higher proliferation of the PDXs that we observe in nearly every genetically disparate pair. Supporting this concept, additional PDXs established from distinct tumor tissue of patients 1913 and 1932, where significant mutation disparity between the PT and the original PDX was observed, demonstrated similar subclonal expansion.

Cancer heterogeneity fosters evolution⁵¹, but evolution is driven by environmental changes. Implantation of cancer cells in immunocompromised mice removes antitumor immunity for cancer cells, thus potentially allowing previously constrained, more immunogenic clones to grow. In support of this idea, we show that genetically disparate PDXs express significantly more clonal neoantigens than their matched PTs. For example, in patient 1957, we found two shared mutations between the PT and PDX. Of the 39 PT-specific mutations, none were predicted to encode a clonal neoantigen. In contrast, 10 clonal neoantigens were predicted out of the 32 PDX-specific mutations. The bulk of these PDX-specific neoantigens were unlikely acquired during PDX production. In mutational signature analysis, when chemo-related mutational signatures were identified in PTs, the same signatures were also identified in the PDXs with all PDX mutations or PDX specific mutations, even though the mice were never treated. Moreover, seven PDX specific clonal mutations from patient 1913 were also found in the two additional PDXs that were established from distinct patient tumor tissue. This observation provides evidence that these mutations preexist in the primary patient tumor, because it is virtually impossible for PDXs grown in different mice to acquire several identical mutations. The preexistence of PDX-specific mutation is also consistent with the low mutation rate of childhood cancers.

Recently, PDX co-clinical trials have been proposed to guide therapy at the time of tumor relapse⁵². Our data suggest such co-trials could misinform treatment when the tumor is highly heterogeneous. In addition, we show in some patients, the immune system, particularly the innate immune system, might have suppressed the more aggressive cancer subclones. Thus, leveraging this antitumor immunity in combination with surgical resection may benefit these patients.

In summary, we build a PDX resource for pediatric solid cancer research and describe evidence that the interplay between intratumor heterogeneity and immune constraints on tumor evolution may underlie the genetic disparity between PTs and PDXs. More studies with larger cohorts are warranted to further validate and extend this finding, including in adult cancers.

Methods

Sample collection and patient consent

This study complies with all relevant ethical regulations. It was approved by the Institutional Review Board (IRB) and the Institutional Animal Care and Use Committees (IACUC, protocol #15015) of University of Texas Southwestern Medical Center (UTSW), Dallas, TX and UT Health San Antonio (UTHSA), TX. The human aspect of this study was deemed minimal risk by the approving IRB. No specific ethics review was therefore required. Individuals with identified solid tumors, both benign and malignant, were approached and offered enrollment on an institutional, IRB approved biorepository prior to standard of care surgical procedures. For patients less than 18 years of age, parents or legally authorized representatives provided consent. Assent of the patient was required for those participants 10–17 years of age. All anatomic sites of disease were eligible and individuals were considered eligible if they were under 30 years of age at the time of consent. Patient race and ethnicity was self-reported. Biorepository consent included collection of medical waste for research, including the tumor utilized here, and consented for the generation of patient derived xenografts. Patients also consented to the collection of germline DNA, collection of basic demographic information and outcome data as part of the biorepository. Tissue from the surgical procedure which was considered excess or not necessary for diagnosis was collected, de-identified, and prepared for shipment and/or injection for development of patient derived xenografts. Tissue samples were kept at 4 °C until prepared for shipment and/or injection. All study procedures were completed after initial consent was obtained. Patient sex was not considered in the study design.

Establishment of solid tumor PDX model

Patient-derived Xenografts (PDX) were generated from childhood cancer patients as described earlier²⁷ with minor modifications described here. Subcutaneous human xenografts from patient derived tumors were generated in a highly immunodeficient NSG (NOD.Cg-Prkdc IL2-Rgnull/Szj) mouse model (Jackson Laboratories, Bar Harbor, ME, USA).

Tumor specimens were collected immediately after biopsy/surgery in antibiotic (2% penicillin/streptomycin) containing M199 medium. The specimens were transported same day to GCCRI from University Hospital or Methodist Hospital, San Antonio or shipped overnight from other institutions (UTSW and APEC14B1 project hospitals) to GCCRI using cold shipping containers and transplantation was performed the same day specimens were received.

Female NSG mice were received at 6–8 weeks of age and allowed a 7–10 day acclimation period. Animal cages are changed out once a week or every other week depending on housing type, e.g., microisolator or individually ventilated cages. The cages are maintained in animal rooms equipped to provide 10–15 air changes per hour. The room temperature is maintained at 21–26 °C, relative humidity between 30 and 70% with a 14:10 day:night light cycle. Transplantation was performed once mice gained an average body weight of 20 g. Hair was removed at the site of incision (above the base of tail over the spine). The transplantation was performed under a biological laminar flow cabinet, under anesthesia (in an induction chamber with the flow of 5% isoflurane at 4 L of oxygen per minute) until mice were unresponsive to a toe pinch.

The tumor fragments (made about 2 × 2 mm) were kept in fresh medium until the mouse was ready for transplantation. The site was swabbed with 70% ethanol, an incision was made (approx. 4 mm) and a pocket was created under the skin using scissors. Using forceps, a tumor piece was dipped briefly into Matrigel (supplemented with VEGF, 100 ng/ml, to enhance angiogenesis) and placed inside the pocket followed by irrigation with a drop of penicillin/streptomycin and the incision was closed by using a small drop of tissue glue (Vetbond).

Collection of PDX passages as viables and snaps

Mice were monitored for xenograft growth and healing of the incision. When tumors reached a size of about 1 × 1 cm, mice were terminated and tumors were collected as viables (in 7.5% DMSO/50% FBS) and kept frozen in a liquid nitrogen vapor tank and also snap frozen in liquid nitrogen to get Snaps for genomic analysis and preserved at −80 °C. Tumors were also further transplanted into additional mice (typically 5 mice) as donors for passaging of PDX. The maximal tumor size permitted by IACUC (800–1600 mm³) was not exceeded.

DNA and RNA sequencing

Genomic DNA was extracted with DNeasy Blood & Tissue Kit (QIAGEN). KAPA HyperPrep kit was utilized to construct DNA libraries for whole genome sequencing and whole-exome sequencing. Approximately 250–500 ng genomic DNA were sheared with Covaris S220 Ultra Sonicator to the average of 200–400 bp fragments for DNA-seq library preparation. Then one proportion of DNA-seq libraries was quantified and pooled together for whole genome sequencing using 150 bp paired end sequencing; other proportion of DNA-seq libraries (around 250 ng) were quantified and pooled to go through two rounds of hybridization to enrich the DNA fragments of exome regions by using IDT xGen Exome Research Panel (V1 and V2). The final WES library was amplified, quantified, and loaded for 100 bp paired end sequencing at UTHSA Genome Sequencing Facility. On average, WES was sequenced to 300× and low-pass WGS was sequenced to 4×.

RNA was isolated using RNeasy Mini Kit (QIAGEN). The quality of Total RNA was checked by Agilent Fragment Analyzer (Agilent Technologies, Santa Clara, CA), and only high-quality RNA samples (RQN > 7) were used for mRNA-seq library preparation and sequencing. Following the Illumina TruSeq stranded mRNA sample preparation guide, we used approximately 500 ng Total RNA for RNA-seq library preparation. After RNA-seq libraries were quantified, they were pooled and subsequently loaded for 100 bp paired read sequencing run on the Illumina HiSeq 3000 platform. An average of 80 million reads were obtained per sample.

Target enrichment and deep sequencing

Based on the mutation calling result from WES data, we designed the probes for unique somatic point mutations found in 585_PT and 585_PDX. Approximate 100 ng whole genome DNA was used for DNA-seq library preparation with Twist Library Preparation EF2.0 Enzymatic Fragmentation Kit (104206, Twist bioscience, San Francisco, CA). The whole genome DNA was sheared by enzymatic fragmentation and the fragmentation time has been optimized to generate the mode fragment length about 200–300 bps. Then following end repair & A-tailing and adapter ligation, SPRI beads size selection was used to ensure the library insert size uniform. After PCR amplification, the final DNA-seq library is cleaned up with SPRI beads and quantified with Qubit and Fragment Analyzer. Then target capture libraries were prepared by following Twist Custom Panel Hybridization Capture of DNA libraries protocol (Twist bioscience, San Francisco, CA), and final libraries were quantified with Qubit and Fragment Analyzer. Final libraries were then loaded on NovaSeq 6000 System with 150 bp paired end sequencing. After the sequencing run, sample demultiplexing is performed to generate FASTQ files for each sample. The average depth of sequencing is 5000–7000×.

Target-capture-based deep sequencing was often used in PT/PDX comparisons. While it can improve detection sensitivity of mutations with low allele fractions, it does not and should not be used to identify mutations that are not detected by whole exome or genome sequencing data. The results from deep sequencing data should also be interpreted with caution. Whereas the detection of a PDX-specific mutation in the PT by deep sequencing suggests pre-existence of the mutation in the parental patient tumor, the absence of the mutation in the PT does not prove the mutation is acquired de novo by the PDX because the PT sample, where the deep sequencing is done, is not the tissue origin of the PDX. Sequencing coverage in this context should be also considered for data interpretation.

Sequencing data preprocessing and quality control

Trim Galore⁵³ (v0.6.7) was applied to raw sequencing data to remove the adapter and poor-quality reads. BWA-MEM⁵⁴ (v0.7.17) and STAR⁵⁵ (v2.7.9a) were used to align DNA and RNA sequencing data to the reference genome. To remove mouse-derived reads in PDXs, we mapped the sequencing data to the human (GRCh38, GENCODE v29) and mouse (GRCm38, GENCODE vM19) reference genomes from GENCODE⁵⁶. Disambiguate⁵⁷ (v1.0) was then employed on the BAM files to remove mouse reads. Notably, for RNA sequencing data, we converted the BAM file of human reads to FASTQ format using Samtools⁵⁸ (v1.14), so that we can merge them with the unmapped reads for gene fusion detection. GATK⁵⁹ best practice workflow was used to deduplicate and recalibrate the aligned BAM files for DNA sequencing data.

PDXs with a mouse contamination rate >50% were excluded from further analyses. Samples with this high contamination rate included one RNA-seq (1853), three WES (1796, 1853, 512) and two WGS PDXs (1796, 1853). We further excluded two WES PDX samples (560-SM, 707) that had low coverage after mouse read removal. NGSCheckMate⁶⁰ (v1.0.0) was applied to ensure matching between PTs and PDXs using both DNA and RNA sequencing data. In addition, RNA-seQC⁶¹ (v2.3.5) and Samtools were applied to RNA and DNA sequencing data to assess mapping quality.

Mutational analysis

MuTect2 (GATK v4.2.3.0), VarScan (v2.4.4), Strelka (v2.9.10) and Pindel (v0.2.5b9)^62,63,64,65 were utilized to identify somatic mutations and indels from the WES data. To filter false positives, DKFZ’s bias filtering (https://github.com/DKFZ-ODCF/DKFZBiasFilter) was used to filter mutations with strand bias or bias toward PCR template strand. We used fpfilter.pl (https://sourceforge.net/projects/varscan/files/scripts) to remove false positives from VarScan output. We excluded mutations in intergenic, intron, or outside capture regions. To remove potential germline variants, we annotated the remaining mutations using population databases (including 1000 genome phase 3, ESP6500, non-TCGA ExAC and gnomAD 3.0)⁶⁶, and only kept variants with MAF < 0.001. We further removed variants that were found in either TCGA panel of normal or the panel of normal generated from this dataset. Then, we filtered out multiallelic mutations and double/triple nucleotide polymorphisms (DNP and TNP), and only included insertions or deletions shorter than 50 bp. Next, we required mutations to have at the minimum tumor depth ≥ 14, normal depth ≥ 8, tumor VAF ≥ 0.05, normal VAF ≤ 0.01 and tumor mutant allele reads ≥ 4. High confidence somatic mutations were identified as those that were called by at least two callers. Notably, for PT-PDX paired sample, if a mutation was detected only in one sample, we rescued the mutation in its paired sample if this mutation was found by any of the tools in the raw outputs. To test if this rescue strategy would miss any mutations, we used bam-readcount⁶⁷ to examine all the 294 PT or PDX private mutations in the matched sample. This supervised approach only found 2 PDX private mutation (<1%) with very low VAFs (0.024 and 0.014) in the matched PT, and found no evidence of PT private mutations in the matched PDX.

Several adaptions were made for tumors without a matched normal. For these tumors, we used MuTect2 tumor-only mode, and the 40 normal samples in our dataset were used as the panel of normal. Pindel was not used because it requires the matched normal. Mutations were considered high confidence if they were detected by all three tools. After rescuing mutations in paired samples, we used SGZ⁶⁸ to predict if a variant was somatic or germline. We excluded the predicted germline or probable germline variants unless the variant is cataloged by the COSMIC database or located in cancer genes⁶⁹, and kept those predicted as somatic or likely somatic. If an identified germline variant was found in one of PT-PDX paired samples, we removed it in both samples.

To test if the multi-caller approach would miss hotspot mutations, we compared mutations downloaded from the MSKCC hotspot database with the mutations that have been filtered out. Of the 3554 somatic mutations that were called by only one caller, only one point mutation (NUP93, E14V) and two indels of CTNNB1 were documented in the MSKCC hotspot database. Thus, the number of missed mutations is negligible. Since CTNNB1 harbors frequent indels in hepatoblastoma, we applied a supervised approach to identify them, see ‘other mutation related analyses’.

To validate the mutation calling, we examined mutations called from WES in RNAseq and low pass WGS sequencing data. Of the 388 mutation sites with coverage ≥10 in either RNAseq or low pass WGS, 356 mutations showed at least one read covering the mutant allele, resulting in a validation rate of 92%.

We used oncoKB-annotator to determine functional consequences of the mutations. In total we identified 46 oncogenic or likely oncogenic mutations, 30 of which were found in PT/PDX pairs. Among the 30, 28 were shared between matched PTs and PDXs.

Tumor purity and ploidy prediction

For samples with a paired normal, tumor purity and ploidy were estimated using Sequenza⁷⁰ (v3.0.0). For samples without a paired normal, tumor purity and ploidy were estimated using PureCN⁷¹ based on CNV segmentation by CNVkit⁷² (v0.9.9). Tumor purity was also estimated from RNAseq data using ESTIMATE⁷³.

Consensus clonality analysis

We applied four methods to characterize mutation clonality. The first method was described by McGranahan et al.⁷⁴. Using the method, we estimated cancer cell fraction (CCF) for each mutation and classified mutations as clonal if their CCF confidence interval overlaps 1, or as subclonal if otherwise. We additionally used PyClone-VI (v0.1.1), CliP (v1.2.1) and Ccube (v1.0)^75,76,77. These methods cluster mutations and then estimate the corresponding CCF of each cluster. Based on the outputs of these methods, we identified clonal and subclonal mutations by the following criteria:

If only one cluster was found, all mutations within the cluster were regarded clonal.
If more than one clusters were found, mutations of the cluster with the highest mean CCF were regarded clonal. For the remaining clusters, if the mean CCF was larger than 0.9, mutations within those clusters were also regarded clonal. This relaxed criterion was used to accommodate uncertainties associated with CCF estimates. The others were regarded subclonal.

The consensus mutational clonality was built on votes from these four approaches. A mutation was identified as consensus clonal if it was identified as such by at least two methods, and similarly for subclonal mutations. Other mutations were treated as ambiguous. Mutations without the needed copy number information to infer clonality were not classified. The clonality flow between PTs and PDXs was plotted by R package ggalluvial⁷⁸ using the consensus calls. We also manually examined VAFs and copy number status of those mutations between each paired PT and PDX to corroborate the evolutionary pattern.

Neoantigen prediction

The 4-digit HLA typing of each sample was predicted using Optitype⁷⁹ (v1.3.5). Based on the non-synonymous mutations and HLA typing, pVACseq⁸⁰ (pVACtools suite v3.0.0) was applied to identify peptides of 8–11 amino acids. The peptide binding affinity to MHC was predicted using NetMHCpan (v4.1), PickPocket (v1.1), SMM (v1.0) and SMMPMBEC (v1.0)^81,82,83,84. Neoantigens were identified as peptides with best MT IC50 ≤ 500 nM. We in total identified 305 neoantigens in PT-PDX paired samples, of which 239 were clonal.

Mutation signature analysis

The known mutational signature matrix (v3.2, GCRh38) was downloaded from COSMIC³². We determined mutation signatures using the R package deconstructSigs⁸⁵ (v1.8.0). For signature analysis, we required a sample to have at least 20 somatic mutations. For visualization, signatures with weight less 0.25 across all samples were excluded. We did not exclude any signatures during deconvolution, and no signature scaling was applied.

Other mutation-related analyses

To calculate microsatellite unstable (MSI) scores, we used MSIsensor⁸⁶ for tumors with a matched control and MSIsensor2⁸⁶ for tumors without a matched control. To identify large in-frame CTNNB1 deletions in hepatoblastoma, we used MANTA⁸⁷ (v1.6.0) to identify the structural breakpoints in exon 3 or 4 of CTNNB1, following a previous study²⁹. By this approach, we further added 4 CTNNB1 deletion back to our mutation calling result. We applied Telseq⁸⁸ (v0.0.1) to both WGS and WES data to estimate the average telomere length of each sample.

Somatic copy number alterations (SCNA)

Copy number (CN) segmentation was calculated from WGS data using CNVkit⁷² (v0.9.9) with default parameters. For tumor-only cases, we generated the copy number reference from 40 normal samples. Absolute copy number was estimated using PureCN⁷¹ (v2.2.0) best practice pipeline with segmentation generated by CNVkit. GISTIC2⁸⁹ was used to call recurrent peaks of all samples with parameter “-conf 0.99 -armpeel 1 -ta 0.3 -td 0.3.”

To compare the copy number profiles between PTs and PDXs, we first divided the genome into 1 Mb window bins using BEDTools⁹⁰ (v2.30.0). After removing segments located in centromeres or telomeres, we calculated the weighted mean of each bin across all samples. Copy number similarity was quantified by Pearson correlation based on the weighted mean matrix. Similarly, for CN similarity of cancer genes, the similarity was calculated using Pearson correlation based on the weighted mean of each gene. The cancer gene list was downloaded from Cancer Gene Census⁶⁹. Here, we only used genes identified as oncogenes or tumor suppressors (listed in Supplementary Data “Cancer genes”).

Focal copy number variations were identified with segment length <50% of the chromosome arm and with copy number ratio >0.3 or <−0.3. Given the potential inconsistency of focal SCNAs breakpoints in PTs and PDXs, if a focal event was amplified or deleted in both PT and PDX samples and the breakpoints of these two segments were within +/−10 kb range, this event was regarded as a shared focal event. Finally, for paired PT or PDX samples, we also rescued focal SCNAs with breakpoints within +/−10 kb range and with copy number ratio >0.1 (for amplification) or <−0.1 (for deletion) in its paired sample. We in total rescued 65 focal SCNAs (22% of all focal events).

The genomic instability (GI) of each chromosome arm was calculated as the proportion of gains (>0.3) or losses (<−0.3). The total CIN of each sample was defined as the mean of arm-level GIs.

Gene expression analysis

Kallisto⁹¹ (v0.46.0) was used to calculate transcript per million (TPM). For unsupervised clustering analysis, UMAP was used based on the top 1500 variable genes identified by median absolute deviation of TPM after removing immune-related, mitochondrial, and ribosomal genes. The immune-related genes were identified as those whose gene expression was positively correlated with ESTIMATE immune score (p < 0.05, Pearson correlation). We removed immune genes because these genes are either low or absent in our PDXs. Including them would bias the clustering of PDXs and PTs. We used the UMAP function implemented in the python package umap-learn (v0.5.1) with parameters “n_neighbors=15, min_dist=0.15”. The similarity between PTs and PDXs was calculated with Spearman’s correlation after excluding immune-related, mitochondrial, and ribosomal genes. HTseq-count⁹² (v0.13.5) was used to generate exon expression of each gene with parameters “-s no -t exon -m union --nonunique all”. RSEM⁹³ was used to estimate raw read counts.

Gene fusion identification

To detect fusions in PDXs, we merged unmapped reads with human-only reads from Disambiguate. The unmapped reads may contain junction spanning reads. Both STAR-Fusion⁹⁴ (v1.10.0) and PRADA2⁴ were applied to detect gene fusions. After removing fusions that were observed in normal samples, i.e., those annotated as “GTEx_recurrent”, we obtained 916 fusions with STAR-Fusion. With PRADA2, we obtained 237 fusions. Fusions identified by both methods were regarded as high confidence. For PT-PDX or PDX-PDX paired samples, we rescued PT or PDX only fusions in its paired sample from the raw fusion pool of STAR-Fusion or PRADA2. We in total rescued 7 fusions (4.3% of all gene fusions). The functional consequence of fusion candidates (in-frame or out-of-frame) was predicted with PRADA2⁴.

To validate the PDX fusion transcript LRPAP1–PDGFRA, we designed a pair of primers located on LRPAP1 and PDGFRA around the predicted fusion site: forward—5’ GCCAAGTATGGTCTGGACGG and reverse—5’ CGGGCAGCACATTCGTAATC, respectively. Product length was 233 bp. Total RNA was isolated from 30 mg of snap frozen PDX tissue using RNeasy mini kit (Qiagen, Cat#74004). Using One-step qRT-PCR kit (Invitrogen, Cat#11732-020) we performed one-step RT-PCR to amplify the predicted fusion gene junction from the same PDX tissue as was used for sequencing: 516_PDX & 892257_PDX. 50 ng of total RNA input was used for RT-PCR reaction. RT-PCR was performed in 50 μl reactions using 0.5 mM dNTPs, 3 mM MgSO4, 0.2 μM each primers and provided mix of SuperScript III RT/Platinum Taq. The RT-PCR reaction was carried out with the following program: 500 C for 30 min, followed by 950 C, 2 min and by 400 cycles of 950 C, 15 s, 550 C, 30 s and 680 C, 1 min. RT-PCR products were analyzed by agarose gel electrophoresis (2%). The result was visualized with SYBR safe DNA gel stain (ThermoFisher Sc, Cat#S33100).

Other analyses

The ssGSEA and GSEA analysis was done using the python package GSEApy (v0.10.4)⁹⁵. MsigDB C2 collection⁹⁶ (c2.all.v2022.1.Hs.symbols.gmt) was used in GSEA analysis to find the significantly differential pathways between Group1 and Group2 patient tumors. To obtain cell proliferation activity, we applied ssGSEA to cell proliferation signatures, including Benporath_Proliferation, REACTOME_Cell_Cycle, and KEGG_Cell_Cycle. We also applied ssGSEA to patient tumors to estimate activity of immune-related gene signatures. The gene signatures were collected from previous studies^97,98. Besides, we used TIMER2.0⁹⁹ to estimate abundance of immune cell infiltration in PT samples.

Statistics and reproducibility

No statistical method was used to predetermine sample size. PDXs were excluded from the analyses when high mouse contamination was detected in the genomic data. The experiments were not randomized.

Data availability

The raw sequencing data generated in this study have been deposited in the European Genome-Phenome Archive database under accession code EGAS00001006710 [https://ega-archive.org/datasets/EGAD00001009863]. The processed genomic data are available at synapse (Synapse ID: syn35811916). PDX clinical information and request forms can be found at the pediatric solid tumor PDX portal [https://pstPDX.streamlit.app]. The processed data generated in this study are provided in the Supplementary Information/Source data file. For the raw sequencing data that are under controlled access on EGA, access information including data access agreement and conditions of data release is provided on the portal site. Data access requests can also be sent to cprit_tpct@uthscsa.edu. Data access will be granted as soon as data requests are approved by the data oversight committee. No restriction is placed on how long the data will be made available for; however, data availability is bound by the scope and duration of the research projects described in the data access agreement. Source data are provided with this paper.

Code availability

The codes used for sequencing data analysis are available on Github at https://github.com/fnhe/PediatricSolidTumorPDX¹⁰⁰.

References

Flores-Toro, J. A. et al. The childhood cancer data initiative: using the power of data to learn from and improve outcomes for every child and young adult with pediatric cancer. J. Clin. Oncol. 41, 4045–4053 (2023).
Article PubMed Central PubMed Google Scholar
Ward, E., DeSantis, C., Robbins, A., Kohler, B. & Jemal, A. Childhood and adolescent cancer statistics, 2014. Ca. Cancer J. Clin. 64, 83–103 (2014).
Article PubMed Google Scholar
Hudson, M. M. et al. Clinical ascertainment of health outcomes among adults treated for childhood cancer. JAMA 309, 2371–2381 (2013).
Article CAS PubMed Central PubMed Google Scholar
Yang, J. et al. PCAT: an integrated portal for genomic and preclinical testing data of pediatric cancer patient-derived xenograft models. Nucleic Acids Res. 49, D1321–D1327 (2021).
Article CAS PubMed Google Scholar
Gao, H. et al. High-throughput screening using patient-derived tumor xenografts to predict clinical trial drug response. Nat. Med. 21, 1318–1325 (2015).
Article CAS PubMed Google Scholar
Hidalgo, M. et al. Patient-derived xenograft models: an emerging platform for translational cancer research. Cancer Discov. 4, 998–1013 (2014).
Article CAS PubMed Central PubMed Google Scholar
Houghton, P. J. et al. The pediatric preclinical testing program: Description of models and early testing results. Pediatr. Blood Cancer 49, 928–940 (2007).
Article PubMed Google Scholar
Murphy, B. et al. Evaluation of alternative in vivo drug screening methodology: a single mouse analysis. Cancer Res. 76, 5798–5809 (2016).
Article CAS PubMed Central PubMed Google Scholar
Kurmasheva, R. T. & Houghton, P. J. Identifying novel therapeutic agents using xenograft models of pediatric cancer. Cancer Chemother. Pharmacol. 78, 221–232 (2016).
Article CAS PubMed Central PubMed Google Scholar
Drapkin, B. J. et al. Genomic and functional fidelity of small cell lung cancer patient-derived xenografts. Cancer Discov. 8, 600–615 (2018).
Article CAS PubMed Central PubMed Google Scholar
Townsend, E. C. et al. The public repository of xenografts enables discovery and randomized phase II-like trials in mice. Cancer Cell 29, 574–586 (2016).
Article CAS PubMed Central PubMed Google Scholar
Izumchenko, E. et al. Patient-derived xenografts effectively capture responses to oncology therapy in a heterogeneous cohort of patients with solid tumors. Ann. Oncol. 28, 2595–2605 (2017).
Article CAS PubMed Central PubMed Google Scholar
Vaubel, R. A. et al. Genomic and phenotypic characterization of a broad panel of patient-derived xenografts reflects the diversity of glioblastoma. Clin. Cancer Res. 26, 1094–1104 (2020).
Article CAS PubMed Google Scholar
Woo, X. Y. et al. Conservation of copy number profiles during engraftment and passaging of patient-derived cancer xenografts. Nat. Genet. 53, 86–99 (2021).
Article CAS PubMed Central PubMed Google Scholar
Hoge, A. C. H. et al. DNA-based copy number analysis confirms genomic evolution of PDX models. npj Precis. Oncol 6, 1–7 (2022).
ADS Google Scholar
Ben-David, U. et al. Patient-derived xenografts undergo mouse-specific tumor evolution. Nat. Genet. 49, 1567–1575 (2017).
Article CAS PubMed Central PubMed Google Scholar
Rokita, J. L. et al. Genomic profiling of childhood tumor patient-derived xenograft models to enable rational clinical trial design. Cell Rep. 29, 1675–1689.e9 (2019).
Article CAS PubMed Central PubMed Google Scholar
Murphy, A. J. et al. Forty-five patient-derived xenografts capture the clinical and biological heterogeneity of Wilms tumor. Nat. Commun. 10, 5806 (2019).
Article ADS CAS PubMed Central PubMed Google Scholar
Braekeveldt, N. et al. Patient-derived xenograft models reveal intratumor heterogeneity and temporal stability in neuroblastoma. Cancer Res. 78, 5958–5969 (2018).
Article CAS PubMed Google Scholar
Nicolle, D. et al. Patient-derived mouse xenografts from pediatric liver cancer predict tumor recurrence and advise clinical management. Hepatology 64, 1121–1135 (2016).
Article CAS PubMed Google Scholar
Stewart, E. et al. Orthotopic patient-derived xenografts of paediatric solid tumours. Nature 549, 96–100 (2017).
Article ADS CAS PubMed Central PubMed Google Scholar
Smith, K. S. et al. Patient-derived orthotopic xenografts of pediatric brain tumors: a St. Jude resource. Acta Neuropathol. 140, 209–225 (2020).
Article CAS PubMed Central PubMed Google Scholar
Brabetz, S. et al. A biobank of patient-derived pediatric brain tumor models. Nat. Med. 24, 1752–1761 (2018).
Article CAS PubMed Google Scholar
Woodfield, S. E. et al. A novel cell line based orthotopic xenograft mouse model that recapitulates human hepatoblastoma. Sci. Rep. 7, 17751 (2017).
Article ADS PubMed Central PubMed Google Scholar
Meyer, W. H. et al. Development and characterization of pediatric osteosarcoma xenografts. Cancer Res. 50, 2781–2785 (1990).
CAS PubMed Google Scholar
Sun, H. et al. Comprehensive characterization of 536 patient-derived xenograft models prioritizes candidates for targeted treatment. Nat. Commun. 12, 5086 (2021).
Article ADS CAS PubMed Central PubMed Google Scholar
Morton, C. L., Papa, R. A., Lock, R. B. & Houghton, P. J. Preclinical chemotherapeutic tumor models of common childhood cancers: solid tumors, acute lymphoblastic leukemia, and disseminated neuroblastoma. Curr. Protoc. Pharmacol. Chapter 14, Unit14.8 (2007).
Wang, L.-J. et al. An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data. BMC Genomics 20, 1007 (2019).
Article PubMed Central PubMed Google Scholar
Hirsch, T. Z. et al. Integrated genomic analysis identifies driver genes and cisplatin-resistant progenitor phenotype in pediatric liver. Cancer Cancer Discov. 11, 2524–2543 (2021).
Article CAS PubMed Google Scholar
Shen, H. et al. Integrated molecular characterization of testicular germ cell tumors. Cell Rep. 23, 3392–3406 (2018).
Article CAS PubMed Central PubMed Google Scholar
Lawrence, M. S. et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature 499, 214–218 (2013).
Article ADS CAS PubMed Central PubMed Google Scholar
Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020).
Article ADS CAS PubMed Central PubMed Google Scholar
Koch, A. et al. Childhood hepatoblastomas frequently carry a mutated degradation targeting box of the beta-catenin gene. Cancer Res. 59, 269–273 (1999).
CAS PubMed Google Scholar
Noureen, N. et al. Integrated analysis of telomerase enzymatic activity unravels an association with cancer stemness and proliferation. Nat. Commun. 12, 139 (2021).
Article CAS PubMed Central PubMed Google Scholar
Wang, Z. et al. Molecular mechanism of telomere length dynamics and its prognostic value in pediatric cancers. J. Natl Cancer Inst. 112, 756–764 (2020).
Article PubMed Google Scholar
Jiménez-Sánchez, A. et al. Heterogeneous tumor-immune microenvironments among differentially growing metastases in an ovarian cancer patient. Cell 170, 927.e20–938.e20 (2017).
Article Google Scholar
Guo, H., Callaway, J. B. & Ting, J. P.-Y. Inflammasomes: mechanism of action, role in disease, and therapeutics. Nat. Med. 21, 677–687 (2015).
Article PubMed Central PubMed Google Scholar
Warrington, R. et al. An introduction to immunology and immunopathology. Allergy Asthma Clin. Immunol. 7, S1 (2011).
Mao, X. et al. Crosstalk between cancer-associated fibroblasts and immune cells in the tumor microenvironment: new findings and future perspectives. Mol. Cancer 20, 131 (2021).
Article CAS PubMed Central PubMed Google Scholar
Gadd, S. et al. A Children’s Oncology Group and TARGET initiative exploring the genetic landscape of Wilms tumor. Nat. Genet. 49, 1487–1494 (2017).
Article CAS PubMed Central PubMed Google Scholar
Chagtai, T. et al. Gain of 1q as a prognostic biomarker in Wilms tumors (WTs) treated with preoperative chemotherapy in the International Society of Paediatric Oncology (SIOP) WT 2001 Trial: a SIOP Renal Tumours Biology Consortium Study. J. Clin. Oncol. 34, 3195–3203 (2016).
Article CAS PubMed Central PubMed Google Scholar
Nagae, G. et al. Genetic and epigenetic basis of hepatoblastoma diversity. Nat. Commun. 12, 5423 (2021).
Article ADS CAS PubMed Central PubMed Google Scholar
Litchfield, K. et al. Whole-exome sequencing reveals the mutational spectrum of testicular germ cell tumours. Nat. Commun. 6, 5973 (2015).
Article ADS CAS PubMed Central PubMed Google Scholar
Sheikine, Y. et al. Molecular genetics of testicular germ cell tumors. Am. J. Cancer Res. 2, 153–167 (2012).
CAS PubMed Central PubMed Google Scholar
Martin, J. W., Squire, J. A. & Zielenska, M. The genetics of osteosarcoma. Sarcoma 2012, 627254 (2012).
Article PubMed Central PubMed Google Scholar
Hoadley, K. A. et al. Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer. Cell 173, 291.e6–304.e6 (2018).
Article Google Scholar
Cocco, E., Scaltriti, M. & Drilon, A. NTRK fusion-positive cancers and TRK inhibitor therapy. Nat. Rev. Clin. Oncol. 15, 731–747 (2018).
Article CAS PubMed Central PubMed Google Scholar
Hechtman, J. F. NTRK insights: best practices for pathologists. Mod. Pathol. 35, 298–305 (2022).
Article PubMed Google Scholar
Gao, R. et al. Punctuated copy number evolution and clonal stasis in triple-negative breast cancer. Nat. Genet. 48, 1119–1130 (2016).
Article CAS PubMed Central PubMed Google Scholar
Kresse, S. H., Meza-Zepeda, L. A., Machado, I., Llombart-Bosch, A. & Myklebost, O. Preclinical xenograft models of human sarcoma show nonrandom loss of aberrations. Cancer 118, 558–570 (2012).
Article PubMed Google Scholar
McGranahan, N. & Swanton, C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613–628 (2017).
Article CAS Google Scholar
Malaney, P., Nicosia, S. V. & Davé, V. One mouse, one patient paradigm: New avatars of personalized cancer therapy. Cancer Lett. 344, 1–12 (2014).
Article CAS PubMed Google Scholar
Krueger, F., James, F., Ewels, P., Afyounian, E. & Schuster-Boeckler, B. FelixKrueger/TrimGalore: v0.6.7 - DOI via Zenodo. https://doi.org/10.5281/zenodo.5127899 (2021).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://doi.org/10.48550/arXiv.1303.3997 (2013).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Article CAS PubMed Central PubMed Google Scholar
Ahdesmäki, M. J., Gray, S. R., Johnson, J. H. & Lai, Z. Disambiguate: an open-source application for disambiguating two species in next generation sequencing data from grafted samples. F1000Res 5, 2741 (2016).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed Central PubMed Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed Central PubMed Google Scholar
Lee, S. et al. NGSCheckMate: software for validating sample identity in next-generation sequencing studies within and across data types. Nucleic Acids Res. 45, e103 (2017).
Article CAS PubMed Central PubMed Google Scholar
Graubert, A., Aguet, F., Ravi, A., Ardlie, K. G. & Getz, G. RNA-SeQC 2: efficient RNA-seq quality control and quantification for large cohorts. Bioinformatics 37, 3048–3050 (2021).
Article CAS PubMed Central PubMed Google Scholar
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
Article CAS PubMed Central PubMed Google Scholar
Koboldt, D. C. et al. VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 25, 2283–2285 (2009).
Article CAS PubMed Central PubMed Google Scholar
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
Article CAS PubMed Google Scholar
Ye, K., Schulz, M. H., Long, Q., Apweiler, R. & Ning, Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25, 2865–2871 (2009).
Article CAS PubMed Central PubMed Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Article PubMed Central PubMed Google Scholar
Khanna, A. et al. Bam-readcount - rapid generation of basepair-resolution sequence metrics. J. Open Source Softw. 7, 3722 (2022).
Article ADS Google Scholar
Sun, J. X. et al. A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal. PLoS Comput. Biol. 14, e1005965 (2018).
Article PubMed Central PubMed Google Scholar
Sondka, Z. et al. The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Nat. Rev. Cancer 18, 696 (2018).
Article CAS PubMed Central PubMed Google Scholar
Favero, F. et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 26, 64–70 (2015).
Article CAS PubMed Google Scholar
Riester, M. et al. PureCN: copy number calling and SNV classification using targeted short read sequencing. Source Code Biol. Med. 11, 13 (2016).
Article PubMed Central PubMed Google Scholar
Talevich, E., Shain, A. H., Botton, T. & Bastian, B. C. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput. Biol. 12, e1004873 (2016).
Article ADS PubMed Central PubMed Google Scholar
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612 (2013).
McGranahan, N. et al. Clonal status of actionable driver events and the timing of mutational processes in cancer evolution. Sci. Transl. Med. 7, 283ra54 (2015).
Article PubMed Central PubMed Google Scholar
Gillis, S. & Roth, A. PyClone-VI: scalable inference of clonal population structures using whole genome data. BMC Bioinformatics 21, 571 (2020).
Article PubMed Central PubMed Google Scholar
Yuan, K., Macintyre, G., Liu, W., Group, P.-11 working & Markowetz, F. Ccube: a fast and robust method for estimating cancer cell fractions. Preprint at https://doi.org/10.1101/484402 (2018).
Jiang, Y. et al. CliP: subclonal architecture reconstruction of cancer cells in DNA sequencing data using a penalized likelihood model. Preprint at https://doi.org/10.1101/2021.03.31.437383 (2021).
Brunson, J. C. ggalluvial: Layered grammar for alluvial plots. J. Open Source Softw. 5, 2017 (2020).
Article ADS PubMed Central PubMed Google Scholar
Szolek, A. et al. OptiType: precision HLA typing from next-generation sequencing data. Bioinformatics 30, 3310–3316 (2014).
Article CAS PubMed Central PubMed Google Scholar
Hundal, J. et al. pVACtools: a computational toolkit to identify and visualize cancer neoantigens. Cancer Immunol. Res. 8, 409–420 (2020).
Article CAS PubMed Central PubMed Google Scholar
Jurtz, V. et al. NetMHCpan-4.0: improved peptide-MHC class I interaction predictions integrating eluted ligand and peptide binding affinity data. J. Immunol. 199, 3360–3368 (2017).
Article CAS PubMed Google Scholar
Zhang, H., Lund, O. & Nielsen, M. The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding. Bioinformatics 25, 1293–1299 (2009).
Article CAS PubMed Central PubMed Google Scholar
Nielsen, M., Lundegaard, C. & Lund, O. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method. BMC Bioinformatics 8, 238 (2007).
Article PubMed Central PubMed Google Scholar
Kim, Y., Sidney, J., Pinilla, C., Sette, A. & Peters, B. Derivation of an amino acid similarity matrix for peptide: MHC binding and its application as a Bayesian prior. BMC Bioinformatics 10, 394 (2009).
Article PubMed Central PubMed Google Scholar
Rosenthal, R., McGranahan, N., Herrero, J., Taylor, B. S. & Swanton, C. deconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol. 17, 31 (2016).
Niu, B. et al. MSIsensor: microsatellite instability detection using paired tumor-normal sequence data. Bioinformatics 30, 1015–1016 (2014).
Article CAS PubMed Google Scholar
Chen, X. et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32, 1220–1222 (2016).
Article CAS PubMed Google Scholar
Ding, Z. et al. Estimating telomere length from whole genome sequence data. Nucleic Acids Res. 42, e75 (2014).
Article CAS PubMed Central PubMed Google Scholar
Mermel, C. H. et al. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 12, R41 (2011).
Article PubMed Central PubMed Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed Central PubMed Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS PubMed Google Scholar
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Article CAS PubMed Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 12, 323 (2011).
Article CAS PubMed Central PubMed Google Scholar
Haas, B. J. et al. Accuracy assessment of fusion transcript detection via read-mapping and de novo fusion transcript assembly-based methods. Genome Biol. 20, 213 (2019).
Article PubMed Central PubMed Google Scholar
Fang, Z., Liu, X. & Peltz, G. GSEApy: a comprehensive package for performing gene set enrichment analysis in Python. Bioinformatics 39, btac757 (2023).
Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739–1740 (2011).
Article CAS PubMed Central PubMed Google Scholar
Ju, M. et al. Pan-cancer analysis of NLRP3 inflammasome with potential implications in prognosis and immunotherapy in human cancer. Brief. Bioinform. 22, bbaa345 (2021).
Article PubMed Google Scholar
Thompson, J. C. et al. Gene signature of antigen processing and presentation machinery predicts response to checkpoint blockade in non-small cell lung cancer (NSCLC) and melanoma. J. Immunother. Cancer 8, e000974 (2020).
Article PubMed Central PubMed Google Scholar
Li, T. et al. TIMER2.0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res. 48, W509–W514 (2020).
Article CAS PubMed Central PubMed Google Scholar
He, F. & Zheng, S. Genomic profiling of subcutaneous patient-derived xenografts reveals immune constraints on tumor evolution in childhood solid cancer. github https://doi.org/10.5281/zenodo.8411315 (2023).

Download references

Acknowledgements

We thank Kathryn Bondra, Fuyang Li, Vanessa DelPozo, Samson Ghilu, and Edward Favors for technical assistance with sample preparation and animal work. We thank UT Health Information Management Systems for IT support. This work was supported by CPRIT (RP160716, RP220599 to P.J.H., RP180319 to S.X.S., RR170055 to Z.S.). Z.L. is supported by NIH NCI R50CA265339. The Sequencing data used in the study were generated at The Greehey Children’s Cancer Research Institute (GCCRI) Genome Sequencing Facility (GSF). GSF is a Mays Cancer Center Next Generation Sequencing Shared Resource (NGSSR) and is supported by NIH-NCI P30 CA054174, NIH Shared Instrument grants S10OD021805 and S10OD030311 (Z.L.), and CPRIT Core Facility Awards RP160732 and RP220662 (Y.C.).

Author information

These authors contributed equally: Funan He, Abhik M. Bandyopadhyay.

Authors and Affiliations

Greehey Children’s Cancer Research Institute, University of Texas Health Science Center, San Antonio, TX, USA
Funan He, Abhik M. Bandyopadhyay, Anna Rogojina, Trevor Holland, Dawn Garcia, Korri Weldon, Luz-Nereida Perez Prado, Allison C. Grimes, Zhao Lai, Yi Zou, Dias Kurmashev, Yidong Chen, Xiaojing Wang, Gail E. Tomlinson, Peter J. Houghton, Raushan T. Kurmasheva & Siyuan Zheng
Department of Population Health Sciences, University of Texas Health Science Center, San Antonio, TX, USA
Funan He, Yidong Chen, Xiaojing Wang & Siyuan Zheng
Department of Pediatrics, Division of Hematology/Oncology, University of Texas Southwestern Medical Center, Dallas, TX, USA
Laura J. Klesse, Erin Butler, Taylor Hartshorne, Lin Xu & Stephen X. Skapek
Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
Laura J. Klesse, Erin Butler, Lin Xu, Yang Xie & Stephen X. Skapek
Gill Center for Cancer and Blood Disorders, Children’s Health Children’s Medical Center, Dallas, TX, USA
Laura J. Klesse, Erin Butler & Stephen X. Skapek
Department of Biochemistry and Structural Biology, University of Texas Health Science Center, San Antonio, TX, USA
Sang H. Chun
Department of Pediatrics, University of Texas Health Science Center, San Antonio, TX, USA
Anne-Marie Langevin, Allison C. Grimes, Aaron Sugalski, Shafqat Shah, Chatchawin Assanasen & Gail E. Tomlinson
Mays Cancer Center, University of Texas Health Science Center, San Antonio, TX, USA
Anne-Marie Langevin, Chatchawin Assanasen, Zhao Lai, Yidong Chen, Xiaojing Wang, Gail E. Tomlinson, Peter J. Houghton, Raushan T. Kurmasheva & Siyuan Zheng
Department of Molecular Medicine, University of Texas Health Science Center, San Antonio, TX, USA
Zhao Lai, Peter J. Houghton & Raushan T. Kurmasheva
Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, TX, USA
Lin Xu & Yang Xie
Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, USA
Yang Xie

Authors

Funan He
View author publications
You can also search for this author in PubMed Google Scholar
Abhik M. Bandyopadhyay
View author publications
You can also search for this author in PubMed Google Scholar
Laura J. Klesse
View author publications
You can also search for this author in PubMed Google Scholar
Anna Rogojina
View author publications
You can also search for this author in PubMed Google Scholar
Sang H. Chun
View author publications
You can also search for this author in PubMed Google Scholar
Erin Butler
View author publications
You can also search for this author in PubMed Google Scholar
Taylor Hartshorne
View author publications
You can also search for this author in PubMed Google Scholar
Trevor Holland
View author publications
You can also search for this author in PubMed Google Scholar
Dawn Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Korri Weldon
View author publications
You can also search for this author in PubMed Google Scholar
Luz-Nereida Perez Prado
View author publications
You can also search for this author in PubMed Google Scholar
Anne-Marie Langevin
View author publications
You can also search for this author in PubMed Google Scholar
Allison C. Grimes
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Sugalski
View author publications
You can also search for this author in PubMed Google Scholar
Shafqat Shah
View author publications
You can also search for this author in PubMed Google Scholar
Chatchawin Assanasen
View author publications
You can also search for this author in PubMed Google Scholar
Zhao Lai
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zou
View author publications
You can also search for this author in PubMed Google Scholar
Dias Kurmashev
View author publications
You can also search for this author in PubMed Google Scholar
Lin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Yidong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Gail E. Tomlinson
View author publications
You can also search for this author in PubMed Google Scholar
Stephen X. Skapek
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. Houghton
View author publications
You can also search for this author in PubMed Google Scholar
Raushan T. Kurmasheva
View author publications
You can also search for this author in PubMed Google Scholar
Siyuan Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Project supervision and funding acquisition: P.J.H., S.Z., R.K., S.X.S., G.E.T. and X.Y. PDX production and sample preparation: A.M.B., R.K., A.R., T.H. Fusion validation: A.R. Genomic data analysis and portal development: F.H., S.Z., Y.C., L.X., X.W., D.K., S.H.C. Patient tumor and clinical data: L.J.K., G.T., L.-N.P.P., E.B., T.H., A.-M.L, A.G., A.S., S.S., C.A. Genomic data production: Z.L., Y.Z., D.G., K.W. Manuscript writing, with input from all authors: S.Z., F.H., P.J.H., R.K.

Corresponding authors

Correspondence to Raushan T. Kurmasheva or Siyuan Zheng.

Ethics declarations

Competing interests

L.J.K. consults for Alexion Pharmaceuticals without a fee. The other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jo Lynne Rokita and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Dataset 3

Supplementary Dataset 4

Supplementary Dataset 5

Supplementary Dataset 6

Supplementary Dataset 7

Supplementary Dataset 8

Supplementary Dataset 9

Supplementary Dataset 10

Source data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

He, F., Bandyopadhyay, A.M., Klesse, L.J. et al. Genomic profiling of subcutaneous patient-derived xenografts reveals immune constraints on tumor evolution in childhood solid cancer. Nat Commun 14, 7600 (2023). https://doi.org/10.1038/s41467-023-43373-1

Download citation

Received: 29 March 2023
Accepted: 07 November 2023
Published: 22 November 2023
DOI: https://doi.org/10.1038/s41467-023-43373-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.