Integrated genomic analysis reveals mutated ELF3 as a potential gallbladder cancer vaccine candidate

Pandey, Akhilesh; Stawiski, Eric W.; Durinck, Steffen; Gowda, Harsha; Goldstein, Leonard D.; Barbhuiya, Mustafa A.; Schröder, Markus S.; Sreenivasamurthy, Sreelakshmi K.; Kim, Sun-Whe; Phalke, Sameer; Suryamohan, Kushal; Lee, Kayla; Chakraborty, Papia; Kode, Vasumathi; Shi, Xiaoshan; Chatterjee, Aditi; Datta, Keshava; Khan, Aafaque A.; Subbannayya, Tejaswini; Wang, Jing; Chaudhuri, Subhra; Gupta, Sanjiv; Shrivastav, Braj Raj; Jaiswal, Bijay S.; Poojary, Satish S.; Bhunia, Shushruta; Garcia, Patricia; Bizama, Carolina; Rosa, Lorena; Kwon, Wooil; Kim, Hongbeom; Han, Youngmin; Yadav, Thakur Deen; Ramprasad, Vedam L.; Chaudhuri, Amitabha; Modrusan, Zora; Roa, Juan Carlos; Tiwari, Pramod Kumar; Jang, Jin-Young; Seshagiri, Somasekar

doi:10.1038/s41467-020-17880-4

Download PDF

Article
Open access
Published: 24 August 2020

Integrated genomic analysis reveals mutated ELF3 as a potential gallbladder cancer vaccine candidate

Akhilesh Pandey ORCID: orcid.org/0000-0001-9943-6127^1,2,3^na1^na2,
Eric W. Stawiski^4,5,6^na1^na2,
Steffen Durinck^4,5^na1,
Harsha Gowda^1,7^na1,
Leonard D. Goldstein^4,5,
Mustafa A. Barbhuiya^1,8,
Markus S. Schröder^5,9,
Sreelakshmi K. Sreenivasamurthy ORCID: orcid.org/0000-0002-8891-3714¹,
Sun-Whe Kim¹⁰,
Sameer Phalke¹¹,
Kushal Suryamohan⁶,
Kayla Lee⁶,
Papia Chakraborty⁶,
Vasumathi Kode⁶,
Xiaoshan Shi⁶,
Aditi Chatterjee ORCID: orcid.org/0000-0001-8474-5824¹,
Keshava Datta¹,
Aafaque A. Khan¹,
Tejaswini Subbannayya¹,
Jing Wang⁶,
Subhra Chaudhuri⁵,
Sanjiv Gupta¹²,
Braj Raj Shrivastav¹³,
Bijay S. Jaiswal⁵,
Satish S. Poojary¹⁴,
Shushruta Bhunia¹⁴,
Patricia Garcia¹⁵,
Carolina Bizama¹⁵,
Lorena Rosa¹⁶,
Wooil Kwon¹⁰,
Hongbeom Kim ORCID: orcid.org/0000-0002-1595-0135¹⁰,
Youngmin Han ORCID: orcid.org/0000-0003-0456-7824¹⁰,
Thakur Deen Yadav¹⁷,
Vedam L. Ramprasad¹¹,
Amitabha Chaudhuri⁶,
Zora Modrusan⁵,
Juan Carlos Roa¹⁵,
Pramod Kumar Tiwari¹⁴^na2,
Jin-Young Jang ORCID: orcid.org/0000-0003-3312-0503¹⁰^na2 &
…
Somasekar Seshagiri ORCID: orcid.org/0000-0003-4272-6443^5,18^na2

Nature Communications volume 11, Article number: 4225 (2020) Cite this article

9861 Accesses
38 Citations
178 Altmetric
Metrics details

Subjects

Abstract

Gallbladder cancer (GBC) is an aggressive gastrointestinal malignancy with no approved targeted therapy. Here, we analyze exomes (n = 160), transcriptomes (n = 115), and low pass whole genomes (n = 146) from 167 gallbladder cancers (GBCs) from patients in Korea, India and Chile. In addition, we also sequence samples from 39 GBC high-risk patients and detect evidence of early cancer-related genomic lesions. Among the several significantly mutated genes not previously linked to GBC are ETS domain genes ELF3 and EHF, CTNNB1, APC, NSD1, KAT8, STK11 and NFE2L2. A majority of ELF3 alterations are frame-shift mutations that result in several cancer-specific neoantigens that activate T-cells indicating that they are cancer vaccine candidates. In addition, we identify recurrent alterations in KEAP1/NFE2L2 and WNT pathway in GBC. Taken together, these define multiple targetable therapeutic interventions opportunities for GBC treatment and management.

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

Austin D. Reed, Sara Pensa, … Walid T. Khaled

Feasibility of functional precision medicine for guiding treatment of relapsed or refractory pediatric cancers

Article Open access 11 April 2024

Arlet M. Acanda De La Rocha, Noah E. Berlow, … Diana J. Azzam

Deep whole-genome analysis of 494 hepatocellular carcinomas

Article 14 February 2024

Lei Chen, Chong Zhang, … Hongyang Wang

Introduction

The gallbladder is an important part of the biliary tract system. Gallbladder cancer (GBC) is the most common of the biliary tract cancers¹. GBC is a highly fatal malignancy with median survival of <1 year^2,3,4. This is primarily due to non-specificity of symptoms during initial stages of the disease with patients generally presenting at an advanced stage of the cancer. GBC often occurs in the setting of gallstones (cholelithiasis) or chronic inflammation (cholecystitis) and is typically detected incidentally in patients undergoing treatment for these conditions¹. The diagnosis is also confounded by the anatomic position of the gallbladder and the non-specificity of the symptoms during the initial stages of the disease¹.

GBC is the 20th most common cancer worldwide with an estimated 178,100 new cases diagnosed annually⁴ (http://globocan.iarc.fr). In 2020, in the United States, an estimated 11,980 new cases and 4,090 deaths due to gallbladder cancer is expected (https://www.cancer.net/cancer-types/gallbladder-cancer). In contrast to the general population in the United States where GBC incidence is low, it is a more common gastrointestinal malignancy in both Southwestern Native Americans and in Mexican Americans⁴. Incidence of GBC is highest in South American countries that include Chile, Bolivia, and Ecuador and Asian countries such as Korea, India, Pakistan and Japan. Interestingly, GBC incidence is lowest in Africa^1,4 (http://globocan.iarc.fr). In addition to race, GBC incidence increases with age and women are affected two to six times more often than men⁴. Other recognized risk factors for GBC development include the occurrence of gallstones and Salmonella infection⁴.

Previous molecular studies on GBC have focused on the assessment of mutations in few candidate genes such as TP53 and KRAS¹. Recent exome sequencing of nine GBC samples of Caucasian origin identified TP53 as a significantly mutated GBC gene⁵. Another study examined exomes from 32 GBC samples of Chinese origin and identified TP53, KRAS, and ERBB3 as significantly mutated GBC genes⁶. Further, exome sequencing and analysis of 28 GBC patients of Japanese origin and targeted sequencing of 51 samples of Chinese origin identified alterations in ERBB family members⁷. A recent follow-up study⁶ reported exome data from an additional 125 additional Chinese patients linking frequent ERBB2/3 mutation and upregulation of PD-L1 in GBC⁸. A recent study reported exome sequencing of 16 GBC samples of Japanese origin and targeted sequencing of 30 GBC samples that included 26 Italian and 4 Japanese patients⁹.

Given that GBC incidence shows strong geographic variation, in this study we perform a comprehensive analysis of 167 GBCs that includes patients from three geographically different regions namely, South Korea (n = 94), India (n = 64) and Chile (n = 9). We analyze 167 tumors from three geographically distinct parts of the world and identify ELF3 to be a significantly mutated gene. Given our large sample size, we find several previously unreported significantly mutated GBC genes. The altered genes include TP53, ELF3, ERBB3, CTNNB1, ARID2, CDNK2A, STK11, SMAD4, ARID1A, EHF, KRAS, NFE2L2, PIK3CA, and PSIP1. We further identify a class of mismatch-repair-deficient gallbladder cancers with elevated mutation rates which are likely candidates for immunotherapy. The ELF3 mutations are predominantly frame-shift alterations that result in several neoantigens that are able to activate CD8+ T-cells, confirming them as potential cancer vaccine candidates.

Results

Genomic analysis of GBC samples

We have performed a comprehensive genomic analysis of 213 samples that included 167 gallbladder (GBC) primary tumors, 7 GBC cell lines, 23 gallbladder tissue from cholecystitis cases (cholecystitis), 14 gallbladder tissue from gall stone cases (stone), and 2 gallbladder polyps (polyp). Overall, we have obtained whole exome (WES) from 206 samples (160 GBC, 23 cholecystitis, 14 stone, 2 polyps and 7 cell lines), RNA-seq from 120 samples (115 GBC and 5 cell lines) and low pass (<5x) whole-genome sequence (WGS) data from 184 samples (146 GBC, 15 cholecystitis, 14 stone, 2 polyps and 7 cell lines; Supplementary Table 1, Supplementary Data 1, and Supplementary Fig. 1).

For 98 of the 167 GBC cases in this study we have obtained exome, RNA-seq and low pass WGS data, making this a comprehensive GBC genomic data set (Supplementary Table 1, Supplementary Data 1 and Supplementary Fig. 1).

GBC mutational profile

We obtained WES data on 160 GBC (152 patient-matched paired tumor/normal GBC and 8 unpaired GBC) from India (60), Korea (91), and Chile (9). Also, we obtained WES data for 7 GBC cell lines. In addition, we surveyed pre-cancerous gallbladder tissue samples from 23 cholecystitis cases, 14 gallbladder stones, and 2 gallbladder polyps by WES (Supplementary Table 2 and Supplementary Data 2–8). Samples were sequenced at an average coverage of 93x and tumor/normal relationships were confirmed using exome sequence data (Supplementary Data 2 and Methods). Principal component analysis (PCA) using the germline variants from the matched normal revealed that the samples clustered into groups based on their population of origin (Fig. 1a and Supplementary Fig. 2). The six-cell lines derived from patients of Japanese ancestry and one from a Korean patient clustered with the Korean samples consistent with a north east-Asian genetic profile (Fig. 1a and Supplementary Fig. 2). Amongst the patient-matched paired tumor-normal GBC samples, a total of 21,439 protein-altering somatic mutations were identified, including 17,475 missense, 1215 nonsense, 26 stop loss, 22 start lost, 419 essential splice-site mutations, and 2282 indels (Supplementary Table 2 and Supplementary Fig. 3). A majority of the mutations (92%; 19,757/21,439) were novel and were not reported in COSMIC.v70¹⁰ (Supplementary Table 2). Using RNA-seq data, we confirmed the expression of 8,706 protein-altering somatic variants identified by WES (Methods, Supplementary Table 2, Supplementary Data 3 and 4).

**Fig. 1: Genetic variation in GBC and non-GBC samples.**

Both cholecystitis and gallstones (cholelithiasis) are believed to lead to precancerous lesions by inducing dysplastic changes in the pathogenesis of GBC¹. We sequenced tissue from areas surrounding the inflamed sites in the gallbladder tissue from patients with chronic cholecystitis (n = 23), or gallbladder stones (n = 14) and gallbladder polyps (n = 2) together designated ‘GB-other’. We found significantly fewer somatic protein-altering mutations with a median of 1 (range 0–90) for cholecystitis and a median of 0.5 (range 0–2) for GB-other compared to a median of 65 (range 0–4,867) in GBCs (Fig. 1b and Supplementary Data 5). We next looked for cancer-associated somatic alterations using a combination of exome and low pass WGS data and found mutations or copy loss in 30% (7/23) of cholecystitis samples (Fig. 1c). The alterations included PIK3R2, CHD1, TP53, and CDKN2A. Consistent with this, we found alterations in TP53, CDKN2A, PIK3R2, and CHD1 in GBC (Supplementary Data 3 and Supplementary Fig. 4). Previously, TP53 and CDKN2A have been implicated as GBC drivers^5,6,7. Though not previously implicated in GBC, CHD1 is a known gastric cancer driver and PIK3R2 is frequently mutated in endometrial cancers^11,12,13. In contrast to cholecystitis samples, somatic alterations in the ‘GBC-other’ group were not detected.

We found the median mutation rate in GBC to be between that of hepatocellular carcinoma (HCC) and colorectal cancers (COADREAD; Fig. 1d). However, we found three outlier samples with very high mutation burden. We tested these samples for microsatellite instability (MSI) using the MANTIS¹⁴ (see Methods) and found that they were positive for MSI. Consistent with this, all the three samples had a high mutation burden (>1000 protein-altering mutations) (Fig. 1e). We confirmed that these samples carried deleterious mutations in known mismatch repair genes (Fig. 1e). Interestingly, the outlier mutational status of the MSI positive samples was similar to that of mismatch repair-deficient colorectal cancer (COADREAD), endometrial carcinoma (UCEC), and stomach adenocarcinoma (STAD) (Fig. 1e).

To understand the mutational processes that contribute to the development of GBC we identified and cataloged the possible 96 base substitution types taking into account the possible eight base pair somatic changes (C > T, T > C, C > A, C > G, T > A, and T > G) and the flanking 5’ and 3’ base context, as previously described¹⁵. The substitution frequencies between GBC samples from India, Chile and Korea were similar and was consistent with the frequency pattern observed in a set of 29 GBCs from Japan⁷ (Supplementary Fig. 5). Using non-negative matrix factorization, as described previously (see Methods), we identified six prominent mutational signatures (Supplementary Figs. 5 and 6a, b). The strongest correlates included age (C > T mutations at NpCpG sites), an APOBEC signature (dominated by C > T and C > G mutations at TpCpN sites), and an MSI signature. These findings are consistent with previously identified processes in gallbladder and ampullary cancers^7,16,17. When we compared the mutation profile to 8 other cancer types, we found it clustered most closely to head and neck squamous cell carcinoma (Supplementary Fig. 7).

Mutated genes and their significance in GBC

Exome sequencing identified protein-altering somatic mutations in 10,224 genes and of these 4750 (46%) were mutated in at least two patients. We found recurrent mutations in 102 chromatin-modifying genes, including NSD1, ARID1A, SETD2, and PBRM1, 187 protein kinases, including TTN, ERBB2, ERBB3, STK11, and LATS1, and 73 G-protein coupled receptors including CELSR1/2/3, CHRM3, and GRM1. Using Polyphen¹⁸, we found that 55% (11,716/21,439) of protein-altering mutations were predicted to be deleterious or high impact mutations. In contrast, only 11% (233,524/2,122,090) of the protein-altering germline variants from this study were predicted to have a functional impact.

We assessed the mutated genes for their significance using a q-score metric¹⁹. Our analysis identified 25 significantly mutated GBC genes that included CTNNB1, ELF3, TP53, ERBB2, ARID2, ERBB3, STK11, CDKN2A, SMAD4, ARID1A, KRAS, EHF, PIK3CA, BRAF, ACVR2A, PSIP1, NFE2L2, CHRM3, ZNF107, SMARCA4, APC, NF1, KAT8, MAP2K4 and HIST1H2AG (Fig. 2a; q-score ≥ 1.1; FDR ≤ 8%; Supplementary Table 2 and Supplementary Fig. 8a–d). This list includes well-known oncogenes, CTNNB1, ERBB2, ERBB3, KRAS, PIK3CA, BRAF, and NFE2L2, tumor suppressors, TP53, ARID2, STK11, CDKN2A, SMAD4, SMARCA4, ARID1A, APC, NF1, and MAP2K4, and less well-established cancer-associated genes such as ELF3, EHF, ACVR2A, PSIP1, CHRM3, HIST1H2AG, KAT8, and ZNF107. Previous studies on GBC reported TP53, KRAS and ERBB3 as significantly mutated GBC genes (SMG)⁶, though low-frequency mutations were observed in other SMG GBC genes reported in this study^6,7,20.

**Fig. 2: Significantly mutated genes in GBC.**

The ELF3, ETS-domain transcription factor, identified as significantly mutated gallbladder cancer gene was altered in 21% of samples (34/160) (Fig. 2b). Previously, ELF3 was reported as a frequently mutated gene in biliary tract (3–9.5%) and ampullary carcinomas (15%)^6,7,9,16,17. ELF3 is also known to be mutated in cervical adenocarcinomas (13%)²¹, bladder cancers (8%)²², gastric cancers (4%)²³, and colorectal cancers (3%)^24,25. In addition to ELF3, EHF²⁶, another member of the ETS transcription factor subfamily, was also found significantly mutated in 4% of the GBC samples (7/160) (Fig. 2c).

A majority of ELF3 mutations we observed were frame-shift, stop gained and essential splice-site mutations (73% 27/37 mutations) and they clustered in the C-terminal ETS-domain (Fig. 2b). In addition, two essential splice-site mutations that result in an RNA transcript that codes for a truncated ELF3 (Fig. 2d) were found. Interestingly, ELF3 mutations were more frequent in Korean (31% 28/91) and Chilean patients (22% 2/9) compared to GBC patients from India (7% 4/60; p value 0.0003 India vs non-Indian; Fisher’s exact test).

We observed that the ELF3 mutations co-occurred significantly with TP53 mutations (p value = 0.01; Fig. 2e). Patients carrying both ELF3 and TP53 mutations had a worse overall survival that showed a trend towards significance (p value 0.0547; Fig. 2f) as opposed to no difference in survival in individuals with mutations in just either one of the genes (Supplementary Fig. 8d).

In addition to the WNT pathway genes CTNNB1 and APC, the significantly mutated genes included the chromatin-modifying gene KAT8, tumor suppressor STK11, oncogene NFE2L2, and ZNF107 that codes for a zinc finger protein. In addition, ACVR2A, a serine-threonine kinase and a member of the TGF-beta superfamily was also found to be mutated. The chromatin-associated protein gene, PSIP1, also showed a distinct mutation pattern with 5 of the 6 mutations (4% of samples) showing high impact frame-shift mutations (n = 4) or a premature stop codon (n = 1). These mutations preserved the PSIP1 H3K36me3 interacting ‘Pro-Trp-Trp-Pro’ (PWWP) domain while leading to the loss of the region coding for the C-terminal ‘integrase binding’ (IBD) domain (Fig. 2f). Interestingly, p52 PISP1 is a well-characterized isoform that lacks the IBD and has been associated with transcriptional activation and alternative splicing²⁷ and its relevance in GBC requires further investigation. Another significantly mutated gene, CHRM3 encodes a GPCR muscarinic cholinergic receptor. Interestingly, CHRM3 was recently shown to mediate gallbladder contraction through a voltage-gated Ca2+ channel²⁸. However, the exact relevance of the CHRM3 mutations in GBC needs further studies.

Recurrence of somatic mutations is an indication of its cancer relevance²⁹. We examined our data for recurrent hotspot mutations and found 89 hotspot mutations across 73 genes (see Methods; Supplementary Data 8). Included in the genes with hotspots were 11 significantly mutated GBC genes. Genes with most hotspots included TP53 (11 in 53 samples), ERBB2 (3 in 18 samples), CTNNB1 (3 in 16 samples), and ELF3 (3 in 9 samples). The ELF3 hotspot mutations included frame-shift mutations at codons 55 (2 samples), 320 (5 samples), and 324 (2 samples, including 1 missense).

To further understand specific mutation patterns, we performed a meta-analysis by comparing all the somatic mutations identified against a list of high confidence hotspot mutations identified in a comprehensive pan-cancer analysis³⁰. We identified 65 meta-hotspot mutations across 22 genes (Supplementary Data 9). The most common genes identified in GBC were TP53 (22 in 40 samples), ERBB2 (5 in 19 samples), and CTNNB1 (5 in 18 samples, primarily concentrated around codons 32–45). Amongst the most common mutations across all cancers that also occurred in GBCs were KRAS G12/G13 (5 samples), PIK3CA H1047/E545/E542 (6 samples) and NRAS Q61 (1 sample). Other mutations of interest included ERBB3 (2 meta-hotspots in 7 samples) carrying activating mutations at V104 and D297³¹. Interestingly, the four BRAF meta-hotspots, G466A (1), G469V (1), D594G (2), G596R (1), observed in 5 samples did not involve the canonical V600 codon. The NFE2L2 (5 meta-hotspots in 5 samples) mutations were primarily concentrated around amino acid positions 29–34. We also found samples containing CDK4 R24C, BCL2L12 R18W, RAC1 A178E, and XPO1 E571K mutations, previously reported in other tumor types³⁰.

Splicing, expression and copy number alterations in GBC

We performed t-SNE analysis of RNA-seq data from 115 GBC and identified two main clusters designated as cluster A and B (Supplementary Fig. 9). Cluster A is characterized by high expression of mitochondrial genes and also showed high levels of apoptosis-related gene such as BAX, BAD, FASTK, and NOXA1. Further, expression of PTEN, SMAD4, NF1, and NF2 was low in cluster A compared to samples in cluster B. Interestingly, samples in cluster B had marked upregulation of oncogenes such as BRAF, KRAS, and CBL. Several histone encoding genes were also upregulated in cluster B. We also found upregulation of NPAT and GONL4 in cluster B. NPAT is a key co-activator of histone transcription and GON4L is involved in biogenesis of the histone locus bodies and a known NPAT binding partner³². Additionally, cell cycle regulators such as SPDYE1 and SPDYE2 were also highly expressed in cluster B. Also upregulated in cluster B were transcripts for TP53 modulators ATM and MDM4.

We performed de novo prediction of splice variants from 115 GBCs and 4 cholecystitis samples to identify tumor-specific splicing events. We considered 835 candidate cancer-associated genes and filtered out splice variants expressed in a dataset of 9155 normal samples³³. We identified 62 candidate protein-altering splice variants in 24 samples. They included recurrent variants in ELF3 (n = 2) (Fig. 2d), an alteration each in KEAP1 and NFE2L2 (exon 2 deletion) (Supplementary Data 10). The exon 2 deletion in NFE2L2 variant was previously described in squamous cell carcinoma³⁴ and is known to result in the loss of interaction with the negative regulator KEAP1, NFE2L2 stabilization, induction of a NFE2L2 transcriptional response, and KEAP1/NFE2L2 pathway dependence.

We performed copy number analysis using low pass WGS data from 146 GBC samples. ERBB2 was frequently amplified in 13% (19/146) of the GBCs. We confirmed overexpression of ERBB2 in 68% (13/19) of the GBCs with amplification (Supplementary Fig. 10). Furthermore, we found one sample (GBC138) with EGFR amplification and corresponding increased EGFR expression (Supplementary Fig. 10). We also found amplification of MET (GBC061), KRAS (GBC009), and NRAS (GBC001). These genes also showed elevated expression in the corresponding samples (Supplementary Fig. 10). Chromosome 12 showed a distinct recurrent amplification in 6 samples involving YEATS4, RAB3IP, and FRS2. We found expression of these genes to be elevated in these samples (Supplementary Figs. 11 and 12). Among genes that showed copy loss were CDKN2A/B (14/146), SMAD4 (3/146), FHIT (11/146), BAP1 (7/146). PBRM1 (3/146) and PTEN (2/146) and this correlated with lower expression of these genes in the corresponding samples (Supplementary Fig. 13).

Gene fusions in GBC

Analysis of RNA-seq data identified 23 gene fusion events in our GBCs (Supplementary Table 3). In one sample, we found a fusion involving PTPRK and RSPO (Supplementary Fig. 14) that led to overexpression of RSPO3. This gene fusion product is known to promote and potentiate WNT signaling²⁴. Also, we found a recurrent fusion involving two patient tumors where exon 1 of GSK3A was fused in-frame to exon 3 of CDC42EP1, resulting in a transcript coding for GSK3A lacking the kinase domain (Supplementary Fig. 15a). A gene fusion involving PTEN and LIPA (LIPA-PTEN) leading to removal of the sequence encoding the PTEN Tensin C2 domain was identified in a patient tumor (Supplementary Fig. 15b). Another GBC sample carried an SLC12A7-TERT fusion that resulted in overexpression of TERT (Supplementary Fig. 15c). We also found an in-frame GRB7-LASP1 fusion resulting in elevated LASP1 expression (Supplementary Fig. 15). Upregulated LASP1 has been linked to malignant phenotype in cholangiocarcinoma³⁵.

Mutated ELF3 neoantigen peptides activate CD8+ T-cells

Recent advances in cancer immunotherapy have led to impressive survival benefit for patients in some cancers^36,37. Understanding the neoantigens arising from somatic mutations and the composition of tumor immune microenvironment will provide opportunities for immunotherapy in GBC. With this as a goal, from a set of 1301 somatic single nucleotide variants (SNVs) and 240 somatic indels expressed in our GBC samples we predicted high-affinity MHC Class I binding neoantigen peptides (IC50 < 500 nm) (Supplementary Data 11). This resulted in an average of 15 (range 0–51) neoantigens per patient (Fig. 3a). Multiple neoantigens corresponding to mutated TP53, ELF3, CTNNB1, ERBB2, ARID1A, and CDKN2A were predicted. These genes were mutated in at least 4% of the GBC patients (range 4–17%, n = 5–19 from 115 exome/RNA-seq samples; Fig. 3b). Among these, ELF3 had the highest number (n = 9) of predicted neoantigens resulting from frame-shift mutations in GBC. A similar trend was observed for TP53 in which 5 of the 9 high affinity HLA binders were frame-shift mutations. Additionally, recurrent mutations in ERBB2 also contributed to potential neoantigenic peptides. The presence of potential antigenic peptides from recurrent somatic mutations in GBC suggests that these peptides can serve as potential common cancer vaccine candidates.

To test the relevance of these predicted neoantigens, we selected 13 mutant peptides and the corresponding wild-type (WT) sequences from ELF3 (6), CTNNB1 (2), ERBB3 (3), and TP53 (2). We tested these peptides for their ability to activate CD8+ T-cells using HLA-matched healthy donor PBMCs. Antigen-specific activation of CD8+ T-cells was assessed by intracellular IFN-γ production using FACS (IFNG-APC (1:100), Biolegend, Cat. No. 502512; Supplementary Fig. 16). Two mutant ERBB2 peptides, S310Y and S310F, and three ELF3 peptides, Y19fs, L73fs and V345fs were found to activate CD8+ T-cells (Fig. 3c, d).

To determine clonotypic changes and activation of CD8+ T-cells in response to the mutant peptides, we perfomed transcriptome-coupled single-cell T-cell receptor (sc-TCR) sequencing. In this experiment, ELF3 Y19fs was expressed as a minigene in dendritic cells and incubated with CD8+ T-cells (see Methods). TCR sequence analysis revealed CD8+ T-cell clonal expansion when incubated with ELF3 Y19fs mutant expressing dendritic cells. However, we did not detect these TCR sequences in the empty vector control treated cells, indicating that they were specific to the ELF3 Y19fs mutant. The clonally expanded T-cells identified contributed to 8% of the total T-cells in the assay (n = 670) (Fig. 3e and f). Overlaying the single-cell transcriptome data on the clonally expanded T-cells showed that ELF3 Y19fs induced 3-fold higher levels of IFN-γ transcript in the expanded T-cells compared to empty vector control (Fig. 3g–j). Though some clonal expansion of T-cells was observed in empty vector control, the most frequent clonal sequence represented 2% of the total T-cells (n = 178) (Fig. 3e and f; Supplementary Data 12). Importantly, the level of IFN-γ was undetectable in these cells (Fig. 3j). Consistent with these findings, the abundance of the CDR3 sequences identified by single-cell sequencing correlated well with those found in the PMBCs treated with the mutant ELF3 peptide (Fig. 3k).

In addition to the immunogenic ELF3 peptides detected in this study, we found TP53 G154V peptide to also be immunogenic as it resulted in a 2.5-fold increase in clonal amplification of T-cells as assessed by TCR sequencing (Supplementary Fig. 17). These findings taken together suggest that the neoantigenic peptides derived from ELF3, ERBB2, and TP53 have the potential for use as cancer vaccines either alone or in combination with checkpoint inhibitors in GBC patients.

MHC I genotype-linked immunoediting is thought to select against specific cancer driver mutations³⁸. We assessed if the differences in ELF3 mutation rates between Korean and Indian samples could in part be explained by immunoediting, given the differences in the MHC I alleles in these populations³⁹. We performed a neoantigen prediction and binding simulation comparing the specific immunoedited alleles in the two populations (see Methods). We found a slightly higher, albeit significant, percentage of ELF3 neoantigen binding amongst the Indian alleles (43%) when compared to the Korean alleles (41%; OR = 1.1, n = 2168 vs n = 1060, FET p value 0.03) suggesting that immune editing may contribute to the observed regional ELF3 mutation rates.

We performed TCR analysis on bulk RNA-seq data (see Methods and Supplementary Fig. 18) to assess the differences in the T-cell repertoire in ELF3, ERBB2, and TP53 mutated GBC samples. We did not observe significant differences in Shannon Entropy of TCR repertoire diversity between samples carrying the specific neoantigens generating mutations that were tested experimentally in the T-cell activation assay (Supplementary Fig. 18a). However, we did observe a trend for decreased Shannon Entropy of TCR repertoire in ELF3 frameshift mutant samples suggestive of TCR selection. To further power this analysis, we included samples with any frame shift mutation (n = 90) versus those without any frameshift (n = 25) mutations. Here we observed a significantly lower Shannon entropy (p = 0.0008) on the TCR repertoires suggesting that perhaps some TCR selection had occurred in these samples (Supplementary Fig. 18b).

GBC immune microenvironment

Tumor-infiltrating lymphocytes (TILs) and macrophages have been proposed as a prognostic marker in patients with different cancer types^40,41. They can also affect the efficacy of checkpoint inhibitors⁴². We used RNA-seq data to analyze for presence of TILs and macrophages using the xCell package⁴³ (Supplementary Data 13; see Methods) and identified 5 distinct clusters (Fig. 4a). Cluster 1 had significantly higher levels of CD8+ T-cells (p value <0.05; Fig. 4b) and expression of LAG-3 (p value <0.05), a T-cell suppressor marker (Fig. 4c). PD-L1 was also higher on average when compared to the other clusters but it was not statistically significant (Supplementary Fig. 19a). Further, the cluster types were not significantly prognostic (Supplementary Fig. 19b; p value = 0.45), in part perhaps because of the small number of samples in the clusters. There were no distinctive mutation patterns amongst the five clusters (Supplementary Fig. 19c). Cluster 4 showed a higher level of endothelial cell signature (Fig. 4d). As only two patients in cluster 4 had survival data available, we examined the endothelial score based on quartiles. Patients in the highest endothelial cell quartile had a significant reduction in survival (p value = 0.024) (Fig. 4e). Concordant with this the microvascular (mv) gene signature, scores for lymphatic (ly), and endothelial cells are highly correlated (R² = 0.9; Fig. 4a).

**Fig. 4: Tumor-infiltrating lymphocyte cluster (TIL) in GBC samples.**

KEAP1/NFE2L2 pathway involvement in GBC

KEAP1/NFE2L2, a cellular pathway for sensing and responding to oxidative stress, is frequently mutated in human cancers. We identified several patients with alterations in the transcription factor NFE2L2 (n = 11) and its negative regulators KEAP1 (n = 3) and CUL3 (n = 3) (Fig. 5a–c). Most NFE2L2 alterations (6/11) were found in the N-terminal region required for interaction with KEAP1. Loss-of-function mutations in KEAP1 or activating mutations in NFE2L2 can result in the activation of 27 NFE2L2 downstream target genes, which can be used as a gene signature summarized in a pathway activation score³⁴. Application of the gene signature to tumors with available RNA-seq data identified a group of samples that showed elevated expression for most target genes, classifying patients into NRF2+ (score >15) and NRF2- patients. Hierarchical clustering based on the 27 signature genes segregated the two groups (Fig. 5d). Patients with mutations in pathway genes NFE2L2, KEAP1, and CUL3 were overrepresented in the NRF2+ group (n = 6/14 NRF2+ vs n = 2/87 NRF2- patients, P < 6 × 10^–5, two-sided Fisher’s Exact Test). To search for additional genes that may be involved in pathway activation, we considered significantly mutated GBC genes or known cancer-associated genes recurrently mutated in our data set and tested for overrepresentation among NRF2+ patients. Among 232 genes tested ARID2 was the most enriched mutated gene (p = 0.00074, FDR = 0.13). A recent report⁴⁴ has shown that KEAP1/NFE2L2 pathway activation, which leads to the reduction of reactive oxygen species, may help suppress macrophage inflammation response. Interestingly, we observed no difference in M1 macrophages but found a significantly higher level of M2 macrophages (Fig. 5e; p value = 0.016) in NRF2+ samples, consistent with a suppressed macrophage environment. We also found that KEAP1/NFE2L2 pathway activation, using RNA-seq, appears to be a significant prognostic predictor of survival (Fig. 5f; p value = 0.049).

**Fig. 5: KEAP1/NFE2L2 pathway activation in GBC.**

Pathway alterations in GBC

We integrated exome, copy number variation and gene fusion data within pathways (Fig. 6a–f) and country of origin. The p53/RB1 pathway was the most commonly altered pathway in GBC (Fig. 6f). The WNT pathway was primarily being driven through activating CTNNB1 mutations (Fig. 6a) though we also found an activating RSPO3 fusion. The SWI/SNF pathway had frequent inactivating mutations in SMARCA4, ARID1A, and ARID2 (Fig. 6b). We found many therapeutically actionable mutations in the RAS/PI3K pathway involving frequent alterations involving ERBB2, ERBB3, BRAF, and PIK3CA (Fig. 6c). We also found frequent inactivation of the ETS family members ELF3 and EHF (Fig. 6d). Our data demonstrate a role for KEAP1/NFE2L2 pathway activation in GBC (Fig. 6e).

**Fig. 6: Integrated analysis of pathway alterations observed in GBC.**

Discussion

We have performed a comprehensive integrative genomic analysis of 167 gallbladder primary tumors, and 7 GBC cell lines. Also, we have analyzed premalignant gallbladder tissue from 23 cholecystitis cases, 14 gallstones, and 2 gallbladder polyps. We found somatic mutations in cholecystitis that were indicative of a premalignant stage. Our study uncovered a class of hypermutated GBC that carried mutations in mismatch repair genes. We report 25 significantly mutated GBC genes that include several targetable driver genes such as ERBB2, ERBB3, KRAS, PIK3CA, and BRAF. Importantly, several of the ERBB2 mutations observed are known to be oncogenic and targetable⁴⁵ and patients with such mutations are candidates for targeted HER2 therapy. Analysis of exome and RNA-seq data identified recurrent alterations in KEAP1/NFE2L2 and WNT pathways. Cancer vaccines or checkpoint inhibitors have not been approved for treating gallbladder cancers. We have identified neoantigens from several mutated GBC genes including ELF3, ERBB2, and TP53 and found that they were capable of T-cell activation indicating that they are potential cancer vaccine candidates. Further, we have identified GBC samples with MSI and they likely are candidates for checkpoint inhibitor therapy⁴⁶. Together these findings provide an opportunity for testing immunotherapy in gallbladder cancer.

Overall, our study significantly expands on previous genomic studies providing a comprehensive genomics view of GBCs. Specifically, we identify actionable alterations in over 20% of our cases (Supplementary Fig. 20 and Supplementary Table 4). There are no targeted therapies approved for GBC and current standard-of-care for GBC involves surgery, chemo- and radiation-therapy. Incorporating genomic analysis as part of GBC patient care in the clinic will help improve outcomes through use of approved targeted therapies. Also, the GBC molecular alterations reported in this study and others^5,6,7,8,9 are an opportunity for development of new therapies.

Methods

Samples, DNA and RNA preps

In this study, we analyzed 167 human primary GBC samples as well as 39 non-GBC samples and the corresponding matched normal tissue in most cases using exome-seq, and/or low-pass whole-genome sequencing and/or RNA-seq (Supplementary Table 1). Fresh frozen samples used in the study were obtained from patients undergoing extirpative surgery for GBC. This study was conducted with IRB approval (Pontificia Universidad Católica de Chile IRB, Institutional Human Ethics Committee of Jiwaji University (India) and Seoul National University Hospital IRB (Seoul)) and written patient informed consent. Human tissue samples were de-identified prior to their shipment and analysis and are not considered human subject research under the US Department of Human and Health Services regulations and related guidance (45 CFR Part 46). Basic demographic information for the patient samples in the study, where available, is included in Supplementary Data 1. Tissue processing as well as simultaneous extraction of high-quality genomic DNA and total RNA from the same samples were performed as previously described⁴⁷. The study also included GBC cell lines TGBC24TKB, TGBC2TKB, G-415 (RIKEN Bio Resource Center, Ibaraki, Japan), OCUG-1 (Health Science Research Resources Bank, Osaka, Japan), SNU-308 (Korean Cell Line Bank, Seoul, Korea), GB-d1 (From Dr. Masao Tanaka’s lab, Japan)⁴⁸.

Exome capture and sequencing

Using the Agilent SureSelect Human All Exome kit (50 Mb), we generated libraries and sequenced them on HiSeq 2500/4000 (Illumina, CA) to generate 2 × 75 bp paired-end data. We obtained a targeted mean coverage of 93x with 93% bases covered at ≥10x (Supplementary Data 2).

RNA-seq

RNA-seq libraries were generated using TruSeq RNA Sample Preparation v2 kit (Illumina, CA) and sequenced on HiSeq 2500 and HiSeq 4000 to obtain ~63 million 2 × 75 bp paired-end (average) reads per sample.

Whole-genome sequencing

Low pass whole-genome sequencing (Illumina, CA) data (an average of 2.3x) for tumors and matched normal samples were obtained using whole-genome libraries were prepared according to manufacture’s instructions (Illumina, CA).

Sequence data processing

We evaluated all sequencing reads for quality using BioConductor ShortRead package⁴⁹. Sample identities were confirmed by comparing exome and RNA-Seq data variants for concordance. We performed an all-against-all sample comparison of germline variants and confirmed the patient tumor-normal pairing.

Variant calling

Sequencing reads were mapped to UCSC human genome (GRCh38) using BWA software⁵⁰ set to default parameters. Local realignment, duplicate marking, and variant calling of germline variants were performed as described previously⁵¹. Strelka⁵² was used for somatic variant calling on tumor and its matched normal BAM file. Known germline variants represented in dbSNP Build 131⁵³ or found in the ExAC database⁵⁴ at an allele frequency ≥0.1% were filtered out for all samples. The effect of all nonsynonymous somatic mutations on gene function was predicted using PolyPhen⁵⁵. Variants were annotated using Ensembl (release 86). TCGA mutation data used in mutation rate comparison across tumors were retrieved using the CGDSR R package from cBioPortal^56,57.

Additional data QC

Sample origins were confirmed using principal component analysis using 5709 common variants⁵⁴ in exomes from this study. Samples were colored according to their known country of origin and clustered using the first and second principal component. Mutant variants with RNA-seq reads count ≥ 1 confirmed their expression. Significantly mutated genes had to have at least one third of their mutations validated by RNA-seq and also have a minimum of half their mutant calls confirmed by the MuTect somatic variant calling algorithm⁵⁸. Recurrently mutated codons that could not be confirmed by RNA-seq were excluded from hotspot analysis, but were retained in the mutation list.

MSI status determination

MANTIS (Microsatellite Analysis for Normal-Tumor InStability) was used to detect microsatellite instability for all paired GBC samples. Microsatellites within the reference genome (GRCh38) were identified using RepeatFinder, which is included in MANTIS, with default options. MANTIS was run with the options recommended for whole-exome data: -mrq 20.0 -mlq 25.0 -mlc 20 -mrr 1 --threads 8. Samples with a score >0.35 were predicted to be MSI.

Evaluation of mutations using simulation

A database of all possible nonsynonymous mutations (~70 million) within our exome targets was generated and classified into one of six mutation types, C:G > G:C, C:G > A:T, C:G > T:A, T:A > A:T, T:A > C:G or T:A > G:C. We assessed the functional impact of each mutation using PolyPhen⁵⁵, SIFT⁵⁹, and Condel⁶⁰. Mutations were classified as deleterious when at least two of the three methods employed showed that it had an adverse functional impact. Monte Carlo simulations were performed to assess if the observed ELF mutations differed from randomly generated mutations as described previously⁶¹.

Mutational signatures

GBC exome sequence data was analyzed for the frequency of the possible 96 mutation types as described recently¹⁵. TCGA exome data for 2437 samples from 8 other cancer from SomaticCancerAlterations Bioconductor package and two small cell lung cancer studies^62,63 was also included in the analysis. We detected a set of six common signatures using Non-Negative Matrix Factorization across the combined data set. Using the mutSignatures (https://cancer.sanger.ac.uk/cosmic/signatures_v2) package⁶⁴ we compared our signatures to that reported in COSMIC¹⁰ version 2. We also repeated the analysis after removing the MSI samples (Supplementary Fig. 5).

Mutational significance and hotspot meta-analysis

We evaluated the mutational significance of genes using MuSIC⁶⁵. Given their outlier mutation rate, MSI samples were excluded from this calculation. Q-scores were calculated by taking the negative log₁₀ of the CRT q-values produced by MuSIC and SMGs were selected with a minimum q-score of 1. Germline variants of interest were considered for the following genes: BRCA1, BRCA2, TP53, MEN1, MLH1, MSH2, MSH6, PMS1, and PMS2. Hotspot meta-analysis was performed as previously described⁶⁶. For the hotspot meta-analysis, we compared GBC mutations to a previous pan-cancer analysis identifying high-confidence recurrent somatic hotspot mutations in cancer³⁰. Hotspot mutations within the data set were matched by codon position within a gene.

RNA-seq data analysis

RNA-seq reads aligned to the human genome version GRCh38 using GSNAP⁶⁷ were used to compute the gene level expression counts. This involved counting the number of reads aligning concordantly within a pair and uniquely to each gene locus using gene models defined by NCBI and Ensembl gene annotations, and RefSeq mRNA database. Variance stabilized expression values for plotting the expression heatmaps were computed using DESeq2⁶⁸. Unsupervised consensus clustering of top 400 most variable genes was performed using the variance stabilized expression values as input to the ConsensusPlus method implemented by the R package ConsensusClusterPlus.

Identification of transcript alterations

Analysis of splice variants was performed using the R/Bioconductor software package SGSeq (1.8.1)⁶⁹. We performed de novo prediction of gene models from aligned RNA-seq reads for 115 tumors and 4 cholecystitis samples using default parameters. Splice variants were identified from gene models and quantified in terms of FPKM and relative usage PSI (percent spliced-in). PSI estimates with denominator < 10 were set to NA. Splice variants detected in gallbladder samples were also quantified in 9155 normal human tissue samples from the Genotype-Tissue expression (GTEx) project³³. To identify transcript alterations, we considered splice variants in 835 candidate genes and selected those with FPKM > 2 and PSI > 0.1 in at least one gallbladder sample, and FPKM = 0 in >99.8% of GTEx samples. Identified variants were called in gallbladder samples for which FPKM > 2. FPKM-based criteria were required at both start and end of the splice variant. Alternative starts, ends and retained introns were excluded. Effect on protein-coding potential was assessed with respect to canonical transcript isoforms.

Low pass whole-genome copy number analysis

The genome was divided in 10 kb bins and the number of reads in each bin provided a count for the genomic bins. This was used to estimate copy number ratio by computing the log2 ratio of the tumor counts with the corresponding normal sample counts and adjusting for total number of reads for each sample. The copy number ratios were then segmented using circular binary segmentation (CBS) and the segments were used to assign a copy number log2 ratio for each gene. Recurrent genomic regions with DNA copy gain and loss were identified using GISTIC2⁷⁰ using log₂ copy number ratio >0.4 and <−0.4 for gains and losses, respectively.

Gene fusion detection and validation

Putative fusions were identified using a computational pipeline we have developed called GSTRUCT-fusion⁶². Only fusion events that had at least 4 reads mapping to the fusion junction were included for further consideration. We then further manually curated the fusion results by removing events that are likely false positives.

Neoantigen prediction and immune editing

The seq2HLA program⁷¹ was used to assign HLA genotypes based on RNA-seq data using a P value cutoff of 0.01. Predicted neoantigens expression was confirmed using the RNA-seq data. The NetMHCcons algorithm from the IEDB software suite⁷² was used to perform predictions on a sliding window of 8–11mers on mutant peptide sequences and the best affinity peptide was chosen as a representative.

We performed immune editing simulations as follows. First, for every Korean and Indian sample, we took HLA genotypes as predicted from the seq2HLA program and recorded the allele on two lists based on origin, HLA-India and HLA-Korea. These lists preserved the observed frequency of HLA alleles observed. Next, we investigated if there were differences in predicted HLA binding affinities based on the HLA alleles observed by origin. To do this, we performed 5000 Monte Carlo Simulations in which we randomly selected a single observed ELF3 mutation as well as a single HLA-Korean allele from their respective list. For each Korean HLA allele and ELF3 mutation we performed neoantigen binding affinity predictions as described above and recorded them as binders (≤500 nm) or non-binders. We then did the same for HLA-India alleles. We then calculated a p value for binders verses non-binders between the two simulations using a two-sided Fisher’s Exact Test.

Testing neoantigen peptide activity

Briefly, 10 µM wild-type and mutant peptides were incubated with 0.5 million PBMCs in the presence of IL-2 and IL-15 and the incubation mixture was replenished with a fresh batch of peptide-cytokine mix every 3-days. On day-22, PBMCs were harvested and stained for CD3 (0.125 ug/100 ul; Invitrogen Cat. No.12003942;), CD8 (0.125 ug/100 ul; Invitrogen Cat. No. 17008742) and intracellular IFN-γ (Biolegend Cat. No. 502512; dilution 1:100). Antigen-specific activation of T-cells was determined by FACS from the frequency of CD3⁺/CD8⁺ T-cells expressing IFN-γ. Antigen-induced TCR repertoire analysis was performed by subjecting a portion of the PBMC to TCR sequencing using Immuno-SEQ assay (Adaptive Biotechnologies, WA). Unique and shared TCR clones were identified by comparing the TCR repertoire of mutant and WT peptides or DMSO controls. Neoantigens were additionally tested for immunogenicity using a minigene assay (OncoPept^TM, MedGenome Inc., CA). Briefly, monocytes prepared from donor PBMCs were differentiated into dendritic cells (DCs)⁷³ and transfected with minigenes encoding five tandem sequences of 9-mer mutant peptides or empty vector as control. After 24 h, transfected DCs were co-cultured with 4-fold excess of purified CD8+ T-cells for 10 days. The co-culture was re-stimulated using autologous PBMCs electroporated with the minigene vectors and 48 h post re-stimulation, cells were stained for CD3, CD8, and intracellular IFN-γ to determine antigen-specific activation of IFN-γ. Approximately 10,000 cells from this assay were collected, washed and subjected to single-cell sequencing with immune profiling to determine the gene expression profile in combination with the TCR repertoire as per manufacturer’s instruction (10x Genomics, CA). Sequencing results were evaluated using Loupe Cell and Loupe V(D)J Browsers (10x Genomics, CA) to assess antigen-specific CD8 T cell clonotype induction and their corresponding functional gene expression profiles.

Estimating cellular content from bulk RNA-seq data

The cellular composition of 115 tumor samples with available RNA-seq data was analyzed using xCell (R package version 1.1.0)⁴³. xCell incorporates a novel method to remove dependencies between similar cell types and utilizes gene signatures for over 60 immune and stroma cell types to estimate the enrichment of each cell type in a tissue sample. We then performed hierarchical clustering using the pvclust R package⁷⁴ using the parallel feature and running hclust with method ward.D2 using euclidean distance measurements. We then cut the tree for 5 clusters after visual inspection of the heatmap.

TCR repertoire analysis

Fastq files were trimmed to remove adapter sequences and low-quality reads using Trimmomatic⁷⁵. The clean fastq files were then analyzed using MiXCR⁷⁶ to identify TCR clonotypes. Based on TCR clones, Shannon index was calculated using R package vegan (https://cran.r-project.org/web/packages/vegan/).

Actionable alterations in GBC

Actionable alterations were identified by comparing the mutations detected in the study against a list of actionable cancer gene alterations in the OncoKB database (https://www.oncokb.org/actionableGenes).

Pathway analysis

Mutational significance of genes from the Reactome^77,78 pathways in the MSigDB v4.0⁷⁹, modified to include NF2, was computed using the MuSIC⁶⁵ path-scan program. Integrated analysis of mutation, gene expression, copy number variation and fusion data, limited to samples with alterations, is shown in the quilt plots. Alteration frequencies calculation included all samples that contained data for a given alteration type.

Data availability

The Exome, RNA-Seq and WGS data are available through the European Genome Archive under accession EGAS00001003004 [TCR sequencing/expression data reported in Fig. 3k, e–j can be obtained by contacting the Institutional data access committee gDAC-gbc@gene.com under an MTA that will allow the use of the TCR sequences for non-commercial research studies. All other relevant data are available in the article, supplementary information, or from the corresponding author upon reasonable request.

References

Wistuba, I. I. & Gazdar, A. F. Gallbladder cancer: lessons from a rare tumour. Nat. Rev. Cancer 4, 695–706 (2004).
CAS PubMed Google Scholar
Torre, L. A. et al. Global cancer statistics, 2012. CA Cancer J. Clin. 65, 87–108 (2015).
PubMed Google Scholar
Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2016. CA Cancer J. Clin. 66, 7–30 (2016).
PubMed Google Scholar
Hundal, R. & Shaffer, E. A. Gallbladder cancer: epidemiology and outcome. Clin. Epidemiol. 6, 99–109 (2014).
PubMed PubMed Central Google Scholar
Jiao, Y. et al. Exome sequencing identifies frequent inactivating mutations in BAP1, ARID1A and PBRM1 in intrahepatic cholangiocarcinomas. Nat. Genet. 45, 1470–1473 (2013).
CAS PubMed PubMed Central Google Scholar
Li, M. et al. Whole-exome and targeted gene sequencing of gallbladder carcinoma identifies recurrent mutations in the ErbB pathway. Nat. Genet. 46, 872–876 (2014).
CAS PubMed Google Scholar
Nakamura, H. et al. Genomic spectra of biliary tract cancer. Nat. Genet. 47, 1003–1010 (2015).
CAS PubMed Google Scholar
Li, M. et al. Genomic ERBB2/ERBB3 mutations promote PD-L1-mediated immune escape in gallbladder cancer: a whole-exome sequencing analysis. Gut https://doi.org/10.1136/gutjnl-2018-316039 (2018).
Article PubMed Google Scholar
Wardell, C. P. et al. Genomic characterization of biliary tract cancers identifies driver genes and predisposing mutations. J. Hepatol. 68, 959–969 (2018).
CAS PubMed Google Scholar
Forbes, S. A. et al. COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer. Nucleic Acids Res. 38, D652–D657 (2010).
CAS PubMed Google Scholar
Cancer Genome Atlas Research, N. Comprehensive molecular characterization of gastric adenocarcinoma. Nature 513, 202–209 (2014).
ADS Google Scholar
Cheung, L. W. et al. High frequency of PIK3R1 and PIK3R2 mutations in endometrial cancer elucidates a novel mechanism for regulation of PTEN protein stability. Cancer Disco. 1, 170–185 (2011).
CAS Google Scholar
Cancer Genome Atlas Research, N. et al. Integrated genomic characterization of endometrial carcinoma. Nature 497, 67–73 (2013).
ADS Google Scholar
Kautto, E. A. et al. Performance evaluation for rapid detection of pan-cancer microsatellite instability with MANTIS. Oncotarget 8, 7452–7463 (2017).
PubMed Google Scholar
Alexandrov, L. B., Nik-Zainal, S., Wedge, D. C., Campbell, P. J. & Stratton, M. R. Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 3, 246–259 (2013).
CAS PubMed PubMed Central Google Scholar
Yachida, S. et al. Genomic sequencing identifies ELF3 as a driver of ampullary carcinoma. Cancer Cell 29, 229–240 (2016).
CAS PubMed PubMed Central Google Scholar
Gingras, M. C. et al. Ampullary cancers harbor ELF3 tumor suppressor gene mutations and exhibit frequent WNT dysregulation. Cell Rep. 14, 907–919 (2016).
CAS PubMed PubMed Central Google Scholar
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
CAS PubMed PubMed Central Google Scholar
Dees, N. D. et al. MuSiC: identifying mutational significance in cancer genomes. Genome Res. 22, 1589–1598 (2012).
CAS PubMed PubMed Central Google Scholar
Tekcham, D. S. et al. Epigenetic regulation of APC in the molecular pathogenesis of gallbladder cancer. Indian J. Med. Res. 143, S82–S90 (2016).
PubMed PubMed Central Google Scholar
Ojesina, A. I. et al. Landscape of genomic alterations in cervical carcinomas. Nature 506, 371–375 (2014).
ADS CAS PubMed Google Scholar
Cancer Genome Atlas Research, N. Comprehensive molecular characterization of urothelial bladder carcinoma. Nature 507, 315–322 (2014).
ADS Google Scholar
Wang, K. et al. Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer. Nat. Genet. 46, 573–582 (2014).
CAS PubMed Google Scholar
Seshagiri, S. et al. Recurrent R-spondin fusions in colon cancer. Nature 488, 660–664 (2012).
ADS CAS PubMed PubMed Central Google Scholar
TCGA. Comprehensive molecular characterization of human colon and rectal cancer. Nature 487, 330–337 (2012).
ADS Google Scholar
Luk, I. Y., Reehorst, C. M. & Mariadason, J. M. ELF3, ELF5, EHF and SPDEF transcription factors in tissue homeostasis and cancer. Molecules 23, https://doi.org/10.3390/molecules23092191 (2018).
Pradeepa, M. M., Sutherland, H. G., Ule, J., Grimes, G. R. & Bickmore, W. A. Psip1/Ledgf p52 binds methylated histone H3K36 and splicing factors and contributes to the regulation of alternative splicing. PLoS Genet. 8, e1002717 (2012).
CAS PubMed PubMed Central Google Scholar
Lee, M. C., Yang, Y. C., Chen, Y. C. & Huang, S. C. Muscarinic receptor M3 mediates human gallbladder contraction through voltage-gated Ca2+ channels and Rho kinase. Scand. J. Gastroenterol. 48, 205–212 (2013).
CAS PubMed Google Scholar
Lawrence, M. S. et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature 505, 495–501 (2014).
ADS CAS PubMed PubMed Central Google Scholar
Chang, M. T. et al. Identifying recurrent mutations in cancer reveals widespread lineage diversity and mutational specificity. Nat. Biotechnol. 34, 155–163 (2016).
CAS PubMed Google Scholar
Jaiswal, B. S. et al. Oncogenic ERBB3 mutations in human cancers. Cancer Cell 23, 603–617 (2013).
CAS PubMed Google Scholar
Yang, X. C. et al. A conserved interaction that is essential for the biogenesis of histone locus bodies. J. Biol. Chem. 289, 33767–33782 (2014).
CAS PubMed PubMed Central Google Scholar
Consortium, G. T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Google Scholar
Goldstein, L. D. et al. Recurrent loss of NFE2L2 Exon 2 is a mechanism for Nrf2 pathway activation in human cancers. Cell Rep. 16, 2605–2617 (2016).
CAS PubMed Google Scholar
Zhang, H. et al. Upregulated LASP-1 correlates with a malignant phenotype and its potential therapeutic role in human cholangiocarcinoma. Tumour Biol. 37, 8305–8315 (2016).
CAS PubMed Google Scholar
Emens, L. A. et al. Cancer immunotherapy: opportunities and challenges in the rapidly evolving clinical landscape. Eur. J. Cancer 81, 116–129 (2017).
CAS PubMed Google Scholar
Hu, Z., Ott, P. A. & Wu, C. J. Towards personalized, tumour-specific, therapeutic vaccines for cancer. Nat. Rev. Immunol. 18, 168–182 (2018).
CAS PubMed Google Scholar
Marty, R. et al. MHC-I genotype restricts the oncogenic mutational landscape. Cell 171, 1272–1283 e1215 (2017).
CAS PubMed PubMed Central Google Scholar
Gonzalez-Galarza, F. F., Christmas, S., Middleton, D. & Jones, A. R. Allele frequency net: a database and online repository for immune gene frequencies in worldwide populations. Nucleic Acids Res 39, D913–D919 (2011).
CAS PubMed Google Scholar
Ujiie, H. et al. The tumoral and stromal immune microenvironment in malignant pleural mesothelioma: a comprehensive analysis reveals prognostic immune markers. Oncoimmunology 4, e1009285 (2015).
PubMed PubMed Central Google Scholar
Cornelissen, R. et al. Ratio of intratumoral macrophage phenotypes is a prognostic factor in epithelioid malignant pleural mesothelioma. PLoS One 9, e106742 (2014).
ADS PubMed PubMed Central Google Scholar
Riaz, N. et al. Tumor and microenvironment evolution during Immunotherapy with Nivolumab. Cell 171, 934–949 e916 (2017).
CAS PubMed PubMed Central Google Scholar
Aran, D., Hu, Z. & Butte, A. J. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 18, 220 (2017).
PubMed PubMed Central Google Scholar
Kobayashi, E. H. et al. Nrf2 suppresses macrophage inflammatory response by blocking proinflammatory cytokine transcription. Nat. Commun. 7, 11624 (2016).
ADS CAS PubMed PubMed Central Google Scholar
Kavuri, S. M. et al. HER2 activating mutations are targets for colorectal cancer treatment. Cancer Disco. 5, 832–841 (2015).
CAS Google Scholar
Boyiadzis, M. M. et al. Significance and implications of FDA approval of pembrolizumab for biomarker-defined disease. J. Immunother. Cancer 6, 35 (2018).
PubMed PubMed Central Google Scholar
Richards, W. G. et al. A microaliquoting technique for precise histological annotation and optimization of cell content in frozen tissue specimens. Biotech. Histochem. 82, 189–197 (2007).
CAS PubMed Google Scholar
Shimura, H., Date, K., Matsumoto, K., Nakamura, T. & Tanaka, M. Induction of invasive growth in a gallbladder cancer cell line by hepatocyte growth factor in vitro. Jpn J. Cancer Res. 86, 662–669 (1995).
CAS PubMed PubMed Central Google Scholar
Morgan, M. et al. ShortRead: a bioconductor package for input, quality assessment and exploration of high-throughput sequence data. Bioinformatics 25, 2607–2608 (2009).
CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
CAS PubMed PubMed Central Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
CAS PubMed PubMed Central Google Scholar
Saunders, C. T. et al. Strelka: accurate somatic small-variant calling from sequenced tumor-normal sample pairs. Bioinformatics 28, 1811–1817 (2012).
CAS PubMed Google Scholar
Sherry, S. T. et al. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29, 308–311 (2001).
CAS PubMed PubMed Central Google Scholar
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016).
CAS PubMed PubMed Central Google Scholar
Ramensky, V., Bork, P. & Sunyaev, S. Human non-synonymous SNPs: server and survey. Nucleic Acids Res. 30, 3894–3900 (2002).
CAS PubMed PubMed Central Google Scholar
Cerami, E. et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Disco. 2, 401–404 (2012).
Google Scholar
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal 6, pl1 (2013).
PubMed PubMed Central Google Scholar
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
CAS PubMed PubMed Central Google Scholar
Ng, P. C. & Henikoff, S. Accounting for human polymorphisms predicted to affect protein function. Genome Res. 12, 436–446 (2002).
CAS PubMed PubMed Central Google Scholar
Gonzalez-Perez, A. & Lopez-Bigas, N. Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. Am. J. Hum. Genet. 88, 440–449 (2011).
CAS PubMed PubMed Central Google Scholar
Durinck, S. et al. Spectrum of diverse genomic alterations define non-clear cell renal carcinoma subtypes. Nat. Genet. 47, 13–21 (2015).
CAS PubMed Google Scholar
Rudin, C. M. et al. Comprehensive genomic analysis identifies SOX2 as a frequently amplified gene in small-cell lung cancer. Nat. Genet. 44, 1111–1116 (2012).
CAS PubMed PubMed Central Google Scholar
Peifer, M. et al. Integrative genome analyses identify key somatic driver mutations of small-cell lung cancer. Nat. Genet. 44, 1104–1110 (2012).
CAS PubMed PubMed Central Google Scholar
Fantini, D., Vidimar, V., Yu, Y., Condello, S. & Meeks, J. J. MutSignatures: an R package for extraction and analysis of cancer mutational signatures. bioRxiv https://doi.org/10.1101/2020.03.15.992826 (2020).
Article Google Scholar
Dees, N. et al. MuSiC: identifying mutational significance in cancer genomes. Genome Res. 22, 1589–1598 (2012).
Bueno, R. et al. Comprehensive genomic analysis of malignant pleural mesothelioma identifies recurrent mutations, gene fusions and splicing alterations. Nat. Genet. 48, 407–416 (2016).
CAS PubMed Google Scholar
Wu, T. D. & Nacu, S. Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics 26, 873–881 (2010).
CAS PubMed PubMed Central Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
CAS PubMed PubMed Central Google Scholar
Goldstein, L. D. et al. Prediction and quantification of splice events from RNA-seq data. PLoS One 11, e0156132 (2016).
PubMed PubMed Central Google Scholar
Mermel, C. H. et al. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 12, R41 (2011).
PubMed PubMed Central Google Scholar
Boegel, S. et al. HLA typing from RNA-Seq sequence reads. Genome Med. 4, 102 (2012).
PubMed PubMed Central Google Scholar
Vita, R. et al. The immune epitope database (IEDB) 3.0. Nucleic Acids Res. 43, D405–D412 (2015).
CAS PubMed Google Scholar
Wolfl, M. & Greenberg, P. D. Antigen-specific activation and cytokine-facilitated expansion of naive, human CD8+ T cells. Nat. Protoc. 9, 950–966 (2014).
PubMed PubMed Central Google Scholar
Suzuki, R. & Shimodaira, H. Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics 22, 1540–1542 (2006).
CAS PubMed Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
CAS PubMed PubMed Central Google Scholar
Bolotin, D. A. et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat. Methods 12, 380–381 (2015).
CAS PubMed Google Scholar
Croft, D. et al. Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. 39, D691–D697 (2011).
CAS PubMed Google Scholar
Milacic, M. et al. Annotating cancer variants and anti-cancer therapeutics in reactome. Cancers 4, 1180–1211 (2012).
CAS PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge Genentech NGS lab and Bioinformatics groups for their help with the project. We also thank Dr. Ruchi Chaudhary, MedGenome, for bioinformatics support. A.P. was supported by NIH Grant R01CA184165 and NCI’s Clinical Proteomic Tumor Analysis Consortium Initiative U24CA210985. P.K.T was supported by grants from DST-SERB, Govt. of India.

Author information

These authors contributed equally: Akhilesh Pandey, Eric W. Stawiski, Steffen Durinck, Harsha Gowda.
These authors jointly supervised this work: Akhilesh Pandey, Eric W. Stawiski, Pramod Kumar Tiwari, Jin-Young Jang, Somasekar Seshagiri.

Authors and Affiliations

Institute of Bioinformatics, Bangalore, Karnataka, 560066, India
Akhilesh Pandey, Harsha Gowda, Mustafa A. Barbhuiya, Sreelakshmi K. Sreenivasamurthy, Aditi Chatterjee, Keshava Datta, Aafaque A. Khan & Tejaswini Subbannayya
Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, 576104, India
Akhilesh Pandey
Center for Individualized Medicine and Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
Akhilesh Pandey
Bioinformatics and Computational Biology Department, Genentech Inc, South San Francisco, CA, 94080, USA
Eric W. Stawiski, Steffen Durinck & Leonard D. Goldstein
Molecular Biology Department, Genentech Inc., South San Francisco, CA, 94080, USA
Eric W. Stawiski, Steffen Durinck, Leonard D. Goldstein, Markus S. Schröder, Subhra Chaudhuri, Bijay S. Jaiswal, Zora Modrusan & Somasekar Seshagiri
Research and Development Department, MedGenome Inc, Foster City, CA, 94404, USA
Eric W. Stawiski, Kushal Suryamohan, Kayla Lee, Papia Chakraborty, Vasumathi Kode, Xiaoshan Shi, Jing Wang & Amitabha Chaudhuri
QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Harsha Gowda
Department of Pathology and Laboratory Medicine, Pennsylvania State University College of Medicine, Hershey, PA, 17033, USA
Mustafa A. Barbhuiya
SciGenom Labs, Cochin, Kerala, 682037, India
Markus S. Schröder
Department of Surgery, Division of Hepatobiliary and Pancreatic Surgery, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, 08826, South Korea
Sun-Whe Kim, Wooil Kwon, Hongbeom Kim, Youngmin Han & Jin-Young Jang
Research and Development Department, MedGenome Labs Pvt. Ltd., Bangalore, Karnataka, 560099, India
Sameer Phalke & Vedam L. Ramprasad
Department of Pathology, Cancer Hospital and Research Institute, Gwalior, Madhya Pradesh, 474009, India
Sanjiv Gupta
Department of Surgical Oncology, Cancer Hospital and Research Institute, Gwalior, Madhya Pradesh, 474009, India
Braj Raj Shrivastav
Jiwaji University, Gwalior, Madhya Pradesh, 474011, India
Satish S. Poojary, Shushruta Bhunia & Pramod Kumar Tiwari
Department of Pathology, Millennium Institute on Immunology and Immunotherapy, School of Medicine, Pontificia Universidad Católica de Chile, Santiago, Chile
Patricia Garcia, Carolina Bizama & Juan Carlos Roa
Applied Molecular and Cellular Biology PhD Program Universidad De la Frontera, Temuco, Chile
Lorena Rosa
Department of Surgery, Postgraduate Institute of Medical Education and Research, Chandigarh, 160012, India
Thakur Deen Yadav
SciGenom Research Foundation, 3rd Floor, Narayana Nethralaya Building, Narayana Health City, #258/A, Bommasandra, Hosur Road, Bangalore, Karnataka, 560099, India
Somasekar Seshagiri

Authors

Akhilesh Pandey
View author publications
You can also search for this author in PubMed Google Scholar
Eric W. Stawiski
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Durinck
View author publications
You can also search for this author in PubMed Google Scholar
Harsha Gowda
View author publications
You can also search for this author in PubMed Google Scholar
Leonard D. Goldstein
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa A. Barbhuiya
View author publications
You can also search for this author in PubMed Google Scholar
Markus S. Schröder
View author publications
You can also search for this author in PubMed Google Scholar
Sreelakshmi K. Sreenivasamurthy
View author publications
You can also search for this author in PubMed Google Scholar
Sun-Whe Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sameer Phalke
View author publications
You can also search for this author in PubMed Google Scholar
Kushal Suryamohan
View author publications
You can also search for this author in PubMed Google Scholar
Kayla Lee
View author publications
You can also search for this author in PubMed Google Scholar
Papia Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Vasumathi Kode
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoshan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Aditi Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar
Keshava Datta
View author publications
You can also search for this author in PubMed Google Scholar
Aafaque A. Khan
View author publications
You can also search for this author in PubMed Google Scholar
Tejaswini Subbannayya
View author publications
You can also search for this author in PubMed Google Scholar
Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Subhra Chaudhuri
View author publications
You can also search for this author in PubMed Google Scholar
Sanjiv Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Braj Raj Shrivastav
View author publications
You can also search for this author in PubMed Google Scholar
Bijay S. Jaiswal
View author publications
You can also search for this author in PubMed Google Scholar
Satish S. Poojary
View author publications
You can also search for this author in PubMed Google Scholar
Shushruta Bhunia
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Bizama
View author publications
You can also search for this author in PubMed Google Scholar
Lorena Rosa
View author publications
You can also search for this author in PubMed Google Scholar
Wooil Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Hongbeom Kim
View author publications
You can also search for this author in PubMed Google Scholar
Youngmin Han
View author publications
You can also search for this author in PubMed Google Scholar
Thakur Deen Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Vedam L. Ramprasad
View author publications
You can also search for this author in PubMed Google Scholar
Amitabha Chaudhuri
View author publications
You can also search for this author in PubMed Google Scholar
Zora Modrusan
View author publications
You can also search for this author in PubMed Google Scholar
Juan Carlos Roa
View author publications
You can also search for this author in PubMed Google Scholar
Pramod Kumar Tiwari
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Young Jang
View author publications
You can also search for this author in PubMed Google Scholar
Somasekar Seshagiri
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.P. and S.S. conceived the project. A.P. supervised sample processing, data interpretation and manuscript writing. E.W.S performed data analysis and wrote manuscript. S.D. performed RNA-seq, fusion, copy number analysis and co-wrote manuscript. H.G. supervised sample collection, processing, data interpretation and co-wrote manuscript. L.D.G. performed RNA-Seq splicing analysis and co-wrote the manuscript. M.S.S. performed variant analysis, immune microenvironment analysis and data QC. M.A.B. collected tumors and blood samples and extracted DNA and RNA. S.K.S extracted DNA, RNA and sample QC. S.-W. K. acquired cancer and normal tissue samples, clinical data collection. S. P. genomics data collection. K.S. analyzed actionable GBC alterations. K. L. single cell genomics data collection. P.C. single cell genomics data collection. A.C. supervised sample collection, processing and maintenance of cell lines. K.D. collected tumor tissue and blood. A.A.K. sample collection, storage and shipping. T.S. sample collection, storage and shipping. J.W. performed TCR repertoire analysis. S.C. genomics data collection. S.G. macro- and microscopic examination of the tissue samples. B.R.S. patient recruitment, clinical examination, surgical removal of the tumors and cholecystectomy. B.S.J. wet lab support. S.S.P. assisted with tumors and blood sample collection S.B. assisted with tumor and blood sample collection. P.G. patient recruitment and tumor and blood sample collection. C.B. patient recruitment and tumor and blood sample collection. L.R. assisted with sample collection. W.K. provided cancer and normal tissue samples, acquired patients, clinical data collection. H.K. acquired patients, clinical data collection. Y.H. acquired patients, clinical data collection. T.D.Y. sample collection. V.R. data collection. A.C. neoantigen peptide functional analysis and co-wrote the manuscript. Z.M. genomics data collection. J.C.R. macroscopic and microscopic examination of the tissue samples. P.K.T. overall supervision of sample collection, sample storage and study design, co-wrote manuscript. J.-Y. J. defined study concept, provided cancer and normal tissue samples, acquired patients, clinical data collection. S.S. assisted with analysis and co-wrote manuscript, which was reviewed and edited by the other coauthors.

Corresponding authors

Correspondence to Akhilesh Pandey, Eric W. Stawiski, Jin-Young Jang or Somasekar Seshagiri.

Ethics declarations

Competing interests

E.W.S., S.D., L.D.G., S.C., B.S.J., Z.M and S.S hold shares/options in Roche. E.W.S., A.C, and K.S hold options in MedGenome Labs Pvt. The remaining authors have no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Descriptions of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Supplementary Data 10

Supplementary Data 11

Supplementary Data 12

Supplementary Data 13

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pandey, A., Stawiski, E.W., Durinck, S. et al. Integrated genomic analysis reveals mutated ELF3 as a potential gallbladder cancer vaccine candidate. Nat Commun 11, 4225 (2020). https://doi.org/10.1038/s41467-020-17880-4

Download citation

Received: 23 January 2020
Accepted: 23 July 2020
Published: 24 August 2020
DOI: https://doi.org/10.1038/s41467-020-17880-4

This article is cited by

Kelch-like proteins in the gastrointestinal tumors
- An-bo Fu
- Sen-feng Xiang
- Mei-dan Ying
Acta Pharmacologica Sinica (2023)
ELF3 promotes gemcitabine resistance through PKMYT1/CDK1 signaling pathway in gallbladder cancer
- Linhua Yang
- Hui Wang
- Yingbin Liu
Cellular Oncology (2023)
Topological reorganization and functional alteration of distinct genomic components in gallbladder cancer
- Guoqiang Li
- Peng Pu
- Yingbin Liu
Frontiers of Medicine (2023)
Single-cell dissection of remodeled inflammatory ecosystem in primary and metastatic gallbladder carcinoma
- Xiang Wang
- Chunliang Liu
- Hongyang Wang
Cell Discovery (2022)
Gallbladder cancer
- Juan C. Roa
- Patricia García
- Jill Koshiol
Nature Reviews Disease Primers (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.