Immunopeptidomics-based identification of naturally presented non-canonical circRNA-derived peptides

Ferreira, Humberto J.; Stevenson, Brian J.; Pak, HuiSong; Yu, Fengchao; Almeida Oliveira, Jessica; Huber, Florian; Taillandier-Coindard, Marie; Michaux, Justine; Ricart-Altimiras, Emma; Kraemer, Anne I.; Kandalaft, Lana E.; Speiser, Daniel E.; Nesvizhskii, Alexey I.; Müller, Markus; Bassani-Sternberg, Michal

doi:10.1038/s41467-024-46408-3

Download PDF

Article
Open access
Published: 15 March 2024

Immunopeptidomics-based identification of naturally presented non-canonical circRNA-derived peptides

Nature Communications volume 15, Article number: 2357 (2024) Cite this article

3713 Accesses
20 Altmetric
Metrics details

Subjects

Abstract

Circular RNAs (circRNAs) are covalently closed non-coding RNAs lacking the 5’ cap and the poly-A tail. Nevertheless, it has been demonstrated that certain circRNAs can undergo active translation. Therefore, aberrantly expressed circRNAs in human cancers could be an unexplored source of tumor-specific antigens, potentially mediating anti-tumor T cell responses. This study presents an immunopeptidomics workflow with a specific focus on generating a circRNA-specific protein fasta reference. The main goal of this workflow is to streamline the process of identifying and validating human leukocyte antigen (HLA) bound peptides potentially originating from circRNAs. We increase the analytical stringency of our workflow by retaining peptides identified independently by two mass spectrometry search engines and/or by applying a group-specific FDR for canonical-derived and circRNA-derived peptides. A subset of circRNA-derived peptides specifically encoded by the region spanning the back-splice junction (BSJ) are validated with targeted MS, and with direct Sanger sequencing of the respective source transcripts. Our workflow identifies 54 unique BSJ-spanning circRNA-derived peptides in the immunopeptidome of melanoma and lung cancer samples. Our approach enlarges the catalog of source proteins that can be explored for immunotherapy.

Integrated proteogenomic deep sequencing and analytics accurately identify non-canonical peptides in tumor immunopeptidomes

Article Open access 10 March 2020

Unannotated proteins expand the MHC-I-restricted immunopeptidome in cancer

Article 18 October 2021

MARS an improved de novo peptide candidate selection method for non-canonical antigen target discovery in cancer

Article Open access 22 January 2024

Introduction

Adoptive T cell-based immunotherapies and cancer vaccines are becoming powerful cancer treatment options¹. They leverage natural anti-cancer immunity by targeting human leukocyte antigen (HLA) bound peptides presented specifically on the surface of malignant cells². Despite unprecedented developments exploring the tumor immunopeptidome repertoire, most studies remain focused on canonical antigens, derived from protein-coding genomic regions, such as mutated neoantigens which are mostly patient-specific^1,3,4. Immunogenic tumor-specific antigens that are shared across patients might provide a more promising approach in terms of treatment effectiveness, notably because several patients can benefit from the same immunotherapy treatment⁵. In recent years, mass spectrometry (MS) based immunopeptidomics coupled with novel proteogenomic approaches identified novel canonical and non-canonical cancer-specific antigens resulting from genetic and epigenetic alterations during cancer progression^6,7, affecting the cellular transcriptome⁸, translatome⁹, proteome^10,11 and the antigen presentation machinery¹². Remarkably, circRNA translation has also been proposed as an unexplored source of antigens in cancer¹³.

circRNAs, initially thought to be a by-product of transcription, are covalently closed sequences of RNA. In humans, such non-polyadenylated transcripts are produced by a non-canonical splicing process, known as back-splicing, between two non-sequential exons where the 3’ end of a downstream exon is fused to the 5’ end of an upstream exon¹⁴. The generated junction is called back-splicing junction (BSJ). The first described circRNAs were composed only of exonic sequences, but the portfolio was expanded to include intronic and exonic-intronic circRNAs, containing introns and exons/introns, respectively. In general, the expression of most circRNAs is low compared to their linear counterparts, but some circRNAs are highly abundant and represent the main transcribed products of the host genes¹⁵. Because of their covalently closed-loop structures, circRNAs are protected from exonucleases. CircRNAs’ inherent stability makes them highly promising biomarkers across diverse human diseases. For example, altered levels of circRNAs can be detected in urine and blood of cancer patients^16,17,18,19. Their biogenesis is regulated by different RNA-binding proteins (RBPs), such as quaking (QKI) and adenosine deaminases acting on RNA (ADARs) proteins, which have been implicated in tumor progression^20,21,22,23.

Functionally, one of the most studied roles of circRNAs is their potential to act as a sponge for microRNAs (miRNAs), sequestering these regulatory molecules so that they no longer target mRNAs for degradation or translation inhibition²⁴. However, circRNAs have additional gene expression regulatory functions. They compete with canonical splicing²⁵, enhance parental gene expression by interacting with the Pol II machinery^26,27 or act epigenetically by regulating DNA methylation and active DNA demethylation²⁸. CircRNAs can also interact and sequester some RBPs, modulating their activity²⁹. Despite being considered “non-coding” transcripts, lacking a 5′ cap and 3′ poly(A) tail, their sequences often harbor regulatory sequences which can promote cap-independent translation, driven by internal ribosome entry sites (IRES)³⁰ or consensus N⁶-methyladenosine-modified motifs^31,32. Extensive analyses of the translation potential of endogenous circRNAs using MS data demonstrated the enrichment of IRES-like short elements in endogenous circRNAs able to initiate their translation³³. Efficient circRNA translation has also been demonstrated in studies using exogenous circRNAs with infinite open reading frames (ORFs) lacking stop codons, undergoing rolling circle translation³⁴. Accumulating evidence has shown the potential of circRNAs to be translated into functional proteins in cancer³⁵. For example, the translation of a CTNNB1 circRNA (hsa_circ_0004194) gives rise to a novel isoform of β-catenin, harboring a distinct, shorter C-terminus via the creation of a new stop codon after circularization. This isoform was implicated in tumor growth by activating the Wnt/β-catenin pathway³⁶, emphasizing the role of CTNNB1 in cancer through non-genetic alterations by the creation and translation of a circRNA, rather than via activating mutations³⁷.

Importantly, circRNAs may encode peptide sequences spanning the BSJ that are distinct from those sequences generated by the canonical splicing process. However, as circRNAs are mostly predicted to have short ORFs, especially those spanning the BSJ, their detectability by MS-based proteomics and by ribosome profiling is challenging^38,39,40,41. The lack of experimental evidence for these short ORFs at the proteome level could be explained by their possible higher instability⁴² resulting in proteasomal degradation³³. However, unstable proteins with short half-lives are ideal sources for HLA class I (HLA-I) peptides, and therefore, circRNAs could be an interesting source of neoantigens that might play a role in tumor immunosurveillance⁴². A combination of ribo-seq profiling and shotgun proteomics MS have been used to predict the circRNA-derived translatome and associated proteome^39,43. Furthermore, in a recent study, the discovery of putative circRNA-encoded proteins was accompanied by the detection of two HLA-I-associated peptides⁴⁴, but neither of them was encoded by the region overlapping the BSJ, therefore, they could potentially derive from the linear transcript. In another study, 13 predicted circRNA-derived antigens from hepatobiliary tumor organoids were validated with MS. However, interpretation of the results is difficult since an unusual sample processing method was used, in which, following HLA-I complex purification, the eluates were submitted to gel electrophoresis (SDS-PAGE) and in-gel trypsin digestion prior to peptide cleanup and MS analysis⁴⁵, steps that are expected to be deleterious for HLA bound peptides and are not typically employed in immunopeptidomics⁴⁶.

In this study we developed an approach to identify HLA-presented circRNA-derived peptides by MS immunopeptidomics. We focused on melanoma and lung cancer as tumor models where aberrant expression of circRNAs has been already documented^47,48. The workflow included the design and generation of a generic reference database containing trimmed circRNA-derived ORFs spanning the BSJ and initiated by the canonical start codon ATG, allowing the MS-based identification of circRNA-derived antigens overlapping the BSJ. Validation of the MS-detected peptides was performed by parallel reaction monitoring (PRM) and the presence of the corresponding source circRNAs transcripts was confirmed by divergent RT-PCR and Sanger sequencing. We identified 29 different circRNA-derived peptide sequences spanning the BSJ in two melanoma samples. After treatment with IFNγ or the proteasome inhibitor MG132, the presentation of circRNA-derived peptides was controlled in a manner comparable to canonical peptides. Furthermore, we discovered 21 unique circRNA-derived HLA-I and HLA-II peptide sequences spanning the BSJ in a cohort of eight lung cancer patient tumors. We discussed challenges associated with the detection of circRNA-derived immunopeptides and the importance of exploring their presentation across tumor and healthy tissues. Our approach enabled the identification of tumor-associated antigens encoded by circRNAs, that have the potential to be promising targets for immunotherapy.

Results

A dedicated workflow for the detection of circRNA-derived HLA peptide candidates

To explore the presentation of unique peptides potentially derived from circRNA sources spanning the BSJ that are not found in any of the translation frames of the canonical linear gene transcripts, we generated a reference fasta file of circRNA sequences present in circBase⁴⁹ (Fig. 1a). circBase is the first repository of circRNAs which merged different datasets of circRNAs, offering an interface with standardized annotations and unique identifiers. Using a targeted approach, we identified and in silico translated all BSJ-containing “stop-to-stop” circRNA fragments that had a canonical translation initiation codon ATG upstream of the BSJ (see Methods section and Supplementary Fig. 1). Stop-to-stop sequences that did not contain this codon were discarded. HLA-I peptides are short (8-15 amino acids, mostly 9 mers), while HLA-II peptides may reach up to 25 amino acids (average length of around 15-16 amino acids). Therefore, where possible, sequences were further trimmed to a length of with up to 49 amino acids covering the transcript position corresponding to at least one BSJ (24 amino acids upstream the BSJ, one amino acid partially encoded by the BSJ and 24 amino acids downstream the BSJ). This made the circRNA-derived BSJ-ORF fasta reference suitable for both HLA-I and HLA-II MS-based immunopeptidomics workflows (Fig. 1b; see Methods section)⁵⁰. The trimmed circRNA-derived putative ORF fasta sequences spanning the BSJ and initiated by the canonical start codon ATG were concatenated with a human UniProt fasta file⁵¹, containing canonical protein sequences, before performing the MS database search. To apply stringent cutoffs and to minimize false identifications, the immunopeptidomics MS raw files were initially searched against the concatenated fasta reference with two search engines, MaxQuant⁵² and Comet⁵³. The NewAnce tool⁶ was used to calculate a group-specific FDR of 0.03, for both MaxQuant and Comet, for peptides derived from the human UniProt entries (namely group protein-coding ‘PC’) and peptides derived uniquely from the circRNA-derived BSJ-ORF fasta reference (group ‘circRNA’). Only peptides identified by both search engines were retained. In this study, we were particularly interested in characterizing circRNA-derived peptides that span the BSJ, named ‘circRNA-BSJ’. Importantly, our searches against the circRNA group resulted in identification of other circRNA-derived peptides that do not overlap the BSJ (specifically named below as ‘circRNA-not-BSJ’ when applicable), that could potentially derive from out-of-frame translation events of the matched linear RNAs (with canonical or alternative translational initiation sites), which also represent a potential source of non-canonical antigens. Their inclusion in the study was not within the study’s intended scope. They were added for the sake of comparison and completeness. Furthermore, an additional step of peptide mapping against the NCBI Standard Protein BLAST database was performed to remove peptides mapping to annotated coding sequences within this much larger reference (Fig. 1c). Experimental validations of peptide identification were carried out by introducing heavy-labeled synthetic peptides into newly generated immunopeptidome samples, followed by PRM analyses (Fig. 1d). Furthermore, the confirmation of the presence of circRNA transcripts generating the back-splicing event was accomplished through divergent RT-PCR, with subsequent direct Sanger sequencing of the amplicons (Fig. 1e).

**Fig. 1: Immunopeptidomics workflow for identification and validation of circRNA-derived HLA-I bound peptides.**

HLA-I peptides eluted from the T1185B melanoma cell line were analyzed by LC-MS/MS with data-dependent acquisition (DDA). In total, we identified 17,770 PC peptides, 122 circRNA peptides, including 19 circRNA-BSJ peptides that overlap the BSJ by one or two amino acids, depending on the relative position between the codons and the BSJ (Table 1, Fig. 2a). PC derived peptides exhibited the expected length distribution for HLA-I peptides (average length of 9.80), while circRNA, and the subset of circRNA-BSJ peptides were overall longer (average length of 10.07 and 10.11, respectively; Fig. 2b). Indeed, extended HLA-I restricted peptides, longer than 11 amino acids, effectively stimulate CD8⁺ T cell responses within the typical range for epitope-specific CD8 + T cells, however, longer, “bulging” peptides may pose challenges for T-cell receptor recognition compared to shorter peptides⁵⁴. 91%, 95% and 89% of the PC, circRNA, and circRNA-BSJ derived peptides, respectively, were predicted to bind any of the HLA-I molecules expressed in T1185B cells with a rank threshold < 2% (Fig. 2c). Overall, circRNA-BSJ spanned the entire range of peptide intensities (Fig. 2d), with 32% of circRNA-BSJ peptides consistently detected in all three biological replicates (Fig. 2e). This percentage is within the range observed for PC (36%) and all peptides derived from circRNA (32%) groups. We observed a global enrichment in HLA-A*68:01 bound PC peptides that was even more profound in the circRNA and circRNA-BSJ groups (Fig. 2f).

Table 1 HLA-I circRNA-derived peptides, overlapping the BSJ encoding region, detected in T1185B and Mel-1

Full size table

**Fig. 2: Validation of circRNA-derived peptides overlapping the back-splice junction.**

Comparable results were obtained when the workflow was applied to another cell line and matched tumor tissue derived from melanoma patient Mel-1. Here, 9,666 PC peptides and 61 circRNA-derived peptides were detected in the Mel-1 cell line, while in the tumor tissues, 15,443 PC and 55 circRNA-derived peptides, respectively, were identified (Supplementary Fig. 2a). In total, 11 unique circRNA-BSJ peptides were identified in the Mel-1 samples (Table 1). The expected peptide length distribution and high percentage of HLA-binders for both PC and circRNA groups confirmed the high quality of the data (Supplementary Fig. 2b, c), and the intensity of circRNA-BSJ peptides was found within the range of all other PC peptides (Supplementary Fig. 2d). With one exception of a peptide predicted to bind HLA-A*02:01 molecule (identified in both cell line and tumor tissue), all other circRNA-BSJ peptides were predicted to bind HLA-A*03:01. The HLA allotype distribution of the peptides predicted to bind the respective HLA molecules was remarkably different between the Mel-1 cell line and the matched tumor tissue (Supplementary Fig. 2e), suggesting a dysregulation of the HLA expression in the expanded primary cell line. Nevertheless, a clear enrichment in HLA-A*03:01 bound circRNA-derived peptides was observed in both sample types.

Validation approaches for candidate circRNA-BSJ derived peptides

The resulting spectrum matches of candidate circRNA-BSJ peptides were manually checked using PDV spectrum visualization tool⁵⁵ (Supplementary Fig. 3). Furthermore, to validate the identification of peptides, we spiked heavy-labeled synthetic peptides corresponding to our candidates into newly generated immunopeptidome samples from T1185B and Mel-1 cell lines. We subjected the cell line samples to PRM analyses and compared the co-elution profile and the fragmentation pattern of the synthetic heavy-labelled and endogenous light peptides. Applying this method, we validated seven and four unique circRNA-BSJ derived peptides in T1185B and Mel-1 cell lines, respectively (Table 1, Supplementary Fig. 4). Furthermore, the presence of circRNA transcripts generating the back-splicing event in the samples was confirmed with a divergent RT-PCR followed by direct Sanger sequencing of the amplicons. In total, the back-splice junctions of eleven and three circRNAs were validated in T1185B and Mel-1, respectively (Table 1, Supplementary Figs. 5, 6).

As an example, the circRNA-BSJ peptide DLYNGSSIVS[R], predicted to bind HLA-A*68:01 was identified in the T1185B cell line. The square brackets in the peptide sequence above represent the amino acid encoded by the codon spanning the BSJ. By PRM we detected the co-elution of the heavy labelled peptide and the endogenous light peptide and the similar MS/MS fragmentation patterns (Fig. 2g and Supplementary Fig. 4). This peptide matched four potential circRNA-derived ORFs: hsa_circ_0015364, hsa_circ_0015366, hsa_circ_0111261 and hsa_circ_0111262, all of them hosted by gene COP1 (also known as RFWD2) (Fig. 2h). Divergent RT-PCR followed by direct Sanger sequencing confirmed that at least two of those circRNAs are expressed in T1185B (Fig. 2i, j), representing two possible sources for this HLA peptide.

The circRNA-BSJ peptide [R]VFEVYHTTVLK, matching the circRNAs hsa_circ_0003137 and hsa_circ_0004030 from CTNNB1 gene, was detected and validated with PRM and Sanger sequencing in both T1185B and Mel-1 samples (Supplementary Fig. 4, Supplementary Fig 5, and Supplementary Fig. 6), and its shorter circRNA-not-BSJ version EVYHTTVLK was also detected. An additional shorter peptide VYHTTVLK and the partially overlapping peptide HTTVLKIQR from the same potential circRNA ORFs, were detected in Mel-1 and T1185B cell lines, respectively. Interestingly, [R]VFEVYHTTVLK and EVYHTTVLK were previously reported to be detected in melanoma as well as benign human tissues and to be derived potentially from translation of a novel upstream ORF in CTNNB1 gene through an alternative translational initiation site⁵⁶. Our results suggest a potential circRNA source, that importantly, contains the canonical translation initiation ATG codon.

Notably, although not directly related, an amplicon obtained through unspecific amplification during RT-PCR of the CTNNB1 hsa_circ_0003137, matched the CTNNB1 circRNA hsa_circ_0004194 (Supplementary Figs. 5, 6) which doesn’t encode for [R]VFEVYHTTVLK. hsa_circ_0004194 was reported to be associated with the translation of a novel isoform of β-catenin with a shorter C-terminus leading to the promotion of cell growth in hepatocellular carcinoma³⁶, suggesting that additional potential peptides may be derived from various novel coding ORFs in the CTNNB1 gene.

Another interesting example is the hsa_circ_0111569 circRNA generated by back-splicing between exon 10 - exon 3 of the CDC73 (NM_024529.5) host gene (Fig. 3a). This circRNA has an ORF of 735 nucleotides with multiple start codons but without any stop codon (Fig. 3b), having the potential for rolling circle translation³⁴. This unique ORF encodes for the circRNA-BSJ peptide [TT]ENIPVVRR, a predicted HLA-A*68:01 ligand, that we identified and validated by PRM in the T1185B immunopeptidome (Fig. 3c and Supplementary Fig. 4). Furthermore, the presence of this junction was validated through RT-PCR and Sanger sequencing in T1185B samples (Fig. 3d, e). Curiously, in the canonical protein, the two exon junctions involved in the back-splicing event are also spanned by two other MS-identified canonical PC peptides (AT)ENIPVVRR and SV(TE)GASAR, where the amino acids encoded by the two codons spanning the linear spliced junctions are represented above within the brackets. These two peptides are also predicted to bind HLA-A*68:01, and we validated them with PRM (Fig. 3c and Supplementary Fig. 4). Both peptides overlap the linear exon-exon junctions, suggesting that this translated region may have some features that promote its processing and presentation.

**Fig. 3: Antigen presentation of *CDC73* gene.**

Comparable presentation levels of circRNA-BSJ and PC peptides post IFNγ treatment or proteasome inhibition

To further explore the biogenesis of circRNA-BSJ derived HLA-I bound peptides, we treated T1185B cells with either IFNγ or the proteasome inhibitor MG132. A data-independent acquisition (DIA) MS acquisition workflow was applied to compare the presentation levels of peptides derived from circRNA and PC sources (Fig. 4a). To properly control false identifications and to properly account for lower likelihood of true identification of non-canonical sequences⁵⁷, we implemented a group-specific FDR control in FragPipe^58,59,60,61. When calculating the FDR, canonical and non-canonical peptides were classified into different groups. The FDR was calculated for each group separately because different groups have different score distributions. Here, we applied a group-specific FDR of 0.03, allowing stringent control of error in the circRNA search space. Furthermore, to increase stringency of our analysis, hybrid DIA analyses were performed also using Spectronaut with global FDR of 0.01, and only peptides identified by both tools were retained for further analysis (Fig. 4b and Supplementary Fig. 7a–d). Using this approach, 25,284 PC and 27 unique circRNA-BSJ peptides were identified (Supplementary Data 1), 16 of them were in common with the DDA analysis described above. Again, the HLA allelic distribution revealed a prominent presentation of HLA-A*68:01 bound peptides in the PC and circRNA peptidomes (Fig. 4b, c). Intensity values of all quantified peptides clustered hierarchically based on the treatment (see Methods section, Supplementary Fig. 7e), therefore, we performed a differential presentation analysis. We detected a highly significant increase in the presentation of the HLA-B*40:01 bound peptides following IFNγ treatment (Fig. 4d and Supplementary Fig. 8), in agreement with previous studies that demonstrated preferential IFNγ-induced upregulation of HLA-B expression⁶². Additionally, gene ontology enrichment analysis comparing the source protein annotation of the enriched peptides with the global distribution (using the Student’s t-test difference between treatment groups) revealed a significantly higher presentation of peptides derived from proteins associated with response to type I interferon following IFNγ treatment (presentation enrichment score: 0.44, Benj. Hoch. FDR: 1.15E-8; Fig. 4e and Supplementary Data 2). The proteasome inhibitor MG132 treatment led to an enrichment of peptides mapping essentially to proteasome subunits (KEGG term: Proteasome, presentation enrichment score: 0.32, Benj. Hoch. FDR: 8.52E-06; Fig. 4f and Supplementary Data 3) and had a broad impact on the peptidome, by increasing and decreasing the presentation of a large fraction of HLA-A*68:01 bound peptides (Supplementary Fig. 8). We found a minor, yet significant, downregulation of circRNA-derived peptides compared to PC peptides, upon IFNγ treatment (One-way ANOVA and Sidak’s multiple comparisons test, adjusted p-value = 0.0178; Fig. 4g), mainly related to the relative increased presentation of HLA-B, and to a lower extent also of HLA-C alleles, that do not mediate presentation of circRNA-BSJ peptides that are associated with HLA-A*68:01. Indeed, by subsequently restricting the analysis only to the ligands of HLA-A*68:01, we found no significant difference in the presentation of circRNA-BSJ peptides following IFNγ or MG132 treatments (One-way ANOVA and Sidak’s multiple comparisons test, adjusted p-value = 0.9976 for IFNγ and adjusted p-value = 0.4064 for MG132 treatments; Fig. 4g, h). Overall, our analysis suggests that circRNA-BSJ and PC peptides are similarly sampled for processing and presentation.

**Fig. 4: IFNγ or MG132 treatments similarly impacted the presentation of circRNA-derived and PC peptides in T1185B cells.**

Identification of potentially unique lung cancer associated circRNA-BSJ derived peptides

CircRNAs-derived peptides, especially those that span the BSJ, are an interesting potential source of neoantigens that can be used to precisely target cancer cells. However, defining their tissue specificity poses a challenge. Therefore, we leveraged recently published large HLA-I and HLA-II immunopeptidomics DIA and DDA datasets of tumoral and adjacent healthy matched multi-region tissues from eight lung cancer patients⁶³ and searched for circRNA-BSJ derived peptides presented specifically, or to a higher extent, in the tumors. With FragPipe, applying a group-specific FDR of 0.03, we built spectral libraries with the DDA data and searched the DIA data against it (Fig. 5a). Concerning HLA-I immunopeptidomes, overall, 119,084 peptide sequences were identified, ranging from 21,887 to 43,566 peptides across the patient’s tumors and 16,637−34,888 peptides in the adjacent healthy tissues. Correspondingly, 42–154 and 32–128 circRNA-derived peptide sequences were identified in the tumor and healthy tissues, of which 19 were circRNA-BSJ (Fig. 5b, c, Supplementary Fig. 9, and Table 2). Fifteen of these circRNA-BSJ peptides were predicted to bind with affinity rank < 2% any of the HLA-I molecules expressed in at least one of the patients (Supplementary Data 4). Although none of these were deemed cancer-related based on their host gene expression in TCGA/GTEx (see Methods), six circRNA-BSJ peptides were uniquely detected in the tumor tissues, including the peptide ILDKKVE[KL]. This peptide is predicted to be encoded by the circRNA hsa_circ_0076651 which is another example of an ORF with putative infinite translation, hosted by the HSP90AB1 gene. Interestingly, the peptide ILDKKVE[KL] was detected in five of the lung cancer patients, exclusively in tumor tissues, and it was not identified in any benign tissue included in the HLA Ligand Atlas⁶⁴. Interestingly, the PC ILDKKVEKV peptide counterpart, that differs by only one amino acid, is ubiquitous and was detected in both tumor and healthy tissues. Both peptides are predicted to be strong binders of HLA-A*02:02 allele molecule with a % rank of 0.029 and 0.020 (NetMHCpan4.1), respectively. Since T cells that bind to ILDKKVEKV within the HLA-A*02:01 context were observed in the human T-cell repertoire⁶⁵, the potential immunogenicity of the ILDKKVE[KL] peptide becomes a compelling subject for further investigation. Furthermore, we searched the paired HLA-II immunopeptidomics data using the same workflow and identified overall 59,063 peptide sequences, ranging from 12,377–23,881 peptides across the patient’s tumors and 9083–19,263 peptides in the adjacent healthy tissues. Nevertheless, only two circRNA-BSJ HLA-II peptides were identified (Supplementary Fig. 9 and Supplementary Data 5), both predicted to be binders (NetMHCIIpan4.1, Supplementary Data 6), in agreement with the previous report that demonstrated very low contribution of non-canonical sources to the HLA-II immunopeptidome⁶³.

**Fig. 5: Identification of potentially lung cancer-associated circRNA-BSJ peptides.**

Table 2 HLA-I circRNA-derived peptides, overlapping the BSJ encoding region, detected in the lung cohort (eight patients, six with paired healthy tissue) through DDA-DIA FragPipe analysis with group-specific FDR calculation

Full size table

Discussion

The human translatome potentially contains many undiscovered ORFs originated from the various translation frames of the linear transcripts, as revealed by the discovery of thousands of novel unannotated open reading frames (nuORFs) which populate the immunopeptidome⁹. Likewise, the expansion of the cancer related translatome repertoire to circRNA-derived ORFs, and especially those spanning the BSJ region represents an additional source of neoantigens that can be presented and potentially improve current immunotherapies. In the current study we developed a workflow to specifically identify HLA-presented circRNA-derived peptides, directing our MS search to peptides derived from predicted ORFs with a canonical initiation start codon and encoded within the region overlapping the BSJ. Our approach of considering only a flanking region with a maximum of 24 amino acids around the BSJ, reduces the search space while allowing the identification of 25-mer peptides having at least one amino acid spanning the BSJ. Thus, the BSJ-focused circRNA reference is also suitable for HLA-II immunopeptidomics studies.

With the MS-based immunopeptidomics approach, we identified circRNA-derived peptides spanning the BSJ region in two melanoma samples and in a cohort of eight multi-region lung cancer tissues. In contrast to previous studies^44,45, we used a stringent and standardized immunopeptidomics workflow by combining different MS search engines and/or applying a group-specific FDR to decrease the number of false identifications. Treatment of T1185B melanoma cells with IFNγ or the proteasome inhibitor MG132 did not alter differently the presentation of PC and circRNA-BSJ derived peptides, suggesting they follow similar routes of antigen processing and presentation. IFNγ treatment induced the upregulation of HLA expression, mainly the HLA-B*40:01, and induced the presentation of IFNγ regulated genes that are often massively upregulated following treatment⁶⁶. Most circRNA-derived peptides we detected in T1185B cells are predicted to bind HLA-A*68:01. In general, their normalized presentation level is comparable to that of the canonical HLA-A*68:01 peptidome, but there is a tendency for lower presentation after treatment. The limited expression of their source proteins could be a factor that impedes their presentation, even when there is an upregulation of HLA expression. Furthermore, proteasome degradation is a main source of peptides for HLA presentation. Therefore, we explored the impact of proteasome inhibition on the presentation of canonical and circRNA-derived peptides in T1185B cells treated with the reversible proteasome inhibitor MG132. The degradation of dysfunctional proteasomes through autophagy is a known phenomenon^67,68 that could potentially result in the increased presentation of the degradation products we observed in the treated cells. Moreover, a positive feedback loop might result in increased expression, albeit at substoichiometric levels, of certain proteasome subunits. Due to misfolding, these subunits could be swiftly degraded, potentially contributing to the immunopeptidome through this mechanism. In our experimental setup, the immunopeptidome derived from circRNAs exhibited a presentation profile that closely resembled the canonical peptidome. This observation suggests that it shares a similar dependence on the proteasomal pathway.

Cancer-specific translation of circRNAs and associated antigen presentation can lead to new therapeutic targets for tumor-immunotherapy. The existence of cancer-specific circRNAs has been shown at the transcriptional level⁶⁹. Interestingly, QKI is upregulated during epithelial-mesenchymal transition (EMT) and boosts the abundance of circRNAs^20,70. In our study we addressed cancer-specificity by analyzing HLA-I immunopeptidomic data derived from a cohort of eight lung cancer patients, comprising tumor and matched healthy tissues. Despite the limited number of patients and the absence of circRNA expression data, we identified a few HLA-presented circRNA-derived peptides exclusively detected in cancer tissues. For example, the circRNA-BSJ derived peptide ILDKKVE[KL] is predicted to be encoded within the hsa_circ_0076651 in an open reading frame lacking stop codons through rolling circle translation. This circRNA is hosted by the HSP90AB1 gene which expression is known to be upregulated in lung cancer and linked with poor overall survival after surgery⁷¹. Remarkably, while peptides derived from HSP90AB1 are often detected in the human immunopeptidome, it has been demonstrated that T cells that bind to ILDKKVEKV presented by HLA-A*02:01 molecules were observed in the human T-cell repertoire and were activated following viral infection⁶⁵. As the circRNA-BSJ ILDKKVE[KL] peptide was identified solely within tumor tissues of five lung cancer patients, the exploration of its potential immunogenicity emerges as a compelling avenue for additional research. Beyond those circRNAs with cancer restricted expression, cancer-specific translation of circRNAs could potentially expand the repertoire of immunopeptides that are uniquely presented in cancer⁷². Hence, circRNA-derived peptides not overlapping the BSJ might also be of interest, as they represent a potential source of translated non-canonical peptides.

In the present work, we based our circRNA space search on the circBase database, while other circRNA public databases, such as riboCirc⁴³, TransCirc³⁹ and CSCD2⁶⁹ could also be used to guide the discovery of circRNA-derived peptides. Moreover, the large search space, which negatively affects the sensitivity of MS search engines⁷³, could be further adapted using prior knowledge about circRNA expression in a specific experimental setting. Newly discovered circRNA sequences obtained through capture sequencing⁷⁴ and nanopore sequencing⁷⁵ could be used as input for the generation of a sample-specific circRNA reference file to potentially identify sample-specific circRNA-derived peptides. Alternatively, in the absence of circRNA expression data, the expression of the linear counterparts might be used to restrict the search space to circRNAs from the same host genes, assuming their co-existence^25,76. In conclusion, we have established a dedicated workflow to identify HLA-presented circRNA-derived peptides. We exemplified its utility in identifying cancer-related peptides that can be further explored as candidates for immunotherapy. Our approach is versatile, and candidate peptides from circRNAs can be investigated in various biological and pathological contexts.

Methods

Patient samples, cell lines and cell culture

An informed consent was given by the participants, according to the requirements of the institutional review board (Ethics Commission, Centre hospitalier universitaire Vaudois, CHUV). Cell line T1185B was derived from non-lymphoid metastasis of a melanoma patient at the Ludwig Institute for Cancer Research, Department of Oncology, University of Lausanne⁷⁷. Cell line was grown in RPMI 1640 Medium GlutaMAX™ Supplement (Gibco) with 0.55 mM L-arginine (Sigma), 0.24 mM L-asparagine (Sigma), 1.5 mM L-Glutamine (Gibco), 10 mM HEPES (Gibco), 10% of heat-inactivated fetal bovine serum (FBS), 100 U/mL penicillin and 100 μg/mL streptomycin (BioConcept). Snap-frozen tumor tissues from different regions of a lymph node of Mel-1 melanoma patient (clinical study: NCT03475134) were collected and stored at −80 °C. A cell line was generated from the same patient’s tumor at the CTE Biobank (CHUV) and grown in RPMI 1640 Medium GlutaMAX™ Supplement (Cat# 61870010, Gibco) with 10% of non-heat inactivated FBS, 100 U/mL penicillin and 100 μg/mL streptomycin. After in vitro expansion, both cell lines were trypsinized, washed twice in PBS and dry pellets containing 1 × 10⁸–2 × 10⁸ cells were collected and stored at −80 °C, before HLA-I immunoprecipitation (HLA-IP) workflow.

Treatment with the proteasome inhibitor MG132 and IFNγ

T1185B cells were treated with different concentrations of MG132 (S2619, Selleckchem): 1 μM, 5 μM and 10 μM or DMSO (MG132 vehicle) for 6 h and 24 h, and with 100 IU/mL IFNγ (130-096-484, Miltenyi Biotec) or water (IFNγ vehicle) for 48 h. After treatments, cells were harvested, and cell pellets were collected for HLA-IP. After purification, immunopeptides from each condition of the MG132 treatment were measured by MS in technical duplicates. Three biological replicates were used in each condition of the IFNγ treatment (one MS injection each).

Purification of HLA-I peptides and LC-MS/MS analysis

HLA-I immunoprecipitation was performed using the Waters Positive Pressure-96 Processor (Waters, Milford, MA)^6,66 and the number of samples and replicates are indicated in Supplementary Data 7. Shortly, protein-A Sepharose 4B (Pro-A) beads (Invitrogen) were used to purify W6/32 monoclonal antibodies from the supernatant of HB95 hybridoma cells (ATCC HB-95). After antibody crosslinking, Pro-A beads were used for immunoaffinity purification of HLA-I complexes from tissue or cell line lysates. HLA-I peptides were then purified using a C18 solid phase extraction (SPE) and dried using vacuum centrifugation (Concentrator plus, Eppendorf). Samples were stored at −80 °C if not immediately submitted to mass spectrometry analysis. Finally, immunopeptides were re-suspended in 2% ACN and 0.1% FA (formic acid). iRT peptides (Biognosis, Schlieren, Switzerland) were spiked into in the samples as indicated in Supplementary Data 7 and analyzed by LC-MS/MS.

Liquid chromatography and mass spectrometry (LC-MS)

The LC-MS system consisted of an Easy-nLC 1200 coupled to Q Exactive HF-X mass spectrometer (ThermoFisher scientific, Bremen, Germany) or to Eclipse tribrid mass spectrometer (ThermoFisher Scientific, San Jose, USA). The peptides were eluted on a 450 mm analytical column (8 μm tip, 75 μm ID) packed with ReproSil-Pur C18 (1.9 μm particle size, 120 A pore size, Dr Maisch, GmbH) and separated at a flow rate of 250 nL/min as described in ref. ⁶⁶. For DDA measurements, the top 20 most abundant precursor ions selection was performed on the Q Exactive as described⁶⁶. For DIA, the Eclipse tribrid mass spectrometer was used to sample ions. The cycle of acquisitions consists of a full MS scan from 300 to 1650 m/z (R = 120,000, ion accumulation time of 60 ms and normalized AGC of 250%) and 22 DIA MS/MS scans in the orbitrap. For each DIA MS/MS scan, a resolution of 30,000, a normalized AGC of 250%, and a stepped normalized collision energy (27, 30, and 32) were used. The maximum ion accumulation was set to auto, the fixed first mass was set to 200 m/z, and the overlap between consecutive MS/MS scans was 1 m/z as described in ref. ⁷⁸.

Parallel reaction monitoring (PRM)

Synthetic heavy labelled peptides were ordered from Thermo Fisher Scientific as crude (PePotec grade 3). After re-suspension in 2% ACN in 0.1% FA, synthetic peptides were individually analyzed by MS to confirm the lack of contaminating light counterpart. In a second step, synthetic peptides were spiked into the endogenous immunopeptides (0.5 or 1 pmol μL⁻¹) and PRM was performed on both light (endogenous) and heavy peptides⁶ (Supplementary Table 1). Collected data was analyzed using Skyline (64-bit, v20.2.0.343), using an ion mass tolerance of 0.05 m/z. The resulting MS/MS spectra were further sequenced with the mass spectral peak labeling tool pLabel^TM (v2.4.3)^79,80. Following manual inspection of the results, peptides were annotated as PRM-validated (+) and PRM non-validated (-). Other tested peptides were defined as ‘failed QC’ (due to too long elution profile), and as ‘inconclusive’ due to noisy signal (Table 1).

Construction of reference FASTA files for the identification of non-canonical circRNA peptides by mass spectrometry

To explore the contribution of circRNA sources to the immunopeptidome, we first generated a reference file of circRNA sequences present in the circBase database (http://www.circbase.org/)⁴⁹. circBase holds more than 140,000 circRNAs from exonic, intronic and intergenic loci, from both coding and non-coding regions, represented by linear sequences. The BSJ and its sequence context was created for each circRNA by joining the 3’ end to the 5’ start of the linear sequence. For illustration purposes, Fig. 1a, b refer to circRNAs composed solely by exons, but the strategy was applied to all putative spliced circRNA sequences from circBase. To facilitate the in silico translation of the BSJ-containing circRNA fragments, we concatenated four copies of the circRNA sequence into a single sequence, termed “4x circRNA” in which the circRNA reading frame changes at each of the three internal BSJs (Supplementary Fig. 1a). In this way, all the possible BSJ reading frame transitions are covered in a single in silico translation of the “4x circRNA” sequence. In circRNA sequences that contain an integral number of codons, the reading frame does not change at the BSJ, so only two concatenated circRNA sequences are required, with each invariable reading frame initiated at nucleotide position 1, 2 or 3 in the “2x circRNA” concatenated sequence (Supplementary Fig. 1b). For each 4x (or 2x) circRNA sequence we assembled a list of the sequence-based nucleotide coordinates specifying all ATG codons, the codon containing each BSJ and all stop codons. This list of elements, sorted by coordinate order, was then traversed to isolate one or more BSJ elements bounded by stops (eg. STP-2350, ATG-2368, BSJ-2722, STP-2770) that have the potential to be translated. These “stop-to-stop” sequences were in silico translated into peptide sequences (excluding the stop codons), and the amino acids N-terminal to the first methionine residue were removed. Stop-to-stop elements that did not contain an ATG upstream the BSJ were discarded. Where possible, peptides were further truncated such that the final sequence contained 24 amino acids flanking the BSJ-containing residue (or 23 amino acids if the BSJ was located between two codons). This made the sequences suitable for both HLA-I and HLA-II MS-based immunopeptidomics workflows. The resulting circRNA-derived BSJ-ORF fasta file contains sequences with up to 49 amino acids covering the transcript position corresponding to at least one BSJ (24 amino acids upstream the BSJ, one amino acid partially encoded by the BSJ and 24 amino acids downstream of the BSJ). These sequences were concatenated with a human UniProt fasta file with isoforms [Reviewed (Swiss-Prot), 42362 entries, downloaded on 2022-03-07]⁵¹ before performing the MS database search. The concatenated fasta file was adapted for compatibility with the group-specific FDR calculation in FragPipe following the structure of UniProt fasta file headers^50,51. Two groups were assigned through the attribution of a different number in the Protein Existence field of the fasta headers (PE). PE = 1 was attributed to the protein group (including common MS protein contaminants) and PE = 4 was attributed to the circRNA group, allowing FragPipe to annotate sequences to the different groups for the group-specific FDR calculation.

Mass spectrometry database search workflow

DDA MS search with group-specific FDR: MS-derived raw files resultant from three biological replicates of T1185B (Supplementary Data 7) were searched using MaxQuant⁵² (version 2.1.0.0) with a PSM FDR of 0.1 and Comet⁵³ against the generated reference fasta file containing both UniProt and the trimmed circRNA-derived putative ORFs around the BSJ and initiated by the canonical start codon ATG. Outputs were then intersected by NewAnce (v1.6) (https://github.com/bassanilab/NewAnce), setting a group-specific FDR of 0.03 for protein- and circRNA-derived peptides. The search was done setting a nonspecific protein digestion cleavage, no fixed modifications, methionine oxidation and protein N-term acetylation as variable modifications, and restricting the peptide length to 8–15 amino acids. Same approach was used for the cell line and tumor tissues of patient Mel-1 patient (two biological replicates and three different lymph node regions, respectively). NewAnce comprised a PDV format output which was used to visualize a representative spectrum/best PSM of the identified circRNA-derived peptides. PDV 1.7.4⁵⁵ was used with a tolerance of 10 ppm.

Hybrid DIA MS search: DIA files corresponding to the immunopeptidomics samples of MG132 and IFNγ treatment of T1185B cells were searched using a hybrid DIA approach using two computational tools, FragPipe (v.20.0) with group-specific FDR calculation^58,61,81 and Spectronaut (v.18.4)⁸², against the generated reference fasta file containing both UniProt and the trimmed circRNA-derived ORFs around the BSJ to which a list of common MS contaminant proteins were added. To increase the coverage of the spectral library, we assembled available DDA raw files from T1185B cells treated or not with IFNγ (from the PRIDE accession PXD013649⁶), the newly generated DDA data, together with the DIA files of T1185B cells treated with MG132 and the new IFNγ treatments (and their respective controls), as indicated in Supplementary Data 7. In FragPipe we applied a group-specific FDR threshold of 0.03 (MSFragger Group variable: Protein evidence from FASTA file) while in Spectronaut we applied a global peptide FDR threshold of 0.01. In both engines, the search was done by applying a FDR threshold of 1 for proteins, nonspecific protein digestion cleavage, no fixed modifications, methionine oxidation and protein N-term acetylation as variable modifications, and restricting the peptide length to 8-15 amino acids. Hybrid spectral libraries were then used to match and quantify peptides from the immunopeptidomics DIA data using a peptide precursor group-specific FDR of 0.03 for FragPipe or global FDR of 0.01 for Spectronaut. Default decoy generation methods were used for each MS search tool, reversed and mutated sequences for FragPipe and Spectronaut, respectively. Data analysis was performed using Fragpipe quantification values after overlapping identified sequences from both FragPipe and Spectronaut MS analysis tools.

DDA-DIA MS search with group-specific FDR calculation: HLA-I and HLA-II raw files of the lung cancer cohort of 8 patients and 52 tumoral and healthy matched tissues⁶³ (Supplementary Data 7) were downloaded from PRIDE PXD034772 and analyzed by FragPipe (v.19.2-build39 for HLA-I and v.20.0 for HLA-II immunopeptidomes) which supported group-specific FDR calculation. Spectral library generation was performed using the DDA immunopeptidomics data. The search was done setting a nonspecific protein digestion cleavage, no fixed modifications, methionine oxidation and protein N-term acetylation as variable modifications, a group-specific FDR threshold of 0.03 for peptides (MSFragger Group variable: Protein evidence from FASTA file), a FDR threshold of 1 for proteins, and restricting the peptide length to 8-15 or to 8-25 amino acids for HLA-I or HLA-II MS searches, respectively. Respective spectral libraries were then used to match and quantify peptides from the immunopeptidomics DIA data using a peptide precursor group-specific FDR of 0.03. DIA immunopeptidomics raw files were used for matching and quantification of peptides. Peptides from canonical and non-canonical circRNA groups were used to calculate the FDR separately because the score distributions are different. Pooling them together would result in underestimated FDR for the circRNA group. HLA-I and HLA-II library generation and peptide identification were performed separately.

For MS/MS prediction, an ecosystem within Prosit⁸³ was used via Oktoberfest⁸⁴. Oktoberfest can calibrate collision energy (CE), rescoring search results and generates spectral libraries from a list of peptides. We used “Prosit_2020_intensity_HCD” and “Prosit_2019_irt” as models for intensity and retention time predictions respectively. For peptide ILDKKVEKL prediction, we used CE = 28 for charge state z = 1 + , z = 2+ and z = 3 + . Comparison of the DDA-measured and the Prosit-predicted spectra of ILDKKVEKL was performed using PDV 1.8.2⁵⁵ with a tolerance of 10 ppm.

Furthermore, to remove potentially ambiguous identifications resulting from PSMs that better fit possible modified PC sequences, the MSMS spectra of candidate circRNA-BSJ peptides were re-searched with COMET (same parameters as above, but no FDR) against the human reference proteome UniProt database concatenated with the list of the circRNA-BSJ peptide sequences, including six common modifications. The variable modifications included were 15.9949 Da for oxidation on M, 42.010565 Da for acetylation on the N-terminus, 79.966331 Da for phosphorylation on STY, 119.004099 Da for cysteinylation, 0.98402 Da for deamidation NQ and 57.021464 Da for carbamidomethyl on C. Ambiguous identifications mapping to modified PC peptides with either higher or equal XCorr (delta score =0) were excluded from downstream analyses.

Identification of peptide candidates uniquely derived from circRNAs

After MS search, common MS contaminants were removed and peptides matching both the protein and circRNA groups were annotated as belonging to the protein-coding space. In addition, circRNA-derived peptide candidates were blasted against the Reference Proteins database (refseq_protein, homo sapiens, 106 K entries) using the online NCBI Standard Protein Blast tool⁸⁵ and peptides with 100% similarity to sequences in this reference were re-annotated as belonging to the protein group. circRNA-derived peptides were further mapped to the original trimmed circRNA-derived ORF sequences to identify those overlapping the BSJ encoding region.

Assessing tumor specificity

Cancer related circRNAs in the lung cancer cohort were defined as having expression of their host gene of at least 2.5 TPM at the 99th percentile in any TCGA cancer type but not higher than 1 TPM in any GTEx⁸⁶ normal tissue samples at the 90th percentile (excluding testis and sun exposed skin). We assumed the co-existence between circRNA and linear counterparts, since the biogenesis of circRNAs compete with regular splicing²⁵.

In addition, we downloaded the MS files of HLA-I immunopeptidomics data from the HLA Ligand Atlas⁶⁴ and searched them against the fasta file containing both UniProt and the trimmed circRNA-derived putative ORFs around the BSJ and initiated by the canonical start codon ATG, to obtain information about their detection in benign tissues. We used Comet and the NewAnce tool, setting a nonspecific protein digestion cleavage, no fixed modifications, methionine oxidation as variable modification, a group-specific FDR threshold of 0.03 for PC and circRNA-derived peptides, a FDR threshold of 1 for proteins, and restricting the peptide length to 8-15 amino acids.

HLA typing

Genomic DNA from T1185B cell line and Mel-1 tumor tissue was extracted with the DNeasy Blood & Tissue Kit (Qiagen). HLA-typing was performed using the TruSight HLA v.2 Sequencing Panel protocol (CareDx). Sequencing was performed on an Illumina® MiniSeq™ System (Illumina) and data was analyzed using the Assign TruSight HLA v.2.1 software (CareDx).

Prediction of HLA binding affinity

NetMHCpan4.1⁸⁷ and NetMHCIIpan4.1⁸⁸ were used to predict the binding affinity of the identified peptides against the patient-specific HLA-I allotypes. Peptide sequences with a % rank lower than 2 were considered as binders. Each binder peptide was then annotated to the best binder allele considering the lowest % rank.

circRNA detection and Sanger sequencing

RNA was extracted with the miRNeasy Tissue/Cells Advanced Mini Kit (Qiagen) according to the manufacturer’s protocol. Purification of total RNA transcripts include a non-enzymatic gDNA removal by a gDNA eliminator column. RNA quality was checked on a Fragment Analyzer using the RNA kit (DNF-471-0500, Agilent Technologies). Total RNA was converted into cDNA using the GoScript™ Reverse Transcriptase kit with random primer hexamers (A2801, Promega). Amplification of the cDNA (equivalent to 100 ng of total RNA) flanking the circRNA BSJ, was performed by a divergent RT-PCR, using oligos designed outside the BSJ, allowing validation of the BJS with Sanger Sequencing (Supplementary Table 2). RT-PCR amplification was performed using the KAPA HiFi HotStart ReadyMix PCR Kit (7958935001, Roche) using manufacturer’s recommendations, except for the number of cycles which was increased to 40 to increment the amount of product to be sequenced. RT-PCR products were run in a 2% agarose gel, bands with the expected size were excised and DNA purified using the NucleoSpin Gel and PCR clean-up kit. Direct Sanger Sequencing of the purified DNA was performed at Microsynth (Balgach, Switzerland).

Data analysis and statistics

Results are shown for peptides of 8-15 and 9-25 amino acids in length for HLA-I or HLA-II datasets, respectively. Statistical analysis of the hybrid DIA MS search addressing MG132 and IFNγ treatments were performed by Perseus 1.5.5.3⁸⁹, after summing the intensities of all precursor charge states for all peptides identified in FragPipe, keeping unmodified and modified peptides separately. Canonical source proteins were annotated with GOBP, GOCC, KEGG and keyword annotations. Common MS contaminants were removed, and the output was split in two datasets analyzed in separate, each one containing only the peptides identified in MG132 or IFNγ treatments with their respective controls. Peptide intensities from FragPipe were log2 transformed and the missing values were imputed from a normal distribution (width=0.3, down shift=1.8), followed by a width adjustment normalization and keeping peptides which sequences were also detected in the Spectronaut MS search, independently on their modifications. For standard hierarchical clustering the intensities of each peptide were z-scored. Otherwise, the log2 intensity matrix was further filtered to retain peptides identified and quantified in at least 80% of the raw files of the associated treatment. Raw files derived from the MG132 treatment were annotated in different categories based on the previous generated hierarchical clustering, Control (6 h and 24 h) and MG132 at 24 h (1 μM, 5 μM and 10 μM at 24 h), while for the IFNγ treatment we annotated the raw files as Control or IFNγ (48 h). The created groups were used to generate volcano plots for each treatment (s0 = 0.75; FDR0.01). The overall effect of the different treatments in the log2 intensity of the identified peptides was checked by performing a two-sample test, using a Student’s t-test with s0 = 0.75 and a permutation-based FDR of 0.01. The Student’s t-test difference between the different groups (per treatment group) were then used to calculate the peptide presentation enrichment score per GOBP, GOCC and KEGG term, and per keyword annotation with 1D annotation enrichment analysis of the associated protein annotations with a Benjamin-Hochberg FDR threshold of 1E-04. The peptide presentation enrichment score per peptide group (PC and circRNAs) and HLA restriction was estimated using a less stringent FDR of 0.02 (Supplementary Data 2 and Supplementary Data 3). GraphPad Prism (version 9.5.1) was used to create a bubble plot representing the enrichment of peptides in each HLA allele and GOBP, GOCC and KEGG categories (peptide number≥ 50, top 12 categories with the higher enrichment score; keyword categories were excluded in this representation). Violin plots showing the difference in the log2 intensity of the peptides associated with their HLA restriction were also generated using GraphPad. DeepVenn (https://www.deepvenn.com) was used to visualize the intersection between FragPipe and Spectronaut. Upset plots showing the intersection of the identified peptide sequences in the three biological replicates of T1185B, annotated in the different PC, circRNA, and circRNA-BSJ groups were generated using Intervene (https://intervene.shinyapps.io/intervene/)⁹⁰.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

MS raw files, reference fasta files and NewAnce, FragPipe and Spectronaut parameters and outputs generated in this study have been deposited in the ProteomeXchange Consortium via the PRIDE partner repository⁹¹ under accession code PXD043989. The lung cancer cohort MS data used in this study are available in the ProteomeXchange Consortium under accession code PXD034772. Publicly available DDA raw files derived from T1185B cells treated or not with IFNγ used in this study are available in the ProteomeXchange Consortium under accession code PXD013649. The circBase database can be found at http://www.circbase.org/. The UniProt database can be found at https://www.uniprot.org. The Cancer Genome Atlas (TCGA) data can be found at https://www.cancer.gov/tcga. GTEx Portal can be accessed through https://www.gtexportal.org/home/⁸⁶. HLA Ligand Atlas can be accessed through https://hla-ligand-atlas.org/welcome. The Human Protein Atlas can be accessed through proteinatlas.org. Source data are provided with this paper.

Code availability

The provided GitHub link contains the necessary code to produce the circRNA-derived BSJ-ORF fasta reference and modify it to align with FragPipe’s group-specific FDR calculation. https://github.com/bassanilab/CircRNA_MS_ref_fasta⁵⁰.

References

Waldman, A. D., Fritz, J. M. & Lenardo, M. J. A guide to cancer immunotherapy: from T cell basic science to clinical practice. Nat. Rev. Immunol. 20, 651–668 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chong, C., Coukos, G. & Bassani-Sternberg, M. Identification of tumor antigens with immunopeptidomics. Nat. Biotechnol. 40, 175–188 (2022).
Article CAS PubMed Google Scholar
Tran, E. et al. Immunogenicity of somatic mutations in human gastrointestinal cancers. Science 350, 1387–1390 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Arnaud, M. et al. Sensitive identification of neoantigens and cognate TCRs in human solid tumors. Nat. Biotechnol. 40, 656–660 (2022).
Article CAS PubMed Google Scholar
Li, L., Goedegebuure, S. P. & Gillanders, W. Cancer vaccines: shared tumor antigens return to the spotlight. Signal Transduct. Target Ther. 5, 251 (2020).
Article PubMed PubMed Central Google Scholar
Chong, C. et al. Integrated proteogenomic deep sequencing and analytics accurately identify non-canonical peptides in tumor immunopeptidomes. Nat. Commun. 11, 1293 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hanahan, D. Hallmarks of Cancer: New Dimensions. Cancer Discov. 12, 31–46 (2022).
Article CAS PubMed Google Scholar
Hu, W. et al. Systematic characterization of cancer transcriptome at transcript resolution. Nat. Commun. 13, 6803 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Ouspenskaia, T. et al. Unannotated proteins expand the MHC-I-restricted immunopeptidome in cancer. Nat. Biotechnol. 40, 209–217 (2022).
Article CAS PubMed Google Scholar
Zhou, Y. et al. Proteomic signatures of 16 major types of human cancer reveal universal and cancer-type-specific proteins for the identification of potential therapeutic targets. J. Hematol. Oncol. 13, 170 (2020).
Article CAS PubMed PubMed Central Google Scholar
Abi Habib, J., Lesenfants, J., Vigneron, N. & Van den Eynde, B. J. Functional Differences between Proteasome Subtypes. Cells 11, 421 (2022).
Article CAS PubMed PubMed Central Google Scholar
Balasubramanian, A., John, T. & Asselin-Labat, M. L. Regulation of the antigen presentation machinery in cancer and its implication for immune surveillance. Biochem Soc. Trans. 50, 825–837 (2022).
Article CAS PubMed PubMed Central Google Scholar
Xia, J., Li, S., Ren, B. & Zhang, P. Circular RNAs as a potential source of neoepitopes in cancer. Front Oncol. 13, 1098523 (2023).
Article CAS PubMed PubMed Central Google Scholar
Nigro, J. M. et al. Scrambled exons. Cell 64, 607–613 (1991).
Article CAS PubMed Google Scholar
Jeck, W. R. et al. Circular RNAs are abundant, conserved, and associated with ALU repeats. RNA 19, 141–157 (2013).
Article CAS PubMed PubMed Central Google Scholar
Peter, M. R. et al. Investigating urinary circular RNA biomarkers for improved detection of renal cell carcinoma. Front Oncol. 11, 814228 (2021).
Article CAS PubMed Google Scholar
He, Y. D. et al. A urine extracellular vesicle circRNA classifier for detection of high-grade prostate cancer in patients with prostate-specific antigen 2-10 ng/mL at initial biopsy. Mol. Cancer 20, 96 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zheng, R. et al. Exosomal circLPAR1 functions in colorectal cancer diagnosis and tumorigenesis through suppressing BRD4 via METTL3-eIF3h interaction. Mol. Cancer 21, 49 (2022).
Article CAS PubMed PubMed Central Google Scholar
Roy, S. et al. Diagnostic efficacy of circular RNAs as noninvasive, liquid biopsy biomarkers for early detection of gastric cancer. Mol. Cancer 21, 42 (2022).
Article CAS PubMed PubMed Central Google Scholar
Conn, S. J. et al. The RNA binding protein quaking regulates formation of circRNAs. Cell 160, 1125–1134 (2015).
Article CAS PubMed Google Scholar
Li, J. et al. An alternative splicing switch in FLNB promotes the mesenchymal cell state in human breast cancer. Elife 7, e37184 (2018).
Article PubMed PubMed Central Google Scholar
Ivanov, A. et al. Analysis of intron sequences reveals hallmarks of circular RNA biogenesis in animals. Cell Rep. 10, 170–177 (2015).
Article CAS PubMed Google Scholar
Shen, H. et al. ADARs act as potent regulators of circular transcriptome in cancer. Nat. Commun. 13, 1508 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Hansen, T. B. et al. Natural RNA circles function as efficient microRNA sponges. Nature 495, 384–388 (2013).
Article ADS CAS PubMed Google Scholar
Ashwal-Fluss, R. et al. circRNA biogenesis competes with pre-mRNA splicing. Mol. Cell 56, 55–66 (2014).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Circular intronic long noncoding RNAs. Mol. Cell 51, 792–806 (2013).
Article CAS PubMed Google Scholar
Li, Z. et al. Exon-intron circular RNAs regulate transcription in the nucleus. Nat. Struct. Mol. Biol. 22, 256–264 (2015).
Article PubMed Google Scholar
Chen, N. et al. A novel FLI1 exonic circular RNA promotes metastasis in breast cancer by coordinately regulating TET1 and DNMT1. Genome Biol. 19, 218 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhou, W. Y. et al. Circular RNA: metabolism, functions and interactions with proteins. Mol. Cancer 19, 172 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pamudurti, N. R. et al. Translation of CircRNAs. Mol. Cell 66, 9–21 e27 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. Extensive translation of circular RNAs driven by N(6)-methyladenosine. Cell Res 27, 626–641 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zhou, C. et al. Genome-Wide Maps of m6A circRNAs Identify Widespread and Cell-Type-Specific Methylation Patterns that Are Distinct from mRNAs. Cell Rep. 20, 2262–2276 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fan, X., Yang, Y., Chen, C. & Wang, Z. Pervasive translation of circular RNAs driven by short IRES-like elements. Nat. Commun. 13, 3751 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Abe, N. et al. Rolling Circle Translation of Circular RNA in Living Human Cells. Sci. Rep. 5, 16435 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Lei, M., Zheng, G., Ning, Q., Zheng, J. & Dong, D. Translation and functional roles of circular RNAs in human cancer. Mol. Cancer 19, 30 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liang, W. C. et al. Translation of the circular RNA circbeta-catenin promotes liver cancer cell growth through activation of the Wnt pathway. Genome Biol. 20, 84 (2019).
Article PubMed PubMed Central Google Scholar
Oules, B. et al. Clinicopathologic and molecular characterization of melanomas mutated for CTNNB1 and MAPK. Virchows Arch. 480, 475–480 (2022).
Article CAS PubMed Google Scholar
Guo, J. U., Agarwal, V., Guo, H. & Bartel, D. P. Expanded identification and characterization of mammalian circular RNAs. Genome Biol. 15, 409 (2014).
Article PubMed PubMed Central Google Scholar
Huang, W. et al. TransCirc: an interactive database for translatable circular RNAs based on multi-omics evidence. Nucleic Acids Res 49, D236–D242 (2021).
Article CAS PubMed Google Scholar
van Heesch, S. et al. The Translational Landscape of the Human Heart. Cell 178, 242–260 e229 (2019).
Article PubMed Google Scholar
You, X. et al. Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity. Nat. Neurosci. 18, 603–610 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ruiz Cuevas, M. V. et al. Most non-canonical proteins uniquely populate the proteome or immunopeptidome. Cell Rep. 34, 108815 (2021).
Article CAS PubMed Google Scholar
Li, H. et al. riboCIRC: a comprehensive database of translatable circRNAs. Genome Biol. 22, 79 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. K. et al. Structured elements drive extensive circular RNA translation. Mol. Cell 81, 4300–4318 e4313 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, W. et al. Tumor-Specific CircRNA-Derived Antigen Peptide Identification for Hepatobiliary Tumors. Engineering 22, 159–170 (2023).
Article CAS Google Scholar
Purcell, A. W., Ramarathinam, S. H. & Ternette, N. Mass spectrometry-based identification of MHC-bound peptides for immunopeptidomics. Nat. Protoc. 14, 1687–1707 (2019).
Article CAS PubMed Google Scholar
Tang, K., Zhang, H., Li, Y., Sun, Q. & Jin, H. Circular RNA as a Potential Biomarker for Melanoma: A Systematic Review. Front Cell Dev. Biol. 9, 638548 (2021).
Article PubMed PubMed Central Google Scholar
Li, J. et al. CircRNAs in lung cancer- role and clinical application. Cancer Lett. 544, 215810 (2022).
Article CAS PubMed Google Scholar
Glazar, P., Papavasileiou, P. & Rajewsky, N. circBase: a database for circular RNAs. RNA 20, 1666–1670 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ferreira H. J. et al. Immunopeptidomics-based identification of naturally presented non-canonical circRNA-derived peptides. Zenodo, https://doi.org/10.5281/zenodo.10598317 (2024).
UniProt, C. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 51, D523–D531 (2023).
Article Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Article CAS PubMed Google Scholar
Eng, J. K., Jahan, T. A. & Hoopmann, M. R. Comet: an open-source MS/MS sequence database search tool. Proteomics 13, 22–24 (2013).
Article CAS PubMed Google Scholar
Josephs, T. M., Grant, E. J. & Gras, S. Molecular challenges imposed by MHC-I restricted long epitopes on T cell immunity. Biol. Chem. 398, 1027–1036 (2017).
Article CAS PubMed Google Scholar
Li, K., Vaudel, M., Zhang, B., Ren, Y. & Wen, B. PDV: an integrative proteomics data viewer. Bioinformatics 35, 1249–1251 (2019).
Article CAS PubMed Google Scholar
Nelde, A. et al. Upstream open reading frames regulate translation of cancer-associated transcripts and encode HLA-presented immunogenic tumor antigens. Cell Mol. Life Sci. 79, 171 (2022).
Article CAS PubMed PubMed Central Google Scholar
Nesvizhskii, A. I. Proteogenomics: concepts, applications and computational strategies. Nat. Methods 11, 1114–1125 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kong, A. T., Leprevost, F. V., Avtonomov, D. M., Mellacheruvu, D. & Nesvizhskii, A. I. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat. Methods 14, 513–520 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yang, K. L. et al. MSBooster: improving peptide identification rates using deep learning-based features. Nat. Commun. 14, 4539 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Yu, F. et al. Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform. Nat. Commun. 14, 4154 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
da Veiga Leprevost, F. et al. Philosopher: a versatile toolkit for shotgun proteomics data analysis. Nat. Methods 17, 869–870 (2020).
Article PubMed PubMed Central Google Scholar
Komov, L. et al. Cell Surface MHC Class I Expression Is Limited by the Availability of Peptide-Receptive “Empty” Molecules Rather than by the Supply of Peptide Ligands. Proteomics 18, e1700248 (2018).
Article PubMed Google Scholar
Kraemer, A. I. et al. The immunopeptidome landscape associated with T cell infiltration, inflammation and immune editing in lung cancer. Nat. Cancer 4, 608–628 (2023).
Article CAS PubMed PubMed Central Google Scholar
Marcu, A. et al. HLA Ligand Atlas: a benign reference of HLA-presented peptides to improve T-cell-based cancer immunotherapy. J. Immunother. cancer 9, e002071 (2021).
Article PubMed PubMed Central Google Scholar
Herberts, C. A. et al. Autoreactivity against induced or upregulated abundant self-peptides in HLA-A*0201 following measles virus infection. Hum. Immunol. 64, 44–55 (2003).
Article CAS PubMed Google Scholar
Chong, C. et al. High-throughput and Sensitive Immunopeptidomics Platform Reveals Profound Interferongamma-Mediated Remodeling of the Human Leukocyte Antigen (HLA) Ligandome. Mol. Cell Proteom. 17, 533–548 (2018).
Article CAS Google Scholar
Goebel, T. et al. Proteaphagy in Mammalian Cells Can Function Independent of ATG5/ATG7. Mol. Cell Proteom. 19, 1120–1131 (2020).
Article Google Scholar
Hoeller, D. & Dikic, I. How the proteasome is degraded. Proc. Natl Acad. Sci. USA 113, 13266–13268 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Feng, J. et al. CSCD2: an integrated interactional database of cancer-specific circular RNAs. Nucleic Acids Res 50, D1179–D1183 (2022).
Article CAS PubMed Google Scholar
Dou, Y. et al. Proteogenomic Characterization of Endometrial Carcinoma. Cell 180, 729–748 e726 (2020).
Article CAS PubMed PubMed Central Google Scholar
Biaoxue, R. et al. Upregulation of Hsp90-beta and annexin A1 correlates with poor survival and lymphatic metastasis in lung cancer patients. J. Exp. Clin. Cancer Res. 31, 70 (2012).
Article PubMed PubMed Central Google Scholar
Silvera, D., Formenti, S. C. & Schneider, R. J. Translational control in cancer. Nat. Rev. Cancer 10, 254–266 (2010).
Article CAS PubMed Google Scholar
Parker, R. et al. The Choice of Search Engine Affects Sequencing Depth and HLA Class I Allele-Specific Peptide Repertoires. Mol. Cell Proteom. 20, 100124 (2021).
Article CAS Google Scholar
Vo, J. N. et al. The Landscape of Circular RNA in Cancer. Cell 176, 869–881 e813 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. et al. Comprehensive profiling of circular RNAs with nanopore sequencing and CIRI-long. Nat. Biotechnol. 39, 836–845 (2021).
Article PubMed Google Scholar
Zhou, T. et al. Rat BodyMap transcriptomes reveal unique circular RNA features across tissue types and developmental stages. RNA 24, 1443–1456 (2018).
Article CAS PubMed PubMed Central Google Scholar
Neubert, N. J. et al. A Well-Controlled Experimental System to Study Interactions of Cytotoxic T Lymphocytes with Tumor Cells. Front. Immunol. 7, 326 (2016).
Article PubMed PubMed Central Google Scholar
Pak, H. et al. Sensitive Immunopeptidomics by Leveraging Available Large-Scale Multi-HLA Spectral Libraries, Data-Independent Acquisition, and MS/MS Prediction. Mol. Cell Proteom. 20, 100080 (2021).
Article CAS Google Scholar
Li, D. et al. pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry. Bioinformatics 21, 3049–3050 (2005).
Article CAS PubMed Google Scholar
Wang, L. H. et al. pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry. Rapid Commun. Mass Spectrom. 21, 2985–2991 (2007).
Article ADS CAS PubMed Google Scholar
Demichev, V. et al. dia-PASEF data analysis using FragPipe and DIA-NN for deep proteomics of low sample amounts. Nat. Commun. 13, 3944 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bruderer, R. et al. Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues. Mol. Cell Proteom. 14, 1400–1410 (2015).
Article CAS Google Scholar
Gessulat, S. et al. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 16, 509–518 (2019).
Article CAS PubMed Google Scholar
Picciani, M. et al. Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit. Proteomics 6, e2300112 (2023).
Article Google Scholar
Coordinators NR. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 41, D8–D20 (2013).
Article Google Scholar
Consortium, G. T. The Genotype-Tissue Expression (GTEx) project. Nat. Genet 45, 580–585 (2013).
Article Google Scholar
Reynisson, B., Alvarez, B., Paul, S., Peters, B. & Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 48, W449–W454 (2020).
Article CAS PubMed PubMed Central Google Scholar
Reynisson, B. et al. Improved Prediction of MHC II Antigen Presentation through Integration and Motif Deconvolution of Mass Spectrometry MHC Eluted Ligand Data. J. Proteome Res. 19, 2304–2315 (2020).
Article CAS PubMed Google Scholar
Cox, J. & Mann, M. 1D and 2D annotation enrichment: a statistical method integrating quantitative proteomics with complementary high-throughput data. BMC Bioinforma. 13, S12 (2012).
Article CAS Google Scholar
Khan, A. & Mathelier, A. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinforma. 18, 287 (2017).
Article Google Scholar
Perez-Riverol, Y. et al. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res. 47, D442–D450 (2019).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We are thankful to Katja Muehlethaler for the generation of the primary cell line Mel-1 and further technical support, and to Anne-Christine Thierry, Petra Baumgaertner, Noemie Fahr, Aymeric Auger, Julien Schmidt, Philippe Guillaume and Alexandre Harari, from the Department of Oncology UNIL CHUV, for their contributions to the discussion of the manuscript. We are thankful to Gabriel Villamil and Uwe Ohler, from The Berlin Institute for Medical Systems Biology, Max Delbruck Center for Molecular Medicine, for their supportive discussions. This study was supported by the Ludwig Institute for Cancer Research, by the Swiss Cancer Research Foundation, grant KFS-4680-02-2019 (M.B.-S.) and the Swiss National Science Foundation, PRIMA grant PR00P3_193079 (M.B.-S.). Some elements from Fig. 5 and Supplementary Fig. 9 were originally created using BioRender.com.

Author information

Authors and Affiliations

Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
Humberto J. Ferreira, Brian J. Stevenson, HuiSong Pak, Jessica Almeida Oliveira, Florian Huber, Marie Taillandier-Coindard, Justine Michaux, Emma Ricart-Altimiras, Anne I. Kraemer, Lana E. Kandalaft, Markus Müller & Michal Bassani-Sternberg
Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
Humberto J. Ferreira, HuiSong Pak, Jessica Almeida Oliveira, Florian Huber, Marie Taillandier-Coindard, Justine Michaux, Emma Ricart-Altimiras, Anne I. Kraemer, Lana E. Kandalaft, Daniel E. Speiser, Markus Müller & Michal Bassani-Sternberg
Agora Cancer Research Centre, Lausanne, Switzerland
Humberto J. Ferreira, Brian J. Stevenson, HuiSong Pak, Jessica Almeida Oliveira, Florian Huber, Marie Taillandier-Coindard, Justine Michaux, Emma Ricart-Altimiras, Anne I. Kraemer, Lana E. Kandalaft, Markus Müller & Michal Bassani-Sternberg
SIB Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
Brian J. Stevenson & Markus Müller
Department of Pathology, University of Michigan, Ann Arbor, MI, USA
Fengchao Yu & Alexey I. Nesvizhskii
Center of Experimental Therapeutics, Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
Lana E. Kandalaft & Michal Bassani-Sternberg
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Alexey I. Nesvizhskii

Authors

Humberto J. Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Brian J. Stevenson
View author publications
You can also search for this author in PubMed Google Scholar
HuiSong Pak
View author publications
You can also search for this author in PubMed Google Scholar
Fengchao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Almeida Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Florian Huber
View author publications
You can also search for this author in PubMed Google Scholar
Marie Taillandier-Coindard
View author publications
You can also search for this author in PubMed Google Scholar
Justine Michaux
View author publications
You can also search for this author in PubMed Google Scholar
Emma Ricart-Altimiras
View author publications
You can also search for this author in PubMed Google Scholar
Anne I. Kraemer
View author publications
You can also search for this author in PubMed Google Scholar
Lana E. Kandalaft
View author publications
You can also search for this author in PubMed Google Scholar
Daniel E. Speiser
View author publications
You can also search for this author in PubMed Google Scholar
Alexey I. Nesvizhskii
View author publications
You can also search for this author in PubMed Google Scholar
Markus Müller
View author publications
You can also search for this author in PubMed Google Scholar
Michal Bassani-Sternberg
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.J.F. and M.B.-S. conceived and designed the project and interpreted the results. B.J.S. constructed the circRNA fasta reference and interpreted the results. H.S.P. assisted in the LC-MS experiments (DDA, DIA, PRM) and MS search analysis. M.M. developed and implemented the software NewAnce for group-specific FDR calculations. L.E.K. collected study material. H.J.F., J.A.O., J.M., and M.T.-C. conducted immunopeptidomics MS experiments. F.Y., and A.I.N implemented the group-specific FDR in FragPipe. F.H., E.R.-A., A.I.K., and D.E.S. assisted in data interpretation. H.J.F. and M.B.-S. wrote the manuscript with contributions from all authors.

Corresponding author

Correspondence to Michal Bassani-Sternberg.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Rupert Mayer, Zefeng Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ferreira, H.J., Stevenson, B.J., Pak, H. et al. Immunopeptidomics-based identification of naturally presented non-canonical circRNA-derived peptides. Nat Commun 15, 2357 (2024). https://doi.org/10.1038/s41467-024-46408-3

Download citation

Received: 21 July 2023
Accepted: 16 February 2024
Published: 15 March 2024
DOI: https://doi.org/10.1038/s41467-024-46408-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.