Next-generation proteomics for quantitative Jumbophage-bacteria interaction mapping

Fossati, Andrea; Mozumdar, Deepto; Kokontis, Claire; Mèndez-Moran, Melissa; Nieweglowska, Eliza; Pelin, Adrian; Li, Yuping; Guo, Baron; Krogan, Nevan J.; Agard, David A.; Bondy-Denomy, Joseph; Swaney, Danielle L.

doi:10.1038/s41467-023-40724-w

Download PDF

Article
Open access
Published: 24 August 2023

Next-generation proteomics for quantitative Jumbophage-bacteria interaction mapping

Nature Communications volume 14, Article number: 5156 (2023) Cite this article

4242 Accesses
5 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Host-pathogen interactions are pivotal in regulating establishment, progression, and outcome of an infection. While affinity-purification mass spectrometry has become instrumental in characterizing such interactions, it suffers from limitations in scalability and biological authenticity. Here we present the use of co-fractionation mass spectrometry for high throughput analysis of host-pathogen interactions from native viral infections of two jumbophages (ϕKZ and ϕPA3) in Pseudomonas aeruginosa. This approach enabled the detection of > 6000 unique host-pathogen interactions for each phage, encompassing > 50% of their respective proteomes. This deep coverage provided evidence for interactions between KZ-like phage proteins and the host ribosome, and revealed protein complexes for previously undescribed phage ORFs, including a ϕPA3 complex showing strong structural and sequence similarity to ϕKZ non-virion RNA polymerase. Interactome-wide comparison across phages showed similar perturbed protein interactions suggesting fundamentally conserved mechanisms of phage predation within the KZ-like phage family. To enable accessibility to this data, we developed PhageMAP, an online resource for network query, visualization, and interaction prediction (https://phagemap.ucsf.edu/). We anticipate this study will lay the foundation for the application of co-fractionation mass spectrometry for the scalable profiling of host-pathogen interactomes and protein complex dynamics upon infection.

Structure-guided discovery of anti-CRISPR and anti-phage defense proteins

Article Open access 20 January 2024

Phage proteins target and co-opt host ribosomes immediately upon infection

Article Open access 04 March 2024

Elucidating the network features and evolutionary attributes of intra- and interspecific protein–protein interactions between human and pathogenic bacteria

Article Open access 08 January 2021

Introduction

Protein–protein interactions (PPIs) are the fundamental building blocks of cellular complexity, and their perturbation and rewiring have profound effects on the proteome and cell fate. During an infection, the interactions between host and pathogen proteome are pivotal in regulating pathogen tropism, infection progression, and, ultimately, infection outcome. Host–pathogen interaction (HPI) mapping using affinity-purification mass spectrometry (AP-MS) has been instrumental in identifying host-targeted processes^1,2,3,4,5 and, recently, in predicting potential therapeutic targets during the SARS-CoV-2 pandemic^6,7,8.

Despite the successes of AP-MS for mapping HPIs, the exogenous expression and purification of individual pathogen proteins limit our ability to characterize HPIs under native expression levels and quantify how these interactions are regulated in the context of the full pathogen protein repertoire during infection as well as precluding the detection of downstream rearrangements in protein complexes beyond the viral protein of interest. While some of these limitations have been partially overcome by the introduction of endogenous tagging within the viral genome, this approach has been mostly limited to small viruses⁹. Both endogenous tagging and ectopic expression are labor-intensive processes that require the generation of numerous plasmids and hundreds or thousands of individual purifications to comprehensively probe protein-protein interactions for an entire viral proteome. This limits the scalability of AP-MS for the characterization of HPIs for larger viruses or bacteria which express hundreds or thousands of proteins.

As a result, small eukaryotic viruses have been prioritized in HPI studies^6,10, thus, extensive knowledge on interactions between larger prokaryotic viruses (bacteriophages) and their host is currently missing. This class of bacterial viruses holds great potential for the treatment of multi-drug resistant bacteria, which have increasingly been reported in the last two decades¹¹. However, without a thorough understanding of putative interactions and functions of the phage gene products, it will be challenging to inform the rational design of the next generation of phage therapeutics.

To bridge this gap, here we have applied co-fractionation mass spectrometry using size-exclusion chromatography, coupled with fast data-independent acquisition MS (SEC DIA-MS)¹². In this technique, protein complexes extracted from a native lysate are size-fractionated, and each fraction is analyzed and acquired via mass-spectrometry, resulting in a large matrix of protein intensities over molecular mass. These protein profiles are then utilized as a proxy for the assembly state under the assumption that proteins having identical peak shapes and positions were physically associated at the separation stage. Using SEC-DIA-MS, we generated two systematic phage–bacteria interactomes and measured host PPI rewiring upon phage infection in Pseudomonas aeruginosa. This was done for two KZ-like phages (ϕKZ¹³ and ϕPA3¹⁴), which share 84% nucleotide sequence identity. Both are archetype Jumbophages that possess large genomes (>300 genes), with very limited organization of genes by function, hence lacking synteny. Unique to this family of phages is the presence of a large proteinaceous shell acting analogous to the eukaryotic nucleus, thus decoupling transcription from translation. This structure confers resistance to several bacterial antiphage systems such as CRISPR^15,16 and has a fundamental role in infection establishment¹⁷ and virion production¹⁸. Through the prediction of PPIs using deep learning and structural modeling, we derived system-level maps of Jumbophage infection encompassing a large fraction of the phage and bacterial host proteome. These HPI maps substantially extend previous knowledge on Jumbophage predation and demonstrate the application of co-fractionation mass spectrometry for HPI profiling.

Results

A cross-phage study of the viral infection cycle

To understand HPIs that mediate phage infection, we infected Pseudomonas aeruginosa (strain PAO1) with either the ϕKZ or ϕPA3 bacteriophage for 60 min in a biological duplicate. To control for virion protein complexes (i.e., complexes present within the phage itself), parallel experiments were also performed using a mutant PAO1 strain, dubbed ‘PAO1 control’, that emerged under phage selection (KZ resistant mutant)¹⁹ that resists infection from both phages as shown in Supplementary Fig. S1 (Fig. 1A). This strain expresses significantly lower levels of FliC protein (the major structural unit of the flagellum), a known receptor for ϕKZ¹⁹.

**Fig. 1: High-throughput interaction proteomics for deep host–pathogen interaction mapping.**

Infected cell lysates were fractionated by size-exclusion chromatography, and each fraction (n = 72) was analyzed using data-independent acquisition MS (DIA-MS) coupled to high-throughput liquid chromatography²⁰. To predict HPIs, we used a modified version of the PCprophet toolkit^12,21, where the random forest classifier was replaced with a deep neural network that was trained for PPI prediction using >10 million interactions from various co-fractionation experiments²². Following data processing and replicate integration, we utilized deep learning to predict co-eluting (i.e., interacting) proteins based on their intensity profiles across all measured fractions for a particular condition. To further increase our confidence, we utilized two filters: first, a target-decoy approach was employed to control for randomly coeluting proteins²¹, and then PPIs were filtered to those with a prediction probability of ≥0.75, resulting in a PPI false-discovery rate of less than 5%.

Derived HPI networks have been organized into a user-friendly website, PhageMAP, where users can query proteins of interest to visualize coelution patterns, interactomes, investigate different assembly states of the PAO1 proteome upon phage infection, and export their findings as publication-quality networks or coelution plots (Fig. 1B).

This experimental workflow resulted in the high-throughput and comprehensive coverage of both the bacterial and the phage proteomes. Specifically, we detected 3782 PAO1 proteins, covering 83% of the validated SwissProt entries (i.e., proteins for which experimental evidence of their existence is available) for the Pseudomonas pan-proteome, and 67% of the unreviewed entries (Fig. 1C). Likewise, we detected 280 proteins for ϕKZ and 198 proteins for ϕPA3, covering 75% and 53% of their proteomes, respectively (Fig. 1D).

To test the achievable robustness and resolution of our workflow, we used two benchmarks. First, the robustness of fractionation was assessed by the Pearson R² between the two replicates of a given condition. Each condition showed an average correlation of >0.8 (Fig. 1E), indicating high reproducibility in both phage infection and SEC fractionation, with most of the SEC-profile peaks overlapping within 1–2 fractions (<0.250 μL). To test the resolution achievable with our chromatographic separations, we calculated the number of SEC peaks per protein, which is a direct proxy for how many different complex assemblies a protein participates in. Approximately 45% of the identified proteins were detected in a single SEC-peak in each condition employed (Fig. 1F). While the presence of a single peak can represent detection of only a monomeric protein, we found the majority of these single-peak proteins (90/137 for ϕKZ, 75/110 for ϕPA3 and 1843/2382 for the PAO1 control) are not at their predicted monomeric molecular weight (Supplementary Fig. S2). This suggests that the protein complex assembly state of the PAO1 proteome was preserved during sample preparation and SEC fractionation.

A high-quality interaction dataset for bacterial protein complexes

Next, we sought to investigate the recovery of known protein complexes by leveraging the partial conservation of core molecular assemblies between P. aeruginosa and other bacteria, such as Escherichia coli, for which protein complexes are more extensively annotated²³. To visualize our data, we utilized the KZ-resistant mutant dataset and projected it using t-SNE (Fig. 2A). In this dimensionality reduction approach, neighboring points in the embedded space are derived from proteins sharing similar protein profiles, while distinct profiles results in distant points. However, due to the non-linear nature of the t-SNE algorithm, the distance between clusters and the shape of the global or local embedding cannot be interpreted back to the input data. Smaller enzymes, such as metabolic enzymes, are usually co-expressed within the same operon²⁴ and have been reported to dimerize or multimerize. In line with this, we observed enzymes such as the pyruvate dehydrogenase complex (Fig. 2B) and the oxoglutarate dehydrogenase complex (Fig. 2C), which migrated at an estimated MW of ≈3.5 × 10⁶ Da (expected MW ≈3.75 × 10⁶ Da) and ≈2.4 × 10⁶ Da, respectively. It is important to point that out that the molecular weight estimation for these large assemblies is subject to error due to these peaks being outside the external calibration curve. To achieve MW estimation, we included in the calibration curve a pure SEC-separated 70S ribosome (Supplementary Fig. S3).

**Fig. 2: *Pseudomonas* protein complexes identified in the SEC-MS data.**

Our sample preparation also preserved membrane-bound complexes. As an example, the AAA protease complex, formed by four hexamers of the AAA protease (ftsH) and 12 copies of each single-pass membrane protein (HflK and HflC)²⁵, was recovered at high molecular weight in a broad peak, as shown in Fig. 2D. The large molecular weight range and sensitivity covered by our separation approach were also demonstrated in the recovery of more transient complexes such as the DNA polymerase III (dnaA, dnaE, and dnaQ) loaded with the γ complex (holA and dnaX) which plays a key role at the replication fork²⁶ (Fig. 2E). Finally, heterodimeric complexes such as the succinyl-coA synthetase were also recovered as demonstrated by the coelution plot in (Fig. 2F). Our manual inspections further confirm that prior knowledge can be easily incorporated into SEC-MS data analysis and allows for straightforward identification of protein complexes.

Comparison of host-targeted processes reveals conserved and divergent predation mechanisms

After having demonstrated the proteome depth achieved in our SEC-MS dataset and the recovery of known complexes, we turned our attention to how Jumbophages re-wire P. aeruginosa protein complexes by evaluating differences in SEC profiles upon phage infection. Variation in SEC profiles between conditions can arise from differential assembly state (i.e., a protein profile shifting to higher or lower molecular weight), different stoichiometry within a complex, or global alterations in protein abundance.

To quantify these different cases, we employed a previously described Bayesian analysis module from the PCprophet package¹² to derive marginal likelihoods (SEC differential score) of protein-level SEC changes between ϕKZ and ϕPA3 versus the receptorless infected samples (i.e., PAO1 control). Differential analysis of two SEC profiles using PCprophet provides the SEC differential score, which represents the variation of complex intensity (stoichiometry) or peak position (assembly state). Comparing the SEC-profile differences between phage-infected PAO1 and PAO1 control revealed approximately 600 proteins showing SEC variation upon infection by either phage (Fig. 3A). Notably, there is substantial consistency in which P. aeruginosa proteins are altered between both phages and the degree of change in their individual SEC profiles (Fig. 3B, cor = 0.677), potentially pointing towards common pathways and complexes hijacked by ϕPA3 and ϕKZ for successful predation. When compared to an independent whole-cell lysate-proteome protein abundance measurement of the same cell lysate, we find that most of the changes at the assembly state level do not have a corresponding variation in protein abundance at the global proteome level. Altogether this observation suggests that SEC-MS offers an orthogonal view on the effect of perturbations, such as infection, on the proteome (Fig. 3C).

**Fig. 3: Differential analysis of SEC-MS data.**

To identify conserved KZ-like jumbophage manipulation of the host interactome, we mapped the SEC-derived PAO1 interaction network (Fig. 3D) with the correspondent protein-level differential data derived from the comparison between phage and uninfected samples. Although a large portion of the nodes do not have a functional annotation, we identified several functional classes where their components were significantly altered upon Jumbophage phage infection, as depicted in Fig. 3E. For example, the biofilm formation pathway (KEGG id:pae02025) was enriched in both phage infected samples (q≤0.01). Several prior studies have highlighted the role of phages in regulating the formation of biofilms via modulation of polysaccharide production and perturbation of cell envelope biology^27,28. We identified multiple proteins in this category having significantly decreased abundance in the high molecular weight region compared to their uninfected counterpart (Fig. 3F), suggesting a lower assembly state or complex reduction upon infection. Specifically, we observed >2-fold reduction in pslD, pslE, and pslG in the high-molecular-weight region. These proteins are members of a complex spanning from the inner membrane (pslG) to the outer membrane (pslD), which is required for the biosynthesis of exopolysaccharide²⁹. Additionally, the uncharacterized proteins PA3346, PA2366, and PA1667, display a broad coelution profile across the molecular weight dimension in the control sample, which is typically associated with membrane proteins¹². Notably, these peaks are largely depleted (>3-fold reduction) in the infected condition. Outer membrane proteins were particularly affected by Jumbophage infection with porins and multi-drug efflux proteins (KEGG pae02010: ABC transporters), displaying a significant reduction in interactions. For example, the MexAB–OprM complex, a key efflux pump³⁰, shows an almost complete reduction of the fully assembled complex (Fig. 3G). Importantly the MexAB-OprM complex was previously shown to be targeted by a Jumbophage closely related to ϕKZ, called OMKO1³¹, potentially representing a secondary receptor-binding site for ϕKZ-like Jumbophages.

It is important to point out that changes we observed could either be beneficial for the phage to overcome its host, a host response to limit phage development, or simply be the result of pleiotropic regulators.

Organization of ϕKZ-like Jumbophage viral interactomes

The remodeling of host protein complexes can be the result of indirect rewiring of host cellular processes or direct interactions with phage proteins. Thus, we next investigated interactions directly involving phage proteins, including complexes containing both phage–host and phage–phage interactions.

Following SEC-MS and PPI prediction, we defined high-confidence interactions as those with a probability score of ≥0.75. In total, we identified 292 interactions between pairs of ϕKZ viral proteins and 6550 HPIs between ϕKZ and PA01 proteins. ϕPA3 showed a similar trend with 145 viral-viral and 3979 host–pathogen protein interactions (Fig. 4A). Topological analysis of these networks revealed a scale-free architecture (Fig. 4B), in line with previous reports that SEC-MS-derived networks present the same architectural features as networks derived from literature curated studies and large PPI databases^12,32,33. It has been observed with smaller phages that genes within the same operon are often functionally related³⁴. Accordingly, we evaluated the distribution of our predicted PPIs (by SEC-MS) in phage-infected PAO1 cells as a function of the genomic separation of their corresponding genes. Here, we find a wide variation in the genomic distance between phage proteins that interact with other phage proteins (i.e., phage-phage interactions) (Fig. 4C), with some genes being separated by distances as large as 139 kb. For example, the two RNA polymerases in ϕKZ are both composed of proteins expressed in different operons with a max distance of 112.761 kb (PHIKZ080–PHIKZ180 in the vRNAp). Thereby, our resulting PPI distribution confirms the general lack of synteny within the genomes of ϕKZ-like Jumbophages and shows the SEC-MS approach is a particularly advantageous technique to query phage-encoded protein complexes, agnostic to the overall genome organization (i.e., a guilt-by-association approach at the protein level).

**Fig. 4: Comparative analysis of ϕKZ and ϕPA3 interaction networks.**

Data-driven identification of ϕKZ-like Jumbophage protein complexes

The identified interactions allowed us to recapitulate several known complexes in the Jumbophage proteome, despite a limited number being described at present. For example, we recovered the non-virion-associated RNA-polymerase³⁵ migrating at its expected molecular weight (apparent MW 271 kDa, correct MW ≈ 265 kDa) (Fig. 4D) as well as the virion-associated RNA polymerase³⁶ (apparent MW 300 kDa, correct MW ≈ 297 kDa) as shown in Fig. 4E. Previously described interactions between the phage and the host were also recovered by our approach, such as the interaction of PHIKZ037 with the RNA degradosome, which is involved in the accumulation of viral RNA³⁷ (Fig. 4F).

Building on the recovery of known phage protein complexes and the presence of several phage peak groups at the high-molecular-weight (Supplementary Fig. S4), we sought to probe our dataset for previously undescribed shell-associated proteins. Only two proteins so far have been identified as fundamental for shell formation and function: the major shell protein PhuN^38,39 (gp54 in ϕKZ; gp53 in ϕPA3, gp105 in 201-ϕ2-1), which is the main building block of the shell complex^15,17,40, and the bipolar tubulin spindle protein phuZ which serves to stabilize the shell in the center of the bacterial cell⁴¹. Because the capsid docks on the shell prior to tail attachment and lysis¹⁸, we are unable to differentiate phage proteins contained within the capsid from those that are ejected and associated with the shell by apparent increases in molecular mass alone. To distinguish between these two cases, we performed two orthogonal control experiments using a cesium-purified virion sample and a shell-enriched sample (Supplementary Fig. S5A), which we used to filter the SEC-MS interactors to only proteins enriched in the shell sample and absent in the virion (Supplementary Fig. S5B, C). The six remaining proteins (PHIKZ_p22, PHIKZ036, PHIKZ111, PHIKZ_p64, PHIKZ232, and PHIKZ261) were then tested for association with the shell via fluorescence microscopy using PAO1 expressing mNeonGreen tagged constructs. Overexpression of most constructs resulted in diffuse localization outside of the phage shell (Supplementary Fig. S5D), with gp36-mNG and p64-mNG displaying filaments or puncta that were sometimes peripheral to the phage shell, but time-lapses revealed them to be mobile throughout the cell (see Supplementary Movies 1 and 2). Of note, these results do not conclusively exclude these proteins as potential shell components. Further confirmation of these results would be needed to address potential technical issues, such as disruption of protein localization by tagging or over-expression, shell association at an earlier or later stage of infection, or difficulty in detecting transient interactions. In addition, large (>MDa) and intact shell fragments have been shown to be mostly insoluble³⁹, likely resulting in the loss of many shell fragments prior to SEC and tightly bound shell-associated proteins as a result.

We then turned our attention to interactions between phage and host complexes. Interestingly, several ϕKZ proteins (PHIKZ005, PHIKZ108, PHIKZ285, PHIKZ286, PHIKZ299, and PHIKZ_p51) were predicted by our deep learning tool to be in complex with the fully assembled P. aeruginosa 70S ribosome (Fig. 4G). A recent preprint utilizing a low-resolution fractionation technique (Grad-seq) showed the presence of multiple proteins in the ribosome⁴². Notably, our co-fractionation experiments recovered most of them, albeit at a lower prediction confidence (0.5) than the one utilized to threshold the data (0.75).

To validate these ϕKZ proteins as ribosomal interactors, we performed cross-linking mass spectrometry (XL-MS)⁴³ on a pooled sample from the SEC fractions corresponding to the 70S ribosome (Supplementary Fig. S3). We identified 975 crosslinks in total (202 inter-links and 871 intra-links), covering several previously reported bacterial protein complexes (Supplementary Fig. S6). The XL-MS data recovered 24 P. aeruginosa ribosomal proteins (separated in 30S and 50S) of which 3 showed physical interaction with 5 ϕKZ proteins. Amongst these phage proteins, we recovered PHIKZ285, PHIKZ286, and PHIKZ108, which were predicted from the SEC-MS data to be in complex with the 70S ribosome. Moreover, we identified PHIKZ_p08 and PHIKZ175 as additional ribosomal interactors (Fig. 4H). PHIKZ286 bound the L1 ribosomal stalk (rplL, rplK, and rplJ), which has an important role in tRNA translocation⁴⁴ and is the contact site for several translation factors⁴⁵. PHIKZ_p08 interacted with rplN bound to its ribosome silencing factor rsfS, which slows down or represses translation⁴⁶. Finally, PHIKZ285, PHIKZ175, and PHIKZ108 were bound to rpmC, which is an accessory protein positioned near the exit site and required for triggering nascent polypeptide folding⁴⁷. These findings demonstrate the power of SEC-MS to detect HPIs involved in critical aspects of host biology, however, further mechanistic characterization is needed to determine if such phage proteins manipulate host ribosomes or instead represent the active translation of the phage proteins.

Identification of previously undescribed phage proteins by SEC-MS

The multiplexed nature of DIA allows unbiased sampling of the full precursor space⁴⁸, so we queried our data for the presence of peptides from previously undescribed phage proteins using a custom protein FASTA built with EMBOSS. We detected 4 previously undescribed proteins for ϕKZ (2 forward and 2 reverse ORFs) and 11 for ϕPA3 (8 forward and 3 reverse) (Fig. 5A, B). The authenticity of these previously undescribed proteins is supported by the detection of two or more proteotypic peptides for nearly all proteins (Fig. 5C) and reproducible detection of the same peptides in 15 or more consecutive fractions across independent experiments (Fig. 5D). All of these proteins showed reproducible quantitation between biological duplicate experiments (n = 72 per replicate), with an average protein-level correlation of 0.75 for ϕPA3 proteins and 0.82 for ϕKZ proteins (Fig. 5E). Most of these proteins migrated at a higher molecular weight than their predicted molecular weight, suggesting they may be associated with high-order assemblies (Fig. 5F). Some of the previously undescribed ORFs are further supported by a great degree of sequence overlap with homologs. The most staggering example is the ϕPA3 reverse sense ORF 56450–58417, which shows >70% sequence similarity with previously reported proteins from various Pseudomonas spp. phages (ϕKZ, Psa21, Phabio, 201ϕ2-1, and PA1C) as shown in Supplementary Fig. S7A. Interestingly, all proteins showing ≥50% identity to 56450-58417 are previously reported or proposed phage RNA polymerase components (RNAP), such as PHIKZ074 (non-virion associated RNAp, UniprotID Q8SD88)^36,49,50. To date, there is no experimental evidence of a nvRNAP in ϕPA3. To derive other putative members of this complex, we extracted the predicted interactors of ORF 56450–58417 (Fig. 5G) and performed BLASTp analysis to identify proteins showing homology to other Jumbophage RNA polymerase components. From this analysis, we selected four interactions, namely PHIPA3055, PHIPA3063, the previously undescribed ORFs 53811-55010, and 55491–56444. Recent work also detected these two ORFs, giving increased confidence in our findings⁵¹. Further manual curation of the genome file for ϕPA3 revealed an intron sequence between 53811–55004 and 55455–56444, leading to a single protein. Hence we renamed the 53811–55010 and 55491–56444 ORFs as 53811–56444.

**Fig. 5: Identification of previously undescribed phage proteins.**

The ORF 56450–58417 interacting proteins show >50% conservation with multiple Jumbophage proteins annotated as RNAP components (Supplementary Fig. S7B–D). Specifically, we identified homologs of both the \({\beta }{{\prime} }\) polymerase subunit (PHIPA3055 and ORF 56450–58417) as well as homologs of the β subunit (ORF 53811–56444). The ϕPA3 protein PHIPA3063 displays 57% sequence similarity to PHIKZ068, an essential nvRNAp component that lacks structural similarity to known components of previously reported RNA polymerases⁴⁹. Utilizing the position of the SEC peak, we estimated the nvRNAp MW in ϕPA3 to be ≈321 kDa (Fig. 5G). Assuming the lack of homodimers in the structure, the predicted MW for these four proteins was ≈283 kDa, suggesting a putative missing subunit. Of note, we did not identify 56450–58417 interactors corresponding to PHIKZ123, another β subunit component, which could explain this observation. To explore the possibility of these proteins (PHIPA3055, PHIPA3063, ORF 53811–56444, and ORF 56450–58417) folding into an RNA polymerase-like assembly, we performed structural prediction of this peak group using AlphaFold2 multimer⁵². We aligned the best scoring model (ipTM + pTM = 0.86, Fig. S8) to the reported structures for the ϕKZ nvRNAP (PDB 7OGP https://www.rcsb.org/structure/7OGP and 7OGR https://www.rcsb.org/structure/7OGR)⁵⁰ as depicted in Fig. 5H. We reached a template modeling (TM) score of 0.72 using US-Align⁵³ and an average RMSD of 0.624 Å using MatchMaker⁵⁴ between our proposed ϕPA3 vRNAp and the ϕKZ RNAp (70GR), indicating a shared tertiary structure similarity between these two assemblies. As we obtained low distances for the β and \({\beta }{{\prime} }\) subunits, we set to investigate the misaligned region at the C-term of the polymerase clamp (PHIPA3063 in ϕPA3 and PHIKZ068). Despite showing high sequence homology (68%), these two proteins share a large intrinsically disordered region (IDR) in the middle of the sequence (275-293 aa for PHIKZ068 and 277-301 aa PHIPA3063) as shown in Supplementary Fig. S9. The IDR likely enables flexibility in the central region, resulting in varied orientations for the folded C-term in PHIPA3063 following AlphaFold predictions.

Discovery and validation of previously undescribed injected phage proteins

Although it is well-known that ϕKZ phages guard their genome from nucleolytic host-immune systems by building a proteinaceous shell¹⁵, this structure is only visible after 20 min of infection. Little is known about how the phage genome is protected or packaged prior to shell assembly. To identify phage proteins proximal to the genome, with possible protective functions, we first determined a detailed virion proteome to allow distinction of virion proteins (injected) from newly synthesized proteins. We performed cesium chloride-gradient purification of ϕKZ coupled with deep peptide fractionation and long chromatographic acquisition (see Supplementary Methods for details). The 245 ϕKZ proteins identified in this dataset encompassed ≥90% of previously reported head proteins⁵⁵ (Supplementary Fig. S10A). To account for low-level contamination caused by cesium chloride-fractionation, we compared our enriched virion sample with the previously reported virion proteins to derive a ROC curve, which we used to select an intensity threshold maximizing recall of known virion proteins and minimize the false-positive rate (Fig. 6A). This filtering identified 81 significantly enriched proteins in total. This included 58/61 (95% recall) of those previously reported and added 23 proteins to the virion composition, which are strongly enriched over their corresponding protein abundance in a non-enriched sample (Fig. 6B). This drastic increase in protein number is dependent on the increased sensitivity sequencing speed of the MS utilized for acquisition as well as extensive offline sample fractionation prior to MS acquisition (see Supplementary Fig. S.10B).

**Fig. 6: Data-driven analysis of injected inner body proteins.**

As prior work reported extensive gp175-driven proteolysis of the head and inner body (IB) proteins⁵⁵, we searched our purified virion data for evidence of unexpected termini (i.e., semi-tryptic peptides) with the goal of confirming prior reported cleavages and potentially identifying novel ones. We recovered 63 semi-tryptic peptides, of which 15 could be mapped to prior data⁵⁵ (≈ 40% overlap). Within our semi-triptic peptides, we identified 20 cleavages corresponding to the reported IB proteins (gp93/95/97) and 12 mapping to 9 unreported proteins. Of note, 8 cleavages could be mapped to gp94, gp177, and gp303 (see Supplementary data and Supplementary Fig. S10C). To identify consensus sequences within our set of IB interactions, we performed motif enrichment analysis using STREME⁵⁶. We found that the LSxE consensus motif was enriched (BH-adjusted p = 6e⁻⁵), corroborating the previously reported S/A/G-X-E motif while providing additional specificity in the P2 position (Fig. 6C).

Starting with only the virion proteome, we next queried the ϕKZ interaction network to identify putative injected proteins (Fig. 6D). Building on this data, we selected the interactors of the previously reported proteins (gp94, gp153, gp162, gp163, and gp177) for additional validation using our previously reported assay for evaluating injection¹⁹. In this assay, PAO1 cells expressing the protein of interest with a FLAG tag are infected with wild-type ϕKZ, resulting in phage particles with labeled proteins. These phage particles are then used to infect gentamicin-treated PAO1 cells (i.e., cells where the translation is inhibited) allowing to evaluate injection using a western blot. By further lowering the interaction thresholds to positive predicted interactions (i.e., PPI probability ≥0.5 instead of 0.75 utilized to select high-confidence interactors), we further identified gp184 as an IB interactor and validated it as an injected protein. These experiments confirmed the injection of the previously reported IB proteins (gp93, gp95, gp97) and further validated the injection of most of their interactors (gp94, gp153, gp163, gp177, and gp184) as showcased in Fig. 6E. We note that among the 3xFLAG-tagged virion proteins tested in this study, the injection profile of inner body protein PHIKZ090 is inconsistent with a previous report¹⁹. In this prior study, it was reported that PHIKZ090 tagged at the C-terminus with mNeonGreen is injected into PAO1 cells upon infection by PHIKZ (as detected by fluorescence microscopy). We do not detect injection of PHIKZ090-3xFLAG via western blotting. The reasons for this discrepancy are unclear and may result from weaker expression of this construct (as compared to the controls PHIKZ093-3xFLAG and PHIKZ089-3xFLAG) and lower levels of packaging in the virion. As a consequence, the amount of PHIKZ090-3xFLAG injected into PAO1 cells is reduced, likely to levels that are below the detection limit of the western blot assay. We acknowledge that poor expression of certain constructs is a limitation of this assay. Here, by using a highly sensitive MS of the virion combined with SEC-MS, we identify and validate the injection of eight proteins (three previously reported) that are highly abundant, found in the virion, and interact with the previously reported IB proteins. Overall, these proteins give us a starting point to unravel the interactome of the ejected phage genome and identify proteins that protect the genome from host nucleases.

Discussion

Understanding the dynamics driving host and pathogen interactions and their dynamics upon infection is a crucial component to deepening our knowledge of the mechanisms regulating infection progression and outcome. To date, most proteomics studies of infectious diseases focused on the analysis of a few pathogen proteins by tag/antibody-based purification or the measurement of protein abundance variation in infected samples. Yet it is widely known that the pathogen proteome works as an ensemble through protein–protein interactions to hijack the host cell, which in turn regulates both expression and interaction between host proteins. Hence, a system-wide view of the intrinsic modularity of the pathogen proteome and how it quantitatively regulates host complexes is key to understanding pathogenic mechanisms at the molecular level.

In this study, we demonstrate the application of SEC-MS to systematically investigate pathogen proteome organization and host interactome plasticity upon Jumbophages infection of P. aeruginosa. ϕKZ-like phages (specifically ϕKZ and ϕPA3) are potent killers of P. aeruginosa (with a broad host range), making them timely alternatives to antibiotics with many ϕKZ-like phages already in clinical trials to treat bacterial infections. By obtaining an atlas of these phage interactomes, we can begin to construct a mechanistic understanding of the ϕKZ-like Jumbophage infections, ranging from viral composition to protein injection, transcription, and phage nucleus assembly and growth.

Our ϕKZ-like phage interactomes recapitulated prior evidence for the subdivision of Jumbophage proteomes into distinct assemblies, such as virion and non-virion-associated RNA polymerases, as well as the interaction with key host complexes such as the RNA degradosome. We expanded our knowledge on the phage interactions with essential host processes such as translation, where we identified phage proteins interacting with the ribosomal stem and ribosomal silencing factors. Moreover, while the lack of immediate genome organization hinders the prediction of functions for phage proteins, the deep coverage and unbiased nature of SEC-MS data offers a straightforward approach to identifying previously undescribed complexes and proposing putative functions. As an example, by using SEC-derived interactors of a de-novo predicted ϕPA3 protein (ORF 56450–58417), we identified a heterotetrameric assembly which is predicted to have strong structural homology to the reported nvRNAP in ϕKZ. This suggests that the unbiased nature of SEC-MS data allows for not only the discovery of an uncharacterized protein but also enables to probe of its putative function through the detection of new protein–protein interactions. Identifying such complexes will enable further investigation using structural and biochemical approaches. In addition to the identification of interactions, these maps offer the opportunity to further quantify host interactome remodeling and disentangle variation in expression from the assembly state. By comparing the P. aeruginosa interactome between infected and uninfected, we observed a large degree of changes during infection, with perturbation of similar complexes between the two Jumbophages suggesting conserved mechanisms of phage predation. While here we have a first draft of the KZ-like Jumbophage interactome, it is important to acknowledge the trade-off between specificity and throughput in interaction identification in SEC-MS, which we mitigated by utilizing only high-confidence interactions for analysis. This step, while increasing the confidence in our PPIs identification, still does not allow for the complete removal of false-positive results due to the lack of a real ground-truth dataset, which is to be expected in large-scale fractionation experiments. Advances in deep learning models for prediction of interactions from co-fractionation mass spectrometry data and integration of orthogonal features (besides the coelution itself), such as predicted structure or function, are expected to improve prediction accuracy and reduce the false discovery rate for uncharacterized proteomes. Overall, the characterization of host-pathogen molecular networks remains challenging, but we provided the first interactome-wide study of infection progression using two models of ϕKZ-like phages in P. aeruginosa.

Wider application of SEC-MS is expected to significantly accelerate the characterization of pathogenic mechanisms by providing proteome-wide insights into the physical association between host and pathogen complexes, thus enabling the identification of novel druggable targets, host vulnerabilities, or guidance in the development of new biologicals.

Methods

Cloning

C-terminal 3xFLAG fusions of ϕKZ IB proteins pHERD30T plasmids encoding C-terminal 3xFLAG fusions of ϕKZ IB proteins (PHIKZ089, PHIKZ090, PHIKZ093, PHIKZ095, PHIKZ097, and PHIKZ162) were cloned as follows: the plasmids pHERD30T-IB-mNeonGreen (PHIKZ089, PHIKZ090, PHIKZ093, PHIKZ097, PHIKZ162) were digested with restriction enzymes XhoI and KpnI (upstream and downstream of the mNeonGreen sequence) to generate a pHERD30T-IB____ backbone (IB: PHIKZ089, PHIKZ090, PHIKZ093, PHIKZ097, PHIKZ162). An insert sequence corresponding to XhoI-GGGGS-3xFLAG-KpnI was digested with XhoI and KpnI to generate an insert fragment that was ligated individually with each corresponding backbone using T4 DNA ligase to generate the plasmids ((see: Supplementary Table 1 for a list of plasmid and Supplementary Table 2 for list of primers utilized). The pHERD30T plasmid encoding a C-terminal 3xFLAG fusion of PHIKZ095 was generated as follows—the plasmid pHERD30T -PHIKZ090-3xFLAG was digested with restriction enzymes SacI and XhoI to generate a pHERD30T____ -3xFLAG backbone. The plasmid pHERD30T-PHIKZ095-mNeonGreen (sequence: Supplementary Table 1) was digested with SacI and XhoI, and the smaller fragment corresponding to SacI-PHIKZ095-XhoI was extracted by DNA Gel extraction to obtain an insert fragment. The insert fragment was ligated with the pHERD30T____-3xFLAG backbone using T4 DNA ligase to create the plasmid pHERD30T-PHIKZ095-3xFLAG. All plasmid sequences were checked for correct assembly upstream and downstream of the insert sequence with Sanger Sequencing (Quintara Biosciences) using sequencing primers QB0068 (5′-ATGCCATAGCATTTTTATCC-3′) and QB0049 (5′-CCCAGTCACGACGTTGTAAAACG-3′). C-terminal 3xFLAG fusions of ϕKZ virion proteins: pHERD30T plasmids encoding C-terminal 3xFLAG fusions of ϕKZ proteins (PHIKZ030, PHIKZ_p29, PHIKZ092, PHIKZ094, PHIKZ129, PHIKZ153, PHIKZ157, PHIKZ163, PHIKZ177, PHIKZ184, PHIKZ203, PHIKZ244, PHIKZ303) were generated as follows: the plasmid pHERD30T-mNeonGreen -3xFLAG was linearized via PCR using primers pHERD30T-mNG-3xF_F, pHERD30T-mNg-3xF_R to generate a pHERD30T____-3xFLAG backbone (removing the mNeonGreen sequence). The genes encoding ϕKZ proteins were amplified from purified ϕKZ particles (removing the stop codon) via PCR using the corresponding primers to generate insert fragments. These insert fragments were individually assembled with the pHERD30T____-3xFLAG backbone to generate the plasmids. All plasmid sequences were checked for correct assembly upstream and downstream of the insert sequence with Sanger Sequencing (Quintara Biosciences) using sequencing primers QB0068 (5′-ATGCCATA GCATTTTTATCC-3′) and QB0046 (5′-TGTAAAACGACGGCCAGT-3′). Benchling files containing sequences of all constructs, attached primers, and sequencing files are reported in Supplementary Data Files.

Plaque assay

Plaque assays were conducted at 30C with solid LB agar plates. Totally, 150 μL of overnight bacterial culture was mixed with 3 mL top agar (0.35% LB-Agar, 10 mM MgSO₄) and plated on bottom Agar (20 mL LB-Agar, 10 mM MgSO₄). Phage lysates were diluted 10-fold, and 2 μL spots were applied to the top agar after it had been poured and solidified.

Bacterial culture

P. aeruginosa strains PAO1 were grown overnight in 3 mL LB at 37 °C with aeration at 175 rpm. Cells were diluted 1:100 from a saturated overnight culture into 100 mL LB with 10 mM MgSO₄ and grown for ≈2.5 h at 37 °C with aeration at 175 rpm. At OD600 nm = 0.5–0.6 (≈3e⁸ CFU/mL), the cell cultures were infected with bacteriophage (ϕKZ or ϕPA3; MOI ≈ 1) on ice for 10 min (to allow complete adsorption of virions onto cells) and then incubated at 30 °C for 50 min (total time of infection 60 min). Thereafter, the cell cultures were transferred to pre-chilled 50 mL falcon tubes and centrifuged at 6000xg, 0 °C for 5 min. The supernatant was discarded, and cell pellets were washed twice with 5 mL ice-cold LB and combined.

After the final wash, the bacterial pellets were resuspended in a 5 mL ice-cold LB. The concentrated cell culture was flash-frozen in liquid nitrogen and subsequently mechanically lysed using a SPEX-freezer mill.

Shell isolation via density centrifugation

The shell isolation was performed as we previously reported³⁹. Briefly, we infected P. aeruginosa PA01 with ϕPA3 or ϕKZ for 60 min. The bacteria were mechanically lysed via Dounce homogenization in NP40 Lysis Buffer (50 mM Bis–Tris, 150 mM NaCl, 0.5% NP40, 5% glycerol, 5 mM DTT, 20 ng/μl Lysozyme, 1 mM EDTA, 1 mM EGTA—pH 6.5). The lysate was clarified at 16,000×g for 5 min, and the insoluble fraction was resuspended in wash buffer (20 mM Bis–Tris, 150 mM NaCl, 1 mM DTT, 1 mM EDTA, 2 mM MgCl₂—pH 6.5). This was subject to further 500×g (5 min) and 15,000×g (10 min) centrifugation with the insoluble fraction isolated and resuspended in wash buffer each time. The insoluble fraction of the last 15,000×g spin was retained as the final product. The shell-enriched sample was acetone precipitated using eight volumes of ice-cold acetone and incubated overnight. Following incubation, the protein pellet was washed thrice with ice-cold acetone and dried under a vacuum.

Cesium gradient purification of phage virions

Bacteriophages (ϕKZ or ϕPA3) were propagated in LB at 37 °C with PAO1 as a host. Liquid growth curve experiments were used to ascertain the MOI of bacteriophage stock needed to ensure complete lysis of the bacteria following a substantial growth as ascertained by OD600 measurement. Growth curve experiments were carried out in a Synergy H1 micro-plate reader (BioTek, with Gen5 software). Cells were diluted 1:100 from a saturated overnight culture with 10 mM MgSO₄. Diluted culture (140 μl) was added together with 10 μl of 10× serial dilutions of bacteriophage stocks to wells in a 96-well plate. This plate was cultured with maximum double orbital rotation at 37 °C for 24 h with OD600 nm measurements every 5 min. Thereafter, the bacteriophage stock was added at the appropriate MOI to a 1:100 back-dilution of a saturated PAO1 overnight culture in 100 mL LB with 10 mM MgSO₄, and the bacterial culture incubated for 24 h (37 °C with aeration, 175 rpm). Totally, 5 mL of chloroform was added to the cultures in a fume-hood, and the cultures were incubated with chloroform for 15 min (37 °C, 175 rpm) to ensure maximum lysis of bacterial cells. The cell cultures were transferred to 50 mL falcon tubes and centrifuged at 6000×g for 15 min to pellet bacterial debris. The supernatant (containing bacteriophages in high titer) was carefully transferred to a fresh set of 50 mL falcon tubes and centrifuged and 6000×g for 15 min to pellet any residual bacterial debris. The supernatant was transferred to fresh 50 mL falcon tubes with 2 mL chloroform. To obtain high-purity virion particles, a previously described protocol was followed⁵⁷. The virions from the bacterial cell lysate were concentrated by slow stirring overnight at 4 °C in 1 M NaCl and 10% PEG (final concentration) and then pelleted (11’300×g, 4 °C, 30 min). Pellets were resuspended in 20 ml of SM buffer (50 mM Tris-HCl (pH 7.5), 100 mM NaCl, 8 mM MgSO₄, 0.002% gelatin) containing Complete Protease Inhibitor (Roche). The phage suspension (5.8 mL/tube) was layered onto CsCl step gradients composed of the following concentrations of CsCl: 1.59 g/ml (0.75 ml), 1.52 g/ml (0.75 ml), 1.41 g/ml (1.2 ml), 1.30 g/ml (1.5 ml) and 1.21 g/ml (1.8 ml). The buffer used throughout the gradient was 10 mM Tris-HCl (pH 7.5) and 1 mM MgCl₂. Tubes were spun at 31,000 rpm for 3 h at 10 °C in an SW41 rotor (Beckman Coulter ultracentrifuge), and the resulting phage band had a buoyant density of 1.36 g/ml. This fraction was collected and dialyzed against three changes of 50 mM Tris-HCl and 10 mM MgCl₂ at 4 °C. This ultra-purified phage stock was diluted in SM buffer, and its titer was assessed using plaque assays. Finally, the phage virion stock was acetone precipitated using eight volumes of ice-cold acetone.

Bacterial infection and SEC sample preparation

Cryomilled samples were resuspended in ≈4 ml of SEC running buffer (50 mM ammonium bicarbonate and 150 mM NaCl pH 7.4) supplemented with protease inhibitors (Roche) and ultracentrifuged at 60,000×g for 30 min at 4 °C. The supernatant was concentrated to 100 μL using a 100 kDa molecular weight cutoff filter to simultaneously enrich for high-molecular-weight assemblies and deplete monomeric proteins. The concentrated sample was centrifuged once more at 10,000g at 4 °C to remove particles.

Size-exclusion chromatography

Approx 1000 μg per sample (≈80–90 μL as estimated by Bradford’s assay) were separated on an Agilent Infinity 1260 HPLC operating at 0.5 mL/min in SEC running buffer with a Phenomenenex SRT-C1000 column connected and cooled at 4 °C. Seventy-two fractions of 125 μl were collected after 3.75 ml until 13 ml and the column was then washed with 2 column volumes (18 mL) of SEC buffer. The MW was estimated using a protein mixture (Phenomenex AL0-3042), while an E. coli 70s ribosome (NEB, cat nr P0763S) was used to estimate which fractions to use for ribosome XL-MS.

SEC-MS proteomics sample preparation

The SEC samples were prepared as we previously reported⁵⁸ using a 96-well filter-aided sample preparation (FASP). The FASP filters were conditioned by washing twice with 100 μL of ddH₂O. SEC buffer was removed by centrifugation (1800×g 1 h), and proteins were resuspended in 50 μL of TUA buffer (TCEP 5 mM, Urea 8 M, 20 mM ammonium bicarbonate) and incubated on a thermos shaker (37 °C, 400 rpm) for 30 min. Cysteine residues were then alkylated by the addition of 20 μL CAA buffer (Chloroacetamide 35 mM, 20 mM ammonium bicarbonate) for 1 h at 25 °C in the dark. TCEP and CAA were removed by centrifugation (1800×g, 30 min), and filters were washed 3 times with 100 μL of 20 mM ammonium bicarbonate. Proteins were digested in 50 μL of 20 mM ammonium bicarbonate with 1 μg of tryspin per fraction. A 96-well receiver plate (Nucleon, Thermo-Fisher) was used to collect the peptides by centrifugation for 30 min at 1800×g. The filter plates were washed once with 100 μL of ddH₂O and centrifuged to dryness (1800×g, 60 min). The peptides from the receiver plate were transferred to protein LoBind tubes (Eppendorf), and the corresponding well was washed with 50 μL of 50% acetonitrile (ACN) in ddH₂0 to increase the recovery of hydrophobic peptides. The combined resulting peptides per each fraction were vacuum dried and stored at −80 °C until MS acquisition. For each phage, 5 μL from each fraction were pooled together to generate a phage-specific library. Each sample-specific library was prepared on a C18 spin column (Nest). Following activation of the column with 1 column volume (CV) 100% ACN and wash with 2 CV of 0.1% formic acid, the peptides were bound to the column and eluted using a step-wise gradient of ACN from 5 to 25 (5% increases) in 0.1% triethylamine to account for the increased hydrophobicity of the XL peptides compared to not modified ones. A final fraction at 80% ACN was added to recover hydrophobic peptides.

Proteomics sample preparation for virion-enriched protein pellets

Dried proteins were resuspended in 100 μL of 8M urea, 100 mM ammonium bicarbonate (ABC) pH 8.1. TCEP (Thermo Fisher) was added to 5 mM final concentration, and the samples were incubated at room temperature for 30 min. Reduced cysteines were alkylated with 10 mM chloroacetamide (CAA) for 30 min in the dark. Following alkylation, the urea was diluted to 1 M with 100 mM ABC and the proteins were digested with 2 μg of trypsin per sample for 14 h at 37 °C in a thermo-shaker (600 rpm). Digestion was stopped by acidification using 10% formic acid (FA), and the samples were desalted using a C18 spin column (Nest group). Briefly, columns were activated using 1 column volume (CV) of ACN and then equilibrated with 2 CV of 0.1% FA. Peptides were loaded twice and then washed with 3 CV of 0.1% FA. Elution was done using 0.5 CV of 50% ACN 0.1% FA and repeated twice. Samples were dried under vacuum and stored at −80 °C until acquisition.

Crosslinking MS sample preparation

ϕKZ infection and SEC separation were performed as described above. Following separation, the SEC fractions corresponding to the 70S ribosome peak (F33–F38) were pooled. The was crosslinked for 1 h at RT using 5 mM DSSO from a freshly prepared 30 mM stock in water-free DMF. The reaction was quenched by the addition of ABC to 50 mM for 30 min at RT, and the proteins were precipitated using 8 volumes of ice-cold acetone. Following overnight incubation, pellets were washed 5 times with 8× volumes of ice-cold acetone and briefly dried under a vacuum. The pools were reconstituted in 8 M urea, 100 mM ABC and 5 mM TCEP and incubated for 30 min at RT. CAA was added to 10 mM final concentration, and the samples were incubated in the dark for 1 h. Urea was diluted to 1 M by the addition of 100 mM ABC, and the proteins were digested overnight with 2 µg of trypsin in a thermo shaker at 30 °C. Samples were acidified with 10% TFA, and high-ph tip fractionation was performed as we previously described⁵⁸. Briefly, following activation, equilibration, and washing of the C18 resin, the elution was done using a step-wise gradient of ACN from 10 to 40 (5% increases) in 0.1% triethylamine to account for the increased hydrophobicity of the XL peptides compared to not modified ones. The resulting fractions were dried under a vacuum.

SEC-MS and spectral library acquisition

Samples were resuspended in buffer A (0.1% FA), and approximately 200 ng were analyzed by DIA-PASEF on a Bruker TimsTOFpro interfaced with a Ultimate3000 UHPLC. For the SEC-MS experiment, the peptides were separated on a PepSep column (15 cm, 150 µm IID) using a 38-min gradient at 0.6 μl/min. Following loading, the peptides were eluted for 20 min with a 5% to 30% B (0.1% FA in ACN) in 20 min. The column was then washed for 5 min at 90% and high flow (1 μl/min) and re-equilibrated at 5% ACN for the next run. The peptides were sprayed through a 20 mm ZDV emitter kept at 1700 V and 200 µC. The mass spectrometer was operated in positive mode using DIA-PASEF acquisition⁵⁹. Briefly, 4 PASEF scans (0.85 1/K0 to 1.30 1/K0) were acquired and divided each precursor range into 24 windows of 32 Da (500.7502–966.67502 m/z) overlapping 1 Da. Each of the fractionated samples (phage-specific libraries) was acquired in DDA-PASEF using a similar gradient composition except for the elution, which was performed in 90 min leading to a 120 min gradient. For DDA-PASEF, the ion mobility window and precursor range were matched to the DIA boundaries to allow for seamless library building and search.

XL-MS data acquisition

The XL-MS samples were acquired on a Bruker TimsTOFpro interfaced with a Ultimate3000 UHPLC. The peptides were separated using a 118 minutes linear gradient. Following loading, the percentage of B (80% ACN in 0.1% FA) was increased from 2% to 8% in 5 min and then to 43% in 90 min. Residual peptides were eluted at 50% B for 10 min, and then the column was washed at 88% B for the remaining 13 min. The peptides were separated on a PepSep column (15 cm, 150 mm iid, 1.9 μm beads size). The mass spectrometer was operated in positive mode and data-dependent acquisition with the same source parameters as the SEC fractionated samples. To enrich for crosslinked peptides, a custom IM polygon was employed⁶⁰, and charge inclusion was enabled (3 + to8 + precursors). Precursors having nominal intensity above 20,000 were selected for fragmentation using an inverted collision energy of 23 eV at 0.73 1/k0 and 95 eV at 1.6 1/k0.

SEC-MS data analysis

The DDA files were searched within the Fragpipe toolkit using MSfragger⁶¹ v3.7 and the ‘DIA-speclib-quant’ workflow using the Pseudomonas aeruginosa pan proteome FASTA (5564 entries, proteome ID UP000002438, downloaded on the 05/22, https://www.uniprot.org/proteomes/UP000002438). For each phage, the correspondent FASTA nucleotide file was downloaded from GenBank (NC_004629.1 https://www.ncbi.nlm.nih.gov/nuccore/NC_004629 for ϕKZ and NC_028999.1 https://www.ncbi.nlm.nih.gov/nuccore/NC_028999 for ϕPA3), and EMBOSS was used for novel ORFs prediction (see ‘Prediction of novel ORFs’ section for details). The GenBank files were translated to protein level using BioPython and supplemented to the Pseudomonas FASTA. Carbamylation of cysteines was set as a fixed modification, while oxidation of methionine, N-term acetylation (peptide level), and pyro-glu formation were set as variable modifications. EasyPQP (https://github.com/grosenberger/easypqp, v 0.1.37) was used to generate a spectral library. Following phage-specific library generation, PAO1 precursors from all libraries were transferred to ensure the presence of the same PAO1 proteins with the same peptides across all DIA experiments using lowess for RT realignment. The DIA-PASEF data was searched with DIA-NN⁶² v.1.7.1 using a library-centric approach. Identified spectrum with MS1 precursors within 10 ppm and MS2 precursors within 15 ppm were selected, and a second library was generated (double-pass mode). Quantification was set to robust (high accuracy), and cross-run normalization was disabled.

XL-MS data analysis

XL-MS timsTOF files were converted to mgf using MSconvert v3.0.21072-998eff1c0 (developer build). MS1 peak picking was enabled, and the spectrum was denoised (top30 peaks in 100 m/z bins). Ion mobility scans were combined. Following the conversion, the peak files were searched in XiSearch⁶³ v1.7.6.7 using a fraction-specific FASTA containing only the protein ids identified by SEC-MS in the corresponding MW range. MS1 and MS2 tolerances were fixed to 10 and 15 ppm with 10 ppm of peptide tolerance. DSSO was selected as crosslinker (158.0037648 Da), and the correspondent oxidized, and amidated crosslinker was added as modifications. Link-FDR was fixed at 5% (boosted), and the resulting file were imported into XiView (https://xiview.org) for manual inspection of crosslinked spectrums.

Data analysis for DDA-purified virion samples

TimsTOF DDA files were searched in MSfragger using the LFQ-MBR workflow. Cysteine carbamylation was selected as a fixed modification, while N-term acetylation and deamidation were enabled as a variable modification with a max of 3 variable modifications per peptide. Peptides of lengths 7–50 were searched again by a database of phage, Pseudomonas aeruginosa, plus contaminants. Decoys were generated by pseudo-inversion. Percolator was used for FDR-control at 1% PSM.

Protein–protein interaction prediction from SEC-MS data

DIA-NN reports were filtered at 1% library Q-value, and to infer protein quantities, the top2 peptides yielding the highest intra-protein correlation were averaged (sibling peptide correlation strategy) across replicates for each condition. This step was performed across all samples to ensure the same peptides were used for every replicate and condition. The raw MS2 profiles were smoothed using a Savitzky–Golay filter and rescaled in a 0–1 range. A dot product matrix between all proteins was calculated, and proteins showing r² ≥ 0.3 were selected as putative interactors for prediction. For every pair, we calculated 5 features: (i) sliding window (q = 6) correlation, (ii) fraction-wide intensity difference, (iii) peak shift, (iv) Euclidean distance, and (v) contrast angle dot-product.

For prediction, we utilized a fully-connected neural network implemented in Tensorflow v2.12.0 (https://www.tensorflow.org). Briefly, we set the input layer as a number of features (147), followed by a fully connected layer with 100 neurons and a dropout layer (0.2%), and a fully connected layer with 72 neurons. A final output layer using sigmoid as an activation function was used for classifying co-eluting and not-coeluting proteins. For training, a previously reported dataset was used³². To select for positive, we utilized protein pairs in STRING using a combined score of 0.9 and experimental evidence, while negative were randomly selected. The DNN model was trained for 100 epochs using ADAM (learning rate = 0.001) and binary cross-entropy as a loss function. Early stopping (patience = 20) was utilized to avoid overfitting. To further removed spuriously co-eluting PPIs after the prediction step, we calculated an equal number of decoy PPIs by randomly sampling the remaining proteins and utilizing the DNN model to predict their coelution probability. We then utilized these two distributions to perform target-decoy competition using posterior probabilities as we previously described²¹ at 5% FDR, and the final interaction table was further filtered using a combined interaction probability of ≥0.75.

ORFs prediction from nucleotide FASTA

EMBOSS v6.6.0.0 subroutine getorf was used to predict open reading frames (ORFs) with a minimum size of 50 AA. Existing annotated genes were removed from the predicted ORFs using bedtools subroutine subtract, allowing us to differentiate between existing and novel ORFs.

Structural prediction and alignment for ϕPA3 vRNAp

Protein complex prediction was performed using AlphaFold 2 (https://github.com/deepmind/alphafold). AF2 was run with full database size and the multimer preset. OpenMM energy minimization was performed to generate relaxed models, and 5 models per complex were generated. Models were ranked by ipTM + TM, and the PAE and LDDT were extracted for visualization. Each complex was submitted as a FASTA file, with proteins ordered from the longest to the shortest sequence. The alignment was performed using US-Align⁵³ (https://zhanggroup.org/US-align/), and the oligomer option was selected. Alignments of predicted complex structures (ϕKZ vRNAp and 4 proteins ϕPA3 vRNAp) were performed by multiple structure alignment (MSTA) using US-align with default parameters, and a TM-cutoff of 0.45 was used to estimate topological similarities between the two structures. For visualization purposes, the structure of vRNAp (70GR) without PHIKZ123, which lacked homolog identification in ϕPA3, was used as a template in MatchMaker.

Fluorescent microscopy of putative shell components

0.8% agar pads were supplemented with 0.5 μg/mL DAPI for phage DNA staining P. aeruginosa strain PAO1 expressing each of the fluorescent shell candidate constructs was grown in liquid culture supplemented with up to 0.05% arabinose (depending on optimal conditions for each construct) to induce construct expression until an OD of 0.5, and subsequently infected with ϕKZ lysate for 50 min at 30 °C before imaging. Microscopy was performed on an inverted epifluorescence (Ti2-E, Nikon, Tokyo, Japan) equipped with a Photometrics Prime 95B 25-mm camera and the Perfect Focus System (PFS). Images were acquired using Nikon Elements AR software (version 5.02.00). Cells were imaged through channels of phase contrast (200 ms exposure, for cell recognition), blue (DAPI, 50 ms exposure, for phage DNA), and green (GFP, 200 ms exposure, for mNeonGreen constructs) at 100× objective magnification. Final figure images were prepared in Fiji (version 2.1.0/1.53c)⁶⁴.

Generation of ϕKZ particles packaged with 3xFLAG fusions of ϕKZ virion proteins

ϕKZ particles packaged with virion proteins bearing a C-terminal 3xFLAG-tag were generated by adapting a protocol used to generate ϕKZ particles packaged with mNeonGreen-tagged inner body proteins^19,65. PAO1 cells transformed with the appropriate pHERD30T − (PHIKZxxx) − 3xFLAG construct were grown overnight in 3 mL LB supplemented with gentamicin (50 μg/ml) at 37 °C with aeration at 175 rpm. Cells were diluted 1:100 from a saturated overnight culture into 5 mL LB supplemented with MgSO₄ (10 mM) and Gentamicin (50 μg/ml) and grown for ≈2.5 h at 37 °C with aeration at 175 rpm. At OD600 nm = 0.5–0.6 (3E8 CFU/mL), the bacterial cultures were infected with ϕKZ (WT, MOI ≈ 1) for 2.5 h. Thereafter 1 mL of chloroform was added to the cultures in a fume-hood, and the cultures were incubated with chloroform for 15 min (37 °C, 175 rpm). The cell cultures were transferred to 15 mL falcon tubes and centrifuged at 6000×g for 15 min to pellet bacterial debris. The supernatant (containing bacteriophages in high titer) was carefully transferred to a fresh set of 50 mL falcon tubes and centrifuged and 6000×g for 15 min to pellet any residual bacterial debris. Thereafter, 4 mL of the supernatant was filtered and concentrated (≈10×) using Amicon-100 centrifugal filters to remove excess 3xFLAG-tagged proteins. The concentrated supernatant was used for western blot experiments.

Western blot and blot analysis

PA01 cells were grown overnight in 3 mL LB at 37 °C with aeration at 175 rpm. Cells were diluted 1:100 from a saturated overnight culture into 5 mL LB with 10 mM MgSO₄ and grown for 2.5 h at 37 °C with aeration at 175 rpm. Upon reaching 0.5 OD (600 nm), gentamicin was added (50 μg/ml), and the cells were chilled on ice for 5 min to stall translation. described, and upon reaching 0.5 OD (600 nm), gentamicin was added (50 μg/ml), and the cells were chilled on ice for 5 min to stall translation. Thereafter PAO1 cells (≈1 OD equivalent) were infected with ϕKZ particles packaged with virion proteins bearing a C-terminal 3xFLAG-tag (MOI ≈ 1) on ice for 10 min (to allow complete adsorption of virions onto cells) and then incubated at 30 °C for 15 min. Thereafter, the cell cultures were transferred to pre-chilled 15 mL falcon tubes and centrifuged at 6000×g, 0 °C for 5 min. The supernatant was discarded, and the cell pellet was washed twice with 2 mL of pre-chilled (0 °C) LB to remove excess unbound virions. The cell pellet was lysed in 100 μL of lysis buffer (20 mM Tris, pH 7.5, 150 mM NaCl, 2% glycerol, 1% TTX-100, CompleteMini EDTA-free protease inhibitor cocktail). The lysed suspension was further sonicated on ice using a Q125 sonicator (10 pulses, 1 s ON, 1 s OFF, 30% amplitude). The cell lysate was centrifuged at 15,000×g (15 min, 0 °C) to remove cellular debris. The clarified cellular lysate (100 μL) was boiled with 33 μLL of 4× Laemmli Buffer (with Beta-mercaptoethanol) for 10 min. 14 μLL of lysate samples were loaded. For virion control samples, 10 μLL of purified virions were boiled with 3.3 μLL of 4× Laemmli Buffer (with Beta-mercaptoethanol) for 10 min, and 2 μLL of samples were loaded. SDS-PAGE gels were run with running buffer (100 mL 10× Tris-Glycine SDS Buffer, 900 mL Milli-Q water) at 130 V for 1 h (constant voltage setting). The SDS-PAGE gels were transferred onto 0.2 μM PVDF membranes using a wet transfer (Transfer Buffer: 100 mL 10× Tris-Glycine Buffer, 200 mL methanol, 700 mL Milli-Q water; 100 V, 1 hour, 4 °C). The membranes were incubated with blocking buffer (5% Omniblock milk, non-fat-dry in 1× TBST (200 mL Tris Buffer Saline, 0.20 mL Tween-20)) for 1 h at room temperature. Thereafter the blocking buffer was discarded, and the membranes were incubated with 1:1000 dilutions of mouse anti-FLAG M2 antibody (Sigma-Aldrich) in 1× TBST (overnight, 4 °C, with constant shaking). Thereafter the membranes were washed thrice for 10 min with TBST and incubated with 1:3000 dilution of Goat anti-mouse HRP (Ref: 62-6520; Lot: XD347166 Invitrogen) in blocking buffer for 1 h at room temperature with constant shaking. Finally, the membranes were washed thrice for 10 min with TBST and incubated with Clarity Western ECL substrate. Membranes were imaged on an Azure 500 imager.

Statistics and reproducibility

No statistical method was used to predetermine sample size, no single fraction or replicate was excluded from the final analysis, and each SEC-MS was not randomized to avoid MS signal carry-over and increase reproducibility.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The supporting MS data is available via Massive with the identifier MSV000091715. The PhageMap database is freely accessible at https://phagemap.ucsf.edu/. The Alphafold2 predicted structures are available on GitHub at https://github.com/anfoss/Phage_data. The source data is included in this publication. Source data are provided with this paper.

Code availability

The utilized software for prediction of PPIs from SEC data is freely available at https://github.com/anfoss/PPIprophet/ and at Zenodo at https://doi.org/10.5281/zenodo.8161692.

References

Shah, P. S. et al. Comparative flavivirus-host protein interaction mapping reveals mechanisms of dengue and zika virus pathogenesis. Cell 175, 1931–1945.e18 (2018).
PubMed PubMed Central Google Scholar
Hiatt, J. et al. A functional map of HIV-host interactions in primary human T cells. Nat. Commun. 13, 1752 (2022).
ADS CAS PubMed PubMed Central Google Scholar
Eckhardt, M., Hultquist, J. F., Kaake, R. M., Hüttenhain, R. & Krogan, N. J. A systems approach to infectious disease. Nat. Rev. Genet. 21, 339–354 (2020).
CAS PubMed PubMed Central Google Scholar
Batra, J. et al. Protein interaction mapping identifies RBBP6 as a negative regulator of Ebola virus replication. Cell 175, 1917–1930.e13 (2018).
PubMed PubMed Central Google Scholar
Hashimoto, Y., Sheng, X., Murray-Nerger, L. A. & Cristea, I. M. Temporal dynamics of protein complex formation and dissociation during human cytomegalovirus infection. Nat. Commun. 11, 806 (2020).
ADS CAS PubMed PubMed Central Google Scholar
Gordon, D. E. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature. https://doi.org/10.1038/s41586-020-2286-9 (2020).
Stukalov, A. et al. Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV. Nature 594, 246–252 (2021).
ADS CAS PubMed Google Scholar
Meyers, J. M. et al. The proximal proteome of 17 SARS-CoV-2 proteins links to disrupted antiviral signaling and host translation. PLoS Pathog. 17, 1–30 (2021).
MathSciNet Google Scholar
Luo, Y. et al. HIV-host interactome revealed directly from infected cells. Nat. Microbiol. https://doi.org/10.1038/nmicrobiol.2016.68 (2016).
Jäger, S. et al. Global landscape of HIV-human protein complexes. Nature 481, 365–370 (2012).
ADS Google Scholar
Dadgostar, P. Antimicrobial resistance: implications and costs. Infect. Drug Resist. 12, 3903–3910 (2019).
CAS PubMed PubMed Central Google Scholar
Fossati, A. et al. PCprophet: a framework for protein complex prediction and differential analysis using proteomic data. Nat. Methods https://doi.org/10.1038/s41592-021-01107-5 (2021).
Krylov, V. et al. Phage phikz—the first of giants. Viruses 13, 1–18 (2021).
Google Scholar
Monson, R., Foulds, I., Foweraker, J., Welch, M. & Salmond, G. P. The Pseudomonas aeruginosa generalized transducing phage ϕPA3 is a new member of the ϕKZ-like group of ‘jumbo’ phages, and infects model laboratory strains and clinical isolates from cystic fibrosis patients. Microbiology 157, 859–867 (2011).
CAS PubMed Google Scholar
Mendoza, S. D. et al. A bacteriophage nucleus-like compartment shields DNA from CRISPR nucleases. Nature 577, 244–248 (2020).
ADS CAS PubMed Google Scholar
Malone, L. M. et al. A jumbo phage that forms a nucleus-like structure evades CRISPR-Cas DNA targeting but is vulnerable to type III RNA-based immunity. Nat. Microbiol. 5, 48–55 (2020).
CAS PubMed Google Scholar
Chaikeeratisak, V., Birkholz, E. A. & Pogliano, J. The phage nucleus and PhuZ Spindle: defining features of the subcellular organization and speciation of nucleus-forming jumbo phages. Front. Microbiol. 12, 1–8 (2021).
Google Scholar
Chaikeeratisak, V. et al. Subcellular organization of viral particles during maturation of nucleus-forming jumbo phage. Sci. Adv. 8, 8–9 (2022).
Google Scholar
Li, Y. et al. A family of novel immune systems targets early infection of nucleus-forming jumbo phages. Preprint at bioRxiv https://doi.org/10.1101/2022.09.17.508391 (2022).
Fossati, A. et al. System-wide profiling of protein complexes via size exclusion chromatography-mass spectrometry (SEC-MS). Methods Mol. Biol. (Clifton, N. J.) 2259, 269–294 (2021).
CAS Google Scholar
Frommelt, F. et al. DIP-MS: A novel ultra-deep interaction proteomics for the deconvolution of protein complexes. Preprint at bioRxiv https://doi.org/10.1101/2023.03.22.533843 (2023).
Skinnider, M. A. & Foster, L. J. Meta-analysis defines principles for the design and analysis of co-fractionation mass spectrometry experiments. Nat. Methods 18, 806–815 (2021).
CAS PubMed Google Scholar
Caufield, J. H., Abreu, M., Wimble, C. & Uetz, P. Protein complexes in bacteria. PLOS Comput. Biol. 11, 1–23 (2015).
Google Scholar
Lawrence, J. G. Shared strategies in gene organization among prokaryotes and eukaryotes. Cell 110, 407–413 (2002).
CAS PubMed Google Scholar
Qiao, Z. et al. Cryo-EM structure of the entire FtsH-HflKC AAA protease complex. Cell Rep. 39, 110890 (2022).
CAS PubMed Google Scholar
Jeruzalmi, D., O’Donnell, M. & Kuriyan, J. Crystal structure of the processivity clamp loader gamma (γ) complex of E. coli DNA polymerase III. Cell 106, 429–441 (2001).
CAS PubMed Google Scholar
Sutherland, I. W., Hughes, K. A., Skillman, L. C. & Tait, K. The interaction of phage and biofilms. FEMS Microbiol. Lett. 232, 1–6 (2004).
CAS PubMed Google Scholar
Silpe, J. E. & Bassler, B. L. A host-produced quorum-sensing autoinducer controls a phage lysis-lysogeny decision. Cell 176, 268–280.e13 (2019).
PubMed Google Scholar
Wu, H., Wang, D., Tang, M. & Ma, L. Z. The advance of assembly of exopolysaccharide Psl biosynthesis machinery in Pseudomonas aeruginosa. MicrobiologyOpen 8, e857 (2019).
CAS PubMed PubMed Central Google Scholar
Andrésen, C. et al. Critical biophysical properties in the Pseudomonas aeruginosa efflux gene regulator MexR are targeted by mutations conferring multidrug resistance. Protein Sci. 19, 680–692 (2010).
PubMed PubMed Central Google Scholar
Chan, B. K. et al. Phage selection restores antibiotic sensitivity in MDR Pseudomonas aeruginosa. Sci. Rep. 6, 26717 (2016).
ADS CAS PubMed PubMed Central Google Scholar
Salas, D., Stacey, R. G., Akinlaja, M. & Foster, L. J. Next-generation interactomics: considerations for the use of co-elution to measure protein interaction networks. Mol. Cell. Proteom. 19, 1–10 (2020).
CAS Google Scholar
Havugimana, P. C. et al. A census of human soluble protein complexes. Cell 150, 1068–1081 (2012).
CAS PubMed PubMed Central Google Scholar
Rajagopala, S. V., Casjens, S. & Uetz, P. The protein interaction map of bacteriophage lambda. BMC Microbiol. 11, 213 (2011).
CAS PubMed PubMed Central Google Scholar
Yakunina, M. et al. A non-canonical multisubunit RNA polymerase encoded by a giant bacteriophage. Nucleic Acids Res. 43, 10411–10420 (2015).
CAS PubMed PubMed Central Google Scholar
Ceyssens, P.-J. et al. Development of giant bacteriophage ϕKZ is independent of the host transcription apparatus. J. Virol. 88, 10501–10510 (2014).
PubMed PubMed Central Google Scholar
Van den Bossche, A. et al. Structural elucidation of a novel mechanism for the bacteriophage-based inhibition of the RNA degradosome. eLife 5, 1–20 (2016).
Google Scholar
Laughlin, T. G. et al. Architecture and self-assembly of the jumbo bacteriophage nuclear shell. Nature 608, 429–435 (2022).
ADS CAS PubMed PubMed Central Google Scholar
Nieweglowska, E. S. et al. The ϕPA3 phage nucleus is enclosed by a self-assembling 2D crystalline lattice. Nat. Commun. 14, 927 (2023).
ADS CAS PubMed PubMed Central Google Scholar
Chaikeeratisak, V. et al. Assembly of a nucleus-like structure during viral replication in bacteria. Science 355, 194–197 (2017).
ADS CAS PubMed PubMed Central Google Scholar
Chaikeeratisak, V. et al. The phage nucleus and tubulin spindle are conserved among large pseudomonas phages. Cell Rep. 20, 1563–1571 (2017).
CAS PubMed PubMed Central Google Scholar
Gerovac, M. et al. Immediate targeting of host ribosomes by jumbo phage encoded proteins. Preprint at bioRxiv http://biorxiv.org/content/early/2023/02/26/2023.02.26.530069.abstract. https://doi.org/10.1101/2023.02.26.530069 (2023).
Lenz, S. et al. Reliable identification of protein-protein interactions by crosslinking mass spectrometry. Nat. Commun. 12, 1–11 (2021).
ADS Google Scholar
Réblová, K., Sponer, J. & Lankas, F. Structure and mechanical properties of the ribosomal L1 stalk three-way junction. Nucleic Acids Res. 40, 6290–6303 (2012).
PubMed PubMed Central Google Scholar
Maruyama, K. et al. Switch of the interactions between the ribosomal stalk and EF1A in the GTP- and GDP-bound conformations. Sci. Rep. 9, 14761 (2019).
ADS PubMed PubMed Central Google Scholar
Häuser, R. et al. RsfA (YbeB) proteins are conserved ribosomal silencing factors. PLoS Genet. 8, e1002815 (2012).
PubMed PubMed Central Google Scholar
Kramer, G. et al. L23 protein functions as a chaperone docking site on the ribosome. Nature 419, 171–174 (2002).
ADS CAS PubMed Google Scholar
Gillet, L. C. et al. Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: A new concept for consistent and accurate proteome analysis. Mol. Cell. Proteom. 11, O111.016717 (2012).
Google Scholar
Orekhova, M., Koreshova, A., Artamonova, T., Khodorkovskii, M. & Yakunina, M. The study of the phiKZ phage non-canonical non-virion RNA polymerase. Biochem. Biophys. Res. Commun. 511, 759–764 (2019).
CAS PubMed Google Scholar
de Martín Garrido, N. et al. Structure of the bacteriophage PhiKZ non-virion RNA polymerase. Nucleic Acids Res. 49, 7732–7739 (2021).
PubMed Central Google Scholar
Enustun, E. et al. Identification of the bacteriophage nucleus protein interaction network. bioRxiv, 2023.05.18.541317. https://doi.org/10.1101/2023.05.18.541317 (2023).
Evans, R. et al. Protein complex prediction with AlphaFold-Multimer. Preprint at bioRxiv https://www.biorxiv.org/content/early/2021/10/04/2021.10.04.463034. https://doi.org/10.1101/2021.10.04.463034 (2021).
Zhang, C., Shine, M., Pyle, A. M. & Zhang, Y. US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes. Nat. Methods https://doi.org/10.1038/s41592-022-01585-1 (2022).
Meng, E. C., Pettersen, E. F., Couch, G. S., Huang, C. C. & Ferrin, T. E. Tools for integrated sequence-structure analysis with UCSF Chimera. BMC Bioinforma. 7, 339 (2006).
Google Scholar
Thomas, J. A. et al. Extensive proteolysis of head and inner body proteins by a morphogenetic protease in the giant Pseudomonas aeruginosa phage ϕKZ. Mol. Microbiol. 84, 324–339 (2012).
CAS PubMed PubMed Central Google Scholar
Bailey, T. L. STREME: accurate and versatile sequence motif discovery. Bioinformatics 37, 2834–2840 (2021).
CAS PubMed PubMed Central Google Scholar
Wu, W., Thomas, J. A., Cheng, N., Black, L. W. & Steven, A. C. Bubblegrams reveal the inner body of bacteriophage ϕKZ. Science 335, 182 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Fossati, A. et al. Toward comprehensive plasma proteomics by orthogonal protease digestion. J. Proteome Res. https://doi.org/10.1021/acs.jproteome.1c00357 (2021).
Meier, F. et al. diaPASEF: parallel accumulation-serial fragmentation combined with data-independent acquisition. Nat. Methods 17, 1229–1236 (2020).
CAS PubMed Google Scholar
Steigenberger, B. et al. Benefits of collisional cross section assisted precursor selection (caps-PASEF) for cross-linking mass spectrometry. Mol. Cell. Proteom. 19, 1677–1687 (2020).
CAS Google Scholar
Kong, A. T., Leprevost, F. V., Avtonomov, D. M., Mellacheruvu, D. & Nesvizhskii, A. I. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat. Methods 14, 513–520 (2017).
CAS PubMed PubMed Central Google Scholar
Demichev, V., Messner, C. B., Vernardis, S. I., Lilley, K. S. & Ralser, M. DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat. Methods 17, 41–44 (2020).
CAS PubMed Google Scholar
Mendes, M. L. et al. An integrated workflow for crosslinking mass spectrometry. Mol. Syst. Biol. 15, e8994 (2019).
CAS PubMed PubMed Central Google Scholar
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
CAS PubMed Google Scholar
Guan, J. et al. Bacteriophage genome engineering with CRISPR-Cas13a. Nat. Microbiol. 7, 1956–1966 (2022).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by an NIH grant 1R01AI167412 (J.B.D. and D.L.S.) and 1R01AI171041 (J.B.D. and D.A.A.). D.M. is supported by the NIH Ruth L. Kirschstein National Research Service Award 1F32GM149125-01. We thank Prof. James Wells at UCSF for the usage of the HPLC used to perform the size-exclusion experiments. We thank Natalie Whitis for assistance with operating the SPEX-freezer cryo-mill. Molecular graphics were performed with UCSF ChimeraX, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from the National Institutes of Health R01-GM129325 and the Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases. Figure 1A, B was prepared with Biorender.

Author information

These authors contributed equally: Andrea Fossati, Deepto Mozumdar.

Authors and Affiliations

J. David Gladstone Institutes, San Francisco, 94158, CA, USA
Andrea Fossati, Adrian Pelin, Nevan J. Krogan & Danielle L. Swaney
Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, 94158, CA, USA
Andrea Fossati, Adrian Pelin, Nevan J. Krogan & Danielle L. Swaney
Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, 94158, CA, USA
Andrea Fossati, Adrian Pelin, Nevan J. Krogan & Danielle L. Swaney
Department of Immunology and Microbiology, University of California San Francisco, San Francisco, 94158, CA, USA
Deepto Mozumdar, Claire Kokontis, Yuping Li, Baron Guo & Joseph Bondy-Denomy
Department of Biochemistry, University of California San Francisco, San Francisco, 94143, CA, USA
Melissa Mèndez-Moran, Eliza Nieweglowska & David A. Agard

Authors

Andrea Fossati
View author publications
You can also search for this author in PubMed Google Scholar
Deepto Mozumdar
View author publications
You can also search for this author in PubMed Google Scholar
Claire Kokontis
View author publications
You can also search for this author in PubMed Google Scholar
Melissa Mèndez-Moran
View author publications
You can also search for this author in PubMed Google Scholar
Eliza Nieweglowska
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Pelin
View author publications
You can also search for this author in PubMed Google Scholar
Yuping Li
View author publications
You can also search for this author in PubMed Google Scholar
Baron Guo
View author publications
You can also search for this author in PubMed Google Scholar
Nevan J. Krogan
View author publications
You can also search for this author in PubMed Google Scholar
David A. Agard
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Bondy-Denomy
View author publications
You can also search for this author in PubMed Google Scholar
Danielle L. Swaney
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.F.: Performed proteomics sample preparation and analysis of all MS data, developed PhageMAP, and wrote the paper. D.L.S., J.B.D., D.A., N.K.: Conceptualization, supervision, writing, and funding acquisition. A.P.: Novel ORF prediction. D.M., C.K., B.G.: Phage infection experiments, virion enrichment, microscopy, and W.B. for injected proteins. E.N. and M.M.: Performed shell enrichment. Y.L.: Critical input in revising the paper. All co-authors contributed to reviewing and editing the paper.

Corresponding authors

Correspondence to Joseph Bondy-Denomy or Danielle L. Swaney.

Ethics declarations

Competing interests

The Krogan Laboratory has received research support from Vir Biotechnology, F. Hoffmann-La Roche, and Rezo Therapeutics. Nevan Krogan has previously held financially compensated consulting agreements with the Icahn School of Medicine at Mount Sinai, New York, and Twist Bioscience Corp. He currently has financially compensated consulting agreements with Maze Therapeutics, Interline Therapeutics, Rezo Therapeutics, and GEn1E Lifesciences, Inc. He is on the Board of Directors of Rezo Therapeutics and is a shareholder in Tenaya Therapeutics, Maze Therapeutics, Rezo Therapeutics, and Interline Therapeutics. D.L.S. has financially compensated consulting agreements with Maze Therapeutics and Rezo Therapeutics. The other authors declare no competing interests

Peer review

Peer review information

Nature Communications thanks Ben Collins, Nachimuthu Ramesh, and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Movie 1

Supplementary Movie 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fossati, A., Mozumdar, D., Kokontis, C. et al. Next-generation proteomics for quantitative Jumbophage-bacteria interaction mapping. Nat Commun 14, 5156 (2023). https://doi.org/10.1038/s41467-023-40724-w

Download citation

Received: 13 March 2023
Accepted: 07 August 2023
Published: 24 August 2023
DOI: https://doi.org/10.1038/s41467-023-40724-w

This article is cited by

Phage proteins target and co-opt host ribosomes immediately upon infection
- Milan Gerovac
- Kotaro Chihara
- Jörg Vogel
Nature Microbiology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.