Mating pair stabilization mediates bacterial conjugation species specificity

Bacterial conjugation mediates contact-dependent transfer of DNA from donor to recipient bacteria, thus facilitating the spread of virulence and resistance plasmids. Here we describe how variants of the plasmid-encoded donor outer membrane (OM) protein TraN cooperate with distinct OM receptors in recipients to mediate mating pair stabilization and efficient DNA transfer. We show that TraN from the plasmid pKpQIL (Klebsiella pneumoniae) interacts with OmpK36, plasmids from R100-1 (Shigella flexneri) and pSLT (Salmonella Typhimurium) interact with OmpW, and the prototypical F plasmid (Escherichia coli) interacts with OmpA. Cryo-EM analysis revealed that TraNpKpQIL interacts with OmpK36 through the insertion of a β-hairpin in the tip of TraN into a monomer of the OmpK36 porin trimer. Combining bioinformatic analysis with AlphaFold structural predictions, we identified a fourth TraN structural variant that mediates mating pair stabilization by binding OmpF. Accordingly, we devised a classification scheme for TraN homologues on the basis of structural similarity and their associated receptors: TraNα (OmpW), TraNβ (OmpK36), TraNγ (OmpA), TraNδ (OmpF). These TraN-OM receptor pairings have real-world implications as they reflect the distribution of resistance plasmids within clinical Enterobacteriaceae isolates, demonstrating the importance of mating pair stabilization in mediating conjugation species specificity. These findings will allow us to predict the distribution of emerging resistance plasmids in high-risk bacterial pathogens.

C onjugative transfer of DNA involves a type IV secretion system (T4SS), the relaxosome and a conjugative pilus 1 . In the prevailing model of conjugation, the pilus extends off the surface of the donor and establishes contact with a recipient 2 . It then retracts 3 , drawing the recipient towards the donor leading to the formation of a tight mating junction through a process termed 'mating pair stabilization' (MPS) 4 . Recently, important advancements have been made in elucidating the structure of the T4SS and conjugative pilus from several incompatibility group F (IncF) plasmids [5][6][7] . However, the mechanism underpinning MPS remains relatively unclear.
OmpA dependency was associated with the plasmid-encoded donor subunit TraN as substitution of traN from the F plasmid with traN from R100-1 abrogated the conjugation deficiency 16 . TraN is an OM protein containing 22 conserved cysteine residues and is involved in the formation of mating aggregates during MPS 2,17,18 .
The N-terminal domain, specifically, mediates specificity for recipient OmpA. Despite compelling genetic evidence that TraN cooperates with OmpA to mediate MPS, attempts to isolate a TraN-OmpA complex were unsuccessful 17 .
While IncF plasmid conjugation has been extensively studied in E. coli K12, few studies have investigated conjugation of contemporary antibiotic resistance plasmids found in clinically relevant pathogens. Klebsiella pneumoniae is a Gram-negative pathogen frequently associated with antimicrobial resistance 19 . Carbapenem resistant K. pneumoniae (CRKP) isolates often carry carbapenemaseencoding plasmids, including the K. pneumoniae carbapenemase (KPC)-encoding plasmid pKpQIL, which expresses conjugation machinery similar to that found on F and R100-1 20,21 . pKpQIL is particularly closely associated with clinical isolates of the globally pervasive K. pneumoniae sequence type ST258 22,23 . While temperature and mating substrate moderately affect conjugative transfer 24 , the role of recipient cells during pKpQIL conjugation is unclear.
In addition to resistance plasmids, mutations affecting the K. pneumoniae OM trimeric channel-forming porins OmpK35 and OmpK36 (homologues of OmpF and OmpC in E. coli, respectively) also contribute to reduced carbapenem susceptibility 25 . The selectivity of the porins is conferred by loop 3 (L3) which extends into the barrel of the porin, forming a constriction point within the channel 26 . In ST258 CRKP isolates, the introduction of a premature stop codon in ompK35 is ubiquitous, leading to loss of functional OmpK35 27 . Moreover, many ST258 isolates express OmpK36

Mating pair stabilization mediates bacterial conjugation species specificity
Wen Wen Low 1,2 , Joshua L. C. Wong  Bacterial conjugation mediates contact-dependent transfer of DNA from donor to recipient bacteria, thus facilitating the spread of virulence and resistance plasmids. Here we describe how variants of the plasmid-encoded donor outer membrane (OM) protein TraN cooperate with distinct OM receptors in recipients to mediate mating pair stabilization and efficient DNA transfer. We show that TraN from the plasmid pKpQIL (Klebsiella pneumoniae) interacts with OmpK36, plasmids from R100-1 (Shigella flexneri) and pSLT (Salmonella Typhimurium) interact with OmpW, and the prototypical F plasmid (Escherichia coli) interacts with OmpA. Cryo-EM analysis revealed that TraN pKpQIL interacts with OmpK36 through the insertion of a β-hairpin in the tip of TraN into a monomer of the OmpK36 porin trimer. Combining bioinformatic analysis with AlphaFold structural predictions, we identified a fourth TraN structural variant that mediates mating pair stabilization by binding OmpF. Accordingly, we devised a classification scheme for TraN homologues on the basis of structural similarity and their associated receptors: TraNα (OmpW), TraNβ (OmpK36), TraNγ (OmpA), TraNδ (OmpF). These TraN-OM receptor pairings have real-world implications as they reflect the distribution of resistance plasmids within clinical Enterobacteriaceae isolates, demonstrating the importance of mating pair stabilization in mediating conjugation species specificity. These findings will allow us to predict the distribution of emerging resistance plasmids in high-risk bacterial pathogens.
variants containing L3 insertions that are not present in OmpK36 from the reference laboratory strain ATCC 43816, a derivative of which serves as the wild type (WT) strain in this work 28 . We showed that a glycine-aspartic acid (GD) ST258-associated L3 insertion constricts the channel by 26%, reducing antibiotic diffusion across the OM 29 .
pKpQIL conjugation is dependent on recipient OmpK36. The aim of this study was to determine whether ST258-associated porins affect pKpQIL conjugation. As ompK35 is a pseudogene in these isolates, we assessed the role of OmpK36 in the absence of OmpK35 expression (Supplementary Table 1). We generated a reporter pKpQIL called pKpGFP (Fig. 1a), which expresses superfolder green fluorescent protein (sfGFP) under control of a lac promoter (Plac), allowing for quantification of conjugation frequency by selective plating or fluorescence assays. We then engineered a K. pneumoniae donor that constitutively expresses LacI for repression of sfGFP expression. pKpGFP was transferred into this strain, which we named GFP donor (GFP-D) (Supplementary Table 1). Upon conjugative transfer of pKpGFP from GFP-D into a K. pneumoniae recipient that does not constitutively express LacI, sfGFP is expressed (Fig. 1b).
We compared the conjugation efficiency of pKpGFP from GFP-D into recipients expressing WT OmpK36 (36 WT ), ST258 OmpK36 containing the L3 GD insertion (36 ST258 ) or missing ompK36 (ΔompK36). A similar reduction in pKpGFP conjugation frequency into 36 ST258 and ΔompK36 recipients was observed compared with a recipient expressing 36 WT (Fig. 1c). Next we determined whether the impact of 36 ST258 on pKpGFP conjugation is restricted to the recipient by comparing plasmid transfer between GFP-D expressing either 36 WT or 36 ST258 . Both donors exhibited similar conjugation frequencies (Extended Data Fig. 1a), suggesting that 36 ST258 impairs conjugation when expressed specifically in the recipient.
We next determined the mechanism by which 36 ST258 reduces conjugative uptake of pKpGFP. K. pneumoniae expressing 36 ST258 are less susceptible to carbapenems compared with those expressing 36 WT 25,29 . To overcome this, we used pKpGFP to develop a high throughput, real-time conjugation system (RTCS) assay, to quantify conjugation frequency in the absence of selective pressure by measuring temporal fluorescence emission from conjugation mixtures. However, using the GFP-D reporter in RTCS, we did not detect an increase in fluorescence even with K. pneumoniae recipients expressing OmpK36 WT . This suggested that RTCS was not sensitive enough to detect transfer of pKpGFP (Fig. 1c). To increase conjugation efficiency, we deleted the conjugation inhibitor gene finO from pKpGFP to generate a derepressed variant of pKpGFP, pKpGFP-D (Fig. 1d) 30 . pKpGFP-D was transferred into the strain overexpressing LacI to generate the GFP-derepressed donor (GFP-DD).
We stained GFP-D and GFP-DD with rat polyclonal antiserum raised against the intact conjugation pilus. Only GFP-DD bacteria were piliated, which confirmed that deleting finO results in derepression of transfer genes (Fig. 1e). Transfer of pKpGFP-D into recipients was also observed by live microscopy (Supplementary Video). Comparing the conjugation frequency of pKpGFP and pKpGFP-D revealed an overall 2-log-fold increase in transfer of pKpGFP-D (Extended Data Fig. 1b). Importantly, the relative difference in conjugation frequency into recipients expressing 36 WT and 36 ST258 seen using GFP-D was maintained using GFP-DD. When total fluorescence outputs from conjugation mixtures containing GFP-DD were measured in RTCS, an increase in fluorescence was detected as early as 150 min for recipients expressing 36 WT but not 36 ST258 (Fig. 1f).
We used RTCS to determine whether the L3 GD insertion present in 36 ST258 affects conjugative uptake of pKpGFP. Deleting the GD insertion (36 ST258ΔGD ) led to a significant increase in a.f.u. compared with those expressing 36 ST258 (Fig. 1g). This effect could be reconstituted by introducing the GD insertion into OmpK36 WT (36 WT+GD ) and was validated by selective plating (Extended Data Fig. 1c). These results suggest that the GD insertion reduces conjugative uptake of pKpGFP-D, potentially due to pore constriction 29 .
TraN homologues cooperate with distinct recipient OM proteins. As E. coli OmpA dependency in F plasmid conjugation could be circumvented by substituting its traN with traN from R100-1 16 , we hypothesized that OmpK36 dependency seen in pKpQIL is similarly mediated by TraN. We first investigated whether R100-1 conjugation is OmpK36 dependent and observed no significant difference into K. pneumoniae recipients expressing either 36 WT or 36 WT+GD (Extended Data Fig. 2a). We then substituted traN from both pKpGFP and pKpGFP-D with traN of R100-1 (traN R100-1 ); immunofluorescence staining showed that donor cells carrying pKpGFP-DtraN R100-1 (GFP-DDtraN R100-1 ) are piliated (Fig. 2a). Using GFP-DDtraN R100-1 in RTCS revealed that conjugation was no longer affected by the GD insertion in OmpK36 ( Fig. 2b and Extended Data Fig. 3a). This was validated by selective plating (Extended Data Fig. 2b). These results show that OmpK36-dependency during conjugation is plasmid specific and mediated by TraN.
We hypothesized that TraN R100-1 cooperates with another OM protein in K. pneumoniae and identified PhoE 31 and OmpW 32 , which are conserved in Enterobacteriaceae species, as candidates and generated ΔphoE and ΔompW mutants. Using RTCS to assess plasmid transfer into ΔphoE and ΔompW alongside ΔompK36 and ΔompA recipients revealed significantly lower conjugation frequency specifically into the ΔompW recipient ( Fig. 2c and Extended Data Fig. 3b). To determine which domain of TraN mediates specificity, we generated a chimaeric traN (traN Ch1 ) by substituting amino acid residues from the variable N-terminal domain of TraN pKpQIL with corresponding residues from TraN R100-1 (Extended Data Fig. 4a). Donor cells expressing traN Ch1 (GFP-DDtraN Ch1 ) were piliated (Fig. 2d) and facilitated efficient conjugation into the ΔompK36 but not ΔompW recipients ( Fig. 2e and Extended Data Figs. 3c and 4b). These results show that TraN mediates efficient conjugation via specific pairings of TraN R100-1 with OmpW and TraN pKpQIL with OmpK36, with specificity conferred by the variable N-terminal domain.
We next sought to validate that TraN from F mediates dependency on OmpA. We substituted traN from pKpGFP and pKpGFP-D with traN from F (traN F ) to generate pKpGFPtraN F and pKpGFP-DtraN F , respectively. Donors carrying pKpGFP-DtraN F (GFP-DDtraN F ) were piliated (Fig. 2f); however, we observed no increase in fluorescence with the WT recipient compared with the GFP-DD donor (Extended Data Fig. 3d). Nevertheless, using donors carrying pKpGFPtraN F (GFP-DtraN F ) in selection-based assays (which are more sensitive than RTCS) with WT K. pneumoniae recipients, we detected transconjugants, confirming the expression of functional conjugation machinery (Fig. 2g). However, compared with donor strains expressing either TraN pKpQIL or TraN R100-1 , the conjugation frequency associated with the TraN F -expressing donor was significantly lower. As OmpA was first proposed as a receptor for TraN F in E. coli, we hypothesized that the K. pneumoniae homologue of OmpA is not recognized by TraN F . Therefore, we repeated the experiment using E. coli MG1655 recipients. We observed that conjugation of pKpGFP expressing either TraN F or TraN R100-1 was significantly higher than that of the TraN pKpQIL -expressing plasmid (Fig. 2h). This suggests that TraN F , but not TraN pKpQIL , cooperates with an E. coli OM protein. In contrast, TraN R100-1 facilitates efficient conjugation into both K. pneumoniae and E. coli, which suggests that it recognizes OmpW in both bacterial species 32 . TraN F was confirmed to cooperate with E. coli OmpA as we observed a significant decrease in conjugation frequency of pKpGFPtraN F into an MG1655ΔompA compared with the WT recipient ( Fig. 2i and Extended Data Fig. 3e). To understand how the different TraN variants mediate OM receptor specificity, we used AlphaFold 33 to generate predicted structures for TraN pKpQIL , TraN R100-1 and TraN F (Extended Data Fig. 5a). The predicted models show a high overall predicted local-distance difference test (pLDDT) score, providing confidence in the structural variation seen between the different alleles. Each variant appears to contain an amphipathic alpha-helix that can potentially anchor it to the outer leaflet of the OM. The overall structure contains an extended N-terminal domain consisting mostly of β-sheets linked to a β-sandwich domain that we refer to as the 'tip' , while the C-terminal domain is a mix of α-helices and β-sheets that fold back and form intradomain contacts with the N-terminal domain. From the predicted structures, all cysteine residues in each TraN variant could be engaged in intramolecular disulfide bonds (Extended Data Fig. 5b and Supplementary Table 7). Interestingly, structural differences are mainly seen in the 'tip' region of the protein, which corresponds to the variable region of the TraN sequences, consistent with their OM specificity observed during functional conjugation assays (Fig. 2j).

TraN pKpQIL forms a complex with OmpK36.
To determine the molecular basis of TraN pKpQIL -mediated conjugation dependency on OmpK36, we purified TraN pKpQIL (68 kDa) and OmpK36 WT (~120 kDa (trimer)) and tested their ability to form a complex by size exclusion chromatography (SEC). A clear shift in the retention volume of the TraN-OmpK36 sample (12.8 ml) was observed compared with the individual retention volumes of TraN pKpQIL (14.5 ml) and OmpK36 (14.9 ml), which indicates the formation of a stable TraN-OmpK36 complex (Extended Data Fig. 6a). Complex formation was confirmed by SDS-PAGE analysis (Extended Data Fig. 6b). In contrast, no shift was observed when TraN was mixed with OmpK36 WT+GD (Extended Data Fig. 6c), suggesting that the GD insertion impairs complex formation.
The structural basis of the TraN-OmpK36 interaction was assessed by cryo-electron microscopy (cryo-EM), which showed discrete TraN-OmpK36 complexes. However, TraN was mainly disordered and exhibited an occupancy of less than one molecule for each OmpK36 trimer. A three-dimensional (3D) reconstruction for the complex with an overall resolution of 2.6 Å using the map:map approach was generated with density for both the OmpK36 trimer and TraN ( Fig. 3a and Extended Data Fig. 7). The crystal structure of the trimeric OmpK36 (PDB 6RD3) 29 was placed inside the density with minimal rebuilding, mostly amino acid rotamers and correcting for Ramachandran outliers on the basis of electron density and bond strains; the OmpK36 crystal and cryo-EM structures display a root-mean-square deviation (rmsd) of 0.6 Å over 480 C α atoms. The additional density below OmpK36 was assigned to TraN ( Fig. 3a and Extended Data Fig. 7) and it extends into the channel of one subunit of the trimeric porin. The TraN density is weak, present at a low threshold and featureless. Therefore, we decided not to build the TraN model in this density. The density inside the pore is better defined, showing a loop-shaped appearance with side chains; using the AlphaFold model for TraN pKpQIL , that density corresponds to the predicted β-hairpin of the TraN pKpQIL 'tip' (Figs. 2j and 3a). Further evidence that the density inside the pore corresponds to the TraN 'tip' was provided by generating an ab initio complex, which predicted a nearly identical OmpK36-TraN complex as the cryo-EM structure, with an rmsd of 0.45 Å over 480 C α OmpK36 atoms and an rmsd of 1.2 Å for the 9 C α β-hairpin atoms (Fig. 3b).
TraN inserts into the OmpK36 pore from its extracellular side and reaches halfway through the channel to L3 (Fig. 3c). The binding/recognition of the TraN β-hairpin is mostly mediated by interactions with L3 of OmpK36. Structural comparison was performed to investigate the disruption in complex formation caused by the L3 GD insertion (Fig. 3d). Using the TraN-OmpK36 WT model as a reference, the structure for OmpK36 WT+GD (PDB 6RCK) 29 was superimposed onto the complex (Fig. 3d). This showed a clash between Gly115 and Asp114 from L3 of OmpK36, and Ser243' and Gly244' of the TraN β-hairpin (Fig. 3d). These clashes are anticipated to destabilize the OmpK36 and TraN interaction, lowering the affinity of TraN for OmpK36 WT+GD .

TraN-OMP interactions influence plasmid host distribution.
We next investigated the real-world implications of TraN-mediated species-specific transfer of plasmids by analysing TraN sequences from a dataset of plasmids retrieved from GenBank in 2018 34 . Using the Plascad tool for plasmid classification 34 and the NCBI Taxonomy database to determine the bacterial host family, we identified 824 predicted conjugative IncF plasmids from Enterobacteriaceae isolates (Supplementary Table 2). Using tBLASTn 35 , we found that of these plasmids, 265 (32.2%), 166 (20.1%) and 178 (21.6%) contained traN genes encoding proteins with ≥90% amino acid similarity to those found in pKpQIL, R100-1 and F, respectively. In total, these variants account for 74% of the 824 plasmids examined. Importantly, plasmids carrying a similar traN were found in a small number of species, including one single dominant species. Notably, 89.1% of traN pKpQIL plasmids are from K. pneumoniae, while 92.1% of traN F plasmids were found in E. coli (Extended Data Fig. 8a). These findings align with our in vitro data and suggest that TraN-mediated species specificity during conjugation may influence plasmid host range. Finally, while 72.9% of traN R100-1 plasmids were found in E. coli, a substantial proportion (16.9%) were recovered from K. pneumoniae isolates, supporting our findings that this TraN variant facilitates MPS in both bacterial species.
Phylogenetic trees were constructed using the different traN nucleotide sequences and are available to view on Microreact: TraN pKpQIL , TraN R100-1 , TraN F . Among those with the same gene type, there was no correlation between the phylogeny and the plasmid host, suggesting that the plasmids were largely acquired via horizontal gene transfer as opposed to clonal expansion within a species following an initial conjugative event.
We analysed the remaining 215 plasmids for annotated traN sequences and identified 4 other variants found in at least 10 sequenced plasmids (Extended Data Fig. 8b). One variant, which from R100-1 does not affect piliation (magenta) as seen by immunofluorescence microscopy of donor cells carrying pKpGFP-DtraN R100-1 (GFP-DDtraN R100-1 ). Scale bar, 2 μm. b, RTCS was used to assess the effect of the L3 GD insertion on pKpGFP-DtraN R100-1 conjugation. c, pKpGFP-DtraN R100-1 conjugation into a panel of isogenic recipients was assessed using RTCS. A.f.u. data are presented as mean ± s.d. of three biological repeats, analysed by repeated measures one-way ANOVA with Dunnett's multiple comparison test comparing to the OmpK36 ST258 -expressing recipient in (b) and to the ΔompK36 recipient in (c). d, Donor cells carrying pKpGFP-DtraN Ch1 are piliated. Scale bar, 2 μm. e, Recipient OM protein dependency during pKpGFP-DtraN Ch1 conjugation was determined with RTCS. A.f.u. data are presented as mean ± s.d. of three biological repeats, analysed by a two-sided paired t-test. f, The GFP-DtraN F donor is piliated. Scale bar, 2 μm. g,h, The log conjugation frequency of pKpGFP into WT K. pneumoniae (g) and E. coli MG1655 (h) recipients as a function of different TraN homologues was measured. i, log conjugation frequency of GFP-DtraN F into WT and ΔompA recipients was compared. log conjugation frequency data are presented as mean ± s.d. of three biological repeats, analysed by repeated measures ANOVA and Tukey's multiple comparison test in g and h, and by a two-sided paired t-test in i. j, Zoomed in cartoon representation of the tip region from each TraN variant showing a predicted β-hairpin structure that is unique to TraN pKpQIL (circled). aligned with traN from the S. Typhimurium virulence plasmid, pSLT (accession ID: AE006471.2), was found exclusively in Salmonella enterica, with varied distribution within serovars of this species. The 3 remaining variants were not associated with well-known plasmids.
Thus, we assigned these plasmids to 1 of 3 'minor variant' groups (MV1-3) represented by traN sequences from NZ_CP016763.1 (MV1), AP014954.1 (MV2) and NZ_CP023348.1 (MV3). MV1 and MV2 plasmids did not appear to associate with a single dominant   Fig. 9b). On the basis of these observations, we hypothesized that structurally similar tips recognize the same recipient OM proteins. Thus, we generated a chimaeric TraN containing the TraN pSLT tip (TraN Ch2 ), with functional studies revealing that it mediates OmpW dependency (Extended Data Fig. 10a,b). We next generated a third TraN chimaera (TraN Ch3 ) expressing the tip region from TraN MV1 . Donors carrying pKpGFP-DtraN Ch3 were piliated (Extended Data Fig. 10c).
Testing conjugation of pKpGFP-DtraN Ch3 into a panel of E. coli MG1655 recipients lacking OmpA, OmpC or OmpF revealed that TraN MV1 cooperates with OmpF (Extended Data Fig. 10d,e). These findings suggest that tip structure, rather than sequence similarity, determines TraN specificity for recipient OM proteins.
Accordingly, we classified the 7 identified variants on the basis of structural similarity into 4 groups denoted TraNα, TraNβ, TraNγ and TraNδ (Fig. 4a). Phylogenetic analysis of the TraN sequences showed clustering, which aligns with our classification system (Fig. 4b).
Conjugation facilitated by representative TraNs from each group into WT K. pneumoniae, E. coli, S. Typhimurium and E. cloacae recipients was assessed using RTCS (Fig. 5a) into all recipients tested, while TraNβ and TraNγ were associated with species-specific conjugation. In contrast, while plasmids with TraNδ showed a broad host range distribution, species-specific transfer was observed in vitro. This suggests that MPS-mediated conjugative transfer influences host distribution, although this effect may not be as apparent where the sample size of plasmids is small (that is, TraNδ). Phylogenetic analysis of the OM protein receptors from each species revealed clustering of homologues which appear to interact with TraN on the basis of the functional assays, except for homologues of OmpK36 which interact with TraNβ (Fig. 5b).

Discussion
Although TraN has previously been implicated in MPS, we have now elucidated the mechanism by which it mediates intimate contacts between conjugating bacteria. To the best of our knowledge, this is presumably the first example of OM proteins in opposing cells cooperating to mediate bacterial-bacterial interactions. While MPS facilitates efficient conjugation, low-frequency transfer occurs in the absence of a suitable receptor and can be increased following transfer gene derepression. On the basis of this, we propose a model in which the pilus mediates a baseline level of conjugation in the absence of MPS (Fig. 6a). This form of transfer is dependent on the expression level of donor transfer genes and is probably less efficient as cells are not intimately attached. Several observations of conjugation occurring in the absence of intimate cell-cell attachment indeed support the idea that the pilus can serve as a conduit for relaxase-bound DNA entry into the recipient 36,37 . Where recipients expressing the appropriate OMPs are present, low-efficiency transfer is supplemented by high-efficiency transfer mediated by MPS following pilus retraction. In addition to ensuring that mating pairs stay intimately attached throughout DNA transfer, the interaction of TraN with a suitable receptor may also provide a signal to the core machinery that increases transfer efficiency (Fig. 6b). Our model builds upon the 'shoot and pump' model first proposed by Llosa et al. 38 , where recipients cannot avoid conjugation. Ultimately, conjugation will always occur where there is the expression of functional transfer machinery to impart the sheer push force required to deliver DNA into the recipient cell, while MPS serves to improve the efficiency of this process.
Analysis of TraN sequences found on fully sequenced plasmids revealed a clear correlation of several homologues with a single dominant species, the molecular basis of which is reflected in our in vitro findings of TraN pairing with specific OM proteins in the recipient. One feature of the dataset that should be acknowledged is the bias towards bacterial species that are highly associated with antimicrobial resistance and nosocomial infections in developed countries. Therefore, while our analysis may be useful in highlighting TraN variants that facilitate conjugation of high-risk plasmids in pathogens of interest, it may not reflect the entire range of receptive host species. Some epidemic plasmids associated with resistance or virulence may also be overrepresented. Of note, pKpQIL-like plasmids are found extensively within clinical ST258 isolates 39 . However, these plasmids have also been detected in other sequence types of  K. pneumoniae and other bacterial species, suggesting that conjugation still plays a prominent role in its dissemination 20,39 . In summary, our findings highlight the importance of TraNmediated MPS in driving high-efficiency transfer of IncF plasmids and its role in shaping plasmid host range. Our findings suggest that targeted strategies could be developed to interrupt MPS for high-risk IncF plasmids. Notably, despite great diversity in TraN sequence variants, many are structurally conserved and, therefore, mediate conjugation via the same receptors. Accordingly, we have now classified approximately 90% of the plasmids in our curated dataset into one of four groups (TraNα-δ) on the basis of the TraN structural variant expressed and suggest that conjugation-blocking strategies be prioritized for these four known structures. We envision that a combination of traN sequencing with rapid structure prediction could form the basis of future surveillance strategies for emergent high-risk plasmids as a means of predicting dissemination into important pathogens.

Generation of mutants.
All genomic mutations were made in ICC8001, a rifampicin-resistant derivative of K. pneumoniae ATCC 43816 using a two-step recombination methodology. Mutagenesis vectors were mobilized from E. coli CC118λpir into pACBSR-carrying strains through a tri-parental conjugation using the E. coli 1047 pRK2013 helper strain. Merodiploid colonies were selected on LB agar containing gentamicin and streptomycin. Selected colonies were grown for at least 4 h in LB supplemented with streptomycin and 0.4% l-arabinose to induce expression of the I-SceI endonuclease from the pACBSR plasmid. Cultures were streaked onto LB agar containing streptomycin and screened for the intended mutations. Mutations in pKpQIL were introduced using the same methodology.
Mutagenesis vectors were generated by Gibson Assembly (New England Biolabs, E2611L) on the pSEVA612S backbone and were maintained in CC118λpir cells. Site-directed mutagenesis on previously generated vectors was performed according to the Q5 Site-Directed Mutagenesis Kit protocol (New England Biolabs, M0554S). Primers used to generate the mutagenesis vectors and for screening are listed in Supplementary Table 5. All mutations were confirmed by sequencing (Eurofins). The R100-1 plasmid was provided by Fernando de la Cruz and transformed into chemically competent DH5α, which served as the donor strain in conjugation assays. GeneArt Gene Synthesis (ThermoFisher) was used to synthesize a nucleotide string encoding the tip region of TraN MV1 .

Selection-based conjugation assays.
For experiments using donors carrying pKpGFP and its derivatives, recipients were transformed with pSEVA471, a low-copy-number plasmid encoding streptomycin resistance. For quantification of R100-1 conjugation, DH5α carrying R100-1 was used as the donor and recipients were transformed with pUltra-sfGFP, which confers gentamicin resistance. For all experiments, overnight cultures of donor and recipient bacteria were washed in phosphate-buffered saline (PBS). Donor and recipient cells were mixed at a ratio of 8:1, which was previously determined to result in the highest conjugation frequency for pKpQIL 40 , and diluted in PBS (1 in 25 v/v). A volume of 40 μl of the final conjugation mixture was spotted onto LB agar and incubated for 6 h at 37 °C. The spots were collected and resuspended in 1 ml of sterile PBS for serial dilution. Recipient colonies were selected on streptomycin-or gentamicin-containing LB agar plates. Transconjugants were selected on plates supplemented with streptomycin and ertapenem for pKpGFP experiments, and streptomycin and gentamicin for R100-1 conjugation experiments. Plates were visualized on a Safe Imager 2.0 Blue Light Transilluminator (ThermoFisher) to confirm plasmid uptake in transconjugant colonies by GFP fluorescence. Conjugation frequency was calculated as the ratio of the colony forming units (c.f.u.) per ml of transconjugants to the c.f.u. per ml of recipients and the data were log 10 transformed before statistical analysis. RTCS assays. Conjugation mixtures were prepared by mixing PBS-washed overnight cultures of donors carrying derepressed reporter plasmids and recipient bacteria. It was determined that maximal fluorescence emission was obtained when donor and recipient bacteria were mixed at a 1:1 ratio without dilution. The conjugation mixture (8 µl) was spotted onto 270 µl LB agar in a 96-well black microtitre plate in technical triplicate. The plates were incubated for 6 h at 37 °C, with fluorescence readings taken at 10 min intervals on a FLUOstar Omega (BMG Labtech). Fluorescence data at each timepoint were calculated by normalizing the raw GFP emission at that timepoint to the minimum GFP emission recorded for each sample over the 6 h time course. Arbitrary fluorescence units (a.f.u.) were determined by calculating the log fold change of fluorescence at t = 300 min for each mutant recipient strain (X) against the WT recipient, that is, a.f.u. = log 10 (fluorescence X /fluorescence WT ). Purification of conjugative pili and generation of anti-pili antibodies. GFP-DD overnight cultures (2 l) were collected by centrifugation at 7,000 × g for 20 min and resuspended in 40 ml of cold 1X PBS. The resuspended cells were passed through a 25 G needle 30 times. 'Shaved' bacteria were centrifuged at 50,000 × g for 1 h. The supernatant was mixed with 5% PEG 6000, with constant stirring for 1 h at 4 °C. Conjugative pili were precipitated by centrifugation at 50,000 × g for 30 min. The pellet was resuspended in a buffer containing 50 mM Tris pH8, 1 M NaCl and dialysed overnight against the same buffer. The purified pili were visualized by negative stain electron microscopy to assess for pilus integrity and purity. Rat polyclonal antibodies were raised against the purified pili (ThermoFisher). Polyclonal antibodies were adsorbed against paraformaldehyde-fixed wild type K. pneumoniae to isolate antibodies specific to the conjugative pilus.
Immunofluorescence microscopy. Overnight cultures were diluted 1 in 20 (v/v) in fresh LB and 300 ml was added to glass coverslips placed in a 24-well plate before incubation at 37 °C for 1.5 h to allow bacteria to adhere to the surface of the coverslips. Excess medium was removed and the coverslips were washed with PBS before fixation in 4% paraformaldehyde for 20 min at room temperature. Fixed samples were washed in PBS and blocked in 2% bovine serum albumin (BSA) in PBS (w/v). Samples were washed three times before incubation with anti-pili antibodies (1:100 in 2% BSA/PBS) for 1 h at room temperature. Samples were washed three times in PBS and incubated with Alexa Fluor 488 conjugated Donkey anti-rat IgG antibodies (Jackson Immunoresearch, 712-546-150; 1:1,000 in 2% BSA/PBS) for 1 h at room temperature. Coverslips were washed three times in PBS and incubated with FM4-64 (Invitrogen, T13320; 1:100 in water) for 5 min at room temperature. Following this, coverslips were dried and mounted onto glass slides using VECTASHIELD Hardset Antifade mounting medium with DAPI (Vector Laboratories, H-1500-10) according to the manufacturer's instruction. Slides were analysed using a ×100 objective lens on a Zeiss Axio Observer 7 microscope and images were processed on Zen 2.3 (Blue Version, Zeiss). Images shown are representative of at least three biological repeats.
Agarose pad live microscopy. Bacterial conjugation was visualized over time on a Celldiscoverer 7 live cell imaging microscope (Zeiss). For these experiments, the GFP-DD donor strain was mixed with 36 WT -expressing recipients that constitutively express dTomato. Overnight cultures of donor and recipient bacteria were washed in PBS, mixed in a 1:1 ratio and 8 μl was spotted onto a 1 cm 2 2% agarose (w/v) pad supplemented with M9 salts and 0.4% glucose (w/v). The pad was inverted into a μ-Slide 2-well chambered coverslip (Ibidi, 80286). The sample was maintained at 37 °C throughout live imaging. Images were acquired every 10 min for 3.5 h and processed using Zen 2.3 (Blue Version, Zeiss).

Generation of TraN AlphaFold models.
In the absence of homologous TraN structures, ab initio models were generated by AlphaFold v2.0 33 . TraN sequences were submitted to the AlphaFold Colab server with the default settings; the signal peptide was removed from all sequences before modelling. Each structural model was validated by analysing the confidence score as generated by the pLDDT. Molecular graphics and superimposition analysis were performed in UCSF ChimeraΧ-1.2.5 41 .

Overexpression and purification of TraN and OmpK36. Recombinant
OmpK36 was extracted from E. coli outer membranes using 1% n-Dodecyl-N ,N-Dimethylamine-N-Oxide (LDAO; Anatrace) and was further purified by affinity chromatography and anion exchange 29 . The final OmpK36 buffer was 50 mM NaCl, 10 mM HEPES pH 7.0 and 0.03% n-dodecyl-β-d-maltoside (DDM; Anatrace). The mature TraN gene (D28 to Q651) from pKpQIL was subcloned into the pTAMAHISTEV vector with an N-terminal His 7 -tag and a tobacco etch virus (TEV) cleavage site using the NcoI and XhoI restriction enzyme sites. The construct was transformed into E. coli C43 (DE3) competent 42 and expressed in Terrific Broth (TB) medium (Formedium) supplemented with 100 µg ml −1 ampicillin. Cultures were grown to an OD 600 of 0.6-0.8 at 37 °C, then induced with 0.5 mM isopropyl β-d-1-thiogalactopyranoside (IPTG) and maintained for 16 h at 25 °C. Outer membranes were produced as previously described 43 , then solubilized overnight in 1% DDM in 1X PBS. Insoluble material was pelleted at 131,000 × g for 1 h, and the supernatant was supplemented with 30 mM imidazole and loaded onto an Econo-Column (Biorad) containing 5 ml Ni-NTA resin (Qiagen). The column was washed with 5 column volumes of wash buffer (1X PBS, 30 mM imidazole and 0.1% DDM). TraN-His 7 eluted from the resin in wash buffer containing 100-250 mM imidazole. TraN-His 7 was dialysed against 50 mM NaCl, 10 mM HEPES pH 7.0 and 0.03% DDM (buffer A) while undergoing incubation with His 6 -tagged TEV protease for 16-18 h at 4 °C. The dialysed sample was passed over a 5 ml His-Trap column (Cytiva) and the cleaved protein was collected in the flowthrough. Fractions containing TraN were combined and further purified using anion-exchange chromatography (Mono Q 5/10 GL column; Cytiva) using an ÄKTA pure system (Cytiva). The column was equilibrated with buffer A and eluted using a linear gradient with 500 mM NaCl, 10 mM HEPES pH 7.0 and 0.03% DDM (buffer B) over 20 column volumes. TraN eluted in 18% buffer B and was concentrated to 1 mg ml −1 for SEC analysis. SEC analysis of TraN-OmpK36. TraN and OmpK36 were dialysed against buffer A (16-18 h at 4 °C) and then were combined at a 1:2 molar ratio respectively at 1 mg ml −1 and incubated for 16 h at room temperature. The sample was injected onto a Superose 6 10/300 GL column (Cytiva), equilibrated in buffer A and eluted at a flow rate of 0.3 ml min −1 while monitoring the absorbance at 280 nm. This was followed by separate injections of TraN and OmpK36 onto the column at the same molar concentrations as previously described for the comparison of retention volumes. Data were collected on UNICORN 7.5. Fractions were collected and analysed by SDS-PAGE.
Cryo-EM sample preparation and data collection. Sample containing OmpK36-TraN at a concentration of 0.33 mg ml −1 was diluted 1:6 in buffer A. In brief, a 4 μl aliquot of sample was applied to a plasma-cleaned (Gatan Solarus) graphene oxide-coated Cu 300 mesh 1.2/1.3 holey carbon grid (Quantifoil), blotted with force 6 for 4.5 s at 90% humidity and flash frozen in liquid ethane using a Vitrobot Mark IV (FEI). The dataset used for structure determination was collected at the Molecular Electron Microscopy Core at the University of Virginia on a Titan Krios EM operated at 300 keV, equipped with an energy filter and K3 direct electron detector (Gatan). An energy filter slit width of 10 eV was used during data collection and was aligned automatically every hour. All 13,668 movies were collected in counting mode at a magnification of 81 K, pixel size of 1.08 Å, and a defocus range from −2.2 to −1.2 μm. Data collection was performed using a total dose of 50 e − Å −2 across 40 frames at a rate of 4.78 s per movie.
Data processing. Unless otherwise stated, all data processing was completed using cryoSPARC v3.2.0 44 . Movies were corrected for full-frame motion using Patch Motion Correction followed by Gctf contrast transfer function estimation. After contrast transfer function estimation, micrographs were sorted and selected on the basis of estimated resolution (better than 4 Å), defocus (−1 to −2.5 μm), ice thickness and total full-frame motion. Initial particles were automatically picked using 'Blob picker' with minimum and maximum particle diameters of 200 and 256 Å, respectively. Particles were extracted at a box size of 256 pixels, followed by two-dimensional (2D) classification. Class averages of trimeric OmpK36 alone and OmpK36 with TraN were selected for template-based particle picking. A total of 13,780,567 particles were extracted using a box diameter of 256 Å. These particles were sorted using 3 iterative rounds of 2D classification with 50 classes each, the number of online-EM iterations being set to 100 and the batch size to 1,000 per class. The final iteration of 2D classification yielded a subset of 3,412,946 particles.
To differentiate particles containing only OmpK36 or OmpK36 + TraN, multiple 3D maps were generated using ' Ab initio reconstruction' , with class size set to 4. Output 3D maps were inspected for the presence of TraN. Particles were further refined using two iterations of heterogeneous refinement with input volumes created by multi-class ab initio. The highest resolution class from the second iteration of heterogeneous refinement contained 359,314 particles, which allowed for a ~2.6 Å map to be reconstructed using 'non-uniform refinement' (Extended Data Fig. 7 and Supplementary Table 6).
Model building and refinement. The density for the trimeric OmpK36 allowed us to trace the entire backbone and build most side chains throughout the structure. The OmpK36 crystal structure (PDB ID: 6RD3) 29 was used for building the cryo-EM model, which only had small differences relative to the starting model. The predicted TraN AlphaFold model was used for initial interpretation of the loop-shaped density found within the lumen of one porin channel. The loop and the two β-strands on either side of the hairpin of the AlphaFold model could be fit into the density. Two cysteines at either side of the hairpin fit into the TraN density and were used as a starting point for matching larger side chains within the density. Model building, including adjusting side chains, was performed in Coot 45 . The model was refined in Phenix v1.15.2-3472, using real-space refinement with 'ignoring symmetry conflicts' turned on 46 . Refinement included global minimization, B-factor optimization, and applied secondary structure and Ramachandran restraints. The final model had a MolProbity score of 1.39, with 96% and 0.1% in the Ramachandran favoured and outlier regions, respectively (Supplementary Table 6). The OmpK36-TraN complex coordinates have been deposited to the Protein Data Bank (https://www.rcsb.org/) with PDB ID 7SZI. The EM map has been submitted to Electron Microscopy Data Bank (https://www.ebi. ac.uk/pdbe/emdb/) with ID EMD 25567.
Bioinformatic analysis of TraN variants. We analysed a previously described dataset of 14,029 sequenced plasmids deposited in GenBank for predicted conjugative plasmids using the Plascad tool for plasmid characterization 34 . Briefly, Plascad predicts and distinguishes conjugative plasmids from mobilizable and non-mobilizable plasmids on the basis of the presence of relaxase, T4CP and T4SS genes. It further characterizes conjugative plasmids into the four archetypal mating pair formation (MPF) groups. Putative MPF F conjugative plasmids were extracted from the dataset. Next, the bacterial host family associated with each plasmid was determined by querying the NCBI Taxonomy database via the 'ncbi_taxonomy' module in the Python toolkit 'ETE' (v3.0) and manually curating the results. Finally, PlasmidFinder was used to define the plasmid replicons for the purpose of selecting plasmids carrying an IncF replicon. A curated dataset of 824 putative conjugative IncF plasmids found in an Enterobacteriaceae host was screened for traN variants (Supplementary Table 1). A tBLASTn 35 was performed and required sequences to share ≥90% amino acid similarity with a reference and possess ≥75% of the reference length to categorize them as the same traN type. Phylogenetic trees were constructed with RAxML v8.2.8 47 using the traN nucleotide sequences from plasmids carrying variants of the pKpQIL, R100-1 and F traN genes, midpoint-rooted and visualized with metadata using Microreact 48 (version TBC). Further verification of annotated traN sequences was performed by analysing open reading frames in the reference plasmids for cysteine residue abundance on the basis that TraN is known to contain at least 20 cysteine residues (Supplementary Table 8). Statistics and reproducibility. All data are representative of at least three biological repeats. All attempts at replication were successful. Statistical analyses were performed on Prism 9 (GraphPad software). Conjugation data were analysed by repeated measures one-way analysis of variance (ANOVA) with Tukey's or Dunnett's multiple comparison test, as appropriate. Where only two recipient strains were being compared, a two-sided paired t-test was used. A multiple t-test with Holm-Šídák correction was used to assess the effect of OmpK36 mutations in the donor. P values less than 0.05 were considered significant.

Multiple sequence alignments of
Reporting summary. Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability
The data supporting the findings in this study are provided within the Article and its Supplementary  Last updated by author(s): Apr 22, 2022 Reporting Summary Nature Portfolio wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency in reporting. For further information on Nature Portfolio policies, see our Editorial Policies and the Editorial Policy Checklist.

Statistics
For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section.

n/a Confirmed
The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly The statistical test(s) used AND whether they are one-or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section.
A description of all covariates tested A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted

Software and code
Policy information about availability of computer code Data collection RTCS data was collected using BMG Labtech OMEGA 5.70 and exported for further analysis via BMG Labtech MARS Data Analysis 3.42 R5.
Immunofluorescence microscopy data acquisition was collected using Zeiss Zen Pro 2.3. SEC data was collected using UNICORN 7.5. Ab initio models of TraN were generated in AlphaFold v2.0

Data analysis
Statistical analyses were performed in Graphpad Prism 9. IF microscopy images were processed in Zen 3.1 (blue edition). Cryo-EM data processing was performed in cryoSPARC v3.2.0 and the model was refined in Phenix v1.15.2-3472. Bioinformatics analysis was performed using Plascad v1.17, BLAST v2.7.1, Python Toolkit "ETE" v3.0, RAxML v8.2.8 and Microreact v157 (www.microreact.org). Molecular graphics and analyses were performed in ChimeraX-1.2.5 Multiple sequence alignments were performed in Clustal Omega. Alignments and phylogenetic trees were generated in Jalview 2.11.2.2 For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and reviewers. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Portfolio guidelines for submitting code & software for further information.

March 2021
Data Policy information about availability of data All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: -Accession codes, unique identifiers, or web links for publicly available datasets -A description of any restrictions on data availability -For clinical datasets or third party data, please ensure that the statement adheres to our policy The data supporting the findings in this study are provided within the manuscript and Supporting Information. Accession IDs of published sequences for reference plasmids and genomes are listed in the Methods. The coordinates and structure factors of the TraN-OmpK36 complex have been deposited to the Protein Data Bank and Electron Microscopy Data Bank with ID codes 7SZI and 25677 respectively. Source data are provided with this paper.

Field-specific reporting
Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection.

Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences
For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf

Life sciences study design
All studies must disclose on these points even when the disclosure is negative.

Sample size
No statistical methods were used to determine sample size. All assays were performed in biological triplicate as is standard in the field and the data obtained had passed normality tests prior to further statistical analyses.
Data exclusions No data were excluded in the final analyses Replication All selection-based conjugation experiments were performed in biological triplicate. Real-time conjugation system (RTCS) assays were performed in technical and biological triplicate. All attempts at replication were successful.