A multiplexed bacterial two-hybrid for rapid characterization of protein–protein interactions and iterative protein design

Boldridge, W. Clifford; Ljubetič, Ajasja; Kim, Hwangbeom; Lubock, Nathan; Szilágyi, Dániel; Lee, Jonathan; Brodnik, Andrej; Jerala, Roman; Kosuri, Sriram

doi:10.1038/s41467-023-38697-x

Download PDF

Article
Open access
Published: 02 August 2023

A multiplexed bacterial two-hybrid for rapid characterization of protein–protein interactions and iterative protein design

Nature Communications volume 14, Article number: 4636 (2023) Cite this article

3713 Accesses
4 Citations
19 Altmetric
Metrics details

Subjects

Abstract

Protein-protein interactions (PPIs) are crucial for biological functions and have applications ranging from drug design to synthetic cell circuits. Coiled-coils have been used as a model to study the sequence determinants of specificity. However, building well-behaved sets of orthogonal pairs of coiled-coils remains challenging due to inaccurate predictions of orthogonality and difficulties in testing at scale. To address this, we develop the next-generation bacterial two-hybrid (NGB2H) method, which allows for the rapid exploration of interactions of programmed protein libraries in a quantitative and scalable way using next-generation sequencing readout. We design, build, and test large sets of orthogonal synthetic coiled-coils, assayed over 8,000 PPIs, and used the dataset to train a more accurate coiled-coil scoring algorithm (iCipa). After characterizing nearly 18,000 new PPIs, we identify to the best of our knowledge the largest set of orthogonal coiled-coils to date, with fifteen on-target interactions. Our approach provides a powerful tool for the design of orthogonal PPIs.

A tunable orthogonal coiled-coil interaction toolbox for engineering mammalian cells

Article 06 January 2020

Generation of synthetic nanobodies against delicate proteins

Article 08 April 2020

Bottom-up de novo design of functional proteins with complex structural features

Article 04 January 2021

Introduction

Protein–protein interactions (PPIs) are integral to most biological functions and are required for such diverse processes as cell division, signalling, metabolism, transcription and translation¹. Our ability to design and create functions and structures as complex as those found in nature, though still in its infancy, is progressing with advances in both protein design algorithms and gene synthesis.

For example, designed orthogonal sets of interacting proteins, in which each pair of proteins interacts only with its intended on-target pair and none of the other, off-target proteins present in the set, can be used to build nanoscale superstructures for applications in biology, biological engineering and materials science². Supramolecular protein designs can be created using simple, natural protein families such as coiled-coils, which have been used to build numerous designed protein assemblies^3,4,5. However, identifying orthogonal natural proteins is difficult because evolutionarily related proteins often display significant cross-interactions. Another method is to computationally design de novo proteins; in particular, Rosetta-based designs have produced homodimers^6,7 and heterodimers⁸. In a state-of-the-art example, Chen et al.⁸ performed a single-pot experiment mixing fifteen designed heterodimer pairs, which resulted in a set of twelve orthogonal heterodimers. However, the Rosetta energy function did not successfully predict orthogonality, and predicting orthogonal binding and designing large orthogonal sets remain beyond current de novo design methods³.

Coiled-coils, in particular, have many useful characteristics in terms of creating atomically precise designs for macromolecular structures. They are small and precisely oriented, and numerous sequence-based and parametric models exist with which to describe their properties. First identified at the dawn of molecular biology by both Pauling⁹ and Crick¹⁰, coiled-coils are defined by their heptad repeat HPPHPPP (H = hydrophobic residue, P = polar residue), represented as positions abcdefg. This relatively simple structure has given rise to many computational models describing coiled-coil interactions, from the parametric Crick equations of 1953¹¹ to contemporary linear models^12,13,14,15. However, because of their shared similar structure, building large sets of orthogonally interacting coiled-coils, in which all on-target interactions are favoured over all off-target interactions, remains difficult. Though numerous groups have attempted to create orthogonal sets of coiled-coils, these sets have been limited in size and displayed significant off-target interactions^15,16. Increasing our ability to build and characterise large sets of interacting proteins could help solve this problem by providing empirical data with which to improve computational models of PPIs. Simultaneously, this would vastly increase the number of available orthogonal building blocks for nanoscale structural design, allowing for the creation of previously unbuildable structures.

Here, we combine gene synthesis, an assay that allows for multiplexed bimolecular interaction screening, and a computational pipeline to design large libraries of orthogonally interacting coiled-coils. We first built and validated the next-generation bacterial two-hybrid (NGB2H) system, which has several unique advantages over other methodologies in terms of characterising protein libraries. In particular, the NGB2H system allows for the screening of bimolecular interactions without having to test all-against-all libraries; direct large-scale synthesis using oligonucleotide arrays to explore the design space; quantitative readouts on an entire library, including negative interactions, and the characterisation of low-affinity interactions inside the crowded cellular context. We did this iteratively, with synthetically designed libraries increasing in size from 256 interactions to more than 18,000 interactions. From this, we identified to the best of our knowledge the largest sets of orthogonal proteins to date and developed an improved coiled-coil scoring algorithm (iCipa) for use in future investigations of this versatile protein domain.

Results

NGB2H system design

Despite a wealth of techniques via which to analyse PPIs, there is not currently a method that facilitates high-throughput characterisation when analysing PPIs in formats other than all-against-all or is able to distinguish between closely related constructs. However, such a system would allow investigations of PPIs within protein families, polymorphic PPIs, and de novo designed PPIs that are currently intractable. Thus, we built a generalisable, scalable bacterial two-hybrid system using a significantly modified version of the B. pertussis adenylate cyclase two-hybrid¹⁷ (Fig. 1A, Supplementary Information Section 1). Briefly, the two-hybrid functions much as in Karimova et al.¹⁷, in which interacting hybrid proteins reconstitute adenylate cyclase to produce cAMP, which drives reporter gene expression. We measured the relative transcription of a uniquely identifying DNA barcode residing in the reporter gene, which serves as a measure of interaction strength. The barcode is mapped to the two fully sequenced hybrid proteins at an early cloning step using high-throughput sequencing when the barcode and proteins are physically adjacent. This unambiguously identifies even highly homologous proteins and separates synthetic errors from programmed designs. Thus, measuring the relative barcode transcription provides a quantitative, massively multiplexed characterization of PPIs with short-read sequencing. Because the NGB2H system uses a mapping step, it can use gene synthesis, rather than preconstructed libraries, to create diversity, which further frees it from the one-against-all or all-against-all testing common in two-hybrids. We made a number of other improvements, including (1) titratable and inducible control of hybrid protein expression and optimised reporter response on a single plasmid, (2) a background strain with linear cAMP accumulation, (3) a green fluorescence protein (GFP) reporter instead of beta-galactosidase for more rapid individual characterisation, (4) the use of multiple barcodes per construct to achieve statistically robust results and (5) a scarless cloning scheme that allows for library creation with any designed sequence (more information in Supplementary Information Section 1).

**Fig. 1: Design and validation of the NGB2H assay.**

Validation of the NGB2H system

After optimising the system with single-construct GFP measurements (Supplementary Fig. 1), we validated the NGB2H system with 256 previously characterised interactions¹⁵, which we call the CC0 Library. The CC0 Library is a set of sixteen de novo designed, orthogonal, heterodimeric coiled-coils that are tested in an all-against-all configuration. The proteins are highly similar, being four heptad coiled-coils that vary only at the a-position (Ile/Asn), e-position and g-position (Lys/Glu) (Fig. 1B). We designed the CC0 Library to be compatible with our system (Supplementary Fig. 2A) and then barcoded and cloned it (Supplementary Figs. 3A, 4). After inducing the two-hybrid for six hours, we took samples for RNA and DNA extraction to measure the interaction strength and normalize for plasmid abundance, respectively. We obtained high-quality measurements for all 256 protein pairs and calculated an interaction score, defined as the natural logarithm of the median of the ratio of the RNA to DNA reads: \({Interacti}{on\; score}={{{{{\rm{ln}}}}}}\left({{{{{\rm{median}}}}}}\left(\frac{{RNA\; reads}}{{DNA\; reads}}\right)\right)\).

Only barcodes for which ten or more reads were obtained in every DNA replicate and that perfectly mapped to designed protein pairs were used in further analysis. The NGB2H assay was highly replicable, with biological replicates having similar interaction scores (Pearson’s r > 0.98, p < 10⁻¹⁵), with a dynamic range of more than 100-fold (Fig. 1C).

We checked several internal controls to validate the measurements of the NGB2H assay. First, because the protein code is degenerate, we screened nine codon usages for each pair of proteins. Different codon usages showed consistent interaction scores (representative pair Fig. 1D), with all usages correlating with Pearson’s r > 0.92 and p < 10⁻¹⁵ (Supplementary Fig. 5), demonstrating minimal effects on the part of DNA sequence variation and low levels of noise in the interaction scores. We also compared the interaction scores of protein pairs when the two constituent proteins were attached to the other half of the two-hybrid, which we call the reciprocal orientation. We found that the CC0 Library exhibits a strong correlation between the primary and reciprocal orientations (Pearson’s r = 0.92, p < 10⁻¹⁵, Fig. 1E), indicating that the biological machinery of the NGB2H system faithfully recapitulates the biochemical interaction. In addition, a portion of our library contained frameshift mutations, which should not create functional PPIs. As expected, the interaction scores of constructs with indels are clustered at the bottom of the range of correct constructs (Supplementary Fig. 6). Last, to show that the NGB2H system does not suffer from barcode effects or selection pressure from the repeated cloning steps, we replicated the assay with an independent re-barcoding and re-cloning of the CC0 Library, which showed strong correlation with the first iteration’s interaction scores (Pearson’s r > 0.98, p < 10⁻¹⁵, Fig. 1F).

Having confirmed the internal consistency of the CC0 Library, we compared it to the previously published results. Compared to the circular dichroism data published in Crooks et al.¹⁵, we found that the NGB2H system’s dynamic range correlated well with melting temperatures greater than 40 °C (Fig. 1G, H). Given the differences in technique – in vivo versus in vitro, interaction strength versus helicity – the correlation between the interaction score and melting point temperatures (Pearson’s r > 0.75, p < 10⁻¹⁵, Supplementary Fig. 7) largely validate the NGB2H system. Finally, the NGB2H system must be highly scalable. To test its scalability, we computationally reduced the number of reads used in the analysis between 10 and 150-fold and found strong agreement with our full dataset, even when the raw data were reduced 100-fold (Pearson’s r > 0.85, p < 10⁻¹⁵, Fig. 1I), which implies the ability to accurately screen ~25,000 interactions at a similar read depth.

Design of large sets of orthogonal coiled-coils

All dimeric coiled-coils have a similar structure, which is why sequence-based scoring functions can fruitfully predict melting temperatures or binding affinities. The scoring functions accept two sequences as input, usually beginning with a specific register, and return a score. One of the widely used algorithms is bCipa¹⁴, which is based on summing weights for residue-residue interaction pairs, as well as electrostatic interactions and helical propensity, and predicts melting temperatures. The state-of-the-art scoring function was developed by Potapov et al.¹³, which uses triplet weights, in addition to the pair weights, and a much larger training set to predict the free energy of binding. The paper also benchmarks the most common CC scoring functions, such as Fong/SVM¹⁸ and Vinson/CE¹².

To computationally predict large, orthogonal sets of coiled-coils for empirical verification, we built a two-step computational pipeline (Fig. 2A). In brief, we calculated 16.7 million scores for all dimeric interactions between four-heptad coiled-coils with Ile or Asn at the a-position and Glu or Lys at the e- and g- positions using the scoring model of Potapov et al.¹³. The surface b-, c- and f-positions were set to Ala. We then identified orthogonal sets, which can be divided into on-target and off-target interactions such that each constituent protein participates in exactly one on-target interaction, which is stronger than every off-target interaction. This allows us to define an orthogonality gap for an orthogonal set, where the orthogonality gap is calculated as the weakest on-target interaction minus the strongest off-target interaction. For example, in Fig. 2B, on-target interactions are on the diagonal (homodimers) or just above the diagonal (heterodimers). All other interactions are considered off-target. Though computationally challenging, identifying sets with an orthogonality gap is tractable as a variant of the maximum independent set problem¹⁹. Using the bCipa and Potapov scoring functions, we identified the fifteen largest sets and included each of them with three different sets of residues at the b-, c- and f-positions because surface positions can modulate dimer stability and solubility²⁰. We refer to a set of residues used at the b-, c- and f- positions as backgrounds because these do not affect orthogonality. We combined these with two sets of controls spanning eleven backgrounds, resulting in a total of 56 sets containing between 64 to 961 interactions (8169 interactions overall), which we named the CCNG1 Library. After testing a subset of the CCNG1 Library to validate our in-house designs, which we call the CC1 Library, (see Supplementary Figs. 8, 9; Supplementary Information Section 8.3), we designed (Supplementary Fig. 2C), cloned (Supplementary Fig. 3C, 4), and performed the NGB2H assay, from which we collected quality data (Supplementary Fig. 10) on 8073 interactions. The CC0 Library was added to the CCNG1 library as an internal control (Supplementary Fig. 10C).

**Fig. 2: Large orthogonal subsets of coiled-coils from the CCNG1 library.**

The space of all possible pairs, assuming only our limited set of amino acid residues (~16 M), is several orders of magnitude larger than what could be screened experimentally (~25 k), so the design process is crucial in identifying feasible orthogonal sets that can be experimentally tested.

Large orthogonal sets in the CCNG1 library

Although we designed our coiled-coils to form orthogonal sets, the current state-of-the-art coiled coil scoring functions are not sufficiently accurate to do so reliably, and nearly all sets contained off-target interactions stronger than some of the on-target pairs. The proteins involved in strong off-target interactions can be removed from the set, leaving only those interactions that are experimentally verified to be orthogonal. Thus, we refer to an orthogonal subset as the largest experimentally characterised group of orthogonal interactions among what was computationally predicted to be an orthogonal set. To identify the orthogonal subset of each designed orthogonal set, we used a similar approach to that described above and reduced the problem to the maximum independent set problem using Interaction scores from the NGB2H assay.

To make our results robust to experimental noise from the NGB2H assay, we needed to find an appropriate orthogonality gap, that is larger than the uncertainty of the interactions score. We have performed a thorough analysis of both the CC0 internal control (technical repeats), external controls (comparison to measured melting points, Supplementary Fig. 10C) and especially the availability of reciprocal enzyme orientations for the same peptide pair (pairs of identical peptides where the split cAMP parts are reversed). We found uncertainty of less than 0.8 interaction scores in all experiments in this paper (Supplementary Data 9). Thus, to be conservative we enforced an orthogonality gap of at least 1.0 Interaction Score. Using this framework, we were able to identify an orthogonal subset of coiled-coils that contains six pairs, which includes one heterodimer and five homodimers (Fig. 2B). The orthogonality gap we enforce is very strict, for example the CC0 control set has a gap of only 0.4, and at the orthogonality gap of 1.0 it contains only four pairs instead of seven.

There are also applications where the requirements for orthogonality can be reduced, for example in building protein origami as demonstrated by Aupič et al.²¹, in which two identical pairs were used in the same structure. Pairwise orthogonality is the most stringent criterion. In a single-pot experiment, in which all pairs would be present, we speculate that orthogonality would only improve because the off-target states would be competing with the on-target states.

Therefore we have also calculated orthogonal sets with orthogonality gaps of 0.0 and 0.5. At an orthogonality gap of zero, 20 of our 51 experimentally identified orthogonal subsets in CCNG1 library had more than the seven on-target orthogonal interactions (Supplementary Fig. 13). Orthogonal sets at different orthogonality gaps are presented in Supplementary Data 6.

The CCNG1 Library represents the first large-scale systematic investigation of the effects of variation at the b-, c-, and f-positions; therefore, we sought to understand how these positions influenced interactions. As expected, we found that different backgrounds did not significantly affect orthogonality (Fig. 2C and Supplementary Figs. 11, 12). We tested six backgrounds containing the same interfacial residues as the CC0 Library (Supplementary Fig. 14 and Supplementary Information Section 8.4) and found that charged but less helical backgrounds led to weaker, less specific interaction profiles. The findings agree with the model presented by Drobnak et al.²², in which the b-, c- and f- positions were used to modulate affinity.

Improvement of coil-coiled interaction-prediction algorithms

The CCNG1 Library dataset represents the largest dataset of coiled-coil interactions to date. We reasoned that our data could serve as a training set to improve on currently available models. To benchmark current models, we computed scores using the algorithms bCipa¹⁴, Potapov/SVR¹³, Fong/SVM¹⁸ and Vinson/CE¹², which are all linear models with features for amino acid pairings. Each algorithm is only weakly predictive of our measured interactions with the bA background (Fig. 3A) because all models have an R² < 0.2. Notably, each algorithm predicted the strongest interactions well but also predicted many weak interactions that, when measured, had high interaction scores.

**Fig. 3: Comparison, development and validation of the iCipa model.**

We built several linear models similar to bCipa, which included numerous innovations (Supplementary Information Section 3). First, we trained a model on our data that only included weights for the a-, d-, e- and g- position combinations. We also created versions of this simple model with terms for either consecutive residues in the a- position of the same protein or separate terms for weights at the N-terminal a- position, where fraying may occur (Supplementary Fig. 15A).

We then expanded these models with a scoring technique, which we call heptad shifts (Supplementary Fig. 15B). In short, we expect the predominant form of coiled-coil interaction to be the alignment of heptads that have the strongest interaction. In terms of the large number of off-target interactions, this does not necessarily indicate that all four heptads are aligned with the N-terminus but, rather, could indicate an interface of three or fewer heptads. We have trained the models iteratively by changing the alignment of off-target pairs, retraining the models and rescoring the off-target alignments until convergence was achieved (in less than five repetitions in all cases). All of our heptad-shifting scoring algorithms were significantly better than the corresponding non-shifting versions. Our N-terminal a- position weights algorithm was significantly better than both the basic algorithm and the consecutive a- position algorithm (Fig. 3B). Thus, our final model, which we call iCipa, uses heptad shifting and terms for the N-terminal a- positions, and it is more predictive of CCNG1 Interaction scores than previous models, with an R² = 0.27 (Fig. 3C). The effect of heptad shifting on iCipa, as well as bCipa and the Potapov scoring function, is shown in Supplementary Fig. 16.

iCipa is a linear model, which facilitates interpretation. The weights of iCipa have expected and unexpected characteristics (Fig. 3D). a- position residues prefer Ile/Ile pairings, tolerate Asn/Asn pairings between proteins and disfavour Ile/Asn pairings, as expected. As expected, the e- and g- positions favour salt bridges between Glu/Lys and disfavour Glu/Glu pairings. Perhaps counterintuitively, Lys/Lys pairings are acceptable, and previous biochemical work has identified mildly favourable binding contributions on the part of Lys/Lys pairings²³.

To test the iCipa model, we excluded all the data from the original CC0 Library while we trained the weights. When the scoring functions are normalised and compared (Fig. 3E), both the Potapov/SVR and bCipa algorithms performed worse in terms of predicting the measured melting temperatures, with R² < 0.32, as compared to iCipa, with R² = 0.48, representing a 50% increase in predictive ability. Importantly, the increase in predictive power for iCipa on the CC0 Library demonstrates that iCipa has not been trained on an artifact of the NGB2H system but, rather, that the NGB2H system provides high-quality data on PPIs, which can provide general insights into coiled-coil function.

CCmax library design and verification

To evaluate iCipa’s prediction capabilities, demonstrate the scalability of the NGB2H system, and identify larger orthogonal sets of coiled-coils, we built another library, the CCmax Library. The CCmax Library contains 18,491 interactions and contains 931 different coiled-coils in fifteen predicted orthogonal sets and seven control sets (Fig. 4A). The orthogonal sets were designed using our computational framework and scored with one of fifteen variants of iCipa. After designing (Supplementary Fig. 2D) and cloning, we collected high-quality data on 17,983 interactions (Supplementary Fig. 17). The CC0 Library was an internal control added to the CCmax Library, and it broadly agreed with its performance in our previous libraries (Supplementary Fig. 18).

**Fig. 4: The largest orthogonal subsets of the CCmax library.**

Orthogonal sets of the CCmax library

Similarly to the CCNG1 library, we identified the largest experimentally identified orthogonal subsets of each designed set with an orthogonality gap of 1.0 Interaction Score. These orthogonal subsets have as many as fifteen on-target pairs (Fig. 4B) and 318 total interactions from 18 different proteins (Supplementary Fig. 19). Five of the orthogonal subsets contained more on-target interactions than the largest published coiled coil set¹⁵. Our largest orthogonal subset (Fig. 4C) contained fifteen coiled-coil dimers, twelve homodimers and three heterodimers, which is nine more on-target interactions than the set from CCNG1, showing the improvement of iCipa over bCipa and the Potapov scoring functions.

Similar to the CCNG1 Library, we also identified sets with lower orthogonality gaps of at least 0.0 Interaction Score, 0.5 Interaction Score, and one RMSD between the reported melting temperatures of the CC0 subset of the CCmax library mapped to Interaction Scores (Supplementary Fig. 17C). Lowering the orthogonality gap identified more interactions with a maximum of twenty-two on target interactions from twenty-eight different proteins when the gap is zero (Supplementary Fig. 20). All the orthogonal sets are listed in Supplementary Data 6.

Different applications require different levels of orthogonality; while gene circuits likely require extreme orthogonality, protein origami, which benefits from avidity, is not under such strict constraints. Thus, we identified the largest orthogonality gap for different numbers of on-target interactions (Fig. 4D; Supplementary Data 7). As expected, smaller sets had larger gaps, but orthogonality gaps of at least 0.5 interaction Score were identified for sets as large as seventeen on-target interactions. Finally, we compared the CCmax Library’s interaction score with the iCipa predictions, which show substantial improvement over the CCNG1 Library. iCipa was able to predict interaction scores, with R² = 0.43 (Fig. 4E). We attribute the increase in iCipa’s power to the use of a coiled-coil background that consists of only alanine residues at the b-, c- and f- positions. The improvement in predictive power appeared in other algorithms to a lesser extent, all of which maintained an R² < 0.28 (Supplementary Fig. 21).

Discussion

We have developed and validated a system for the high-throughput identification of PPIs. We built a framework to predict orthogonal coiled-coil interactions and used it to design over 26.000 interactions, which we then assayed with the NGB2H system in a design-build-test cycle, summarized in Supplementary Data 8. Using the data collected, we improved state-of-the-art coiled-coil interaction prediction algorithms, which allowed us to design the largest set of any orthogonal proteins to date, with fifteen on-target interactions. Thus, by using iterative design, we demonstrate how high-throughput PPI characterisation can facilitate the identification of a desired protein function and improve design.

Our work builds on previous high-throughput two-hybrids to create a generalisable system for studying PPIs, which can include both soluble and membrane proteins. By uniting gene synthesis with a mapping step and a barcode readout, our system allows for the high-throughput characterisation of any binary PPI. Previous high-throughput studies used highly constrained libraries — either the ORFome^24,25,26,27 of one of a handful of reference genomes; targeted single residue mutations, which only explore a sliver of sequence space around a primary sequence^28,29, or several randomly sheared coding sequences³⁰. Using the capabilities of DNA synthesis broadens the testable sequence space, which facilitates investigations of a variety of areas, such as families of protein domains, extant genetic variation, evolutionary trajectories and epistatic effects. Furthermore, for the investigator who is not interested in an all-against-all approach, synthesis allows for the explicit pairings of only certain proteins. While we benefited from the short length of our proteins of interest, recent pooled gene synthesis techniques^31,32 can be used to interrogate much larger proteins. Deconvoluting library diversity has also been a challenge for other multiplexed assays. Other multiplexed methods involved picking colonies and Sanger sequencing them²⁴, mapping the beginning of reading frames to reference genomes^25,26,27 or manually BLASTing obtained reads³⁰. Our explicit mapping step allows for the high-throughput creation of a library to map arbitrary proteins to DNA barcodes, and because it is a separate step, it could use long-read sequencing to overcome the length limitations of Illumina sequencing.

We do note that our system has several limitations. Notably, our system is limited to measuring dimeric interactions and unable to sense orientation and whether higher-order structures are formed. In our studies, the assemblies follow the consensus design rules and are predicted to be parallel dimeric coiled-coils^33,34, but other structures may have formed. It is also challenging to compare results between libraries. Using a next-generation sequencing readout means that each data point is relative to all other datapoints assayed at the same time, and this may change significantly between libraries that are composed of different proteins, making the comparison of interaction scores between libraries difficult. Lastly, we note that we have only tested interactions pairwise and cannot predict what might occur if more than one pair is present in solution.

Our improvements to coiled-coil design algorithms represent an important advance for de novo protein design. Though coiled-coil interactions have been modelled with diverse approaches, our iCipa algorithm shows clear advantages over existing models. In particular, heptad-shifting provides an intuitive and biologically rational addition that can be applied to any future improvements in coiled-coil design. The combination of heptad shifting with improved and novel weights for sequence features made iCipa substantially more accurate than other tested algorithms, at least for the limited set of residues tested. To increase ease of use, the iCipa scoring function is also available as a webservice at https://ajasja.github.io/icipa.

Here, we simultaneously performed a massive characterisation of PPIs within a protein family and identified the largest set of orthogonal proteins found to date. The CCmax Library characterised twice times as many total interactions as Potapov et al.¹³. From the total of 26,049 interactions we characterised, we found many orthogonal proteins — up to 15 on-target pairs at strict orthogonality gap of 1.0, which is twice the size of the largest coiled-coil set designed by Crooks et al¹⁵. and contains three more additional interactions than the four helix bundle orthogonal set found by Chen et al.⁸. Relaxing the orthogonality constraints produces up to 12 heterodimers or 22 heterodimers and homodimers at orthogonality gap of 0.0.

Though orthogonal coiled-coils are particularly needed as the building blocks for protein origami^4,5, they could be substituted for histidine kinases in orthogonal signalling pathways or synthetic orthogonal transcriptional logic gates^8,35,36 or for the sake of orthogonal cellular localization³⁷.

Thus, the ability to characterize constructs across a highly diverse sequence space and identify networked properties, such as orthogonality, highlights the NGB2H’s scalability and generality. Because it can be adapted to any sequence the experimenter desires, the NGB2H facilitates the interrogation of PPIs beyond endogenous interactomes. It can be used to characterise entire protein families, empirically inform protein design, or investigate complex phenomena such as epistasis.

Methods

Oligonucleotide designs

Libraries were designed as shown in Supplementary Fig. 2. Though the CC0 and CC1 Libraries were assembled from two oligonucleotides and the CCNG1 and CCmax Libraries from one oligonucleotide, they followed the same overall assembly logic. In brief, each library was flanked with two orthogonal 15 bp primers³⁸ for amplification from the OLS pool. Interior to the flanking primers were type IIS restriction enzyme sites to facilitate scarless cloning, and the complete coiled-coil sequence. The CC0 and CC1 Libraries contain extra type IIS sites and flanking 15 bp primers to allow linking and amplification of the X and Y halves of the two-hybrid. A complete description of each design is listed in Supplementary Information Section 2 and all oligonucleotides used are listed in Supplementary Data 1 while all proteins used are listed in Supplementary Data 2.

Orthogonal coiled-coil interaction prediction

To predict orthogonal coiled-coils, we generated all 4,096 possible four heptad coiled-coil with asparagine or isoleucine at the a-position and glutamic acid or lysine at the e- and g-positions and scored 16.7 million interactions in an all-on-all design using the Potapov algorithm (CCNG1 Library) or our iCipa candidate algorithms (CCmax Library). Calculating orthogonality is a challenging problem that scales in exponential time with the number of possible binding partners. We used a maximal clique algorithm to identify sets of orthogonal coiled-coils where all on-target interactions have a higher score than all off-target interactions and it runs in under a minute on a standard laptop. Python v3.7, Pandas 0.25.1, numpy 1.15.4, jupyter lab 4.4.0 were used for the development.

Construct and library cloning

Each library was cloned in a similar manner, with slight differences in methods to attach a random DNA barcode to the OLS pools. After the 20 bp of random DNA was attached with PCR to the 3’ end of the X and Y construct (Supplementary Fig. 3), constructs were sequenced in bulk on a MiSeq to identify it and a specific X and Y (below). After barcode mapping, the T25 and T18 + GFP halves were cloned in sequentially with type IIS restriction enzymes for scarless cloning (Supplementary Fig. 4). All enzymes and polymerases came from NEB. A complete description for how each library was cloned can be found in the Supplementary Information Section 4 and oligonucleotides used for cloning are listed in Supplementary Data 3.

Mapping random barcodes

Once random barcodes were attached and cloned, constructs were sequenced on an Illumina MiSeq to identify the X and Y proteins which each barcode was connected to. DNA containing the X and Y proteins, and the barcode were amplified as a linear fragment, and Illumina’s P5 and P7 adapters attached. Constructs were sequenced with a v3 300 cycle paired end kit (Illumina TG-142-3003), with custom primers spiked into the Illumina primers. Sequences were demultiplexed, and mapped with a BBtools pipeline and consensus building custom script. Full descriptions of how each library was mapped can be found in the Supplementary Information Section 5.

Strains used

All NGB2H experiments were run in TK310³⁹ carrying pSK34. TK310 is a previously published MG1655 derivative with deletions in cpdA, lacY and cyaA, which give it a large linear response range to cAMP. pSK34 contains repressors for both the phlF and Tet promoters to maintain repression of the two-hybrid proteins. CB216 is a NEB5ɑ derivative with pSK34 integrated genomically and only used for cloning. All plasmids used for basic cloning are listed in Supplementary Data 4 and available at Addgene. Supplementary File 2 in the Source Data contains all the plasmids used in gene bank open format.

NGB2H assay execution

Glycerol stocks of each library were thawed, and 100uL were grown up overnight in 100 mL MOPS EZ Rich Defined Media (Teknova M2105) with kanamycin (Teknova K2125) and carbenicillin (Teknova C2130). For time course studies, a glycerol stock containing a library of constitutive GFP constructs was also thawed, and 100uL was inoculated into 10 mL of MOPS EZ Rich Defined Media with kanamycin and carbenicillin and grown overnight. The next morning 1 mL of the GFP library was added to the 100 mL of library culture. After mixing GFP and experimental libraries, 1 mL of overnight culture was added to a fresh culture of 100 mL MOPS EZ Rich Defined Media with carbenicillin and kanamycin and the inducers for two hybrid expression: 5 ng/mL anhydrotetracycline, 1.5 uM 2,4-Diacylphlorolglucinol and 100 uM IPTG, done twice for biological replicates, except where indicated (Supplementary Information Section 6). Flasks were placed in a 37 C degree shaker for six hours. Samples were pulled after 6 h and placed on an ice slurry to quickly cool for 15 min after which cells were spun down for RNA and DNA extraction.

RNA and DNA preparation for barcode sequencing

Samples of RNA were prepared with Qiagen RNeasy kits (Qiagen 74106, or 75144) according to manufacturer’s instructions, with on-column DNase digestion (Qiagen 79254) and concentrated with RNeasy MinElute Cleanup kit (Qiagen 74204). RNA was reverse transcribed with Superscript IV (ThermoFisher 18090050) with a modified protocol such that 25 ug of input RNA was used, the extension step ran for 1 h at 55 C, and 1 uL of RNase A was added in the RNA removal step. Each sample was transcribed with a specific primer, often oSK193 or oSK194, that attached the i7 index and P7 sequencing primer. Samples of DNA were prepared with Qiagen Plasmid Plus Maxi kits (Qiagen 12963) according to the manufacturer’s instructions. RNA samples were verified to contain very low levels of DNA (<1:1000) by qPCR (Kapa Biotechnology KK4601) with oSK199 and oSK200, which was repeated with a high-fidelity PCR for a low number of cycles to keep samples in the exponential amplification phase. DNA samples were similarly quantified with qPCR and amplified for low cycles to attach P5 and P7 and multiplexing indices. Amplified samples were then quantified on an Agilent Tapestation 2200 with D1000 screentape (Agilent 5067-5582), verified to be monodispersed and mixed in equimolar quantities. Complete details for RNA and DNA preparation can be found in the Supplementary Information Section 7.

Barcode sequencing

Pooled RNA and DNA barcodes from each experiment were sequenced with various cores and startups at UCLA. The CC1 and CCNG1 Libraries were sequenced on a Hiseq 2500 while the CCmax and CC0 libraries were sequenced on a Nextseq 550. Samples were diluted and mixed with 5–20% phiX control v3 (Illumina FC-110-3001) and sequenced with oSK326 for read 1 and oSK324 for the index read.

Barcode counting

We used a custom bash script to count DNA barcodes from barcode sequencing. After demultiplexing into reads from RNA or DNA samples, reads were truncated to the 20 bp containing the barcode and unique sequences counted. Barcode counts were then processed with Starcode (v1.3), to condense barcodes within a levenshtein distance of one to remove sequencing errors and tallied again.

Interaction quantification

Barcode count files were imported into R where they were merged with the mapping file to provide the protein pair identified with each barcode. Barcodes corresponding to the same construct were summarized (dplyr 0.7.4) and total counts of RNA barcodes and DNA barcodes per protein pair were obtained. For our analysis we used Interactions scores calculated as \({Interaction\; score}={{{{{\rm{ln}}}}}}\left({{{{{\rm{median}}}}}}\left(\frac{{RNA\; reads}}{{DNA\; reads}}\right)\right)\) for barcodes that had >10 reads in all DNA samples. Interactions for all libraries are reported in Supplementary Data 5.

Orthogonal set identification

Orthogonal sets were identified for the CCNG1 and CCmax libraries. Briefly, we wrote a script, find_orthogonal_sets_w_MIS.py that took the Interaction scores for each set and built a graph with interactions forming the edges between proteins. Finding the maximum independent set of the line graph of this graph gave us the largest orthogonal set of interactions, listed in Supplementary Data 6. The largest sets for different numbers of on-target interactions are listed in Supplementary Data 7. Supplementary File 1 in the Source Data contains excel files for all designed sets, the paring of peptides and a heat plot map.

Statistics and reproducibility

We have verified that technical repeats are very well correlated (Fig. 1F) and so sequencing experiments were only done once, unless explicitly stated otherwise in the main text. No data was excluded. No statistical method was used to predetermine sample size. Biological replicates were performed for CC0 library and all the CC0 library was included in all sequencing experiments as an internal control. The experiments were not randomized. The developed scoring function was tested on novel experimental data. The Investigators were not blinded to allocation during experiments and outcome assessment.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The raw sequencing data generated in this study have been deposited in the Sequence Read Archive under accession code PRJNA737455. The processed data (computational scores, set predictions and processed interactions scores) are available at https://github.com/ajasja/NGB2H and on Zenodo with the https://doi.org/10.5281/zenodo.7774717. Plasmids pSK33, 34, 59, 168 and 179 are available in the Addgene repository (https://www.addgene.org/) under accession codes 193731, 193732, 193736, 193737, 193738 and 196340. Source data are provided with this paper.

Code availability

Code needed to design orthogonal sets, score CC pairs, process data and reproduce figures in the main deposited text is available at https://github.com/ajasja/NGB2H. The exact version of the code used is also available on Zenodo with the https://doi.org/10.5281/zenodo.7774717. The iCipa scoring function is also available as a webservice at https://ajasja.github.io/icipa.

References

Vidal, M., Cusick, M. E. & Barabási, A.-L. Interactome networks and human disease. Cell 144, 986–998 (2011).
Article CAS PubMed PubMed Central Google Scholar
Huang, P.-S., Boyken, S. E. & Baker, D. The coming of age of de novo protein design. Nature 537, 320–327 (2016).
Article ADS CAS PubMed Google Scholar
Ljubetič, A., Gradišar, H. & Jerala, R. Advances in design of protein folds and assemblies. Curr. Opin. Chem. Biol. 40, 65–71 (2017).
Article PubMed Google Scholar
Ljubetič, A. et al. Design of coiled-coil protein-origami cages that self-assemble in vitro and in vivo. Nat. Biotechnol. 35, 1094–1101 (2017).
Article PubMed Google Scholar
Gradišar, H. et al. Design of a single-chain polypeptide tetrahedron assembled from coiled-coil segments. Nat. Chem. Biol. 9, 362–366 (2013).
Article PubMed PubMed Central Google Scholar
Boyken, S. E. et al. De novo design of protein homo-oligomers with modular hydrogen-bond network – mediated specificity. Science 392, 680–687 (2016).
Fallas, J. A. et al. Computational design of self-assembling cyclic protein homo-oligomers. Nat. Chem. 9, 353–360 (2017).
Article CAS PubMed Google Scholar
Chen, Z. et al. Programmable design of orthogonal protein heterodimers. Nature 565, 106–111 (2019).
Article ADS CAS PubMed Google Scholar
Pauling, L. & Corey, R. B. Compound helical configurations of polypeptide chains: structure of proteins of the α-keratin type. Nature 171, 59–61 (1953).
Article ADS CAS PubMed Google Scholar
Crick, F. H. C. The packing of α-helices: simple coiled-coils. Acta Crystallogr 6, 689–697 (1953).
Article CAS MATH Google Scholar
Crick, F. H. C. The fourier transform of a coiled-coil. Acta Crystallogr 6, 685–689 (1953).
Article CAS MATH Google Scholar
Acharya, A., Rishi, V. & Vinson, C. Stability of 100 homo and heterotypic coiled-coil a-a′ pairs for ten amino acids (A, L, I, V, N, K, S, T, E, and R). Biochemistry 45, 11324–11332 (2006).
Article CAS PubMed Google Scholar
Potapov, V., Kaplan, J. B. & Keating, A. E. Data-driven prediction and design of bZIP coiled-coil interactions. PLoS Comput Biol. 11, 1–28 (2015).
Article Google Scholar
Mason, J. M., Schmitz, M. A., Müller, K. M. & Arndt, K. M. Semirational design of Jun-Fos coiled coils with increased affinity: universal implications for leucine zipper prediction and design. Proc. Natl Acad. Sci. USA 103, 8989–8994 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Crooks, R. O., Lathbridge, A., Panek, A. S. & Mason, J. M. Computational prediction and design for creating iteratively larger heterospecific coiled coil sets. Biochemistry 56, 1573–1584 (2017).
Article CAS PubMed Google Scholar
Thompson, K. E., Bashor, C. J., Lim, W. A. & Keating, A. E. SYNZIP protein interaction toolbox: in vitro and in vivo specifications of heterospecific coiled-coil interaction domains. ACS Synth. Biol. 1, 118–129 (2012).
Article CAS PubMed PubMed Central Google Scholar
Karimova, G., Pidoux, J., Ullmann, A. & Ladant, D. A bacterial two-hybrid system based on a reconstituted signal transduction pathway. Proc. Natl Acad. Sci. USA 95, 5752–5756 (1998).
Article ADS CAS PubMed PubMed Central Google Scholar
Fong, J., Keating, A. & Singh, M. Predicting specificity in bZIP coiled-coil protein interactions. Genome Biol. 5, R11 (2004).
Article PubMed PubMed Central Google Scholar
Brodnik, A., Palangetić, M., Siladi, D. & Jovičić, V. Construction of orthogonal CC-sets. Inform. (Slovenia) 43, 19–22 (2019).
MathSciNet Google Scholar
Drobnak, I., Gradišar, H., Ljubetič, A., Merljak, E. & Jerala, R. Modulation of coiled-coil dimer stability through surface residues while preserving pairing specificity. J. Am. Chem. Soc. 139, 8229–8236 (2017).
Article CAS PubMed Google Scholar
Aupič, J. et al. Designed folding pathway of modular coiled-coil-based proteins. Nat. Commun. 12, 940 (2021).
Drobnak, I., Gradišar, H., Ljubetič, A., Merljak, E. & Jerala, R. Modulation of coiled-coil dimer stability through surface residues while preserving pairing specificity. J. Am. Chem. Soc. 139, 8229–8236 (2017).
Krylov, D., Barchi, J. & Vinson, C. Inter-helical interactions in the leucine zipper coiled coil dimer: pH and salt dependence of coupling energy between charged amino acids. J. Mol. Biol. 279, 959–972 (1998).
Article CAS PubMed Google Scholar
Yachie, N. et al. Pooled-matrix protein interaction screens using barcode fusion genetics. Mol. Syst. Biol. 12, 863–863 (2016).
Article PubMed PubMed Central Google Scholar
Trigg, S. A. et al. CrY2H-seq: a massively multiplexed assay for deep-coverage interactome mapping. Nat. Methods 14, 819–825 (2017).
Article CAS Google Scholar
Yang, J. S. et al. rec-YnH enables simultaneous many-by-many detection of direct protein–protein and protein–RNA interactions. Nat. Commun. 9, 3747 (2018).
Yang, F. et al. Development and application of a recombination-based library versus library highthroughput yeast two-hybrid (RLL-Y2H) screening system. Nucleic Acids Res. 46, 1–12 (2018).
Article Google Scholar
Younger, D., Berger, S., Baker, D. & Klavins, E. High-throughput characterization of protein–protein interactions by reprogramming yeast mating. Proc. Natl Acad. Sci. USA 114, 12166–12171 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Diss, G. & Lehner, B. The genetic landscape of a physical interaction. Elife 7, 1–31 (2018).
Article Google Scholar
Andrews, S. S. et al. High-resolution protein–protein interaction mapping using all- versus -all sequencing (AVA-Seq). J. Biol. Chem. 294, 11549–11558 (2019).
Article CAS PubMed PubMed Central Google Scholar
Plesa, C., Sidore, A. M., Lubock, N. B., Zhang, D. & Kosuri, S. Multiplexed gene synthesis in emulsions for exploring protein functional landscapes. Science (1979) 359, 343–347 (2018).
CAS Google Scholar
Sidore, A. M., Plesa, C., Samson, J. A., Lubock, N. B. & Kosuri, S. DropSynth 2.0: high-fidelity multiplexed gene synthesis in emulsions. Nucleic Acids Res. 48, e95 (2020).
Article CAS PubMed PubMed Central Google Scholar
Harbury, P. B., Zhang, T., Kim, P. S. & Alber, T. A switch between two-, three-, and four-stranded coiled coils in GCN4 leucine zipper mutants. Science (1979) 262, 1401–1407 (1993).
CAS Google Scholar
Woolfson, D. N. & Alber, T. Predicting oligomerization states of coiled coils. Protein Sci. 4, 1596–1607 (1995).
Article CAS PubMed PubMed Central Google Scholar
Chen, Z. et al. De novo design of protein logic gates. Science (1979) 368, 78–84 (2020).
CAS Google Scholar
Fink, T. et al. Design of fast proteolysis-based signaling and logic circuits in mammalian cells. Nat. Chem. Biol. 15, 115–122 (2019).
Article CAS PubMed Google Scholar
Lebar, T., Lainšček, D., Merljak, E., Aupič, J. & Jerala, R. A tunable orthogonal coiled-coil interaction toolbox for engineering mammalian cells. Nat. Chem. Biol. 16, 513–519 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kosuri, S. et al. Scalable gene synthesis by selective amplification of DNA pools from high-fidelity microchips. Nat. Biotechnol. 28, 1295–1299 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kuhlman, T., Zhang, Z., Saier, M. H. & Hwa, T. Combinatorial transcriptional control of the lactose operon of Escherichia coli. Proc. Natl Acad. Sci. USA 104, 6043–6048 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the members of the Kosuri and Plesa labs for their feedback on the manuscript and figures. We thank Suhua Feng of the UCLA Broad Stem Cell Research Center and the team of the Technology Center for Genomics and Bioinformatics for performing the next-generation sequencing. We thank Octant Inc., the Kruglyak Lab at UCLA and the Black Lab at UCLA for the use of their next-generation sequencers. We thank Mathew Graf and Will Silkworth for their assistance at the UCLA-DOE Biochemistry Shared Instrumentation Facility. We thank Thomas Kuhlman for kindly providing strain TK310. We thank Amy Keating for sharing the Potapov CC scoring scripts. Finally, we thank Chris Voigt for sharing repressor/promoter sequences with us. This work was supported by the following funding sources: The National Institutes of Health (DP2GM114829) to S.K., Searle Scholars Program to S.K., ERASynBio (1445112) to S.K. and R.J. European Union’s Horizon 2020: CC-LEGO 792305 to A.L., ERC project MaCChines (787115) to R.J. and FET Open project Virofight (899619) to R.J. and Slovenian Research Agency projects: CC-TRIGGER J1-4406 to A.L., P4-0176 to R.J. and J1-9173 to R.J.

Author information

Hwangbeom Kim
Present address: Samsung Biologics, Incheon, Republic of Korea
Nathan Lubock & Sriram Kosuri
Present address: Octant Inc, Emeryville, CA, 94608, USA
Jonathan Lee
Present address: Keck School of Medicine, University of Southern California, Los Angeles, CA, 90033, USA
These authors contributed equally: W. Clifford Boldridge, Ajasja Ljubetič.

Authors and Affiliations

Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
W. Clifford Boldridge, Hwangbeom Kim, Nathan Lubock & Sriram Kosuri
Department of Synthetic Biology and Immunology, National Institute of Chemistry, 1000, Ljubljana, Slovenia
Ajasja Ljubetič & Roman Jerala
EN-FIST Centre of Excellence, 1000, Ljubljana, Slovenia
Ajasja Ljubetič & Roman Jerala
University of Primorska, 6000, Koper, Slovenia
Dániel Szilágyi & Andrej Brodnik
Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, CA, 90095, USA
Jonathan Lee
UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Sriram Kosuri
Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Sriram Kosuri
Institute for Quantitative and Computational Biosciences, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Sriram Kosuri
Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Sriram Kosuri
Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Sriram Kosuri

Authors

W. Clifford Boldridge
View author publications
You can also search for this author in PubMed Google Scholar
Ajasja Ljubetič
View author publications
You can also search for this author in PubMed Google Scholar
Hwangbeom Kim
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Lubock
View author publications
You can also search for this author in PubMed Google Scholar
Dániel Szilágyi
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Lee
View author publications
You can also search for this author in PubMed Google Scholar
Andrej Brodnik
View author publications
You can also search for this author in PubMed Google Scholar
Roman Jerala
View author publications
You can also search for this author in PubMed Google Scholar
Sriram Kosuri
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.K., W.C.B., N.L. and S.K. designed the NGB2H system. A.L., D.S. and R.J. designed the large sets of coiled-coils. H.K., N.L. and W.C.B. designed the oligonucleotide libraries. H.K. W.C.B. and J.L. performed the experiments. A.L. designed the improved interaction algorithms. W.C.B. and N.L. performed the computational analysis. W.C.B, A.L., R.J. and S.K. analysed the results and iteratively planned the next steps. W.C.B created the figures. S.K., W.C.B. and A.L. wrote the manuscript, with input from all authors.

Corresponding authors

Correspondence to Ajasja Ljubetič, Roman Jerala or Sriram Kosuri.

Ethics declarations

Competing interests

S.K. is cofounder and CEO and holds equity, N.L. is an employee and holds equity and J.L. was an employee and holds equity in Octant Inc. All other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jody Mason and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Reporting Summary

Peer Review File

Supplementary Dataset 1 OLS oligonucleotides for gene synthesis of libraries

Supplementary Dataset 2 Sequences of proteins used in libraries

Supplementary Dataset 3 Oligonucleotides for PCR

Supplementary Dataset 4 Plasmids referred to by name

Supplementary Dataset 5 Interaction Scores from all libraries

Supplementary Dataset 6 Largest orthogonal subset for each set in the CCNG1 and CCMax library

Supplementary Dataset 7 Largest orthogonality gaps by number of on-target interactions

Supplementary Dataset 8 Comparisons between libraries

Supplementary Dataset 9 Uncertainty analysis

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Boldridge, W.C., Ljubetič, A., Kim, H. et al. A multiplexed bacterial two-hybrid for rapid characterization of protein–protein interactions and iterative protein design. Nat Commun 14, 4636 (2023). https://doi.org/10.1038/s41467-023-38697-x

Download citation

Received: 04 February 2021
Accepted: 11 May 2023
Published: 02 August 2023
DOI: https://doi.org/10.1038/s41467-023-38697-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.