Phage-assisted continuous evolution of proteases with altered substrate specificity

Packer, Michael S.; Rees, Holly A.; Liu, David R.

doi:10.1038/s41467-017-01055-9

Download PDF

Article
Open access
Published: 16 October 2017

Phage-assisted continuous evolution of proteases with altered substrate specificity

Michael S. Packer^1,2,3,
Holly A. Rees^1,3 &
David R. Liu^1,3,4

Nature Communications volume 8, Article number: 956 (2017) Cite this article

24k Accesses
77 Citations
27 Altmetric
Metrics details

Subjects

Abstract

Here we perform phage-assisted continuous evolution (PACE) of TEV protease, which canonically cleaves ENLYFQS, to cleave a very different target sequence, HPLVGHM, that is present in human IL-23. A protease emerging from ∼2500 generations of PACE contains 20 non-silent mutations, cleaves human IL-23 at the target peptide bond, and when pre-mixed with IL-23 in primary cultures of murine splenocytes inhibits IL-23-mediated immune signaling. We characterize the substrate specificity of this evolved enzyme, revealing shifted and broadened specificity changes at the six positions in which the target amino acid sequence differed. Mutational dissection and additional protease specificity profiling reveal the molecular basis of some of these changes. This work establishes the capability of changing the substrate specificity of a protease at many positions in a practical time scale and provides a foundation for the development of custom proteases that catalytically alter or destroy target proteins for biotechnological and therapeutic applications.

An amber obligate active site-directed ligand evolution technique for phage display

Article Open access 13 March 2020

Ongoing shuffling of protein fragments diversifies core viral functions linked to interactions with bacterial hosts

Article Open access 28 November 2023

Evolution of protease activation and specificity via alpha-2-macroglobulin-mediated covalent capture

Article Open access 11 February 2023

Introduction

Proteases are ubiquitous regulators of protein function in all domains of life and represent approximately one percent of known protein sequences¹. Substrate-specific proteases have proven useful as research tools and as therapeutics that supplement a natural protease deficiency to treat diseases such as hemophilia, or that simply perform their native functions such as the case of botulinum toxin, which cleaves SNARE proteins and mediates beneficial muscle paralysis in numerous therapeutic and cosmetic applications².

Researchers have engineered or evolved industrial proteases with enhanced thermostability and solvent tolerance^{3, 4}. Similarly, a handful of therapeutic proteases have been engineered with improved kinetics and prolonged activity^5,6,7. The potential of proteases to serve as a broadly useful platform for modulating the activity of a protein of interest, however, is greatly limited by the native substrate scope of known proteases. In contrast to the highly successful generation of monoclonal antibodies with tailor-made binding specificities⁸, the generation of proteases with novel protein cleavage specificities has proven to be a major challenge. For example, efforts to engineer trypsin variants that exhibit the substrate specificity of the closely related protease chymotrypsin were unsuccessful until researchers grafted the entire substrate-binding pocket, multiple surface loops, and additional residues from chymotrypsin^9,10,11,12. This approach of replacing protease residues with amino acids from related proteases to impart specificity features from the latter^{13, 14} has not provided proteases with specificities not already known among natural proteases, prompting researchers to instead turn to laboratory evolution to generate proteases with novel specificities^{15,16,17,18,19}.

Although most reports describe evolution of proteases that cleave single mutant substrates, the evolution of a protease that can degrade a target protein of interest will almost always require changing substrate sequence specificity at more than one position within a substrate motif. Simultaneous specificity changes at two protease sub-sites has previously been accomplished in a stepwise fashion requiring distinct rounds of evolution²⁰. Continuous evolution strategies, which require little or no researcher intervention between generations²¹, therefore may be well-suited to evolve over many generations proteases capable of cleaving a target protein that differs substantially in sequence from the preferred substrate of a wild-type protease. In phage-assisted continuous evolution (PACE), a population of evolving selection phage (SP) is continuously diluted in a fixed-volume vessel (the “lagoon”) by an incoming culture of host Escherichia coli ²². The SP is a modified phage genome in which the evolving gene of interest has replaced gene III, a gene essential for phage infectivity. If the evolving gene of interest possesses the desired activity it will trigger expression of gene III from an accessory plasmid (AP) in the host cell, thus producing infectious progeny encoding active variants of the evolving gene. The mutation rate of the SP is controlled using an arabinose-inducible mutagenesis plasmid (MP) such as MP6, which encodes six mutagenic proteins that disrupt DNA replication proofreading, base excision repair, mismatch repair, and export of damaged nucleobases. Upon full induction, this plasmid increases the mutation rate of the SP by ∼300,000 fold²³. Because the rate of continuous dilution is slower than phage replication but faster than E. coli replication, mutations only accumulate in the SP.

Here we describe the use of PACE to continuously evolve TEV protease, which natively cleaves the consensus substrate sequence ENLYFQS, to cleave a target sequence, HPLVGHM, that differs at six of seven positions from the consensus substrate and is present in an exposed loop of the pro-inflammatory cytokine IL-23. After constructing a pathway of evolutionary stepping stones and performing ~2500 generations of evolution using PACE, a resulting protease contains 20 amino acid substitutions, cleaves human IL-23 at the intended target peptide bond, and blocks the ability of IL-23 to stimulate IL-17 production in a murine splenocyte assay. Together, these results establish a strategy for generating proteases with specificities that have been altered at several substrate positions in order to cleave a target protein of interest.

Results

Choice of target substrate and protease

The biochemistry, substrate specificity²⁴, and structure²⁵ of TEV protease have been extensively studied. Previous directed evolution efforts yielded TEV protease variants with altered specificity at any one of the following positions: P6, P1, or P1ʹ^{15, 16, 18, 19}. Based on these studies and our own experiments in which we evolved TEV protease variants that accept substitutions at P6 or P1, we created an “evolvability” scoring matrix (Supplementary Table 1) that assigns a difficulty score to each of the 20 possible amino acids at each of the seven positions recognized by TEV protease. We used this matrix to rank all possible heptapeptides within all human extracellular proteins. From this ranking, we manually curated a handful of disease-associated proteins in which the target peptide sequences were predicted to be solvent exposed based on their crystal structures (Supplementary Table 2).

The resulting candidate target sequences included HPLVGHM, a peptide found in human IL-23. IL-23 is a pro-inflammatory cytokine secreted by macrophages and dendritic cells in response to pathogens and tissue damage, ultimately promoting an innate immune response at the site of injury or infection. This immune response is mediated by IL-23-dependent stabilization of Th17 cells, a class of T helper cells that produce pro-inflammatory cytokines IL-17, IL-6, and TNFα²⁶. Hyperactivity of this pathway can lead to a variety of autoimmune disorders including psoriasis and rheumatoid arthritis²⁷. Monoclonal antibodies that neutralize IL-23 are FDA-approved drugs for the treatment of psoriasis and show promise in late-stage clinical trials for other autoimmune disorders²⁸. These studies suggest that a protease that catalyzes IL-23 degradation may have anti-inflammatory activity.

The target peptide HPLVGHM differs from the TEV consensus substrate sequence, ENLYFQS, at six of seven positions. Two of these substitutions are predicted to not substantially impact TEV protease activity due to its low specificity at positions P5 and P1ʹ, while the other four substitutions occur at positions that are known to be crucial specificity determinants of wild-type TEV protease (P6 Glu, P3 Tyr, P2 Phe, and P1 Gln). Indeed, substitution of TEV substrate P2 Phe or P1 Gln with the corresponding IL-23 substrate residue (P2 Gly or P1 His) has been shown to reduce TEV protease activity by more than an order of magnitude in each case²⁴. Encouragingly, researchers previously used site-saturation mutagenesis and an elegant yeast display screen to identify TEV mutants that accept P1 His instead of the P1 Gln, demonstrating the evolvability of P1 recognition¹⁹.

Evolution of TEV variants that cleave the IL-23 peptide

PACE requires linking the activity of interest to expression of an essential phage gene (such as gene III) and thus phage survival. We previously established such a linkage for a range of activities including RNA polymerase activity, DNA binding, protein binding, and protein cleavage^{22, 29,30,31,32,33}. To link proteolysis to gene expression we used a protease-activated RNA polymerase (PA-RNAP) consisting of T7 RNA polymerase (T7 RNAP) fused through a protease-cleavable linker to T7 lysozyme, a natural inhibitor of T7 RNAP (Fig. 1)³². In this study the PA-RNAP is expressed from the same host-cell AP that places gene III expression under control of the T7 promoter, while the evolving protease is expressed from the SP (Supplementary Figs. 1 and 2).

We verified that SP expressing TEV protease S219V, a form that does not self-cleave (hereafter referred to as “wild-type”), propagate robustly on host cells expressing a PA-RNAP containing the TEV consensus substrate, ENLYFQS, in the linker connecting T7 RNAP and T7 lysozyme. In contrast, replacing the PA-RNAP linker with the target peptide HPLVGHM results in a failure of phage to propagate and rapid clearance of phage, consistent with the inability of wild-type TEV protease, or TEV variants containing a handful of immediately accessible mutations, to cleave the target IL-23 peptide.

We anticipated that successful evolution of TEV protease variants that cleave the IL-23 target would require multiple evolutionary stepping stones^{20, 29,30,31, 33} to guide evolving gene populations through points in the fitness landscape that bring them successively closer to activity on the final target substrate. We designed three evolutionary trajectories such that substrate changes known to strongly disrupt the activity of wild-type TEV protease, including P6 His, P2 Gly, and P1 His,²⁴ were introduced in the earliest stepping stones (Fig. 2). We confronted these challenging substitutions first while the evolving protease populations had access to variants with wild-type-like levels of activity, reasoning that the likelihood of success was higher while proteases had sufficient activity to exchange for altered specificity without falling below a minimum activity threshold needed to survive selection. We introduced these challenging substrate changes one stepping stone at a time to avoid collapse of the evolving phage population and to illuminate at each stage how mutations within TEV protease altered substrate specificity.

We began all three evolutionary trajectories (Fig. 2) by introducing the P6 His substitution (HNLYFQS) into the PA-RNAP and expressing a site-saturation mutagenesis library of TEV protease from the SP. Using NNK codons, we simultaneously randomized three TEV protease residues N171, N176, and Y178, all of which are proximal to the P6 substrate residue. This first PACE yielded variants with enhanced apparent activity on the HNLYFQS substrate (Supplementary Fig. 3) and genotypes were highly enriched for D127A+S135F+N176I, or I138T+N171D+N176T (Supplementary Table 3). Mutations N171D and N176T have been previously characterized as allowing P6 tolerance for uncharged residues such as threonine and proline¹⁵.

Next we pursued two parallel lines of PACE using either ENLYGQS (trajectories 1 and 2) or HNLYFHS (trajectory 3) as the second stepping-stone substrate. For trajectories 1 and 2, we diversified the population that emerged from PACE on the first stepping stone (HNLYFQS) with NNK codons at all four TEV protease residues (209, 211, 216, and 218) that line the hydrophobic pocket that is occupied by the P2 Phe and performed PACE using host cells expressing ENLYGQS. The resulting population of TEV mutants is typified by the mutations N176I, V209M, W211I, M218F (Supplementary Table 4), which confer apparent cleavage activity on both HNLYFQS and ENLYGQS substrates (Supplementary Fig. 4).

In trajectory 3, we used a mixing strategy to access TEV proteases that could cleave the HNLYFHS stepping-stone double mutant substrate. Unlike PACE experiments initiated from a site-saturation mutagenesis library, a mixing strategy relies on a transitional period of phage propagation on a mixture of two different host cell populations, one expressing an accepted substrate (HNLYFQS) and the other expressing the next stepping-stone substrate (HNLYFHS). Following this transitional period, the SP is propagated exclusively on hosts expressing the next stepping-stone substrate (HNLYFHS). The variants that emerged from this stage of trajectory 3 showed weak apparent activity on the double mutant substrate HNLYFHS (Supplementary Fig. 5), and only a single additional enriched mutation D148A (Supplementary Table 5). Encouragingly, mutation at residue D148 has previously been reported to enable activity on ENLYFHS¹⁹.

Due to the low apparent activity of proteases emerging from this mixing experiment, we relied on the site-saturation mutagenesis strategy to evolve activity on the third stepping stone, HNLYGHS. The TEV protease populations from trajectory 3 were randomized at sites implicated in P2 recognition (209, 211, 216, and 218), while for trajectories 1 and 2 the TEV protease population was simultaneously randomized at four sites (146, 148, 167, and 177) as previously described for the reprogramming of TEV specificity at the P1 position¹⁹ (Fig. 2). The primers used to randomize TEV protease residues 167 and 177 must also encode the identity of intervening amino acids N171 and N176. Although the population appeared to converge on N176I (Supplementary Table 4), we reasoned that it was best to preserve genetic diversity at N176 by constructing one library with primers encoding N176I (trajectory 1) and another with N171D+N176T (trajectory 2). Libraries constructed for all three trajectories were then subjected to PACE on host cells expressing the triple mutant substrate HNLYGHS. The variants emerging at this stage of trajectory 1 and 2 were enriched for mutations at residues 146, 148, and 177, consistent with acceptance of the newly introduced P1 substitution¹⁹. Similarly, clones from trajectory 3 exhibit mutations at residues 209, 211, and 218 that may promote acceptance of the newly added P2 Gly substitution. Regardless of trajectory, all clones emerging at this stage exhibit at least one mutation from each of three targeted mutagenesis libraries (Supplementary Table 6), suggesting that they have evolved activity on the triple mutant substrate.

Given the known tolerance of TEV protease for amino acids at positions P5 and P1ʹ, we speculated that proteases evolved to recognize the triple mutant substrate HNLYGHS might already exhibit activity on the final target substrate (HPLVGHM). Indeed, the populations arising from evolution on the triple mutant substrate successfully propagate in PACE on host cells producing the HPLVGHM substrate, and the resulting variants display weak apparent activity on the final target substrate (Supplementary Fig. 6). In order to evolve high levels of activity on the final target substrate from these weakly active mutants, we applied three strategies to increase selection stringency on all three trajectories: (1) express a lower concentration of the PA-RNAP substrate by using a weaker constitutive promoter (proA instead of proB)³⁴; (2) replace the flexible GGS linker that flanks our substrate with the native and potentially structured amino sequence from IL-23 (human IL-23 residues 38–66); and (3) introduce a mutation in the T7 RNAP portion of the PA-RNAP that decreases transcriptional activity (Q649S)³⁵. We confirmed that all three strategies indeed increased selection stringency (Supplementary Fig. 7).

We first applied the lowered substrate concentration strategy using a mixing experiment to transition from proB to proA expression of the PA-RNAP; this experiment yielded modest changes in genotypes. Exploiting the ease of performing PACE on multiple lagoons in parallel, we implemented the other two strategies simultaneously on all three trajectories. The resulting six populations (trajectories 1a, 1b, 2a, 2b, 3a, and 3b; see Fig. 2) were carried forward into PACE on hosts expressing a PA-RNAP with both the IL-23 (38–66) linker and the attenuated T7 RNAP mutant Q649S. In the final stage of PACE for all six populations, we used a proA promoter to generate less of the PA-RNAP containing the IL-23 (38–66) linker and the Q649S mutation. This series of stringency modulation experiments produced variants with higher levels of apparent activity on the final HPLVGHM substrate (Supplementary Fig. 8).

Characterization of evolved TEV protease variants

Mutations that arise early in long evolutionary trajectories can create a cascade of contingencies because subsequent mutations must be compatible with the preexisting genetic context, a phenomenon known as epistasis. Genotypes suggest that epistasis strongly shaped the outcomes of trajectories 1 and 2, which were dominated by N176I vs. N171D+N176T, respectively, prior to the third stage of PACE. During subsequent evolution, the amino acid identity at amino acid 176 appears to have dictated the optimal identity of residue 177, such that the combinations N176I+N177S, or N176T+N177M, predominate trajectories 1 and 2 respectively. Swapping the identity of N177 between clones from trajectories 1 and 2 results in a substantial loss of activity (Supplementary Fig. 9), further consistent with epistasis at this position. It is likely that these genetic differences between trajectory 1 and 2 also later led to the enrichment of distinct mutations outside of the substrate-binding site (Supplementary Tables 3–11). For example, mutations at position 203 persisted only in trajectory 1, while mutations at positions 28, 30, 68, 132, and 162 were only abundant in trajectory 2.

Unsurprisingly, we observed a dramatically different outcome in the third trajectory, which not only experienced a different schedule of stepping-stone substrates but was also subjected to a mixing experiment instead of NNK mutagenesis at residues 146, 148, 167, and 177. Our data are consistent with a model in which a lack of diversification at these critical residues traps trajectory 3 in a local fitness maximum, evidenced by weak apparent activity on the final target substrate (Supplementary Figs. 6 and 8) and few genotypic changes after the fifth stage of trajectory 3 (Table 1). Consequently, while all six populations yielded TEV variants with apparent activity on the final target, the TEV protease variants that exhibited the highest apparent activity on the final target were all derived from trajectories 1 and 2.

Table 1 Representative evolved TEV protease genotypes

Full size table

Three representative proteases from the end of trajectories 1 and 2 were purified and assayed in vitro for their ability to cleave a model protein substrate in which MBP and GST were fused through a linker containing the final HPLVGHM substrate sequence (Supplementary Fig. 10). All three evolved proteases cleaved the model substrate. We selected the most active clone (TEV L2F from trajectory 2, containing 20 non-silent mutations, Supplementary Table 11) for detailed characterization. We assayed the kinetic parameters of this mutant enzyme on wild-type (ENLYFQS) and target (HPLVGHM) substrate peptides using a previously described HPLC method¹⁹. Unlike the wild-type enzyme, which exhibits no detectable activity on the HPLVGHM peptide, the L2F variant processes this substrate with ∼15% of the catalytic efficiency (k _cat/K _M) with which TEV protease cleaves its native substrate (Table 2 and Supplementary Fig. 11). Compared to wild-type TEV, evolved TEV L2F appears to have slightly improved kinetics on the canonical ENLYFQS substrate, possibly due to broadly activating mutations that increase k _cat, while experiencing only a modest fivefold increased K _m on the target substrate HPLVGHM. These results collectively indicate that PACE generated a mutant protease that cleaves a target substrate containing mutations at six positions with only modestly lower efficiency than wild-type TEV protease cleaves its consensus substrate.

Table 2 Kinetic parameters of wild-type and evolved TEV proteases

Full size table

Substrate specificity profiling of an evolved TEV protease

Proteolysis assays on individual substrates reveal that evolved TEV protease L2F maintains the ability to detectably cleave starting and intermediate substrates while acquiring activity on the final IL-23 target (Supplementary Fig. 12). A more comprehensive understanding of the substrate specificity of this evolved enzyme requires an unbiased protease specificity profile generated from a large number of substrate variants. To obtain such a profile, we applied a previously reported phage substrate display method (Fig. 3a)^36,37,38. M13 bacteriophage encoding pIII fused to a FLAG-tag through a library of substrate linkers were immobilized on anti-FLAG magnetic beads. When incubated with a protease of interest, phage encoding cleaved substrates are liberated from the solid support, while phage encoding the intact substrates remain immobilized and are eluted with excess FLAG peptide. The abundance of each substrate in the cleaved and eluted populations was measured by high-throughput DNA sequencing, yielding enrichment values (Supplementary Table 12) and sequence logos (Fig. 3b–e) that convey protease substrate specificity across all possible amino acids³⁹.

We applied this specificity profiling technique to seven separate libraries, each containing a single randomized position within the canonical ENLYFQS substrate. These libraries are inherently biased by the identities of the residues that are held constant, but because of their small theoretical diversity, they are easy to construct and a single round of selection yielded robust enrichment values. We validated this method by enrichment of the consensus motif EXLYFQS (where X = any amino acid) for wild-type TEV protease (Fig. 3b). We also applied this substrate specificity profiling method to more complex libraries containing sets of three consecutive randomized amino acids within either the ENLYFQS or HPLVGHM substrate. The resulting specificity profiles from these larger libraries (Supplementary Fig. 13) did not substantially differ from the results of the single-site libraries. However, the identity of the constant residues (ENLYFQS vs. HPLVGHM) did alter the resulting specificity profiles of TEV L2F; the specificity of TEV L2F is more pronounced for P1 His, the target residue, and P6 Glu, the wild-type residue, in the context of HPLVGHM libraries (Supplementary Fig. 13 and corresponding enrichment values in Supplementary Table 13).

When we compare the specificity profile of evolved TEV L2F (Fig. 3c) to that of wild-type TEV, a number of differences are apparent: TEV L2F shows a shifting of P3 specificity towards aliphatic residues Ile and Val, a shifting of P1 specificity towards His, a shifting of P1’ specificity towards aliphatic amino acids Ala, Ile, and Met, and a broadening of specificity at P6. These changes are consistent with evolutionary pressure to cleave the target substrate HPLVGHM. Although the evolved L2F protease recognizes a shortened motif due to loss of P6 specificity, it retains the ability to reject the substantial majority of amino acids at each of the five others positions used by TEV. The overall specificity of the evolved L2F protease lies well within the range of specificities exhibited by natural proteases such as caspases, granzymes, clotting factors, and MMPs, which typically specify strongly only one or two positions and accept mixtures of several to many amino acids at other positions, yet retain sufficient overall specificity to mediate physiological signaling roles^40,41,42,43. Nonetheless, the use of evolved proteases in complex biological environments may require a proteome-wide evaluation of off-target substrates to identify potential undesired effects.

Functionally independent groups of TEV mutations

To illuminate the molecular basis of the evolved changes in substrate specificity, we generated TEV mutants containing small subsets of mutations and profiled their substrate specificities using substrate libraries in which a single residue of the ENLYFQS substrate was randomized. A number of mutations were predicted to influence solubility and stability based on previous reports^44,45,46 or their distance from the substrate in the crystal structure²⁵. We constructed various combinations of the predicted solubility mutations (T17S, N68D, E107D, D127A, F132L, S135F, F162S, R203Q, K215E, and K229E) as well as mutants that putatively influence specificity at P1 (T146S, D148P, S153N, S170A, and N177M), P6 (I138T, N171D, and N176T), and P2 (V209M, W211I, and M218F) based on the emergence of these mutations during PACE.

All of the tested combinations of mutations resulted in proteases that retained activity to varying degrees (Supplementary Fig. 14), despite being taken out of their PACE-evolved contexts. As expected, the solubility-enhancing mutants exhibited no significant change in specificity (Supplementary Fig. 15). The P2 variant also did not display any substantial specificity changes, consistent with the lack of a strong change in P2 specificity in the TEV L2F specificity profile (Supplementary Fig. 15).

Mutations in the P6 variant (I138T, N171D, N176T) are sufficient to confer loss of glutamate specificity at P6 with no other obvious changes to substrate preferences (Fig. 3d), suggesting some degree of modularity in protease–substrate interactions. Conversely, the P1 variant (T146S, D148P, S153N, S170A, and N177M) not only exhibits broadened specificity at the P1 site, but also shows a concurrent increased affinity for P3 aliphatic side chains (Fig. 3e). The mutations within these two variants appear to be responsible for the three largest differences in substrate specificity between wild-type TEV and TEV L2F.

Evolved TEV L2F cleaves human IL-23

Next we tested the ability of the evolved TEV L2F protease to cleave full-length human IL-23 protein. In its active form, IL-23 is a heterodimer between the IL-12p40 subunit and the IL-23p19 subunit. We incubated TEV L2F with IL-23 in either its heterodimeric or monomeric p19 state, and observed by Western blot the formation of a single cleavage product for the IL-23 heterodimer, and in the presence of excess protease, two cleavage products for the monomeric IL-23p19 substrate (Supplementary Fig. 16).

IL-23 digestion reactions were subjected to LC–MS to identify the cleavage products. The heterodimer cleavage reaction generated a single protein product of mass 3598 Da less than the starting material, matching the fragment liberated by cleavage of the target peptide bond at the HPLVGH//M sequence (Supplementary Fig. 17). Data from the monomer cleavage reaction in the presence of a 1.5-fold excess of TEV L2F revealed two new product masses corresponding to a single cleavage at the target site (HPLVGH//M), as well as proteolysis at both the target site (HPLVGH//M) and an additional secondary site (ARVFAH//G) that is also consistent with the L2F specificity profile shown in Fig. 3c (Supplementary Fig. 18). The absence of an ion corresponding to IL-23 cleaved at only the secondary site suggests that the target site is kinetically favored by TEV L2F. The secondary site was only cleaved in the monomeric substrate and not the heterodimer presumably because it is occluded by the IL-12p40 subunit in the heterodimer structure (Supplementary Fig. 19)⁴⁷.

Evolved TEV deactivates IL-23 and prevents IL-17 secretion

Finally we tested the ability of the evolved TEV L2F protease to abrogate the biological activity of IL-23. We used a previously described IL-23 activity assay with primary isolates of mouse mononuclear splenocytes⁴⁸. When cultured in the presence of IL-2 and IL-23, Th17 cells are stabilized and secrete IL-17 into the media supernatant, which is quantified by ELISA. We observed a dose-dependent attenuation of IL-17 production when IL-23 was pre-incubated with TEV L2F (Fig. 4). These samples were also visualized by western blot demonstrating that the p40 subunit is unaffected by incubation with protease, and that inhibition of IL-17 production is causally linked to IL-23p19 cleavage (Supplementary Fig. 20). A sub-stoichiometric dose of L2F protease (0.40 equivalents) resulted in the loss of nearly all IL-23-induced IL-17 secretion, consistent with TEV L2F catalyzing cleavage of IL-23 following multiple turnover Michaelis–Menten kinetic behavior that we observed in previous experiments using synthetic peptide substrates (Table 2 and Supplementary Figs. 21 and 22). Direct addition of TEV L2F to splenocyte cultures in serum-containing media supplemented with IL-23 did not attenuate IL-17 secretion (Supplementary Fig. 23). The presence of an equivalent concentration of serum did not inhibit cleavage in vitro (Supplementary Fig. 24), suggesting that TEV protease activity could be diminished in the oxidizing cell culture supernatant. Alternatively, it is possible the protease is sequestered by other secreted factors or cell-surface proteins within the complex culture media, or that IL-23 binding to IL-23R may occur faster than IL-23 proteolysis.

Discussion

The generation of on-demand biochemical catalysts has been a longstanding interest of the scientific community⁴⁹. Previous efforts to evolve protease specificity have been successful at altering the substrate specificity of model proteases by one or two amino acids^{15,16,17,18,19,20}. By using PACE to conduct ∼2500 total generations of evolution in three diverging evolutionary trajectories, evolutionary stepping stones to guide populations through long evolutionary trajectories, and both targeted and elevated random mutagenesis, we evolved TEV protease variants that cleave a substrate dramatically different from the wild-type substrate with only a modest decrease in kinetic parameters compared with wild-type TEV on its consensus substrate. This work demonstrates for the first time that a protease can be reprogrammed through laboratory evolution to cleave and deactivate a target protein containing a very different substrate sequence than is recognized by the wild-type protease. Our approach also exemplifies a target selection strategy that integrates knowledge from known positional tolerances of a wild-type enzyme with results from previous evolution efforts. The above data also demonstrate that evolved L2F protease has sufficient specificity to avoid degrading itself, the numerous proteins required for the PACE selection, MBP, GST, or essential protein components of the IL-23 assay beyond IL-23. The ability of the L2F protease to cleave IL-23 is also not inhibited by the presence of 10% fetal bovine serum, which contains high concentrations of other proteins. The specificity of the evolved L2F protease overall resembles that of many natural proteases, which like L2F reject the majority of possible amino acids but accept mixtures of others at each recognized substrate position^40,41,42,43.

While these advances suggest the utility of evolved proteases for many research and industrial applications, they face additional challenges for therapeutic applications. The evolution of proteases under many generations of positive selection alone can result in proteases that accept the target substrate but do not reject the wild-type or intermediate substrates. In cases in which it is desirable to reject cleavage of certain non-target substrates, the application of a PACE negative selection strategy to apply selection pressure against non-target cleavage may be useful³⁰. Moreover, any circulating foreign protein therapeutic if administered repeatedly poses a substantial immunogenicity risk. Consequently, ideal starting points for therapeutic protease evolution may be circulating human proteases, such as the use of human kallikrein 7 in a pioneering study that sculpts its specificity to more selectively proteolyze human amyloid beta⁵⁰. The PACE of a human protease will present a significant but tractable technical hurdle, producing enzymes containing disulfide bonds in active forms in the E. coli cytosol^51,52,53,54.

These challenges notwithstanding, this study represents a foundation for the directed evolution of proteases with highly altered specificities. We demonstrate the feasibility of catalytic inactivation of a target protein with an evolved protease, which in some cases may offer substantial benefits over stoichiometric binding of a neutralizing antibody. In addition to potency advantages, we anticipate that evolved proteases may also enable research and therapeutic applications that are unavailable to antibodies such as proteolysis-induced gain-of-function and proteolysis-mediated alteration of a protein’s import, export, subcellular localization, half-life, or post-translational modification state.

Methods

Ranking of target sites within extracellular proteins

A list of human extracellular and transmembrane proteins with their corresponding amino acid sequences were tabulated using the ProteinData functionality in Mathematica 10. This data was transferred into MATLAB for further processing by a customizable script that performed the following operations (Supplementary Note 1). A rating matrix that is seven positions wide (for the seven sites within the TEV protease recognition motif) by 20 long (for each possible amino acid) was manually populated with subjective “evolvability” integer ratings. Each protein was converted into a binary sparse matrix with as many rows as the length of the protein sequence and 20 columns one for each amino acid. For each protein matrix, seven rows at a time were multiplied by the “evolvability” matrix, with the trace of the resulting 7 × 7 product matrix providing a score for the heptapeptide. For each extracellular protein the best score and the corresponding peptide and starting-residue index were saved. Once all protein sequences had been processed, we sorted the protein names along with their best-match candidate substrate sequences by score.

Expression vectors and phage libraries

All primers were designed to perform USER cloning⁵⁵ and ordered from Integrated DNA Technologies (IDT). For the cloning of phage libraries, NNK codons were generating using hand-mixed phosphoramidite ratios to provide uniform incorporation rates (Supplementary Table 15). All PCR reactions were performed using Phusion U Hot Start polymerase (Thermo-Fisher).

For the assembly of APs and expression vectors, PCR products were purified using EconoSpin columns (Epoch Life Sciences) and assembled with DpnI and USER enzyme in CutSmart Buffer (New England BioLabs). Following assembly, plasmids were transformed into NEB Turbo Competent E. coli cells (New England BioLabs).

For the assembly of phage libraries, PCR products were purified by gel electrophoresis and extracted using the MinElute kit by Qiagen. Following an assembly reaction identical to that of the AP, the USER reaction was desalted using the MinElute PCR purification kit (Qiagen) prior to electroporation into competent E. coli S1059²² (for SP libraries) or NEB Turbo Electrocompetent E. coli cells²² (for substrate display phage libraries). Phage libraries were grown overnight in 2×YT and filtered through sterile 0.22 μm membranes to eliminate host cells. The titers of phage libraries were evaluated by plaque assay using strain S1059 as hosts. Briefly, phage were prepared in four 50-fold serial dilutions of 50 µl. To each dilution was added 100 µl of fresh host cell culture at approximately OD₆₀₀ = 1.0 followed by addition of 900 µl top agar (2×YT, 6 g L⁻¹ agar). The mixture was mixed by pipet, then transferred to a quarter-plate prepared with a thin layer of bottom agar (2×YT, 16 g L⁻¹ agar). Plaque assays were incubated overnight at 37 °C.

In order to assess library quality, 12 clones were sequenced to confirm diversity at the targeted amino acid positions. Briefly, individual plaques were picked with a pipet tip in order to provide template material (SP-infected E. coli) for rolling circle amplification (TempliPhi, GE Healthcare). Sanger DNA sequencing was performed using primer BCD1136 (Supplementary Table 15), and results were aligned and tabulated using SeqMan (DNAStar).

PACE experiments

PACE experiments were performed as follows: ^{22, 29,30,31,32,33}. Briefly, E. coli strain S1030 was co-transformed by electroporation with a mutagenesis plasmid (MP6)²³ and an AP (plasmids are described in Supplementary Table 14 and detailed in Supplementary Figs. 1 and 2). Chemostats containing 80 ml of Davis Rich Media with 22.5 μg ml⁻¹ carbenicillin and 15 μg ml⁻¹ chloramphenicol were inoculated with overnight starter cultures and grown at 37 °C while mixing at 250 rpm via a magnetic stir bar. Once the chemostat grew to ∼OD₆₀₀ = 1.0, we began dilution with fresh media at a rate of 80–100 ml h⁻¹, with the waste needle set at a height of 80 ml. At the same time, we began the flow of chemostat culture at ∼10–20 ml h⁻¹ into a lagoon with a waste needle set at a height of 15 ml. The total flow rate through each lagoon was set based upon the difficulty of a given experiment, with slower dilution being used for more challenging evolutions. For the full duration of the experiment 10% w/v arabinose solution was syringe-pumped into the lagoons at a rate of 0.5–1.0 ml h⁻¹.

Experiments starting with an NNK mutagenized SP library initiated with a lagoon inoculum of 1–2 ml of phage library containing 10⁸–10¹⁰ pfu ml⁻¹. For all other experiments, lagoons were inoculated with 50–100 μl of filtered phage population from the last time point of the previous PACE experiment. In PACE experiments using mixtures of host cell cultures (see the main text), lagoons received an influx of cell culture from two separate chemostats containing hosts bearing two different APs (combined rate of 10–20 ml h⁻¹) for a period of 24–48 h.

Phage samples were collected from lagoon waste outflow lines at 24 h intervals and passed through a 0.22 µm sterile filter to remove host cells. The titers of phage samples were evaluated by plaque assay using strain S1059 as hosts. At the end of each PACE experiment, eight individual plaques were picked with a pipet tip in order to provide template material (SP-infected E. coli) for rolling circle amplification (TempliPhi, GE Healthcare). The same pipet tip was subsequently transferred to a 96-deep well culture plate containing 2×YT media for growth overnight at 37 °C. After a PACE experiment, enriched mutations should be present within multiple clones of this small sample of eight population members. Sanger DNA sequencing was performed using primer BCD1136 (Supplementary Table 15), and results were aligned and tabulated using SeqMan (DNAStar).

Luminescence assays of evolved clones

Clones chosen for characterization were sterile-filtered from the corresponding position within the 96-well culture plate. Saturated overnight cultures of S1030 cells containing a substrate AP were used to initiate luciferase assays in 96-well culture plates. Approximate volumes were 500 µl 2×YT, 50 µl overnight starter culture, and 10 µl filtered phage samples. All assays included a negative control (no phage), a positive control (SP encoding T7 RNAP), and wild-type TEV SP as a reference. Experimental and control conditions were performed in triplicate. After 3–5 h of growth in a 37 °C shaker, 100 µl was transferred to a clear-bottom assay plate to measure OD₆₀₀ and luminescence on a Tecan Infinite Pro Plate Reader. Measurements were analyzed as OD-normalized values and as luminescence fold-change over the negative control.

Purification of TEV proteases and fusion protein substrates

TEV protease was purified as previously described^{19, 44,45,46}, but with minor modifications. Briefly, OneShot BL21 Star (DE3) chemically competent cells (Invitrogen) were transformed with expression vectors encoding MBP fused through a TEV cleavage site to a 6×His-tagged TEV protease. A total of 5 ml of saturated overnight starter culture was added to 1–2 l of LB+kanamycin (40 µg ml⁻¹), and grown at 37 °C until OD₆₀₀ = ~0.7. Expression was induced with 1 mM IPTG for 4 h at 30 °C, and cells were harvested by centrifugation at 6000 × g for 5 min. The pellet was resuspended in 15–25 ml binding buffer (10% glycerol, 50 mM Tris pH 8.0, 1.0 M NaCl, 1 mM DTT, and 20 mM imidazole) with a Roche Complete EDTA-free protease inhibitor tablet (note that TEV protease is unaffected by conventional protease inhibitors). Cells were lysed by sonication for 4 min with a 1-s on, 1-s off cycle at medium power. Lysate was clarified by centrifugation at 18,000 × g for 20 min. Clarified lysate was incubated with 1–2 ml TALON metal affinity resin (Clontech) for 1 h mixing end-over-end at 4 °C. Resin was pelleted at 700 × g for 5 min, and resuspended with 10 ml of binding buffer to load onto a gravity flow column. Resin was washed with 10 column volumes of binding buffer, followed by 2 column volumes of bind buffer with imidazole supplemented to 50 mM. TEV protease was eluted with 4 column volumes of elution buffer (10% glycerol, 50 mM Tris pH 8.0, 0.1 M NaCl, 1 mM DTT, and 250 mM imidazole). The purity of fractions was assessed by SDS-PAGE using precast Bolt 4–12% Bis–Tris gels (ThermoFisher), and TEV containing fractions were pooled and concentrated to <250 µl using an Amicon Ultra Centrifugal filter with a 10 kDa molecular weight cut-off (EMD-Millipore). The concentrated sample was further purified to >95% using a SuperDex 200 Increase 10/300 column (GE Healthcare) running with storage buffer (20% glycerol, 50 mM Tris pH 8.0, 0.1 M NaCl, 1 mM DTT). Proteases used in mammalian cell culture were further subjected to endotoxin removal resin (Pierce), followed by assaying with an LAL endotoxin quantification kit (Pierce). Protein concentrations were determined by Bradford Assay (ThermoFisher) and aliquots were frozen in liquid nitrogen for storage at −80 °C.

MBP–GST test substrates were expressed, purified, and stored exactly as described above for TEV, except for the following changes. Expression was induced with 1 mM IPTG for 16 h at 20 °C, and binding buffer was 50 mM Tris pH 8.0, 0.5 M NaCl, supplemented with a Roche Complete EDTA-free protease inhibitor tablet. After sonication and centrifugation, clarified lysate was incubated with 1 ml glutathione-linked sepharose (Clontech) for 1 h mixing end-over-end at 4 °C. Loaded resin was washed with 40 column volumes of binding buffer, followed by 4 column volumes of elution buffer (50 mM Tris pH 8.0, 100 mM NaCl, 10 mM glutathione). Samples were >95% pure as assessed by SDS-PAGE and were dialyzed against storage buffer (20% glycerol, 50 mM Tris pH 8.0, 0.1 M NaCl, 1 mM DTT) using Slide-A-Lyzer Cassettes with a 10-kDa molecular weight cut-off (ThermoFisher).

Assaying proteolysis of fusion protein substrates

Protease assays consisted of 5 µg of MBP–GST substrate and 1 µg of wild-type or evolved TEV protease incubated for 3 h at 30 °C in storage buffer (20% glycerol, 50 mM Tris pH 8.0, 0.1 M NaCl, 1 mM DTT) supplemented with freshly prepared DTT to a final concentration of 2 mM. Reactions were analyzed by SDS-PAGE and visualized with Coomassie stain.

HPLC kinetics assay

Protease kinetics were determined as previously described¹⁹, but with minor adjustments. Briefly, synthetic peptide substrates (THPLVGHMGTRRW-dinitrophenol-lysine and TENLYFQSGTRRW-dinitrophenol-lysine) and synthetic standards for cleaved products (MGTRRW-dinitrophenol-lysine and SGTRRW-dinitrophenol-lysine) were ordered from Genscript. Dinitrophenol moieities provided strong absorbance at 355 nm for more accurate quantification. Reactions and standards were analyzed by HPLC on a C18 reverse-phase column (Kinetex 5μ C18 100A, Phenomenex) using an acetonitrile gradient from 5 to 50%. Standard curves were constructed for both products (MGTRRW-dinitrophenol-lysine and SGTRRW-dinitrophenol-lysine) to enable quantification of reaction progress.

Reactions were carried out with 0.05–0.1 μM protease and 50 μM to 2 mM substrate. Proteases (in storage buffer plus 1 mM freshly prepared DTT) and substrates (in sterile water) were prepared as solutions at 2× concentration (50 µl each) then combined to yield a total reaction volume of 100 µl. Reactions were incubated at 30 °C for 10 min and quenched with 25 µl of 5% TFA. After quenching, protease was eliminated from samples using an Amicon Ultra Centrifugal filter with a 10-kDa cut-off (EMD-Millipore). Prior to conducting reactions in triplicate, all conditions were tested and monitored at 5, 10, 30 min to ensure that 10 min was within the linear range of the reaction (<25% substrate consumption). Peak integrations were tabulated, converted into product concentrations using the standard curves, and fit to the Michaelis–Menten kinetics model using Prism GraphPad.

Protease specificity profiling

For each combination of library and protease, 60 µl of a 50% suspension of Anti-FLAG M2 Magnetic beads (Sigma) was transferred into a 1.5-ml eppendorf tube. For all subsequent manipulations, a magnetic plate was used to separate beads and allow aspiration of the supernatant. After washing with 1 ml of TBS (20 mM Tris pH 7.0, 150 mM NaCl), beads were incubated with 30–100 µl of substrate phage libraries (titers ranged from 10⁸ to 10¹⁰ pfu ml⁻¹) in 1 ml of TBS at room temperature for 2 h rotating end-over-end. After initial binding, the supernatant was discarded and beads were washed with 1 ml of TBS. Beads with bound substrate phage were incubated in 0.5 ml TBS containing 0.5 µM protease for 2 h. Supernatant containing cleaved substrate phage was recovered, and the beads were again washed with 1 ml of TBS. The remaining bound uncleaved substrate phage was eluted in 0.1 ml of TBS containing 0.1 mg ml⁻¹ FLAG peptide (Sigma).

For substrate libraries containing a single randomized amino acid position, a single round of selection was sufficient. For substrate libraries containing windows of three randomized amino acids, a second round of selection was necessary to detect enrichment. In these cases, the round 1 cleaved substrate phage were expanded in overnight cultures consisting of 100 µl cleaved substrate phage and 900 µl S1030 culture (diluted to OD₆₀₀ = ~0.1). Following outgrowth, cultures were centrifuged to pellet E. coli prior to aspirating the supernatant phage to be used in a second round of phage substrate display as previously described. Due to expansion biases during outgrowth, these specificity profiles were only interpreted after normalizing to the second round elution of the no-protease control experiments.

High-throughput sequencing and data analysis

Samples were amplified by PCR using Q5 Hot Start 2× Master Mix (NEB) with 1 µl of template phage sample and primers (MSP819 or MSP820, and MSP824, see Supplementary Table 15). Illumina barcodes were added in a second PCR reaction using 1 µl of the first round PCR material as template. Samples were pooled and purified by gel electrophoresis using a MinElute Gel Extraction kit (Qiagen). The concentration of the pooled library was measured by Quant-iT PicoGreen dsDNA Assay (ThermoFisher) and diluted to ∼4 nM. This concentration was further adjusted based on qPCR quantification (Kapa Biosystems). Samples were loaded onto an Illumina MiSeq using a v2 50-cycle kit set up to run a single-direction read of 50 nucleotides.

Data was automatically demultiplexed by MiSeq Reporter software and the resulting fastq files were processed by a custom Python script (Supplementary Note 2). This script searches each sequencing read for a perfect match to sequences flanking both sides of the proteolysis site. If the proteolysis site in between these matching flanks is exactly 21 nucleotides, then the proteolysis site is translated to a seven amino acid sequence. Sequences were disregarded in subsequent analysis if they contained a stop codon, were template material used in library cloning (HNLYGHS), or were the FLAG tag sequence (YKDDDDK) due to spontaneous genetic deletion of the proteolysis site. The list of proteolysis sites was tabulated into a position-specific amino acid frequency table. For each library-protease combination, enrichment values were calculated as freq_cleaved/freq_elution-1. For each protease, specificity data from the randomized position within single-site libraries was combined into a single table and converted into a sequence logo using a the Seq2Logo webserver³⁹. For libraries containing windows of three randomized positions, sequence logos for each protease-library combination were separately generated.

Western blot visualization of recombinant IL-23 cleavage

IL-23 was purchased as a Myc-tagged IL-23p19 monomer (TP309680 Origene) and as a heterodimer of IL-23p19 and IL12p40 (PHC9321 ThermoFisher). A total of 5 µg of heterodimer or 0.44 µg of monomer was incubated with 5 µg of TEV L2F for 3 h at 30 °C in storage buffer (20% glycerol, 50 mM Tris pH 8.0, 0.1 M NaCl, 1 mM DTT) supplemented with freshly prepared DTT to a final concentration of 2 mM. Samples were electrophoresed on precast Bolt 4–12% Bis–Tris gels (ThermoFisher), then transferred to a PVDF membrane using the iBlot 2 Dry Blotting System (ThermoFisher). Membranes were incubated at room temperature for 30 min in Odyssey Blocking Buffer (LiCor). Primary antibody (IL-23 Antibody (26H20L23), ABfinity Rabbit Monoclonal, ThermoFisher) was added to blocking buffer in 1:1000 dilution, and the membrane was incubated on a rocker at 4 °C overnight. After three washes with TBST (20 mM Tris pH 7.0, 150 mM NaCl, 0.1% Tween 20), the membrane was incubated 1 h at room temperature in blocking buffer containing a 1:1000 dilution secondary antibody (IRDye 800CW Donkey anti-Rabbit IgG, LiCor). After three more washes with TBST, the membrane was scanned using an Odyssey Imaging System.

LC–MS identification of cleavage sites

IL-23p19 and the IL-23 heterodimer (ThermoFisher) were reduced with 10 mM DTT to identify intact masses of unreacted IL-23p19 subunits. IL-23 substrates were incubated in a manner similar to western blots for 3 h at 30 °C using 2–10 µg of substrate and 4 µg of TEV L2F. All samples were analyzed using an Agilent LC–MS 6220 (ESI-TOF) equipped with an Agilent PLRP-S column. A standard protein LC method was used containing a 15-min reverse-phase gradient (0.1% formic acid in water, MeCN 0.1% formic acid).

IL-23-induced IL-17 production in mouse splenocytes

The following protocol was adapted from previous work⁴⁸ as follows: two male mice (C57BL/6J) were euthanized and dissected to isolate spleens, under a protocol approved by the Broad Institute Institutional Animal Care and Use Committee. Spleens were pulverized into 10 ml of cell culture media (DMEM, Glutamax, high-glucose, penicillin, streptomycin, 10% Fetal Bovine Serum, FBS, ThermoFisher) through a 100 µm nylon mesh Falcon Cell Strainer (Corning). Cell suspensions were centrifuged for 3 min at 700 g, and the supernatant was discarded. The pellet was resuspended in 1 ml ACK lysis buffer (Gibco/ThermoFisher). After 5 min, lysis was stopped with the addition of 9 ml of DMEM, and cells were pelleted by centrifugation at 700 × g for 3 min. If the pellet was red due to remaining red blood cells, ACK lysis was repeated. Otherwise if the pellet was white, lysis was complete and cells were resuspended in 4 ml of DMEM. Cell density was quantified using a Scepter 2.0 Handheld Automated Cell Counter (Millipore). Cultures were diluted to 2 × 10⁶ cells per ml in cell culture media supplemented with 100 units per ml recombinant human IL-2 (Roche). The outer perimeter wells of a 96-well round bottom culture plate were filled with 100 µl of cell-free media to prevent evaporative loss in central wells containing cell cultures. The central wells were prepared in triplicate filled first with 125 µl of culture followed by 25 µl of additives (see below). Cell culture supernatant was sampled after two days of growth, and 10 µl was used to perform a mouse IL-17 ELISA (R&D Systems).

Additives containing IL-23 and varying doses of protease or neutralizing antibody (MAB1510, R&D Systems) were prepared in cell culture media immediately prior to mixing with splenocytes. Additives containing doses of proteases were also prepared as pre-incubated samples at 300× final concentration. Incubation was performed at 4 °C for 16 h in storage buffer (20% glycerol, 50 mM Tris pH 8.0, 0.1 M NaCl, 1 mM DTT) supplemented with 2.5 mg ml⁻¹ of BSA carrier-protein to enhance stability during incubation. These pre-incubated samples were prepared at high concentration to confirm cleavage efficiency by western blot as described above. However, this western blot was conducted using two primary antibodies Anti-IL-12p40 (ab62822 Abcam)/Anti-IL-23p19 (sc271279 Santa Cruz) (dilutions were 1:500 and 1:100 respectively) and two secondary antibodies IRDye800CW Donkey anti-Mouse/IRDye 680RD Donkey anti-Goat (LiCor) (dilutions were 1:2000 each).

Data availability

High-throughput sequencing data will be made available via the Sequence Read Archive under BioProject number PRJNA397152. DNA sequences and constructs are available from Addgene. Complete mutational tables are provided in Supplementary Information.

References

Rawlings, N. D., Barrett, A. J. & Finn, R. Twenty years of the MEROPS database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 44, D343–D350 (2016).
Article CAS PubMed Google Scholar
Craik, C. S., Page, M. J. & Madison, E. L. Proteases as therapeutics. Biochem. J. 435, 1–16 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhao, H. M. & Arnold, F. H. Directed evolution converts subtilisin E into a functional equivalent of thermitase. Protein Eng. 12, 47–53 (1999).
Article CAS PubMed Google Scholar
You, L. & Arnold, F. H. Directed evolution of subtilisin E in Bacillus subtilis to enhance total activity in aqueous dimethylformamide. Protein Eng. 9, 77–83 (1996).
Article CAS PubMed Google Scholar
Persson, E., Kjalke, M. & Olsen, O. H. Rational design of coagulation factor VIIa variants with substantially increased intrinsic activity. Proc. Natl Acad. Sci. USA 98, 13583–13588 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Madison, E. L., Goldsmith, E. J., Gerard, R. D., Gething, M. J. & Sambrook, J. F. Serpin-resistant mutants of human tissue-type plasminogen activator. Nature 339, 721–724 (1989).
Article ADS CAS PubMed Google Scholar
Allen, G. A. et al. A variant of recombinant factor VIIa with enhanced procoagulant and antifibrinolytic activities in an in vitro model of hemophilia. Arterioscl. Thromb. Vasc. 27, 683–689 (2007).
Article CAS Google Scholar
Beck, A., Wurch, T., Bailly, C. & Corvaia, N. Strategies and challenges for the next generation of therapeutic antibodies. Nat. Rev. Immunol. 10, 345–352 (2010).
Article CAS PubMed Google Scholar
Hedstrom, L., Farr-Jones, S., Kettner, C. A. & Rutter, W. J. Converting trypsin to chymotrypsin: ground-state binding does not determine substrate specificity. Biochemistry 33, 8764–8769 (1994).
Article CAS PubMed Google Scholar
Kurth, T., Ullmann, D., Jakubke, H. D. & Hedstrom, L. Converting trypsin to chymotrypsin: structural determinants of S1’ specificity. Biochemistry 36, 10098–10104 (1997).
Article CAS PubMed Google Scholar
Hedstrom, L., Szilagyi, L. & Rutter, W. J. Converting trypsin to chymotrypsin: the role of surface loops. Science 255, 1249–1253 (1992).
Article ADS CAS PubMed Google Scholar
Hedstrom, L., Perona, J. J. & Rutter, W. J. Converting trypsin to chymotrypsin: residue 172 is a substrate specificity determinant. Biochemistry 33, 8757–8763 (1994).
Article CAS PubMed Google Scholar
Ridky, T. W. et al. Programming the Rous sarcoma virus protease to cleave new substrate sequences. J. Biol. Chem. 271, 10538–10544 (1996).
Article CAS PubMed Google Scholar
Lin, Y. C. et al. Alteration of substrate and inhibitor specificity of feline immunodeficiency virus protease. J. Virol. 74, 4710–4720 (2000).
Article CAS PubMed PubMed Central Google Scholar
Carrico, Z. M., Strobel, K. L., Atreya, M. E., Clark, D. S. & Francis, M. B. Simultaneous selection and counter-selection for the directed evolution of proteases in E. coli using a cytoplasmic anchoring strategy. Biotechnol. Bioeng. 113, 1187–1193 (2016).
Article CAS PubMed Google Scholar
Renicke, C., Spadaccini, R. & Taxis, C. A tobacco etch virus protease with increased substrate tolerance at the P1’ position. PLoS ONE 8, e67915 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Varadarajan, N., Gam, J., Olsen, M. J., Georgiou, G. & Iverson, B. L. Engineering of protease variants exhibiting high catalytic activity and exquisite substrate selectivity. Proc. Natl Acad. Sci. USA 102, 6855–6860 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Verhoeven, K. D., Altstadt, O. C. & Savinov, S. N. Intracellular detection and evolution of site-specific proteases using a genetic selection system. Appl. Biochem. Biotechnol. 166, 1340–1354 (2012).
Article CAS PubMed Google Scholar
Yi, L. et al. Engineering of TEV protease variants by yeast ER sequestration screening (YESS) of combinatorial libraries. Proc. Natl Acad. Sci. USA 110, 7229–7234 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Varadarajan, N., Rodriguez, S., Hwang, B. Y., Georgiou, G. & Iverson, B. L. Highly active and selective endopeptidases with programmed substrate specificities. Nat. Chem. Biol. 4, 290–294 (2008).
Article CAS PubMed PubMed Central Google Scholar
Badran, A. H. & Liu, D. R. In vivo continuous directed evolution. Curr. Opin. Chem. Biol. 24C, 1–10 (2015).
Article Google Scholar
Esvelt, K. M., Carlson, J. C. & Liu, D. R. A system for the continuous directed evolution of biomolecules. Nature 472, 499–503 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Badran, A. H. & Liu, D. R. Development of potent in vivo mutagenesis plasmids with broad mutational spectra. Nat. Commun. 6, 8425 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Dougherty, W. G., Cary, S. M. & Parks, T. D. Molecular genetic analysis of a plant virus polyprotein cleavage site: a model. Virology 171, 356–364 (1989).
Article CAS PubMed Google Scholar
Phan, J. et al. Structural basis for the substrate specificity of tobacco etch virus protease. J. Biol. Chem. 277, 50564–50572 (2002).
Article CAS PubMed Google Scholar
Gaffen, S. L., Jain, R., Garg, A. V. & Cua, D. J. The IL-23-IL-17 immune axis: from mechanisms to therapeutic testing. Nat. Rev. Immunol. 14, 585–600 (2014).
Article CAS PubMed PubMed Central Google Scholar
Langrish, C. L. et al. IL-23 drives a pathogenic T cell population that induces autoimmune inflammation. J. Exp. Med. 201, 233–240 (2005).
Article CAS PubMed PubMed Central Google Scholar
Teng, M. W. et al. IL-12 and IL-23 cytokines: from discovery to targeted therapies for immune-mediated inflammatory diseases. Nat. Med. 21, 719–729 (2015).
Article CAS PubMed Google Scholar
Badran, A. H. et al. Continuous evolution of Bacillus thuringiensis toxins overcomes insect resistance. Nature 533, 58–63 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Carlson, J. C., Badran, A. H., Guggiana-Nilo, D. A. & Liu, D. R. Negative selection and stringency modulation in phage-assisted continuous evolution. Nat. Chem. Biol. 10, 216–222 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dickinson, B. C., Leconte, A. M., Allen, B., Esvelt, K. M. & Liu, D. R. Experimental interrogation of the path dependence and stochasticity of protein evolution using phage-assisted continuous evolution. Proc. Natl Acad. Sci. USA 110, 9007–9012 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Dickinson, B. C., Packer, M. S., Badran, A. H. & Liu, D. R. A system for the continuous directed evolution of proteases rapidly reveals drug-resistance mutations. Nat. Commun. 5, 5352 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Hubbard, B. P. et al. Continuous directed evolution of DNA-binding proteins to improve TALEN specificity. Nat. Methods 12, 939–942 (2015).
Article CAS PubMed PubMed Central Google Scholar
Davis, J. H., Rubin, A. J. & Sauer, R. T. Design, construction and characterization of a set of insulated bacterial promoters. Nucleic Acids Res. 39, 1131–1141 (2011).
Article CAS PubMed Google Scholar
Makarova, O. V., Makarov, E. M., Sousa, R. & Dreyfus, M. Transcribing of Escherichia coli genes with mutant T7 RNA polymerases: stability of lacZ mRNA inversely correlates with polymerase speed. Proc. Natl Acad. Sci. USA 92, 12250–12254 (1995).
Article ADS CAS PubMed PubMed Central Google Scholar
Ratnikov, B., Cieplak, P. & Smith, J. W. High throughput substrate phage display for protease profiling. Methods Mol. Biol. 539, 93–114 (2009).
Article CAS PubMed PubMed Central Google Scholar
Scholle, M. D. et al. Mapping protease substrates by using a biotinylated phage substrate library. ChemBioChem. 7, 834–838 (2006).
Article CAS PubMed Google Scholar
Matthews, D. J. & Wells, J. A. Substrate phage: selection of protease substrates by monovalent phage display. Science 260, 1113–1117 (1993).
Article ADS CAS PubMed Google Scholar
Thomsen, M. C. & Nielsen, M. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion. Nucleic Acids Res. 40, W281–W287 (2012).
Google Scholar
Fuchs, J. E. et al. Cleavage entropy as quantitative measure of protease specificity. PLoS Comput. Biol. 9, e1003007 (2013).
Article CAS PubMed PubMed Central Google Scholar
Pop, C. & Salvesen, G. S. Human caspases: activation, specificity, and regulation. J. Biol. Chem. 284, 21777–21781 (2009).
Article CAS PubMed PubMed Central Google Scholar
Song, J. et al. PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites. PLoS ONE 7, e50300 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Lien, S., Pastor, R., Sutherlin, D. & Lowman, H. B. A substrate-phage approach for investigating caspase specificity. Protein J. 23, 413–425 (2004).
Article CAS PubMed Google Scholar
Cabrita, L. D. et al. Enhancing the stability and solubility of TEV protease using in silico design. Protein Sci. 16, 2360–2367 (2007).
Article CAS PubMed PubMed Central Google Scholar
van den Berg, S., Lofdahl, P. A., Hard, T. & Berglund, H. Improved solubility of TEV protease by directed evolution. J. Biotechnol. 121, 291–298 (2006).
Article PubMed Google Scholar
Wei, L. et al. In vivo and in vitro characterization of TEV protease mutants. Protein Expr. Purif. 83, 157–163 (2012).
Article CAS PubMed Google Scholar
Desmet, J. et al. Structural basis of IL-23 antagonism by an Alphabody protein scaffold. Nat. Commun. 5, 5237 (2014).
Article CAS PubMed PubMed Central Google Scholar
Aggarwal, S., Ghilardi, N., Xie, M. H., de Sauvage, F. J. & Gurney, A. L. Interleukin-23 promotes a distinct CD4 T cell activation state characterized by the production of interleukin-17. J. Biol. Chem. 278, 1910–1914 (2003).
Article CAS PubMed Google Scholar
Lerner, R. A., Benkovic, S. J. & Schultz, P. G. At the crossroads of chemistry and immunology - catalytic antibodies. Science 252, 659–667 (1991).
Article ADS CAS PubMed Google Scholar
Guerrero, J. L., O’Malley, M. A. & Daugherty, P. S. Intracellular FRET-based screen for redesigning the specificity of secreted proteases. ACS. Chem. Biol. 11, 961–970 (2016).
Article CAS PubMed Google Scholar
Bessette, P. H., Aslund, F., Beckwith, J. & Georgiou, G. Efficient folding of proteins with multiple disulfide bonds in the Escherichia coli cytoplasm. Proc. Natl Acad. Sci. USA 96, 13703–13708 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Hatahet, F., Nguyen, V. D., Salo, K. E. & Ruddock, L. W. Disruption of reducing pathways is not essential for efficient disulfide bond formation in the cytoplasm of E. coli. Microb. Cell Fact. 9, 67 (2010).
PubMed PubMed Central Google Scholar
Nguyen, V. D. et al. Pre-expression of a sulfhydryl oxidase significantly increases the yields of eukaryotic disulfide bond containing proteins expressed in the cytoplasm of E. coli. Microb. Cell Fact. 10, 1 (2011).
Article CAS PubMed PubMed Central Google Scholar
Stewart, E. J., Aslund, F. & Beckwith, J. Disulfide bond formation in the Escherichia coli cytoplasm: an in vivo role reversal for the thioredoxins. EMBO J. 17, 5543–5550 (1998).
Article CAS PubMed PubMed Central Google Scholar
Nour-Eldin, H. H., Geu-Flores, F. & Halkier, B. A. USER cloning and USER fusion: the ideal cloning techniques for small and big laboratories. Plant Second. Metab. Eng. Methods Appl. 643, 185–200 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

We are grateful to Sunia Trauger, Harvard Small Molecule Mass Spectrometry, for assistance with LC–MS experiments. This work was supported by U.S. National Institutes of Health (NIH) R01 EB022376 (formerly R01 GM065400), NIH R35 GM118062, DARPA HR0011-11-2-0003, DARPA N66001-12-C-4207 and the Howard Hughes Medical Institute. M.S.P. is an NSF Graduate Research Fellow and was supported by the Harvard Biophysics NIH training grant NIH NIGMS T32 GM008313. H.A.R. is supported by the Kilpatrick Educational Fund and the Harvard Chemistry and Chemical Biology Graduate Program.

Author information

Authors and Affiliations

Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, 02138, USA
Michael S. Packer, Holly A. Rees & David R. Liu
Graduate Program in Biophysics Program, Harvard University, 240 Longwood Avenue, Boston, MA, 02115, USA
Michael S. Packer
Broad Institute of MIT and Harvard, 75 Ames Street, Cambridge, MA, 02142, USA
Michael S. Packer, Holly A. Rees & David R. Liu
Howard Hughes Medical Institute, Harvard University, 12 Oxford Street, Cambridge, MA, 02138, USA
David R. Liu

Authors

Michael S. Packer
View author publications
You can also search for this author in PubMed Google Scholar
Holly A. Rees
View author publications
You can also search for this author in PubMed Google Scholar
David R. Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S.P. designed the research, prepared materials and performed experiments. H.A.R. prepared materials and performed experiments. D.R.L. designed and supervised the research. All of the authors contributed to writing the manuscript.

Corresponding author

Correspondence to David R. Liu.

Ethics declarations

Competing interests

The authors have filed provisional patent applications on PACE technologies and on the evolved gene products that are disclosed in this manuscript..

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Packer, M.S., Rees, H.A. & Liu, D.R. Phage-assisted continuous evolution of proteases with altered substrate specificity. Nat Commun 8, 956 (2017). https://doi.org/10.1038/s41467-017-01055-9

Download citation

Received: 22 May 2017
Accepted: 14 August 2017
Published: 16 October 2017
DOI: https://doi.org/10.1038/s41467-017-01055-9

This article is cited by

Ultrahigh-throughput screening-assisted in vivo directed evolution for enzyme engineering
- Shuaili Chen
- Zhanhao Yang
- Guoqiang Zhang
Biotechnology for Biofuels and Bioproducts (2024)
Optimizing ABA-based chemically induced proximity for enhanced intracellular transcriptional activation and modification response to ABA
- Zeng Zhou
- Yue-Qi Wang
- Li Zhang
Science China Life Sciences (2024)
Evolution of protease activation and specificity via alpha-2-macroglobulin-mediated covalent capture
- Philipp Knyphausen
- Mariana Rangel Pereira
- Florian Hollfelder
Nature Communications (2023)
Steering and controlling evolution — from bioengineering to fighting pathogens
- Michael Lässig
- Ville Mustonen
- Armita Nourmohammad
Nature Reviews Genetics (2023)
In vivo hypermutation and continuous evolution
- Rosana S. Molina
- Gordon Rix
- Chang C. Liu
Nature Reviews Methods Primers (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.