Rapid and efficient protein synthesis through expansion of the native chemical ligation concept

Kulkarni, Sameer S.; Sayers, Jessica; Premdjee, Bhavesh; Payne, Richard J.

doi:10.1038/s41570-018-0122

Download PDF

Review Article
Published: 29 March 2018

Rapid and efficient protein synthesis through expansion of the native chemical ligation concept

Sameer S. Kulkarni¹,
Jessica Sayers¹,
Bhavesh Premdjee² &
…
Richard J. Payne¹

Nature Reviews Chemistry volume 2, Article number: 0122 (2018) Cite this article

24k Accesses
226 Citations
22 Altmetric
Metrics details

Subjects

Abstract

The growing interest in proteins, both in fundamental research and in drug discovery, has fuelled demand for efficient synthetic methods to access these biomolecules. Although solid-phase synthesis serves as the workhorse for accessing peptides up to 50 amino acids in length, ligation technologies have underpinned protein synthesis. Native chemical ligation (NCL) represents the most widely used method and relies on the reaction of a peptide bearing an N-terminal cysteine residue with a peptide thioester. While the seminal methodology was limited to reaction at N-terminal cysteine residues, the NCL concept has recently been extended with a view to improving reaction efficiency and scope. Specifically, the discovery that cysteine residues can be desulfurized to alanine has led to the development of a range of thiol-derived variants of the proteinogenic amino acids that can be employed in protein synthesis under a ligation–desulfurization manifold. Furthermore, a number of important technologies have been developed to access larger targets via multi-fragment assembly, including methods for latent thioester activation and orthogonal protecting group strategies. Very recently, the amino acid selenocysteine, together with selenylated proteinogenic amino acid variants, has been shown to facilitate rapid ligation with peptide selenoesters. The large rate accelerations of these ligations have enabled access to proteins on unprecedented timescales, while chemoselective deselenization chemistry renders hitherto unobtainable targets accessible. This Review highlights innovative developments that have greatly expanded the NCL concept, allowing it to serve as a rapid and efficient means of conquering more challenging synthetic protein targets in the near future.

A cysteine selenosulfide redox switch for protein chemical synthesis

Article Open access 22 May 2020

Diselenide–selenoester ligation for chemical protein synthesis

Article 21 June 2019

Selenium chemistry for spatio-selective peptide and protein functionalization

Article 22 February 2024

Introduction

Peptides and proteins are the most ubiquitous biomolecules in living systems and are responsible for orchestrating a plethora of functional and structural roles in the cell. The final structure and function of a given polypeptide are dictated by the specific sequence of 21 proteinogenic amino acids that is encoded within the genome of the organism. The flow of genetic information from DNA to peptides and proteins — the central dogma of molecular biology — involves the transcription of genetic information in DNA into mRNA, followed by the chemical synthesis of polypeptides at ribosomes based on the genetic information encoded in mRNA (translation). However, the total ensemble of proteins within a cell (the proteome) is far more complex than the genome of an organism alone would allow. For example, in humans, 25,000 genes are thought to lead to a proteome in excess of a million proteins. This enormous diversity arises from the chemical modification of proteins after ribosomal synthesis, leading to further diversification of an otherwise concise proteome. These so-called post-translational modifications (PTMs) encompass a wide range of chemical alterations to the protein structure, such as functionalization of amino acid side chains, and have been shown to modulate the structure and function of several proteins in profound ways^1–5. It is widely accepted that nature would not expend energy modifying polypeptides unless the products fulfil a highly important biological role, yet the effect of a given modification on the structure, stability, function and activity of the majority of peptides and proteins across all taxa remains unknown.

The exquisite specificity and potency of peptides and proteins at their targets have led to a renaissance in their use as therapeutics over the past decade^6–8. This is reflected in the United States Food and Drug Administration (FDA) approval rates, which are now more than double those of small molecules. Currently, there are more than 100 approved peptide drugs and over 200 protein therapeutics approved for clinical use⁹, accounting for more than 10% of the pharmaceutical market (ca. US$40 billion). This number is set to greatly increase in the coming years with hundreds of peptides and proteins currently in clinical trials or undergoing preclinical assessment. Peptide and protein drugs have typically been associated with two key drawbacks that can limit their therapeutic applicability: first, large molecular weight, which hampers bioavailability; and second, the presence of native peptide bonds, which are susceptible to proteolytic degradation, leading to short biological half-lives. In some cases, these shortcomings have been alleviated through the incorporation of native PTMs, for example, glycosylation, or through the installation of ‘bespoke modifications’ such as PEGylation¹⁰, lipidation or N-methylation.

While access to large polypeptides and proteins is typically achieved through biological expression in prokaryotic and eukaryotic systems, tailored modifications such as biotinylation, installation of d-amino acids¹¹ or fluorescent tags remain challenging to access using the cellular machinery, despite several advances in unnatural amino acid incorporation¹². As such, chemical synthesis has emerged as an attractive avenue to introduce specific modifications site-selectively and homogeneously on a protein of interest. This is in stark contrast to recombinant methods, in which the enzymatic nature of the PTM process results in inseparable heterogeneous mixtures of the target proteins.

Solid-phase peptide synthesis (SPPS) remains the most efficient platform for the chemical preparation of polypeptides up to 40–50 amino acid residues in length¹³. However, the linear nature of SPPS means that longer syntheses are usually plagued by peptide chain aggregation and steric crowding en bloc, which leads to truncated (uncoupled) sequences, unwanted side products and epimerization. The accumulation of these by-products over iterative steps results in low yields and purities of the final products. The size limit of polypeptide targets that can be generated by SPPS has inspired the development of chemical ligation methods to convergently assemble smaller peptide fragments to generate larger polypeptides and proteins. Early work in this area relied on the condensation of side-chain protected fragments. While this approach has been successfully exploited for the synthesis of peptide therapeutics, including on a production scale¹⁴, the poor solubility of crude protected fragments, as well as the susceptibility of the C-terminal amino acid residue to epimerize during activation, has made this approach largely unattractive. A key solution to these problems was provided through the development of peptide ligation technologies that facilitate the formation of native peptide bonds between completely unprotected peptide fragments^15–21. These technologies have underpinned the chemical synthesis of numerous peptide and protein targets that were previously inaccessible through recombinant methods or SPPS and have therefore played a key role in addressing a number of important questions in biology and medicine^3,22,23. Of the ligation technologies developed to date, the convergent assembly of peptide fragments by native chemical ligation (NCL) represents the most widely employed method¹⁵. The power of this methodology is highlighted by its use in the synthesis of hundreds of protein targets to date. This review focuses on the development of a number of new ligation technologies that have been inspired by the concept of NCL, together with their utility in the total chemical synthesis of large polypeptides and proteins, with and without modifications.

Native chemical ligation

The principle of chemical ligation for peptides was initially explored by Kent and co-workers in the early 1990s (see Box 1 for the origin of the concept and intellectual framework leading to the development of NCL). The reaction first involves a reversible trans-thioesterification via nucleophilic attack on the thioester by the Cys thiolate moiety, leading to the formation of a thioester linkage between the two peptide segments (Fig. 1a). The thioester intermediate then undergoes a rapid S→N acyl shift to afford a native amide bond. One of the key features of the reaction is that it operates in purely aqueous media at neutral pH, conditions that aid in the solubility and stability of the unprotected peptide fragments and the protein targets. NCL technology has revolutionized the field of chemical protein synthesis and has been used for the construction of numerous protein targets since the seminal report of the method in 1994 (Ref. 24). Examples that highlight the utility of the method include the synthesis of a biologically active variant of the 166-residue erythropoiesis protein²⁵, the 203-residue covalent dimer of HIV1 protease²⁶ and, very recently, the 358-residue D-Dpo4 enzyme²⁷. The NCL concept has also been successfully applied in a semisynthetic regime using expressed protein ligation as well as for the preparation of head-to-tail cyclic peptides^16,28. There are several excellent reviews that highlight applications of the traditional NCL method and, as such, they will not be discussed in any further detail in this article^29–31. Instead, the remainder of this Review will highlight the development of new methods inspired by the NCL concept that have greatly expanded the scope of ligation chemistry to provide efficient access to a larger number of synthetic protein targets.

**Figure 1: Ligation technologies inspired by the NCL concept.**

Expanding the scope of NCL beyond cysteine. Despite the empowering nature of NCL in protein synthesis, the need for a Cys residue on the N terminus of one of the peptide fragments limits the possible retrosynthetic disconnections that can be considered when using the method, especially given the paucity of Cys in naturally occurring proteins (1.8% abundance). In recent years, several innovative strategies involving the use of N-terminal auxiliaries^32,33 have been devised to enable protein disconnections at alternative ligation junctions and to abrogate reliance on N-terminal Cys residues (see Box 1 for details). Although auxiliary-mediated ligations have greatly increased the flexibility of ligation chemistry, such methods generally suffer from prolonged reaction times, whereby hydrolysis and epimerization become the dominant competing pathways and often require tedious auxiliary removal steps, leading to lower overall yields. Thus, auxiliary-based approaches have been used to access only a small number of protein targets to date.

In 2001, the scope of the NCL methodology was greatly expanded through the introduction of a post-ligation desulfurization concept. In the seminal report by Yan and Dawson³⁴, the product of the NCL reaction was treated with either Raney Ni or Pd on Al₂O₃ to effect reductive cleavage of the sulfhydryl moiety in the Cys side chain, generating a native Ala residue (Fig. 1a). This method therefore permits the use of Cys as a surrogate for ligation sites containing Ala, a substantially more abundant amino acid (8.9% of residues) in proteins. Ultimately this powerful advance expanded the number of ligation-based disconnections for a given protein target and has been successfully implemented in the synthesis of a number of Cys-free polypeptides and proteins^20,35. A limitation of the desulfurization protocol was the requirement for a large excess of nickel or palladium that in some cases led to undesirable side reactions, such as tryptophan hydrogenation or methionine demethylthiolization, affording α-amino butyric acid. A milder, metal-free desulfurization strategy was later developed by Danishefsky and co-workers³⁶. This method relies on the use of a water-soluble radical initiator 2,2′-azobis[2-(2-imidazolin-2-yl)propane]dihydrochloride (VA-044) together with tris(2-carboxyethyl)phosphine hydrochloride (TCEP) and a hydrogen atom source (in this case, ^tBuSH) in aqueous media to effect desulfurization. This radical-promoted desulfurization approach, based on an early report by Hoffmann et al.³⁷, is thought to be initiated by the formation of a thiyl radical at the Cys side chain, which adds reversibly to the phosphine (Fig. 1b). The resulting phosphoranyl radical can then undergo β-scission to produce an alanyl radical and a phosphine sulfide. Rapid hydrogen abstraction by the alanyl radical from an exogenous thiol then generates the native Ala residue. Importantly, these conditions are completely chemoselective in the presence of a range of potentially susceptible functionalities, including thioesters and methionine residues. Very recently, Li and co-workers have shown that rapid and clean desulfurization can be effected without the use of a radical initiator through treatment with sodium borohydride and TCEP³⁸.

Since the seminal report by Yan and Dawson³⁴, the post-ligation desulfurization concept has found enormous application in the total chemical synthesis of proteins via disconnection at Ala residues³⁵. However, the methodology has also served as a catalyst for the concept that thiol-derived variants of other canonical amino acids could be used as Cys surrogates in NCL, followed by desulfurization to native residues. This has fuelled the development of synthetic routes towards suitably protected β-thiol, γ-thiol and δ-thiol derivatives of the proteinogenic amino acids that can be directly incorporated into fragments by SPPS and employed in protein synthesis using a ligation–desulfurization manifold. A decade on from the synthesis of the first thiol-derived amino acid, intensive research efforts by a number of research groups have culminated in a comprehensive toolbox of 9-fluorenylmethyloxycarbonyl (Fmoc)-SPPS-compatible thiolated variants of 13 of the proteinogenic amino acids, which have greatly expanded the repertoire of peptide ligation chemistry^35,39,40 (Fig. 1c). The first contribution to the amino acid toolbox was β-thiol Phe reported independently by Crich⁴¹ and Botti⁴². The β-thiol Phe moiety was shown to successfully mediate ligation reactions with peptide thioesters in good yields when incorporated on the N terminus of peptides. Subsequent removal of the β-thiol auxiliary could be performed using nickel boride to generate the native Phe residue at the ligation junction; however, this desulfurization can also be performed using a radical initiator⁴³. This proof-of-concept study, which demonstrated that ligation–desulfurization chemistry was possible at amino acids other than Cys, sparked interest in other thiol-derived amino acids by the community, some key examples of which are highlighted below.

Val and Leu represent some of the most abundant amino acids found in proteins (6.8% and 9.8%, respectively), and it was therefore not surprising that these were early targets for synthesis. Seitz and co-workers made use of a suitably protected variant of penicillamine (β,β-dimethylcysteine) as a valine surrogate⁴⁴, while Danishefsky and co-workers developed a γ-thiol valine reagent that led to faster reaction kinetics when reacted with peptide thioesters compared with the homologous reactions at β,β-dimethylcysteine, owing to the improved accessibility of the thiol auxiliary⁴⁵. However, ligation products from both Val surrogates could be cleanly desulfurized using the metal-free method (TCEP and VA-044) to afford native polypeptide products. Following this, syntheses of β-mercapto Leu derivatives were independently reported by the Brik⁴⁶ and Danishefsky⁴⁷ groups, the former report showcasing the methodology through the ligation-based assembly of the HIV-1 Tat protein. Another key reagent is the γ-thiol Lys derivative⁴⁸, which owing to its ability to mediate ligation at both α-amino and ε-amino groups through six-membered S→N acyl shift transition states, has found numerous important applications in protein synthesis. More specifically, this dual ligation capability offers synthetic access to peptides and proteins bearing natural PTMs at the Lys side chain, including acetylation, ubiquitylation and methylation.

A powerful example of the benefits offered by the thiol-derived amino acid toolbox is highlighted by the consecutive use of β-thiol Leu, γ-thiol Val and Cys for the assembly of human parathyroid hormone (hPTH, 1), reported by Danishefsky and co-workers⁴⁹ (Fig. 1c). More specifically, bifunctional fragment 2 bearing an N-terminal β-thiol Leu and a C-terminal alkyl thioester was initially ligated with thiophenyl thioester 3 to yield hPTH (1–37, 4). Separately, N-terminal Thz-protected fragment 5, functionalized as a thiophenyl thioester on the C terminus, was reacted with γ-thiol Val fragment 6. Subsequent Thz deprotection of the resulting ligation product afforded the Cys-bearing fragment hPTH (39–84, 7). A final NCL between hPTH (1–38, 4), possessing a C-terminal thioester, and hPTH (39–84, 7), with an N-terminal Cys residue, yielded the full-length hPTH sequence, which following global desulfurization, effected the removal of all three thiol auxiliaries to generate the native protein hPTH (1–84, 1) after a folding step.

A final noteworthy addition to the toolbox is β-thiolated Asp, which can be prepared in three steps from a commercially available Asp starting material [Boc-Asp(O^tBu)–OH]⁵⁰. This reagent has proved particularly useful owing to the development of an initiator-free chemoselective desulfurization reaction using TCEP and dithiothreitol (DTT) at pH 3, which enables removal of the β-thiol auxiliary in the presence of free sulfhydryl side chains of native Cys residues. This selective desulfurization technique therefore obviates the need for protecting group manipulation in protein targets containing functionally important Cys residues, as highlighted in the synthesis of the extracellular N-terminal domain of the chemokine receptor CXCR4 bearing two PTMs⁵⁰. A further application of this chemoselective ligation–desulfurization methodology was recently reported by Becker and co-workers, who prepared a number of differentially PEGylated prion proteins by ligation at β-thiol Asp followed by chemoselective desulfurization in the presence of a native unprotected Cys residue⁵¹.

Directional flexibility for iterative assembly of proteins. Protein assembly via iterative ligation reactions in the N→C direction was originally achieved by harnessing the differing reactivity of thioesters in a kinetically controlled ligation. Kent and co-workers first demonstrated this concept by using the reactivity of a thiophenyl thioester on the C terminus of one fragment with a bifunctional fragment possessing Cys on the N terminus and a less reactive alkyl thioester on the C terminus that does not partake in the ligation reaction. This methodology was initially showcased in the six-segment assembly of the small protein crambin⁵². The kinetically controlled ligation concept has been further modified for one-pot protein synthesis by the use of a thiol additive — 2,2,2-trifluoroethanethiol (TFET) — which serves to increase the rate of ligation reactions through in situ generation of thioesters with increased reactivity⁵³. Aryl thiol additives, such as mercaptophenylacetic acid (MPAA) (pK_a = 6.6) and thiophenol (pK_a = 6.6), are more commonly used to accelerate NCL reactions owing to their demonstrated proficiency in thioester exchange reactions with alkyl thioesters as well as their excellent leaving group ability upon reaction with the cysteinyl peptide fragment. Unfortunately, the radical quenching activity of these aryl thiol additives prohibits in situ radical desulfurization of the ligation products, and intermediate purification and lyophilization steps must therefore be carried out before a subsequent ligation reaction can be performed. Alternative methods to extract the aryl thiol species following ligation have been reported, including liquid–liquid extraction⁵⁴ and solid-phase capture procedures⁵⁵. However, TFET alleviates the need for these additional steps; the pK_a (7.3) leads to highly competent thioester exchange and efficient acylation by the Cys thiol moiety. Furthermore, the volatility of TFET (boiling point = 35−37 °C) permits facile post-ligation removal through simple sparging with an inert gas; however, the alkyl thiol TFET is a poor radical quencher and therefore can remain in the reaction for in situ desulfurization of Cys or thiol amino acids at the ligation junction. It should be noted that commercially available TFET often requires distillation before use (depending on the source and purity) and, as a malodorous and volatile thiol, should be handled inside a fumehood.

The power of the one-pot kinetically controlled ligation–desulfurization strategy was showcased in the efficient assembly of four differentially sulfated variants of madanin-1, a 60-amino acid Cys-free thrombin inhibitor produced by the hard tick Haemaphysalis longicornis⁵⁶ (Fig. 2a). The family of proteins (8–11) was assembled through the use of three suitably reactive peptide fragments, with the middle bifunctional fragments (12–15) possessing all possible sulfated variants at two Tyr sulfation sites (Tyr32 and Tyr35). The preformed TFET thioester 16 was initially ligated to bifunctional (sulfo)peptide fragments 12, 13, 14 or 15 bearing an N-terminal β-thiol Asp residue and a C-terminal alkyl thioester. The ligation proceeded regioselectively at the TFET thioester owing to increased reactivity compared with the alkyl thioester on 12–15. Upon complete reaction, the C-terminal Thr alkyl thioester was activated with 2 vol.% TFET and subjected to a second ligation with cysteinyl peptide 17. The resulting full-length product was finally subjected to in situ global desulfurization using VA-044, TCEP and reduced glutathione to convert Cys and β-SH Asp residues into Ala and Asp, respectively, affording native madanin-1 8 and madanin-1 sulfoproteins 9–11 in excellent yields over the multistep sequence. These synthetic proteins enabled the importance of Tyr sulfation for anticoagulant and thrombin inhibitory activity to be determined. Specifically, Tyr sulfation was shown to provide a 2–3 orders of magnitude improvement in thrombin inhibitory activity over the unmodified madanin-1 homologue⁵⁶.

**Figure 2: Chemical protein synthesis via iterative ligations in the N→C direction.**

Several other strategies have also been developed to enable iterative ligation reactions in the N→C direction. The most useful of these fall broadly into the category of thioester precursors and include C-terminal Cys activation⁵⁷, the bis(2-sulfanylethyl)amido (SEA) auxiliary⁵⁸, N-sulfanylethylanilide auxiliary⁵⁹, N-alkyl Cys^60,61, 3,4-diaminobenzoic acid (Dbz)⁶² and o-amino(methyl)aniline (MeDbz)⁶³ linkers, o-aminoanilides⁶⁴ and peptide acyl hydrazides^65–68. The SEA auxiliary, first reported by Melnyk and co-workers, possesses a 1,7-dithiol structure, which allows rapid interconversion between the inactive N-acyl perhydro-1,2,5-dithiazepine moiety (SEA^off) and the N→S acyl shift-active SEA dithiol form (SEA^on) through simple redox manipulations. In the reduced form, the SEA auxiliary is competent in ligation chemistry through conversion into a thioester either by exchange with an exogenous additive, such as 3-mercaptopropionic acid, or through trapping of the N→S acyl-shifted SEA thioester with glyoxylic acid. Importantly, the SEA^off cyclic disulfide is compatible with mild reducing agents (for example, MPAA) commonly employed as thiol catalysts in NCL reactions, allowing the use of NCL and SEA ligations in concert. The orthogonality of the SEA auxiliary with NCL was recently highlighted in the synthesis of functional SUMO-1 peptide conjugate 18 (Ref. 69) (Fig. 2b). Initially, SUMO fragment thioester 19 was prepared via activation of the peptide bearing a C-terminal SEA auxiliary (not shown) through exchange with 3-mercaptopropionic acid. The resulting thioester 19 was then subjected to MPAA-catalysed NCL with fragment 20 bearing an N-terminal Cys and a latent C-terminal SEA^off moiety. Subsequent addition of TCEP facilitated the switching of SEA^off→SEA^on and the SEA ligation could then be conducted with fragment 21 in a one-pot manner to afford 18.

Peptide acyl hydrazides have also proved to be highly useful thioester surrogates for the N→C assembly of protein targets through ligation chemistry⁶⁵. Conversion of a given peptide with a C-terminal acyl hydrazide functionality into a thioester is performed through an operationally simple activation of the hydrazide with NaNO₂ followed by thiolysis of the resulting acyl azide with an external thiol additive. Crucially, the hydrazide moiety remains inactive under NCL conditions, therefore acting as a masked thioester that can be unleashed for iterative ligation reactions in the N→C direction. An elegant example of the acyl hydrazide-based NCL approach is the preparation⁶⁸ of α-synuclein 22, a protein that has been implicated in the formation of neuronal Lewy bodies and in the progression of several neurodegenerative disorders, including Parkinson’s disease. Liu et al.⁶⁸ devised a four-segment N→C sequential ligation strategy starting with the activation of acyl hydrazide fragment 23 with NaNO₂, followed by thiolysis (Fig. 2c). The resulting peptide thioester could then be ligated to fragment 24 in an MPAA-catalysed NCL reaction. This procedure was then repeated with fragments 25 and 26 to afford the full-length protein. Global radical desulfurization to effect Cys into Ala conversions at each of the three ligation junctions then afforded synthetic α-synuclein 22 in excellent overall yield.

In a manner similar to N→C protein assembly, several effective methods have also been developed for assembling proteins in the C→N direction. The crux of this concept is to precisely control sequential ligation steps through the use of orthogonal protecting groups for N-terminal Cys residues or Cys surrogates. The design and utility of appropriate Cys protecting groups remains a contemporary research focus⁷⁰; however, several viable strategies have been reported in successful protein syntheses, including Thz^71,72 derivatives and acetamidomethyl (Acm)⁷³ protection of Cys. An elegant method from Brik and co-workers employed a Thz-protected δ-thiol Lys residue to facilitate three iterative ligations in the C→N direction at the ε-amino moiety of Lys with ubiquitin chains functionalized as C-terminal thioesters to generate tetraubiquitin 27 (Ref. 74) (Fig. 3a). Protein assembly was accomplished using three ubiquitin fragments, 28 containing an N-terminal δ-thiol Lys, 29 bearing an N-terminal Thz-protected δ-thiol Lys and a C-terminal thioester and peptide thioester 30. Using iterative cycles of benzylmercaptan-catalysed and thiophenol-catalysed ligation reactions and acidic methoxyamine-mediated Thz deprotection steps, four ubiquitin units were assembled to afford the 304-amino acid protein tetramer. Global radical desulfurization of the three δ-thiol Lys residues to native lysines then provided tetraubiquitin 27. Notably, Liu and co-workers have very recently reported the synthesis of hexaubiquitin through iterative acyl hydrazide chemistry, which represents one of the largest proteins to ever be prepared by chemical synthesis⁷⁵.

**Figure 3: Chemical protein synthesis via iterative ligations in C→N direction.**

Kajihara and co-workers also demonstrated the power of an iterative C→N ligation strategy for the total synthesis of a homogeneously glycosylated variant of interferon-β (IFNβ, 31)⁷⁶. The approach involved disconnection of the 166-amino acid target into three fragments, whereby two Cys residues were introduced for NCL reactions, while the three native Cys residues were protected with Acm groups throughout the protein assembly (Fig. 3b). The synthesis began with NCL between N-terminal cysteinyl fragment 32 and the thioester of glycosylated fragment 33, which possessed a Thz residue on the N terminus. Subsequent methoxyamine-mediated Thz deprotection unmasked the N-terminal Cys residue to afford 34, which could then participate in ligation with N-terminal fragment 35. With the full-length protein assembled, desulfurization of the non-native Cys residues was effected under metal-free conditions. Silver acetate-promoted removal of the Acm protection on Cys and saponification of the benzyl ester protection on sialic acid followed by folding furnished homogeneously glycosylated IFNβ (31).

The ability to perform ligation–desulfurization reactions in an iterative manner in both the N→C and C→N directions has greatly improved the efficiency of chemical protein synthesis. With these technologies, the community has redefined the targets that can be produced, with substantially larger targets (>120 amino acids) now becoming more routinely accessible.

Box 1: Development of NCL and related N-terminal thiol auxiliary approaches

The pursuit of a chemoselective amide-bond-forming transformation between unprotected peptides initially centred on the conjugation of a peptide with a C-terminal thioacid functionality and a peptide with an N-terminal bromoacetamide motif. Importantly, unprotected peptide segments could be solubilized at mM concentrations in 6 M guanidine hydrochloride (GdnHCl) buffer, which enabled efficient conversion into the thioester-linked product (see scheme part a showing the intellectual framework leading to the development of native chemical ligation (NCL))¹²³. Following this study, it was envisioned that the reaction of the C-terminal thioacid with a peptide bearing an N-terminal β-bromoalanine residue would enable the generation of a thioester, which would then undergo an S→N acyl transfer to yield a native peptide bond (scheme part a). However, preliminary experiments found aziridine formation to be a notable competing side reaction. Nonetheless, these experiments helped set the intellectual framework for the NCL reaction, which uses a peptide bearing a C-terminal thioester and a peptide containing an N-terminal Cys residue as the coupling partners (scheme part a)¹⁵.

The requirement for a cysteine residue makes traditional NCL unsuitable for numerous protein targets, which either do not contain Cys or do not possess this residue at a synthetically useful junction. Early approaches to circumvent this problem involved the use of N-terminal auxiliaries (such as derivatives of ethanethiol or 2-mercaptobenzyl) to mimic the role of the Cys thiol group in facilitating a trans-thioesterification event, therefore enabling proximity-induced acylation of the auxiliary-bound secondary amine (scheme part b shows a general scheme for N-terminal auxiliary-mediated ligation)¹²⁴. A final cleavage of the ligation auxiliary then affords the native peptide product. Unsurprisingly, additional steric bulk at the N-terminal amine results in a decreased reaction rate and poor sequence tolerance at the ligation junction. Attention later turned to a mechanistically similar method using auxiliaries appended to the side chain of the N-terminal amino acid. An early side-chain auxiliary strategy, developed by Wong and co-workers, employed a thiol-derived β-O-linked carbohydrate moiety and was termed sugar-assisted ligation^125–127. Alternative side-chain, backbone and N-terminal auxiliaries have since emerged from the groups of Brik¹²⁸, Hojo¹²⁹ and Seitz¹³⁰.

Extending NCL to selenocysteine

In parallel with the revolutionary advances of the NCL reaction manifold, there has also been considerable research attention focused on tackling some of the inherent limitations of the technology, specifically the lack of chemoselectivity of the desulfurization reaction in the presence of native Cys residues and the prohibitively slow ligation rates at sterically demanding amino acid junctions. In 2001, three independent groups demonstrated that the 21st amino acid (Sec) was competent in NCL-like transformations with peptide thioesters, thus providing access to large selenopeptides and selenoproteins for the first time^77–79. Sec was first acknowledged to be biologically vital based on the selenoenzyme glutathione peroxidase, which displayed selenium-based catalytic activity⁸⁰. Since then, several selenoproteins have been identified with functions ranging from phospholipid biosynthesis, muscle development and calcium mobilization, to modulators of redox-regulated signalling^81–89. Despite being the chalcogenic analogue of Cys, Sec exhibits some strikingly different physicochemical properties. First, Sec exhibits a considerably lower reduction potential (−381 mV) than Cys (−180 mV)^90,91. As a result, Sec readily undergoes air oxidation and exists exclusively as a dimeric species (diselenide)⁹². A reducing agent is therefore required for NCL reactions to proceed efficiently through the generation of the monomeric selenolate⁷⁸. Second, the pKa of Sec (5.2–5.6) is lower than that of Cys (8.2), meaning that when monomeric, it exists predominantly as selenolate at physiological pH, thus enabling NCL at Sec to be performed at a lower pH and offering higher yields by minimizing thioester hydrolysis.

Since its inception, Sec-mediated NCL has been exploited for the synthesis of a wide range of peptides^{67,79,93–95} and proteins^{77,78,96–100}. Some of the early examples include a 17-mer fragment of ribonucleotide reductase⁷⁹ and a bovine pancreatic trypsin inhibitor (BPTI) analogue⁷⁸, both possessing an intramolecular selenosulfide linkage. In another example, Hilvert and co-workers synthesized a cyclic peptide by macrolactamization of a linear precursor functionalized with a C-terminal thioester and an N-terminal Sec residue by NCL⁹⁴. In addition, the internal Sec in the ligated cyclic product was shown to be amenable to various synthetic transformations, such as alkylation, oxidative elimination and reductive deselenization⁹⁴. Sec was also found to be compatible with expressed protein ligation (EPL), demonstrated through the synthesis of RNase A⁷⁷ and azurin¹⁰¹. In both cases, a synthetic peptide bearing an N-terminal selenocystine, instead of native Cys, was ligated with a large protein thioester derived from recombinant techniques. Very recently, Rozovsky and co-workers have developed a method for the incorporation of Sec into expressed protein fragments by enriching the growth medium for Escherichia coli with Sec, such that it could be subsequently incorporated using the Cys codon¹⁰². Importantly, the ENLYFQ motif was N-terminally fused to Sec, and the resulting Q–Sec junction in the mature construct could be cleaved using tobacco etch virus (TEV) protease to afford proteins with an N-terminal Sec residue. This enabled the development of expressed protein ligation at Sec with large expressed fragments (Sec-EPL). This work provides the impetus for the site-specific modification of Sec residues within proteins in the future; however, the method currently cannot accommodate Cys residues in the Sec-containing fragment owing to non-selective incorporation.

In 2010, a landmark contribution to Sec-based ligation chemistry came from Dawson and co-workers, who discovered that deselenization of a Sec residue could be achieved under mild conditions using the reducing agent TCEP and a hydrogen donor, such as DTT (Fig. 4a). Importantly this method does not require a radical initiator and was completely chemoselective in the presence of unprotected Cys residues⁹⁵. This pivotal finding highlighted the enormous potential of Sec-based NCL as a method for the construction of proteins retaining native Cys residues that may be crucial to the structure and/or function of a given protein target. It is important to note that before the development of this methodology, the synthesis of Cys-containing targets under a NCL–desulfurization regime necessitated the use of protecting groups on the side chains of Cys residues during the desulfurization step. The observed selectivity is proposed to arise from the weaker C–Se bond, favouring formation of the alanyl radical at Sec over Cys. Mechanistically, the deselenization is proposed to proceed through reversible addition of a selenium-centred radical to the phosphine, leading to a phosphorus-centred radical species. The highly thermodynamically favourable production of a phosphine selenide is then proposed to drive the homolysis of the C–Se bond. The resulting β-carbon-centred radical is then capable of hydrogen atom abstraction to generate the native alanine residue (Fig. 4b). Initially, the ligation–chemoselective deselenization approach was applied on small peptidic systems, including a 38-residue fragment of the redox enzyme glutaredoxin 3 (Grx3, 1–38)⁹⁵. However, the power of this methodology was further exemplified by Metanis et al. in the synthesis of the 125-amino acid human enzyme phosphohistidine phosphatase (PHPT1, 40)⁹⁷. With three Cys residues located in the C-terminal region of the sequence, a strategy that employed both traditional NCL and Sec-mediated ligation reactions was devised, with three segments undergoing sequential ligations in the C→N direction (Fig. 4c). Bifunctional segment 37 was prepared bearing an N-terminal Sec residue protected as a Sez and functionalized with a C-terminal thioester for a standard NCL reaction with the N-terminal Cys residue of fragment 38. The ligated product was treated with MeONH₂ to effect conversion of the Sez moiety into Sec and subjected to Sec-mediated NCL with C-terminal thioester segment 39. Interestingly, while Sez was demonstrated to be stable under Fmoc-SPPS and NCL conditions (similar to Thz), the authors reported that MeONH₂-promoted ring opening was faster in the case of Thz (based on a model system). The purified ligation product could then be successfully deselenized to afford PHPT1 (40). Importantly, the deselenization of Sec proceeded smoothly without modifying the three unprotected Cys residues present in the sequence, thus highlighting the selectivity of the protocol. The same group later accomplished the total synthesis of the 122-residue human selenoprotein M (SELM) through the iterative Sec-mediated ligation–deselenization assembly of four fragments in the C→N direction⁹⁸. Notably, SELM comprises a CXXU motif that is crucial for its biological activity; this would otherwise be inaccessible with traditional Cys-based ligation methods.

**Figure 4: Applications of peptide ligation chemistry at the 21st amino acid (Sec)**

A further exploitation of the Sec reactivity was demonstrated by the Payne⁹⁶ and Metanis¹⁰³ groups, who independently discovered that treatment of Sec with TCEP in the presence of an exogenous oxidant leads to clean conversion into serine at the ligation junction. The discovery of this oxidative deselenization transformation has further broadened Sec ligation chemistry beyond Ala disconnections^96,103,104. While the Metanis group performed deselenization in the presence of oxygen¹⁰³, Payne and co-workers employed oxone as the oxidant^96,104. Notably, the latter approach has been successfully employed for the synthesis of MUC4 and MUC5AC-based glycopeptides^96,104. Furthermore, the methodology was used to assemble the Cys-free protein eglin C (41) via a single ligation between C-terminal thioester 42 and selenocystine-bearing fragment 43, followed by oxidative deselenization⁹⁶ (Fig. 4d).

In a manner similar to the improvement in the scope of NCL through the development of thiol-derived amino acids, expansion of the Sec ligation was first attempted by Danishefsky and co-workers, who developed the synthesis of a trans-γ-selenoproline building block in three steps from orthogonally protected hydroxyproline⁹³. This amino acid was subsequently incorporated into model peptides using Fmoc-based SPPS and was successfully used in ligation–deselenization chemistry with various peptide thioesters. Malins et al. also developed an efficient synthesis of a suitably protected β-selenophenylalanine from Garner’s aldehyde, which could also be successfully employed in ligation chemistry followed by chemoselective deselenization in the presence of unprotected Cys⁶⁷. Taken together, the Sec-based ligation methods, coupled with chemoselective deselenization chemistry, represent powerful new approaches for accessing protein targets without strategically placed Cys residues or where chemoselective removal of the ligation auxiliary in the presence of other sensitive residues (for example, structurally or functionally important Cys residues) is necessary.

Selenoester acyl donors for acceleration of ligation-based protein assembly. The rate of NCL is known to be strongly influenced by the steric and electronic environment of the C-terminal amino acid residue of the thioester component. For instance, peptide thioesters bearing sterically hindered β-branched amino acids at the C terminus (for example, Ile, Thr and Val) suffer from sluggish reaction rates, affording lower ligation yields owing to competing thioester hydrolysis. In the case of C-terminal proline thioesters, an n→π* electronic donation into the carbonyl carbon leads to reduced electrophilicity of the prolyl thioesters, making Pro-Cys junctions synthetically intractable¹⁰⁵. A solution to this problem was reported by Durek and Alewood, who rationalized that replacement of the thioester moiety by an alkyl selenoester would lead to productive ligation chemistry, owing to the superior leaving group ability of the selenolate over the thiolate. Indeed, ligation at model peptides bearing a C-terminal prolyl selenoester were complete in 2 hours, nearly 350 times faster than traditional NCL¹⁰⁶ (Fig. 5a). This initial study laid the foundation for the use of selenoesters as acyl donors in the ligation-based assembly of proteins, including several reports describing efficient methods for accessing peptide selenoesters of various lengths both in solution¹⁰⁷ and on the solid phase¹⁰⁸.

**Figure 5: Peptide ligation chemistry using selenoesters as the acyl donor.**

More recently, Mitchell et al. postulated that substantial rate accelerations in peptide ligation chemistry could be achieved by harnessing the superior reactivity of C-terminal selenoesters (in this case, aryl selenoesters) in combination with the improved nucleophilicity of Sec at the N termini of the other reacting peptide fragments. To test this hypothesis, two model peptides were chosen for the initial experiments, one functionalized as a C-terminal Ala phenyl selenoester and the other was a diselenide dimer possessing an N-terminal selenocystine residue. The authors initially explored the possibility of implementing electrochemistry for reduction of the diselenide to the ligation-active selenolate in order to circumvent the concomitant deselenization of Sec that occurs in the presence of phosphine reductants. However, in a serendipitous finding, the control experiment (without application of a current) led to the generation of the desired ligation product¹⁰⁷. Strikingly, the ligation proceeded cleanly by simply dissolving the peptide fragments in denaturing buffer without the addition of any additives. The additive-free reaction, which was subsequently dubbed diselenide–selenoester ligation (DSL), was also complete within 60 seconds at room temperature, representing a large rate acceleration over the analogous reaction under an NCL manifold. This rate acceleration was also maintained at sterically hindered selenoesters, which were complete within 10 minutes, comparing favourably with the analogous NCL reactions, which require up to 48 hours. Because selenoesters are considerably more prone to hydrolysis than thioesters (at pH > 7), careful pH adjustment during ligation is crucial. Fortunately, the ability to perform these ligations at acidic pH (5–7) enables competing selenoester hydrolysis to be circumvented. It is important to note that the final ligation product is typically obtained as a mixture of symmetrical diselenide, asymmetrical diselenide and product bearing a selenoester linkage on the Sec used for ligation. However, all of these products coalesce into a single product following in situ deselenization (via treatment with TCEP and DTT) (Fig. 5b). Intrigued by this unique transformation, a series of experimental and computational investigations were undertaken to gain mechanistic insight into this rapid ligation methodology. Given the absence of any reductants or additives, it has been proposed that there is a unique initiation step for the DSL transformation to provide a competent intermediate that can enter a native chemical ligation-like pathway. In addition, based on the data compiled from theoretical and experimental observations, precipitation of diphenyl diselenide (DPDS) — a by-product generated during ligation in aqueous buffer — was proposed to be a potential driving force for the reaction. It is worth mentioning that the DPDS produced during ligation acts as a radical quencher and thus needs to be removed through hexane extraction before performing an in situ deselenization reaction. Recently, ligation has also been used in conjunction with oxidative deselenization technology to afford serine at the ligation junction, as showcased in the synthesis of fragments of some human mucin glycoproteins¹⁰⁴.

To illustrate the synthetic utility, the additive-free DSL–deselenization methodology was also applied in the construction of two proteins¹⁰⁷. First, intracellular chorismate mutase from Mycobacterium tuberculosis was assembled through a one-pot ligation–deselenization approach with 57% yield over two steps. After folding, the full-length enzyme was found to possess structure and catalytic activity similar to those of the wild-type enzyme. The orthogonality of the DSL chemistry with NCL was also exemplified through the synthesis of another M. tuberculosis protein — early secretory antigenic protein 6 (ESAT-6). The synthesis of the target was accomplished in high yield via a one-pot kinetically controlled ligation of three fragments in the N→C direction (Fig. 5c). Specifically, the inherent difference in the reactivity of thioesters and selenoesters was exploited through chemoselective DSL between bifunctional middle diselenide dimer segment 44, with an N-terminal selenocystine and a C-terminal thioester, and peptide phenylselenoester 45. The ligated product 46 was generated exclusively in minutes and could be subsequently subjected to NCL with C-terminal segment 47 bearing an N-terminal Cys using TFET as an additive. The presence of TCEP in the NCL step also led to the concomitant deselenization of Sec40 used for the initial DSL reaction. Upon desulfurization using VA-044, TCEP and glutathione, ESAT-6 48 was obtained in 43% yield over four steps following a single high-pressure liquid chromatography (HPLC) purification. The speed and efficiency of the additive-free DSL technology makes it a valuable addition to the ligation chemistry toolbox for the chemical synthesis of proteins. With salient features such as operational simplicity, unprecedented reaction rates (even at sterically encumbered junctions), broad pH tolerance (pH 3–7) and compatibility with unprotected Cys residues and NCL, it is likely that this methodology will find wide application in the chemical synthesis or semi-synthesis of numerous other important protein targets with or without PTMs and other modifications in the future.

Rapid protein assembly via DSL chemistry at selenol-derived amino acids. Since the first report of DSL in 2015, a number of suitably protected selenylated amino acids, including β-selenoLeu¹⁰⁹, β-selenoAsp¹¹⁰ and γ-selenoGlu¹¹⁰, have been developed with a view to broaden the scope of the methodology (Fig. 6). Each of these building blocks is compatible with Fmoc-SPPS and have been successfully employed in DSL–deselenization transformations, including protein synthesis, as highlighted below.

**Figure 6: Protein synthesis via diselenide–selenoester ligation (DSL)–deselenization at diselenide-derived amino acids.**

The utility of DSL–deselenization chemistry at β-selenoLeu has been powerfully demonstrated in the synthesis of a library of differentially modified variants of the CCL5 (also known as RANTES) chemokine-binding protein UL22A from human cytomegalovirus, which were predicted to be sulfated at Tyr65 and Tyr69 (Ref. 109). As there are no Cys or suitably placed alanine residues in the native sequence of UL22A, the protein could not be assembled through traditional ligation methods. Wang and co-workers therefore chose to disconnect UL22A at a challenging Val-Leu junction, leading to two target fragments, diselenide 49 bearing an N-terminal β-selenoLeu and N-terminal fragments 50–53 with variation in the sulfation state at Tyr65 and Tyr69 (Fig. 6a). The synthesis of 49 was achieved by Fmoc-SPPS, incorporating the suitably protected β-selenoLeu, which was in turn accessed from Garner’s aldehyde in eight steps¹⁰⁹. Additive-free DSL reactions between 49 and 50–53 were initially unsuccessful, presumably owing to the sterically demanding nature of the Val-Leu junction. Nonetheless, all ligations with (sulfo)peptide phenylselenoesters 50–53 proceeded smoothly in the presence of TCEP and DPDS as additives, reaching completion in just 1 hour. Following in situ deselenization and HPLC purification, the desired (sulfo)proteins were obtained in excellent yields. The doubly sulfated UL22A was shown to possess a 2.5 orders of magnitude improvement in binding to RANTES over the unmodified protein, thus validating the importance of sulfation for biological activity¹⁰⁹.

Although DSL reactions generally reach completion in minutes, the chemistry is followed by a comparably sluggish deselenization step, normally requiring 6–16 hours. As such, improving the rate of deselenization would provide access to polypeptides and proteins on extraordinarily short timescales. Such an innovation was recently described by Mitchell et al., who demonstrated that the presence of a weak C–Se bond in β-selenoAsp and γ-selenoGlu enables rapid and clean deselenization in less than a minute, orders of magnitude faster than deselenization at Sec¹¹⁰. The exceptional rate increase is thought to be a result of stabilization of the carbon-centred radical, generated during deselenization, by the neighbouring carboxylate functionality in the Asp and Glu derivatives. These selenoamino acids were synthesized in three steps starting from commercially available Boc-Asp(O^tBu)–OH and Boc-Glu(O^tBu)–OH, respectively, with the key selenol functionality installed through an electrophilic selenylation reaction. These could be readily incorporated into model peptides using the Fmoc-SPPS strategy and were demonstrated to undergo ligation followed by rapid deselenization, furnishing desired ligated products in excellent yields. Based on these promising results, it was reasoned that this rapid deselenization combined with the expedient ligation reaction could provide a means to accelerate chemical protein synthesis. To explore this possibility, a library of tick-derived thrombin inhibitors (hyalomin-2, hyalomin-3 and hyalomin-4)¹¹¹ was prepared using one-pot ligation–deselenization technology at β-selenoAsp. As exemplified for hyalomin-2 (58) in Fig. 6b, the hyalomins were assembled from two fragments, one functionalized as a C-terminal phenylselenoester (59) and the other as a peptide dimer bearing an N-terminal β-selenoAsp moiety (60)¹¹⁰. A single one-pot DSL–deselenization transformation facilitated the production of each of the hyalomin proteins within minutes. The entire synthetic procedure, HPLC purification, solvent evaporation (via centrifugal concentration), quantification and thrombin inhibition bioassay could be performed within just 3 hours, opening the exciting possibility of generating rapid SAR on small proteins using this technology in the future.

Due to the rapid kinetics of deselenization of β-selenoAsp and γ-selenoGlu, it was hypothesized that this step could be performed chemoselectively in the presence of an unprotected Sec residue, enabling access to native selenoproteins. Accordingly, selenoprotein K (SelK) was selected as a synthetic target to validate this concept. SelK contains a Sec residue close to the C terminus (Sec92) that is responsible for the formation of an intermolecular diselenide in the native homodimeric protein^112–114. Biologically, SelK is an endoplasmic reticulum membrane protein, believed to be involved in regulating cellular redox balance in cardiomyocytes⁸⁸ and stimulating Ca²⁺ flux to control immunity⁸⁶. The protein was disconnected between Tyr60 and Asp61, necessitating the synthesis of 59-residue peptide phenyl selenoester 61, as well as 62, which possesses an intramolecular diselenide between the native Sec and the N-terminal β-selenoAsp moiety (Fig. 7). Both fragments were made by standard Fmoc-SPPS methods. For 62, the Sec and β-selenoAsp residues were introduced in suitably protected form, and upon cleavage, deprotection and purification afforded exclusively the intramolecular diselenide product. The purified segments were next reacted under additive-free DSL conditions, providing the ligated product as the intramolecular diselenide in 62% yield. Treatment of this intermediate with TCEP (in the absence of an external hydrogen atom source) for just 2 minutes effected the chemoselective deselenization of β-selenoAsp over Sec to afford SelK. In an attempt to streamline the process, a one-pot ligation–chemoselective deselenization strategy was also employed and provided direct access to SelK (63) in 40% yield¹¹⁰.

**Figure 7: One-pot synthesis of 21 kDa homodimeric SelK.**

Based on early applications of DSL technology, the speed and unrivalled chemoselectivity is expected to find widespread applications in the rapid access to therapeutic peptides and protein libraries in the near future, including those possessing the 21st amino acid (Sec). Moreover, access to selenylated derivatives of other proteinogenic amino acids will further expand the scope of this methodology.

Summary and outlook

With the recent advances in peptide ligation technology, it is clear that chemical synthesis can now be used to produce large polypeptide and small protein therapeutics in a highly robust and efficient manner. As such, it is possible that these methods can provide a viable alternative to traditionally used recombinant expression technologies while providing the additional benefit and flexibility of incorporating PTMs or bespoke modifications in a site-specific manner to attenuate structure, function and stability. Inspired by the transformational NCL concept, recently developed methodologies have overcome many of the limitations of the seminal approach and have expanded the number of accessible protein targets as well as the efficiency of chemical protein synthesis. For example, with access to a range of synthetic thiolated amino acids, the repertoire of NCL has been greatly broadened and provides an enormous amount of retrosynthetic flexibility for accessing a protein target by total chemical synthesis. The potential to purchase these reagents from commercial vendors in the future should allow further uptake of these technologies by the community. Furthermore, the rapid kinetics of the recently reported DSL technology provides a viable avenue to overcome one of the remaining key challenges of standard peptide ligation chemistry, the sluggish rates of reaction at sterically hindered junctions. The development of multi-component protein assembly in the N→C or C→N direction, coupled with the orthogonality of NCL-based and DSL-based methods, raises the exciting possibility of generating proteins with minimal handling and intermediary purification steps and on unprecedented timescales. While the median size of a human protein is 375 residues, most proteins that have been generated by chemical synthesis to date are half this size. However, the development of orthogonal N-terminal protection strategies and masked C-terminal acyl donors, coupled with NCL and DSL chemistry, now provides a means to target proteins of increasing size and complexity. Indeed, the groups of Kay¹¹⁵, Liu^27,116 and Klussmann¹¹⁷ have recently reported the preparation of larger targets by total chemical synthesis. An alternative means of generating larger proteins bearing homogeneous modifications is through EPL techniques¹¹⁸. This methodology is well established for NCL, and recently developed recombinant methods for Sec incorporation open the possibility of EPL under a DSL manifold.

The plethora of enabling methods available for chemical protein synthesis also has the potential to open up new fields of research. For example, the speed and efficiency of the latest ligation techniques offer the exciting possibility of generating protein libraries, thus enabling synthetic protein medicinal chemistry. While such a platform cannot compete with the large number of targets that can be generated through phage display¹¹⁹ or expanded codon methodologies^120,121, it has the potential to fuel peptide and protein drug discovery efforts by enabling focused library generation and the establishment of SARs in a manner similar to that of medicinal chemistry programmes with small molecules, where modified, unnatural and/or d-amino acids can be installed site-selectively. The bottleneck of chemical protein synthesis is no longer ligation-based assembly but rather the time-consuming synthesis of the suitably functionalized peptide segments by SPPS, along with laborious HPLC purification and freeze-drying steps. While it is still very difficult to predict the efficiency of the synthesis of a given peptide target by SPPS, a number of recent modifications to the standard method, namely, microwave heating and flow chemistry¹²², have the potential to accelerate this time-consuming process. Automating the SPPS process with purification would reveal the tantalizing possibility of performing semi-automated protein synthesis in the future.

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

How to cite this article

Kulkarni, S. S., Sayers, J., Premdjee, B. & Payne, R. J. Rapid and efficient protein synthesis through expansion of the native chemical ligation concept. Nat. Rev. Chem. 2, 0122 (2018).

References

Muir, T. W. & Kent, S. B. H. The chemical synthesis of proteins. Curr. Opin. Biotechnol. 4, 420–427 (1993).
Article CAS PubMed Google Scholar
Gamblin, D. P., Scanlan, E. M. & Davis, B. G. Glycoprotein synthesis: an update. Chem. Rev. 109, 131–163 (2009).
Article CAS PubMed Google Scholar
Stone, M. J. & Payne, R. J. Homogeneous sulfopeptides and sulfoproteins: synthetic approaches and applications to characterize the effects of tyrosine sulfation on biochemical function. Acc. Chem. Res. 48, 2251–2261 (2015).
Article CAS PubMed Google Scholar
Wright, T. H., Vallee, M. R. J. & Davis, B. G. From chemical mutagenesis to post-expression mutagenesis: a 50 year Odyssey. Angew. Chem. Int. Ed. 55, 5896–5903 (2016).
Article CAS Google Scholar
Huttner, W. B. Sulphation of tyrosine residues — a widespread modification of proteins. Nature 299, 273–276 (1982).
Article CAS PubMed Google Scholar
Uhlig, T. et al. The emergence of peptides in the pharmaceutical business: from exploration to exploitation. EuPA Open Proteom. 4, 58–69 (2014).
Article CAS Google Scholar
Fosgerau, K. & Hoffmann, T. Peptide therapeutics: current status and future directions. Drug Discov. Today 20, 122–128 (2015).
Article CAS PubMed Google Scholar
Lagassé, H. A. D. et al. Recent advances in (therapeutic protein) drug development [version 1; referees: 2 approved]. F1000Research 6, 113 (2017).
Article PubMed PubMed Central CAS Google Scholar
Usmani, S. S. et al. THPdb: Database of FDA-approved peptide and protein therapeutics. PloS ONE 12, e0181748 (2017).
Article CAS Google Scholar
Harris, J. M. & Chess, R. B. Effect of pegylation on pharmaceuticals. Nat. Rev. Drug Discov. 2, 214–221 (2003).
Article CAS PubMed Google Scholar
Kent, S. B. H. et al. Through the looking glass — a new world of proteins enabled by chemical synthesis. J. Pept. Sci. 18, 428–436 (2012).
Article CAS PubMed Google Scholar
Xie, J. & Schultz, P. G. A chemical toolkit for proteins — an expanded genetic code. Nat. Rev. Mol. Cell Biol. 7, 775–782 (2006).
Article CAS PubMed Google Scholar
Merrifield, R. B. Solid Phase Peptide Synthesis. I. The Synthesis of a Tetrapeptide. J. Am. Chem. Soc. 85, 2149–2154 (1963).
Article CAS Google Scholar
Bray, B. L. Large-scale manufacture of peptide therapeutics by chemical synthesis. Nat. Rev. Drug Discov. 2, 587–593 (2003).
Article CAS PubMed Google Scholar
Dawson, P. E., Muir, T. W., Clark-Lewis, I. & Kent, S. B. H. Synthesis of proteins by native chemical ligation. Science 266, 776–779 (1994). This seminal report revolutionized the field of chemical protein synthesis. The method has been employed in the assembly of hundreds of important protein targets.
Article CAS PubMed Google Scholar
Muir, T. W., Sondhi, D. & Cole, P. A. Expressed protein ligation: a general method for protein engineering. Proc. Natl Acad. Sci. USA 95, 6705–6710 (1998).
Article CAS PubMed PubMed Central Google Scholar
Bode, J. W., Fox, R. M. & Baucom, K. D. Chemoselective amide ligations by decarboxylative condensations of N -alkylhydroxylamines and α-ketoacids. Angew. Chem. Int. Ed. 45, 1248–1252 (2006).
Article CAS Google Scholar
Nilsson, B. L., Kiessling, L. L. & Raines, R. T. Staudinger ligation: a peptide from a thioester and azide. Org. Lett. 2, 1939–1941 (2000).
Article CAS PubMed Google Scholar
Saxon, E., Armstrong, J. I. & Bertozzi, C. R. A “traceless” Staudinger ligation for the chemoselective synthesis of amide bonds. Org. Lett. 2, 2141–2143 (2000).
Article CAS PubMed Google Scholar
Hackenberger, C. P. R. & Schwarzer, D. Chemoselective ligation and modification strategies for peptides and proteins. Angew. Chem. Int. Ed. 47, 10030–10074 (2008).
Article CAS Google Scholar
Zhang, Y., Xu, C., Lam, H. Y., Lee, C. L. & Li, X. Protein chemical synthesis by serine and threonine ligation. Proc. Natl Acad. Sci. USA 110, 6657–6662 (2013).
Article CAS PubMed PubMed Central Google Scholar
Unverzagt, C. & Kajihara, Y. Chemical assembly of N-glycoproteins: a refined toolbox to address a ubiquitous posttranslational modification. Chem. Soc. Rev. 42, 4408–4420 (2013).
Article CAS PubMed Google Scholar
Masania, J., Li, J., Smerdon, S. J. & Macmillan, D. Access to phosphoproteins and glycoproteins through semi-synthesis, native chemical ligation and N to S acyl transfer. Org. Biomol. Chem. 8, 5113–5119 (2010).
Article CAS PubMed Google Scholar
Agouridas, V., El Mahdi, O., Cargoët, M. & Melnyk, O. A statistical view of protein chemical synthesis using NCL and extended methodologies. Bioorg. Med. Chem. 25, 4938–4945 (2017).
Article CAS PubMed Google Scholar
Kochendoerfer, G. G. et al. Design and chemical synthesis of a homogeneous polymer-modified erythropoiesis protein. Science 299, 884–887 (2003).
Article CAS PubMed Google Scholar
Torbeev, V. Y. & Kent, S. B. H. Convergent chemical synthesis and crystal structure of a 203 amino acid “covalent dimer” HIV-1 protease enzyme molecule. Angew. Chem. Int. Ed. 46, 1667–1670 (2007).
Article CAS Google Scholar
Jiang, W. et al. Mirror-image polymerase chain reaction. Cell Discov. 3, 17037 (2017).
Article CAS PubMed PubMed Central Google Scholar
Clark, R. J. & Craik, D. J. Native chemical ligation applied to the synthesis and bioengineering of circular peptides and proteins. Pept. Sci. 94, 414–422 (2010).
Article CAS Google Scholar
Dawson, P. E. & Kent, S. B. H. Synthesis of native proteins by chemical ligation. Annu. Rev. Biochem. 69, 923–960 (2000).
Article CAS PubMed Google Scholar
Kent, S. B. H. Total chemical synthesis of proteins. Chem. Soc. Rev. 38, 338–351 (2009).
Article CAS PubMed Google Scholar
Kent, S. B. H. Chemical protein synthesis: Inventing synthetic methods to decipher how proteins work. Bioorg. Med. Chem. 25, 4926–4937 (2017).
Article CAS PubMed Google Scholar
Macmillan, D. Evolving strategies for protein synthesis converge on native chemical ligation. Angew. Chem. Int. Ed. 45, 7668–7672 (2006).
Article CAS Google Scholar
Offer, J. Native chemical ligation with N^α acyl transfer auxiliaries. Pept. Sci. 94, 530–541 (2010).
Article CAS Google Scholar
Yan, L. Z. & Dawson, P. E. Synthesis of peptides and proteins without cysteine residues by native chemical ligation combined with desulfurization. J. Am. Chem. Soc. 123, 526–533 (2001). This important work established the conceptual framework for ligation–desulfurization chemistry.
Article CAS PubMed Google Scholar
Malins, L. R. & Payne, R. J. Recent extensions to native chemical ligation for the chemical synthesis of peptides and proteins. Curr. Opin. Chem. Biol. 22, 70–78 (2014).
Article CAS PubMed Google Scholar
Wan, Q. & Danishefsky, S. J. Free-radical-based, specific desulfurization of cysteine: a powerful advance in the synthesis of polypeptides and glycopolypeptides. Angew. Chem. Int. Ed. 46, 9248–9252 (2007). This milder, metal-free desulfurization using TCEP, a water-soluble radical initiator (VA-044) and a hydrogen atom source (such as ^tBuSH) overcomes some limitations of the original metal-based approach and is demonstrated to be compatible with thioesters, methionine and protected Cys residues.
Article CAS Google Scholar
Hoffmann, F. W., Ess, R. J., Simmons, T. C. & Hanzel, R. S. The desulfurization of meraptans with trialkyl phosphites. J. Am. Chem. Soc. 78, 6414 (1956).
Article CAS Google Scholar
Jin, K., Li, T., Chow, H. Y., Liu, H. & Li, X. P.B. Desulfurization: an enabling method for protein chemical synthesis and site-specific deuteration. Angew. Chem. Int. Ed. 56, 14607–14611 (2017).
Article CAS Google Scholar
Malins, L. R. & Payne, R. J. Synthetic amino acids for applications in peptide ligation-desulfurization chemistry. Aust. J. Chem. 68, 521–537 (2015).
Article CAS Google Scholar
Premdjee, B. & Payne, R. J. in Chemical Ligation: Tools for Biomolecule Synthesis and Modification (eds D’Andrea, L. D. & Romanelli, A. ) 161–218 (John Wiley & Sons, USA, 2017).
Book Google Scholar
Crich, D. & Banerjee, A. Native chemical ligation at phenylalanine. J. Am. Chem. Soc. 129, 10064–10065 (2007).
Article CAS PubMed Google Scholar
Botti, P. & Tchertchian, S. Side-chain extended ligation. Patent WO2006133962 (2006).
Malins, L. R., Giltrap, A. M., Dowman, L. J. & Payne, R. J. Synthesis of β-thiol phenylalanine for applications in one-pot ligation–desulfurization chemistry. Org. Lett. 17, 2070–2073 (2015).
Article CAS PubMed Google Scholar
Haase, C., Rohde, H. & Seitz, O. Native chemical ligation at valine. Angew. Chem. Int. Ed. 47, 6807–6810 (2008).
Article CAS Google Scholar
Chen, J., Wan, Q., Yuan, Y., Zhu, J. & Danishefsky, S. J. Native chemical ligation at valine: a contribution to peptide and glycopeptide synthesis. Angew. Chem. Int. Ed. 120, 8649–8652 (2008).
Article Google Scholar
Harpaz, Z., Siman, P., Kumar, K. S. A. & Brik, A. Protein synthesis assisted by native chemical ligation at leucine. ChemBioChem 11, 1232–1235 (2010).
Article CAS PubMed Google Scholar
Tan, Z. P., Shang, S. Y. & Danishefsky, S. J. Insights into the finer issues of native chemical ligation: an approach to cascade ligations. Angew. Chem. Int. Ed. 49, 9500–9503 (2010).
Article CAS Google Scholar
Yang, R. L., Pasunooti, K. K., Li, F. P., Liu, X. W. & Liu, C. F. Dual native chemical ligation at lysine. J. Am. Chem. Soc. 131, 13592–13593 (2009).
Article CAS PubMed Google Scholar
Shang, S. Y., Tan, Z. P. & Danishefsky, S. J. Application of the logic of cysteine-free native chemical ligation to the synthesis of Human Parathyroid Hormone (hPTH). Proc. Natl Acad. Sci. USA 108, 5986–5989 (2011). This noteworthy article underlines the utility of the toolbox of thiol-derived amino acids to assemble large proteins through a convergent kinetically controlled ligation strategy.
Article CAS PubMed PubMed Central Google Scholar
Thompson, R. E., Chan, B., Radom, L., Jolliffe, K. A. & Payne, R. J. Chemoselective ligation-desulfurization at aspartate. Angew. Chem. Int. Ed. 52, 9723–9727 (2013).
Article CAS Google Scholar
Araman, C. et al. Semisynthetic Prion Protein (PrP) variants carrying glycan mimics at position 181 and 197 do not form fibrils. Chem. Sci. 8, 6626–6632 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bang, D., Pentelute, B. L. & Kent, S. B. H. Kinetically controlled ligation for the convergent chemical synthesis of proteins. Angew. Chem. Int. Ed. 45, 3985–3988 (2006). This pivotal work showcases the power of kinetically controlled ligation chemistry by providing directional flexibility to construct proteins.
Article CAS Google Scholar
Thompson, R. E. et al. Trifluoroethanethiol: an efficient additive for one-pot ligation-desulfurization chemistry. J. Am. Chem. Soc. 136, 8161–8164 (2014). This work demonstrates one-pot protein synthesis from three fragments using iterative ligation–desulfurization chemistry. Key to this methodology is the use of TFET, which does not interfere with radical desulfurization.
Article CAS PubMed Google Scholar
Cergol, K. M., Thompson, R. E., Malins, L. R., Turner, P. & Payne, R. J. One-pot peptide ligation–desulfurization at glutamate. Org. Lett. 16, 290–293 (2013).
Article PubMed CAS Google Scholar
Moyal, T., Hemantha, H. P., Siman, P., Refua, M. & Brik, A. Highly efficient one-pot ligation and desulfurization. Chem. Sci. 4, 2496–2501 (2013).
Article CAS Google Scholar
Thompson, R. E. et al. Tyrosine sulfation modulates activity of tick-derived thrombin inhibitors. Nat. Chem. 9, 909–917 (2017).
Article CAS PubMed Google Scholar
Macmillan, D., Adams, A. & Premdjee, B. Shifting native chemical ligation into reverse through N to S acyl transfer. Isr. J. Chem. 51, 885–899 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ollivier, N., Dheur, J., Mhidia, R., Blanpain, A. & Melnyk, O. Bis(2-sulfanylethyl)amino native peptide ligation. Org. Lett. 12, 5238–5241 (2010).
Article CAS PubMed Google Scholar
Sato, K. et al. N-sulfanylethylanilide peptide as a crypto-thioester peptide. ChemBioChem 12, 1840–1844 (2011).
Article CAS PubMed Google Scholar
Hojo, H., Onuma, Y., Akimoto, Y., Nakahara, Y. & Nakahara, Y. N-alkyl cysteine-assisted thioesterification of peptides. Tetrahedron Lett. 48, 25–28 (2007).
Article CAS Google Scholar
Elrich, L. A., Kumar, K. S. A., Haj-Yahya, M., Dawson, P. E. & Brik, A. N-methylcysteine-mediated total chemical synthesis of ubiquitin thioester. Org. Biomol. Chem. 8, 2392–2396 (2010).
Article CAS Google Scholar
Blanco-Canosa, J. B. & Dawson, P. E. An efficient Fmoc-SPPS approach for the generation of thioester peptide precursors for use in native chemical ligation. Angew. Chem. Int. Ed. 47, 6851–6855 (2008).
Article CAS Google Scholar
Blanco-Canosa, J. B., Nardone, B., Albericio, F. & Dawson, P. E. Chemical protein synthesis using a second-generation N-acylurea linker for the preparation of peptide-thioester precursors. J. Am. Chem. Soc. 137, 7197–7209 (2015).
Article CAS PubMed Google Scholar
Wang, J. X. et al. Peptide o-aminoanilides as crypto-thioesters for protein chemical synthesis. Angew. Chem. Int. Ed. 54, 2194–2198 (2015).
Article CAS Google Scholar
Fang, G. M. et al. Protein chemical synthesis by ligation of peptide hydrazides. Angew. Chem. Int. Ed. 50, 7645–7649 (2011). This seminal paper outlines the use of C-terminal acyl hydrazides as thioester surrogates to facilitate iterative ligation-based assembly of proteins in the N→C direction.
Article CAS Google Scholar
Fang, G. M., Wang, J. X. & Liu, L. Convergent chemical synthesis of proteins by ligation of peptide hydrazides. Angew. Chem. Int. Ed. 51, 10347–10350 (2012).
Article CAS Google Scholar
Malins, L. R. & Payne, R. J. Synthesis and utility of β-selenol-phenylalanine for native chemical ligation–deselenization chemistry. Org. Lett. 14, 3142–3145 (2012).
Article CAS PubMed Google Scholar
Zheng, J. S., Tang, S., Qi, Y. K., Wang, Z. P. & Liu, L. Chemical synthesis of proteins using peptide hydrazides as thioester surrogates. Nat. Protoc. 8, 2483–2495 (2013).
Article CAS PubMed Google Scholar
Boll, E. et al. One-pot chemical synthesis of small ubiquitin-like modifier protein–peptide conjugates using bis(2-sulfanylethyl)amido peptide latent thioester surrogates. Nat. Protoc. 10, 269–292 (2015).
Article CAS PubMed Google Scholar
Jbara, M., Maity, S. K. & Brik, A. Palladium in the chemical synthesis and modification of proteins. Angew. Chem. Int. Ed. 56, 10644–10655 (2017).
Article CAS Google Scholar
Huang, Y.C. et al. Synthesis of l- and d-ubiquitin by one-pot ligation and metal-free desulfurization. Chem. Eur. J. 22, 7623–7628 (2016).
Article CAS PubMed Google Scholar
Bang, D. & Kent, S. B. H. A. One-pot total synthesis of crambin. Angew. Chem. Int. Ed. 43, 2534–2538 (2004).
Article CAS Google Scholar
Veber, D. F., Milkowski, J. D., Denkewalter, R. G. & Hirschmann, R. The synthesis of peptides in aqueous medium, IV. A novel protecting group for cysteine. Tetrahedron Lett. 26, 3057–3058 (1968).
Article Google Scholar
Kumar, K. S. A. et al. Total chemical synthesis of a 304 amino acid K48-linked tetraubiquitin protein. Angew. Chem. Int. Ed. 50, 6137–6141 (2011).
Article CAS Google Scholar
Tang, S. et al. Practical chemical synthesis of atypical ubiquitin chains by using an isopeptide-linked Ub isomer. Angew. Chem. Int. Ed. 56, 13333–13337 (2017).
Article CAS Google Scholar
Sakamoto, I. et al. Chemical synthesis of homogeneous human glycosyl-interferon-β that exhibits potent antitumor activity in vivo. J. Am. Chem. Soc. 134, 5428–5431 (2012).
Article CAS PubMed Google Scholar
Hondal, R. J., Nilsson, B. L. & Raines, R. T. Selenocysteine in native chemical ligation and expressed protein ligation. J. Am. Chem. Soc. 123, 5140–5141 (2001).
Article CAS PubMed Google Scholar
Quaderer, R., Sewing, A. & Hilvert, D. Selenocysteine-mediated native chemical ligation. Helv. Chim. Acta 84, 1197–1206 (2001).
Article CAS Google Scholar
Gieselman, M. D., Xie, L. & Van Der Donk, W. A. Synthesis of a selenocysteine-containing peptide by native chemical ligation. Org. Lett. 3, 1331–1334 (2001). These three independent works (references 77–79) demonstrate that the 21st amino acid (Sec) is competent in NCL-like transformations with peptide thioesters, providing access to large selenopeptides and selenoproteins. This work lays the foundation for the use of selenoamino acids in ligation chemistry.
Article CAS PubMed Google Scholar
Flohe, L., Guenzler, W. A. & Schock, H. H. Glutathione peroxidase: a selenoenzyme. FEBS Lett. 32, 132–134 (1973).
Article CAS PubMed Google Scholar
Kryukov, G. V. et al. Characterization of mammalian selenoproteomes. Science 300, 1439–1443 (2003).
Article CAS PubMed Google Scholar
Reeves, M. A. & Hoffmann, P. R. The human selenoproteome: recent insights into functions and regulation. Cell. Mol. Life Sci. 66, 2457–2478 (2009).
Article CAS PubMed PubMed Central Google Scholar
Muttenthaler, M. & Alewood, P. F. Selenopeptide chemistry. J. Pept. Sci. 14, 1223–1239 (2008).
Article CAS PubMed Google Scholar
Lu, J. & Holmgren, A. Selenoproteins. J. Biol. Chem. 284, 723–727 (2009).
Article CAS PubMed Google Scholar
Johansson, L., Gafvelin, G. & Arner, E. S. J. Selenocysteine in proteins—properties and biotechnological use. Biochim. Biophys. Acta, Gen. Subj. 1726, 1–13 (2005).
Article CAS Google Scholar
Verma, S. et al. Selenoprotein K knockout mice exhibit deficient calcium flux in immune cells and impaired immune responses. J. Immunol. 186, 2127–2137 (2011).
Article CAS PubMed Google Scholar
Shchedrina, V. A. et al. Selenoprotein K binds multiprotein complexes and is involved in the regulation of endoplasmic reticulum homeostasis. J. Biol. Chem. 286, 42937–42948 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lu, C. et al. Identification and characterization of selenoprotein K: an antioxidant in cardiomyocytes. FEBS Lett. 580, 5189–5197 (2006).
Article CAS PubMed Google Scholar
Du, S., Zhou, J., Jia, Y. & Huang, K. SelK is a novel ER stress-regulated protein and protects HepG2 cells from ER stress agent-induced apoptosis. Arch. Biochem. Biophys. 502, 137–143 (2010).
Article CAS PubMed Google Scholar
Besse, D., Siedler, F., Diercks, T., Kessler, H. & Moroder, L. The redox potential of selenocystine in unconstrained cyclic peptides. Angew. Chem. Int. Ed. 36, 883–885 (1997).
Article CAS Google Scholar
Nauser, T., Dockheer, S., Kissner, R. & Koppenol, W. H. Catalysis of electron transfer by selenocysteine. Biochemistry 45, 6038–6043 (2006).
Article CAS PubMed Google Scholar
Guenther, W. H. H. Methods in selenium chemistry. III. Reduction of diselenides with dithiothreitol. J. Org. Chem. 32, 3931–3934 (1967).
Article CAS Google Scholar
Townsend, S. D. et al. Advances in proline ligation. J. Am. Chem. Soc. 134, 3912–3916 (2012).
Article CAS PubMed PubMed Central Google Scholar
Quaderer, R. & Hilvert, D. Selenocysteine-mediated backbone cyclization of unprotected peptides followed by alkylation, oxidative elimination or reduction of the selenol. Chem. Commun. 2620–2621 (2002).
Metanis, N., Keinan, E. & Dawson, P. E. Traceless ligation of cysteine peptides using selective deselenization. Angew. Chem. Int. Ed. 49, 7049–7053 (2010). This landmark paper revealed that deselenization of a Sec residue can be achieved in the absence of a traditional radical initiator using a reducing agent (TCEP) and a hydrogen donor, and is completely chemoselective in the presence of unprotected Cys residues.
Article CAS Google Scholar
Malins, L. R., Mitchell, N. J., McGowan, S. & Payne, R. J. Oxidative deselenization of selenocysteine: applications for programmed ligation at serine. Angew. Chem. Int. Ed. 54, 12716–12721 (2015).
Article CAS Google Scholar
Sai Reddy, P., Dery, S. & Metanis, N. Chemical synthesis of proteins with non-strategically placed cysteines using selenazolidine and selective deselenization. Angew. Chem. Int. Ed. 55, 992–995 (2016).
Article CAS Google Scholar
Dery, L. et al. Accessing human selenoproteins through chemical protein synthesis. Chem. Sci. 8, 1922–1926 (2017).
Article CAS PubMed Google Scholar
Mousa, R., Dardashti, R. N. & Metanis, N. Selenium and selenocysteine in protein chemistry. Angew. Chem. Int. Ed. 56, 15818–15827 (2017).
Article CAS Google Scholar
Mousa, R., Reddy, P. S. & Metanis, N. Chemical protein synthesis through selenocysteine chemistry. Synlett 28, 1389–1393 (2017).
Article CAS Google Scholar
Berry, S. M., Gieselman, M. D., Nilges, M. J., van der Donk, W. A. & Lu, Y. An engineered azurin variant containing a selenocysteine copper ligand. J. Am. Chem. Soc. 124, 2084–2085 (2002).
Article CAS PubMed Google Scholar
Liu, J., Chen, Q. & Rozovsky, S. Utilizing selenocysteine for expressed protein ligation and bioconjugations. J. Am. Chem. Soc. 139, 3430–3437 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dery, S. et al. Insights into the deselenization of selenocysteine into alanine and serine. Chem. Sci. 6, 6207–6212 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mitchell, N. J., Kulkarni, S. S., Wang, S., Malins, L. R. & Payne, R. J. One-pot ligation–oxidative deselenization at selenocysteine and selenocystine. Chem. Eur. J. 23, 946–952 (2017).
Article CAS PubMed Google Scholar
Pollock, S. B. & Kent, S. B. H. An investigation into the origin of the dramatically reduced reactivity of peptide-prolyl-thioesters in native chemical ligation. Chem. Commun. 47, 2342–2344 (2011).
Article CAS Google Scholar
Durek, T. & Alewood, P. F. Preformed selenoesters enable rapid native chemical ligation at intractable sites. Angew. Chem. Int. Ed. 50, 12042–12045 (2011).
Article CAS Google Scholar
Mitchell, N. J. et al. Rapid additive-free selenocystine–selenoester peptide ligation. J. Am. Chem. Soc. 137, 14011–14014 (2015). This article discloses a rapid and highly efficient peptide ligation reaction called diselenide–selenoester ligation (DSL). DSL provides unprecedented reaction rates, does not require any additives and can be used in conjunction with NCL, greatly expanding the number of protein targets that can be accessed by chemical synthesis.
Article CAS PubMed Google Scholar
Hanna, C. C., Kulkarni, S. S., Watson, E. E., Premdjee, B. & Payne, R. J. Solid-phase synthesis of peptide selenoesters via a side-chain anchoring strategy. Chem. Commun. 53, 5424–5427 (2017).
Article CAS Google Scholar
Wang, X., Sanchez, J., Stone, M. & Payne, R. J. Sulfation of the human cytomegalovirus protein UL22A enhances binding to the chemokine RANTES. Angew. Chem. Int. Ed. 56, 8490–8494 (2017).
Article CAS Google Scholar
Mitchell, N. J. et al. Accelerated protein synthesis via one-pot ligation-deselenization chemistry. Chem 2, 703–715 (2017). This work demonstrates that the presence of a weaker C–Se bond in β-selenoAsp and γ-selenoGlu enables rapid and clean deselenization in less than a minute and, in combination with DSL, provides a means to expedite access to synthetic proteins. Deselenization at β-selenoAsp and γ-selenoGlu can be performed chemoselectively in the presence of native Sec, providing access to native selenoproteins.
Article CAS Google Scholar
Jablonka, W. et al. Identification and mechanistic analysis of a novel tick-derived inhibitor of thrombin. PLoS ONE 10, e0133991 (2015).
Article PubMed PubMed Central CAS Google Scholar
Liu, J., Srinivasan, P., Pham, D. N. & Rozovsky, S. Expression and purification of the membrane enzyme selenoprotein K. Protein Expr. Purif. 86, 27–34 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liu, J., Zhang, Z. & Rozovsky, S. Selenoprotein K form an intermolecular diselenide bond with unusually high redox potential. FEBS Lett. 588, 3311–3321 (2014).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. & Rozovsky, S. Membrane-bound selenoproteins. Antioxid. Redox Signal 23, 795–813 (2015).
Article CAS PubMed Google Scholar
Weinstock, M. T., Jacobsen, M. T. & Kay, M. S. Synthesis and folding of a mirror-image enzyme reveals ambidextrous chaperone activity. Proc. Natl Acad. Sci. USA 111, 11679–11684 (2014).
Article CAS PubMed PubMed Central Google Scholar
Xu, W. et al. Total chemical synthesis of a thermostable enzyme capable of polymerase chain reaction. Cell Discov. 3, 17008 (2017).
Article PubMed PubMed Central Google Scholar
Pech, A. et al. A thermostable D-polymerase for mirror-image PCR. Nucleic Acids Res. 45, 3997–4005 (2017).
Article CAS PubMed PubMed Central Google Scholar
Muralidharan, V. & Muir, T. W. Protein ligation: an enabling technology for the biophysical analysis of proteins. Nat. Methods 3, 429–438 (2006).
Article CAS PubMed Google Scholar
Salmond, G. P. C. & Fineran, P. C. A century of the phage: past, present and future. Nat. Rev. Microbiol. 13, 777–786 (2015).
Article CAS PubMed Google Scholar
Goto, Y., Katoh, T. & Suga, H. Flexizymes for genetic code reprogramming. Nat. Protoc. 6, 779–790 (2011).
Article CAS PubMed Google Scholar
Murakami, H., Ohta, A., Ashigai, H. & Suga, H. A highly flexible tRNA acylation method for non-natural polypeptide synthesis. Nat. Methods 3, 357–359 (2006).
Article CAS PubMed Google Scholar
Mijalis, A. J. et al. A fully automated flow-based approach for accelerated peptide synthesis. Nat. Chem. Biol. 13, 464–468 (2017).
Article CAS PubMed Google Scholar
Schnolzer, M. & Kent, S. B. H. Constructing proteins by dovetailing unprotected synthetic peptides: backbone-engineered HIV protease. Science 256, 221–225 (1992).
Article CAS PubMed Google Scholar
Canne, L. E., Bark, S. J. & Kent, S. B. H. Extending the applicability of native chemical ligation. J. Am. Chem. Soc. 118, 5891–5896 (1996).
Article CAS Google Scholar
Brik, A., Yang, Y. Y., Ficht, S. & Wong, C. H. Sugar-assisted glycopeptide ligation. J. Am. Chem. Soc. 128, 5626–5627 (2006).
Article CAS PubMed Google Scholar
Ficht, S., Payne, R. J., Brik, A. & Wong, C. H. Second-generation sugar-assisted ligation: a method for the synthesis of cysteine-containing glycopeptides. Angew. Chem. Int. Ed. 46, 5975–5979 (2007).
Article CAS Google Scholar
Payne, R. J. et al. Extended sugar-assisted glycopeptide ligations: development, scope, and applications. J. Am. Chem. Soc. 129, 13527–13536 (2007).
Article CAS PubMed Google Scholar
Lutsky, M. Y., Nepomniaschiy, N. & Brik, A. Peptide ligation via side-chain auxiliary. Chem. Commun. 10, 1229–1231 (2008).
Article CAS Google Scholar
Hojo, H. et al. The mercaptomethyl group facilitates an efficient one-pot ligation at Xaa-Ser/Thr for (glyco)peptide synthesis. Angew. Chem. Int. Ed. 49, 5318–5321 (2010).
Article CAS Google Scholar
Loibl, S. F., Harpaz, Z. & Seitz, O. A type of auxiliary for native chemical peptide ligation beyond cysteine and glycine junctions. Angew. Chem. Int. Ed. 54, 15055–15059 (2015).
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge financial support from an ARC Linkage grant (S.K., R.J.P.), and the Northcote Scholarship and John A. Lamberton Research Scholarship for PhD funding (J.S.).

Author information

Authors and Affiliations

School of Chemistry, The University of Sydney, Sydney, NSW, Australia
Sameer S. Kulkarni, Jessica Sayers & Richard J. Payne
Department of Protein and Peptide Chemistry, Novo Nordisk A/S, Måløv, Denmark
Bhavesh Premdjee

Authors

Sameer S. Kulkarni
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Sayers
View author publications
You can also search for this author in PubMed Google Scholar
Bhavesh Premdjee
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. Payne
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to all aspects of article preparation. S.S.K and J.S. contributed equally to this manuscript.

Corresponding author

Correspondence to Richard J. Payne.

Ethics declarations

Competing interests

The authors declare no competing interests.

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

PowerPoint slide for Fig. 5

PowerPoint slide for Fig. 6

PowerPoint slide for Fig. 7

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kulkarni, S., Sayers, J., Premdjee, B. et al. Rapid and efficient protein synthesis through expansion of the native chemical ligation concept. Nat Rev Chem 2, 0122 (2018). https://doi.org/10.1038/s41570-018-0122

Download citation

Published: 29 March 2018
DOI: https://doi.org/10.1038/s41570-018-0122

This article is cited by

Synthetic peptide branched polymers for antibacterial and biomedical applications
- Sadegh Shabani
- Sara Hadjigol
- Greg G. Qiao
Nature Reviews Bioengineering (2024)
Selenium chemistry for spatio-selective peptide and protein functionalization
- Zhenguang Zhao
- Shay Laps
- Norman Metanis
Nature Reviews Chemistry (2024)
Recent advances in chemical protein synthesis: method developments and biological applications
- Suwei Dong
- Ji-Shen Zheng
- Lei Liu
Science China Chemistry (2024)
Synthesis and applications of mirror-image proteins
- Katriona Harrison
- Angus S. Mackay
- Richard J. Payne
Nature Reviews Chemistry (2023)
Insights into the ribosome function from the structures of non-arrested ribosome–nascent chain complexes
- Egor A. Syroegin
- Elena V. Aleksandrova
- Yury S. Polikanov
Nature Chemistry (2023)