S-to-N acyl transfer is a high-yielding chemoselective process for amide bond formation. It is widely utilized by chemists for synthetic applications, including peptide and protein synthesis, chemical modification of proteins, protein-protein ligation and the development of probes and molecular machines. Recent advances in our understanding of S-to-N acyl transfer processes in biology and innovations in methodology for thioester formation and desulfurization, together with an extension of the size of cyclic transition states, have expanded the boundaries of this process well beyond peptide ligation. As the field develops, this chemistry will play a central role in our molecular understanding of Biology.
Chemoselectivity refers to the preferential reaction of a single chemical reagent with one of two or more different functional groups1. One of the most formidable challenges presented to synthetic chemists involves achieving high levels of chemoselectivity and regioselectivity amongst the myriad of reactive functionalities present in biological systems. Despite the onerous challenges, chemists continue to discover and develop robust and efficient chemical ligation methodologies that enable exquisitely high levels of selectivity2,3,4. The chemoselective formation of amide bonds is a critical reaction in biology for protein synthesis and post-translational modification of proteins (PTMs), it is also widely utilized in organic synthesis and medicinal chemistry5,6,7. Amide bond formation requires chemoselective ligations that can furnish specific amide products in the presence of functional groups such as unprotected amines, carboxylic acids and alcohols6,7. The chemoselectivity of the S-to-N acyl transfer process arises from the unique reactivity of thioesters that renders them ideal intermediates for acyl transfer processes. Thioesters have been found to be more reactive then the corresponding oxoesters towards most nucleophiles (with the notable exception of hydroxide and alkoxides). The enhanced reactivity of thioesters may be due to a poor C-S π overlap that lowers the overall thermodynamic stability of the thioester relative to the oxoester8,9,10. Thioesters have been exploited as reactive intermediates by chemists to perform sophisticated ligation reactions under neutral conditions in an aqueous environment and have inspired a recent surge in the development of synthetic and biosynthetic ligation methodologies11,12,13,14,15. Efficient cascade processes involving thioester formation and subsequent acyl transfer to N-, O- or Se- groups are at the core of several critical biological processes14. For example, Acetoacetyl-CoA functions as an efficient acyl transfer reagent in the Krebs cycle16. Thioesters are also key intermediates in protein ubiquitination17,18, intein splicing19,20 and the covalent modification of bacterial cell-surface proteins21,22. Thioesters are formed in transglutamination and in glutathione (GSH) biosynthesis. The abundance of thioester mediated processes in nature has led to speculation regarding the role of these derivatives in the origin of life23. In this review we will discuss the key role of S-to-N acyl transfer in biological systems and recent developments and applications of S-to-N acyl transfer in chemical synthesis (semi-synthesis), chemical biology, supramolecular chemistry and chemical probes. Both inter- and intramolecular acyl transfer reactions will be discussed.
S-to-N acyl transfer reactions in biology
The S-to-N acyl transfer presents an efficient chemoselective ligation process utilized by nature to synthesize amide bonds. Among the most heavily investigated biological processes involving S-to-N acyl transfer are protein ubiquitination, intein-mediated protein splicing, sortase mediated protein modification and transglutamination.
Ubiquitination is an important post-translational modification implicated in several cellular pathways such as protein degradation, inflammatory responses and DNA repair24,25,26. The process of ubiquitination involves formation of an isopeptide bond, a non-canonical amide linkage, between the C-terminal Gly residue of Ubiquitin (Ub) and the ɛ-NH2 of the substrate Lys. Isopeptide bonds formed through lysine residues mediate several critical biological processes through intermolecular cross-linking of proteins27. Ubiquitination is a multi-step process orchestrated by three key enzymes: activating enzyme (E1), conjugating enzyme (E2), and ligase (E3), with the overall process depicted in Fig. 1a (ref. 17). The first step is an ATP-dependent process which involves E1 forming a thioester linkage between a catalytic Cys residue and the Ub C-terminal Gly residue. In the second step, the activated Ub molecule is then transferred to an E2 enzyme via a trans-thiolation process with an E2 Cys residue. In the third step, the E3 ligase catalyses the formation of the isopeptide bond between the Ub C-terminal Gly and the ɛ-amino of the substrate Lys residue through a chemoselective S-to-N acyl transfer28,29,30. There are three classes of E3 ligases (i) RING E3s, which bind to the E2-Ub thioester and substrate and catalyse attack of the substrate lysine on the thioester (ii) HECT and (iii) RBR E3s, which both have active site cysteines and catalyse substrate ubiquitination in a two-step process involving formation of a thioester with the E3 followed by attack of the substrate lysine to form the isopeptide bond. Although the molecular detail of ubiquitination is unknown, it is postulated that RING E3 functions in positioning the reactive thioester of E2 and the ubiquitin C-terminus in the correct orientation to react with the attacking lysine of the substrate. Conserved residues in the active site function to catalyse the S-to-N acyl transfer step17. The conjugation of Ub to a protein may involve the addition of a monomer (monoubiquitination) or as a chain of Ubs of various types (polyubiquitination) with these different modifications producing a variety of molecular signals depending on the nature of the ubiquitination31. Recently it has been reported that the ubiquitin-conjugating enzyme E2(Ube2w) employs a novel mechanism to enable specific ubiquitination of the α-amino group of its substrates. This process involves recognition of backbone atoms of intrinsically disordered N-termini32. Post-translational modification of proteins by the small ubiquitin-related modifier (SUMO) protein proceeds through a ubiquitin-like pathway also involving a critical S-to-N acyl transfer step33.
Protein splicing is an intramolecular reaction of a protein whereby an internal protein segment (intein) is excised from a protein with concomitant ligation of the adjacent C-terminal and N-terminal external proteins (exteins). An intein mediates its own excision from a peptide sequence through a series of acyl transfer reactions20,34. The first step of protein splicing involves an N-to-S acyl transfer between the intein Cys and N-extein producing a thioester intermediate (the N-to-O acyl transfer process involving a Ser residue is also well known for inteins but for the scope of this review we will only consider the Cys mediated process). Although this rearrangement appears to be thermodynamically unfavourable, changes in the molecular architecture of the intein push the equilibrium towards thioester formation35. (The reversibility of the S-to-N /N-to-S acyl transfer is discussed in detail in Box 1). A subsequent trans-thioesterification step transfers the N-extein to the C-extein via a Cys residue. Cyclisation of a conserved asparagine (Asn) residue forming a succinimide intermediate and subsequent S-to-N acyl transfer forms the new peptide bond, completing the protein splicing process. Inteins have been widely exploited for the chemical synthesis of proteins through the formation of reactive thioesters on expressed proteins20.
Sortase transpeptidase enzymes
Sortases are transpeptidase enzymes that covalently modify cell surface proteins of bacteria. They are responsible for the surface display and cell-wall anchoring of proteins encoding the sortase recognition motif LPXTG22. They are classified into four groups (sortase A–D) based on sequence homology, the substrate for sortase cleavage and the nucleophile that reacts with sortase thioester. Sortase class A enzymes, found in all Gram-positive bacteria are commonly referred to as housekeeping sortases. They recognize the amino acid (AA) sequence LPXTG at the carboxyl terminus of surface protein precursors. The products of sortase class A reactions are surface proteins that are covalently linked to lipids and subsequently incorporated into the cell-wall envelope36. Sortase class B enzymes recognize a unique NP(Q/K)TN sorting signal in proteins and are involved in haem-iron scavenging, and in cross-linking anchored haem-containing products near membrane transporters. The class C sortases polymerize pili by catalysing transpeptidation reactions forming covalent bonds between individual pili subunits. Sortase class D enzymes are expressed in a variety of spore-forming microorganisms; they function by recognizing and cleaving the sorting signals of selected substrates to immobilize anchored products in the cell wall envelope during spore formation. All classes of sortases recognize a specific sorting signal at the C terminus of their target protein. The specificity of cleavage is determined by recognition of sortase-specific sorting signals; each sortase cleaves its specific sorting signal and subsequently forms a thioester bond between the sortase active site and a residue in the sorting signal. This thioester intermediate then undergoes a chemoselective S-to-N acyl transfer onto a specific nucleophilic amino acid, thus specifying the cell wall component to which the protein becomes ligated22.
Transglutaminase enzymes (TGs) are transferases that catalyse the calcium dependant formation of inter- or intramolecular isopeptide bonds between proteins by crosslinking protein bound glutamine and lysine amino acids37. TGs are homologous to the papain family of proteases with which they share significant active site sequence and structural similarities38. During transglutamination, a glutamine side chain on a substrate protein is attacked by the active site Cys of a transglutaminase enzyme with formation of a thioester intermediate. The TG then mediates S-to-N acyl transfer to a Lys residue on a second substrate molecule with formation of an isopeptide bond. TGs exhibit remarkable specificities not only for the Gln site of hydrolysis, but also for the Lys residue of proteins with which they react. Recent studies suggest that the specificity of Lys residues TGs is not encoded in the primary amino acid sequence surrounding the target Lys but is primarily dependant on its positioning in an accessible segment of the protein structure38.
The first step in glutathione (GSH) biosynthesis is formation of γ-glutamyl cysteine by the enzyme glutamate-cysteine ligase (GshA), however, in the case of bacteria lacking GshA it has been proposed that formation of γ-glutamyl cysteine occurs via an S-to-N acyl transfer pathway involving the selective interception of ProB-bound γ-glutamyl phosphate by cysteine residues39.
These biological examples of chemoselective amide bond formation clearly demonstrate the synthetic potential of the S-to-N acyl transfer process. These processes are multistep and involve the formation of a suitably labile thioester, followed by correct positioning of both the thioester and amine nucleophile in the active site. Ligation reactions are typically catalysed by AA residues located at the active site40. Not only do these biological processes highlight the synthetic potential of the S-to-N acyl transfer, they also serve as inspiration for synthetic molecular processes utilizing S-to-N acyl transfer as a key step. A key requirement for efficient S-to-N acyl transfer is access to a suitable thioester and a number of elegant methods for their synthesis via synthetic and biological approaches have been reported, including pushing the S-to-N acyl transfer process into reverse (Box 1)13,41,42,43. A common synthetic method employed for the preparation of thioesters involves coupling of the C-terminus of a protected peptide/glycopeptide with benzylthiol using PyBOP/DIPEA as coupling reagents (to avoid racemization)44. Wong and co-workers45 have reported solid-phase synthesis of peptide and glycopeptide thioesters using a side chain anchoring strategy. In this approach, the C terminus is coupled with either a thiol or a thioester AA residue to furnish the peptide thioester which is subsequently released from the solid phase45. Dawson and co-workers46 have reported the synthesis of thioesters through formation of a C-terminal N-acylurea moiety. Following SPPS an aminoanilide undergoes specific acylation and cyclization to furnish the resin-bound acylurea peptide. Following cleavage from the resin and global deprotection, the peptide N-acylurea can undergo thiolysis to yield the peptide thioester46. Liu and co-workers47 have reported the use of peptide hydrazides as thioester surrogates in NCL. The peptide hydrazides are prepared through SPPS and subsequent NaNO2 activation and thiolysis on treatment with 4-mercaptophenylacetic acid (MPAA), furnishes a peptide thioester47. Recently Chatterjee and co-workers have reported the application of an N-mercaptoethoxyglycinamide (MEGA) solid-phase linker for the facile synthesis of peptide α-thioesters. Thioester formation occurs from the N-oxyamide via an N-to-S acyl shift through a 6-membered cyclic intermediate48. Several other efficient methods for thioester synthesis have been developed.
Synthetic applications of S-to-N acyl transfer
A broad range of synthetic applications have been developed that utilize the S-to-N acyl transfer as a key step to introduce an amide bond. Particular focus has been placed on the synthesis of peptide bonds through Native Chemical Ligation (NCL) and related methodologies.
The most heavily investigated and widely utilized synthetic application of S-to-N acyl transfer is undoubtedly native chemical ligation. NCL was discovered by Kent and co-workers as an efficient methodology for peptide ligation49. The process gained widespread attention with the reported synthesis of human interleukin-8 (IL-8) by Kent and co-workers50, a major synthetic achievement which served to highlight the synthetic potential of NCL for chemical protein synthesis. The reaction involves an N-terminal cysteine and a C-terminal thioester reacting in a reversible trans-thioesterification step to form a thioester intermediate. This intermediate can subsequently rearrange via an intramolecular S-to-N acyl transfer through a 5-membered transition state, to furnish a native peptide bond at the ligation site (Fig. 2). NCL has revolutionized the total chemical synthesis of peptides and proteins as it enables the chemoselective ligation of two mutually reactive peptide entities in aqueous solution at neutral pH to form a single product in near-quantitative yields. Accordingly, NCL has become one of the most commonly employed methods for peptide ligation. Following a number of substrate studies, the steric nature of the C-terminal thioester was found to be the most influential factor in NCL. Ligations occur faster at less hindered amino acids, such as Gly or Ala. Dawson and co-workers explored the scope of the NCL process and demonstrated that ligations were possible using all 20 amino acids, however, ligations sites involving valine (Val), isoleucine (Ile), and proline (Pro) were less favourable due to slower ligation rates51. Alkyl thioesters can easily be prepared by SPPS but are relatively unreactive in NCL, with time periods of 24–48 h required for the trans-thioesterification step to reach completion49. The initial rate-determining trans-thioesterification step can be accelerated through the addition of an exogenous aryl thiol. Generally, aryl thioesters are more reactive than alkyl thioesters due to their lower pKa values which make them better leaving groups, therefore, addition of an aryl thiol can promote the in situ formation of a more reactive thioester thus accelerating the rate of ligation52,53. Payne and co-workers54 have demonstrated that trifluoroethanethiol TFET can be efficiently employed as an additive in NCL to enable ligations with rates comparable to those obtained with aryl thioesters. This reagent was demonstrated to be efficient in ligation−desulfurization chemistry for protein synthesis without the requirement for intermediate purification or removal/capture from the reaction mixture.
The broad application of NCL has elevated it to a privileged position among chemical methods for peptide, protein and glycopeptide synthesis and it serves to highlight the synthetic potential of S-to-N acyl transfer in chemoselective ligations. Some spectacular synthetic achievements have been reported, in particular for the chemical synthesis of proteins (see applications of NCL for the synthesis of proteins). Using SPPS to assemble the two peptide entities, NCL was used in a convergent manner to allow access to peptides and proteins that were previously inaccessible by chemical synthesis approaches, with the synthesis of a HIV-1 protease covalent dimer being one of the most impressive examples reported to date55. This example, reported by Kent and co-workers55, is a 203-residue polypeptide and is one of the largest linear polypeptides prepared by chemical synthesis.
Despite its utility, certain limitations of NCL have persisted. An ongoing criticism of NCL is the requirement for an N-terminal Cys at the ligation site. This limitation is coupled with the fact that Cys is not a very abundant amino acid in nature (∼1.7% in human proteins) and is seldom suitably distributed throughout a peptide sequence so as to facilitate NCL11,13. Furthermore, NCL in its original form is not fully convergent with multiple peptide fragments being joined in a linear fashion from the C-terminus to the N-terminus. However, numerous synthetic advancements have been reported to address these limitations, including desulfurization and Auxiliary Mediated Ligation (AML), Fig. 2.
One of the most important synthetic advancements in overcoming the requirement for Cys at the ligation site was first reported by Dawson and co-workers and involves the use of NCL coupled with desulfurization56. This concept was initially proved using Ala residues where a Cys residue was mutated to Ala after NCL. Desulfurization from Cys to Ala was promoted using Raney nickel as a reagent. This allowed for ligation at Xaa–Ala sites which are naturally more abundant than Cys. Inspired by this initial work, several groups investigated and extended this strategy to other Cys-surrogates, in the form of mercaptoamino acids whereby a thiol group is incorporated into amino acids, making ligation at Phe, Lys, and Val possible, for example, refs 57, 58, 59. Indeed, it is now possible to perform ligations at 15 out of the 20 natural amino acids60,61. Figure 3 shows the structures of a range of thiol containing amino acids that have been prepared and investigated for NCL. Generally, mercaptoamino acids are built off the side-chains to preserve stereochemistry at the alpha carbon. The synthesis of mercaptoamino acids typically involves installation of the key thiol auxiliary unit into a natural amino acid and usually requires a multistep synthesis61. Only the β-thiol valine (penicillimine) and γ-thiol proline derivatives are commercially available. Several impressive total-syntheses of proteins were achieved through the ligation–desulfurization approach, however, certain limitations of this methodology also exist. The method requires access to specific thiol-modified amino acids for every ligation junction. The limited commercial availability of thiolated amino acid building blocks presents a challenge for their general application in protein synthesis. Although the desulfurization strategy reported by Dawson represented a major milestone in the use of NCL at non-Cys ligation sites, problems such as low product recoveries and non-selective reduction of thioesters and thioethers led to the development of free-radical desulfurization methods. Hoffman and co-workers62 were the first to disclose a desulfurization reaction between thiol and trialkylphosphite derivatives under thermal and photochemical conditions. Shortly thereafter, Walling and Rabinowitz63 proposed the mechanism of the desulfurization. It was proposed that a reactive alkylthiyl radical adds to the phosphite generating a phosphoranyl radical intermediate which can subsequently eliminate to an alkyl radical. Hydrogen abstraction from the parent thiol can furnish the alkane product while also propagating the chain reaction. This approach was later modified by Danishefsky and co-workers resulting in a more environmentally friendly dethiylation method for use in NCL64. They used radical initiator 2,2′-azobis[2-(2-imidazolin-2-yl)propane]dihydrochloride (VA-044) and tris(2-carboxyethyl)phosphine) (TCEP), a widely used disulfide reducing agent in NCL, as the phosphine source. On heating in aqueous solution, the initiator decomposes to give 2-isopropyl-4,5-dihydro-1H-imidazole radical initiator which on reacting with Cys generates a thiol radical. The thiol radical can then react to form a phosphine radical intermediate which then undergoes C-S bond scission to generate an alkyl radical which can subsequently be quenched by addition of a thiol additive, such as t-butyl mercaptan (t-BuSH), through hydrogen abstraction11. This mild process furnished the desired products with yields in excess of 80% and the method was compatible with a variety of functionalities such as methionine, thioesters, and thiazolidine protected Cys residues. Recently, Guo and co-workers have reported a photocatalytic approach to desulfurization employing a ruthenium catalyst in the presence of a phosphine reagent. The methodology was successfully applied to peptide, cyclic peptide and glycopeptide synthesis65. The use of visible light offers advantages over UV initiated methods since it is known that exposure to UV can damage protein structures66.
Auxiliary mediated ligation
An alternative approach to overcome the requirement of Cys for peptide ligation is auxiliary mediated ligation (AML), where an auxiliary-modified peptide is used directly in NCL (Fig. 2c). The auxiliary should facilitate S-to-N acyl transfer to form the native peptide bond and then be removed quantitatively under mild conditions. A wide range of auxiliaries with varying cleavage conditions have been developed. In early examples, the α-amino group of the peptide was used to anchor the thiol auxiliary but this approach often results in low conversion due to steric hindrance and has been mainly limited to ligations containing at least one Gly residue. A more general approach involves anchoring the auxiliary onto a side-chain functional group which is subsequently converted back into the native AA following ligation. Using this approach, the ligation has been reported to proceed efficiently via S-to-N acyl transfer through five-, six- or even larger-membered cyclic transition states. Examples of commonly utilized auxiliaries developed are shown in Fig. 4. Auxiliary approaches are usually built off Gly because of the lack of stereochemistry and simplicity of the submonomer approach for synthesis of the relevant peptoids.
The earliest examples of auxiliary mediated ligation were reported by Kent and co-workers67,68 with the oxyalkyl auxiliary on the amide bond removed by treatment with Zn under acidic conditions to give a native peptide bond. Danishefsky and co-workers developed a cysteine-free NCL approach suitable for the synthesis of N- and O-linked glycopeptides, inspired by the auxiliary methodology reported by Dawson69,70. Aimoto and co-workers reported a mild, photocleavable auxiliary compatible with polypeptide synthesis71. Sugar assisted ligation (SAL), is a form of AML introduced and developed by the Wong group72,73. The thiosugar auxiliary is prepared as a glycosyl amino acid building block and installed during SPPS near the ligation site, where it mediates amide bond formation through an initial thiol-exchange with a peptidyl thioester. The thiol auxiliary can be subsequently removed using established desulfurization methods56. A key difference between SAL and conventional NCL is the size of the cyclic transition state with SAL proceeding via a 14-membered transition state (a detailed discussion on the size of cyclic transition states in S-to-N acyl transfer is provided in Box 2). Consequently, SAL reactions tend to be slower when compared with NCL which proceeds through a 5- or 6-membered cyclic transition state74,75. However, the large transition state in SAL is suggested to be facilitated by the sugar moiety which acts as a rigid scaffold for the reacting groups. As with NCL, the steric nature of the thioester component is the most influential factor in SAL efficiency with residues containing Pro, Val, Iso, Leu, and Thr known to be particularly difficult51,74. To achieve ligation efficiency, it was found that the ligation site must include at least one Gly which may limit the general applicability of SAL in glycopeptide synthesis72. In a related approach, Brik and co-workers reported the application of a modified cyclohexane auxiliary as a scaffold to mimic the role of the glycan in SAL and promote efficient ligation76. Muir and co-workers reported a photocleavable auxiliary suitable for protein ubiquitination. The auxiliary handle was used to direct site-specific ubiquitylation at one of three Lys residues in the target protein77. Seitz and co-workers have recently reported an auxiliary that enables ligation at sterically demanding junctions. The auxiliary can be cleaved under mild basic conditions and is compatible with SPPS78. It is clear from these and other studies that the auxiliary mediated approach offers a useful extension of NCL and can be applied to the total synthesis of complex functionalized proteins.
Kinetically controlled ligation and sequential NCL
The issue of the lack of convergence in NCL was tackled by Kent and co-workers79 who reported Kinetically Controlled Ligation (KCL). KCL exploits the difference in reactivity between alkyl and aryl thioesters, allowing the pre-organisation of the order of NCL reactions. By conducting a one-pot experiment, they demonstrated that the more reactive aryl peptide thioester is ligated with a Cys peptide bearing a less reactive alkyl thioester with the alkyl thioesters remaining intact until further activation80. This approach allows for selective peptide elongation in both the N- and C-terminal direction. The utility of KCL was demonstrated through the synthesis of crambin, a 46-amino acid protein, by a fully convergent synthesis involving six peptide segments and only two protecting groups81. KCL was explored in greater detail by Lee and co-workers82. They demonstrated that optimal KCL can be achieved through combination of a reactive AA such as Gly or Ala with a less reactive AA such as Val. As discussed previously, fluorinated alkyl thiol additives have been used to enable one-pot ligation-desulfurization chemistry for efficient protein synthesis54. The reactivity of the fluorinated alkyl thioester was found to be comparable to that of the aromatic derivative, rendering it extremely useful for KCL. In a recent innovation, Payne and co-workers have dramatically reduced the time required for NCL through the reaction between peptide selenoesters and peptide dimers bearing N-terminal selenocystine. The process proceeds in aqueous buffer to afford native amide bonds without the requirement of additives. The resulting selenocysteine can be deselenized to afford alanine residue at the ligation site83.
The desire to combine recombinant proteins with established NCL methods led to the development of expressed protein ligation (EPL) by Muir and co-workers in 1998 which is based on the natural occurring protein splicing mediated by inteins84. EPL vastly expands the size limitation and scope of protein synthesis by combining NCL with recombinant protein production. This can be accomplished through ligation of a recombinant protein thioester with a synthetic peptide containing an N-terminal Cys (Fig. 5). The protein of interest is expressed N-terminally to an intein domain which is bound through its C-terminus to a solid support. The intein is modified through mutation of the conserved Asn residue halting C-terminal cleavage and splicing. An initial N-to-S acyl transfer generates a thioester intermediate which undergoes trans-thioesterification with an exogenous thiol. Subsequent filtration releases the desired recombinant protein thioester which can then participate in NCL with synthetic cysteinyl peptides34,85. The accessibility of recombinant products of hybrid biological and chemical origin has enabled EPL to emerge as a powerful tool in protein engineering through the introduction of unnatural amino acid sequences, various PTMs, fluorophores, or reactive synthetic handles allowing for further modification or surface immobilization86,87,88,89. The applicability of EPL is well illustrated in the production of fluorescent enzyme biosensors. An example reported by Muir and co-workers demonstrated the incorporation of two fluorophores within Crk-II, a cAbl protein tyrosine kinase substrate involved in several signalling pathways, using EPL. This enabled real-time monitoring of protein phosphorylation and kinase activity, using fluorescence resonance energy transfer (FRET) measurements90. Recently butelase mediated ligations have been reported by Tam and co-workers as an alternative to EPL91. Butelase 1 is a peptide ligase utilized for high-efficiency cyclization and ligation of peptides and proteins ranging in size from 10 to >200 residues. Both inter- and intramolecular ligation pathways proceed via an S-to-N acyl transfer mediated by a butelase thioester intermediate. Sortase mediated ligations have been employed for the site-specific modification of proteins with fluorophores, biotin, proteins or lipids. This approach involves engineering of target protein with a sortase-recognition motif (LPLPXTG) at the site where modification is desired. Subsequent treatment with sortase cleaves the protein between the threonine and glycine residues, facilitating the attachment of an exogenously added oligoglycine peptide modified with the functional group of choice21.
NCL, in combination with related cysteine-free ligation methods and EPL, represents one of the most convenient and popular methods for the synthesis of peptides and proteins of biological and therapeutic interest. The synthetic potential of the methodology is abundantly clear from the numerous reported total-syntheses of proteins that utilize this approach. An impressive array of complex peptides and proteins have been successfully synthesized by NCL including the sialic acid-binding lectin siglec-7 (127 AA)92, human interleukin-6 glyoprotein (183 AA)93, and the pore-forming antimicrobial protein caenopore-5 (82 AA)94. In a ground-breaking study, Valiyaveetil and co-workers employed a semisynthetic approach combining EPL and SPPS to prepare a K+ channel containing the D-enantiomer of alanine in place of a conserved glycine residue. X-ray crystallography structures of the semisynthetic protein demonstrated that the ability of the channel to adapt its structure differently for K+ and Na+ ions is a fundamental aspect of ion selectivity95. A detailed discussion of these synthetic endeavours is beyond the scope of this review but readers are referred to a number of excellent recent reviews11,96,97,98,99. Of particular interest for applications in chemical and molecular biology are proteins displaying defined PTMs such as glycosylation that can be prepared using this approach. The first major example of a glycoprotein prepared through NCL was diptericin, an 82-residue O-linked antimicrobial, reported by Bertozzi and co-workers100. Since then, the construction of glycopeptides and glycoproteins using NCL has continued with the synthesis of EPO being one of the most impressive examples to date (Fig. 6). EPO is a glycoprotein hormone that stimulates the production of red blood cells and is administered as a therapeutic for anaemia. It represents a substantial synthetic challenge with it comprising of 166 amino acids and bearing four glycosylation sites: three N-linked to Asn residues and one O-linked to a Ser residue. To further complicate the synthesis, EPO only contains four Cys residues, all of which are at unfortunate positions (7, 29, 33, 161), rendering them unsuitable for NCL. Nevertheless, this glycoprotein was chemically synthesized by Danishefsky and co-workers101 using NCL and the desulfurization of Cys to Ala as the key strategy. Kajihara and co-workers have recently reported the synthesis of five homogeneous glycoforms of EPO to probe the biological function of the glycan units102. Before this work, Kent and co-workers had reported the chemical synthesis of a homogeneous, polymer-modified erythropoiesis protein (SEP)103. This protein construct displayed potent biological activity in cell and animal assays for erythropoiesis and had significantly prolonged in vivo half-life. The polymer conjugated EPO derivative reported by Kent is arguably more readily synthetically accessible then the glycosylated versions of Danishefsky/Kajihara and serves to illustrate the power of chemical synthesis for the preparation of improved protein therapeutics.
Due to the critical importance of ubiquitination in a range of human diseases, suitable methodologies to prepare practical quantities of homogeneous ubiquitinated proteins are essential to enable detailed biochemical studies and to develop new therapeutics. Methodologies that forge the native isopeptide bond are of particular interest for further biological studies. Both synthetic and enzymatic strategies have been investigated for the preparation of ubiquitinated proteins31,104,105. Although enzymatic approaches are attractive, they require the identification and isolation of the corresponding E2 and E3 ligases associated with ubiquitination of the target proteins, which can be extremely challenging106,107. Chemical ubiquitination of proteins enables the preparation of homogeneous protein constructs with defined ubiquitination patterns24,108. Current synthetic methods to introduce native Ub linkages are driven primarily by NCL and EPL approaches105,109. However, limitations for both of these approaches exist. Cysteine is never found near ubiquitination sites in proteins due to the potential for interference with the enzymatic thioester transfer step, therefore, it must be introduced synthetically or enzymatically for NCL/EPL approaches110. Also, formation of the required thioester linkage can be slow and low yielding, in particular for the sterically constrained isopeptide bond of ubiquitination31,111,112. The use of cysteine surrogates such as mercaptolysine has been advanced to overcome the limitations of NCL but these ligations require a chemical desulfurization step to furnish the native bond59 (Fig. 7a). Surrogate mediated coupling reactions are also typically slow and are limited to relatively unhindered ligation sites, limiting the scope of the ligation methodology. Recently, Chatterjee and co-workers have reported a highly innovative approach towards native chemical ubiquitination involving the ligation auxiliary 2-aminooxyethanethiol. Following protein ligation through transthioesterification and S-to-N acyl transfer, the auxiliary is cleaved via homolysis of chemically labile N–O bond to furnish the native Ub linkage. This novel approach was applied to the chemical synthesis of full-length sumoylated histone H4 and histone H2B (ref. 113).
In a seminal study, Muir and co-workers demonstrated that chemical ubiquitination of histone H2B induces direct stimulation of K79-specific methyltransferase hDot1L. Using the photolytic ligation auxiliary developed by the group (Fig. 4), two traceless orthogonal expressed protein ligation (EPL) reactions were used to ubiquitinate H2B site-specifically. Reconstitution of the synthetic uH2B into chemically defined nucleosomes revealed that uH2B directly activates methylation of histone H3 at lysine K79 by methyltransferase hDot1L (ref. 114).
Disulfide-directed ubiquitination has been employed for the site-specific ubiquitination of histone H2B and proliferating cell nuclear antigen (PCNA)115,116. In this semisynthesis approach, the recombinant Ub-intein conjugate was incubated with cysteamine which undergoes intein-mediated transthioesterifcation followed by S-to-N acyl transfer to furnish the Ub bearing a C-terminal aminoethanethiol linker. For H2B, the thiol was subsequently activated by reaction with 2,2′-dithiobis(5-nitropyridine), (DTNP) to generate the mixed disulfide reagent. The Ub-thiol derivative and H2B disulfide were allowed to react at pH 6.9 to furnish the disulfide-linked ubiquitylated histone uH2B. In the PCNA system, Ellman's reagent was employed to form the mixed disulfide reagent. Pratt and co-workers117 employed disulfide-directed ubiquitination to prepare site-specifically ubiquitin modified α-synuclein derivatives. They demonstrated that different ubiquitination sites have differential effects on α-synuclein aggregation.
In addition to chemical monoubiquitination of proteins, methodologies have also been investigated for the preparation of various poly-Ub substrates24,105. These strategies involve the chemical ligation of Ub proteins to form polymers before ligation of the poly-Ub onto the protein of interest. One approach that has been investigated for the preparation of Ub polymers is GOPAL (genetically encoded orthogonal protection and activated ligation), by Chin and co-workers118. In this approach, the acceptor ubiquitin features a chemically protected Lys residue that is encoded by genetic code expansion. Subsequent orthogonal protection of all other amine groups and chemoselective deprotection of the encoded Lys residue generates an ubiquitin with a single acceptor Lys that is ligated to the thioester of the donor ubiquitin via an S-to-N acyl transfer, thus forming an isopeptide bond. Deprotection of all remaining amine groups and refolding of the diubiquitin results in native diubiquitin chains. This process is somewhat limited in scope due to the requirement for significant protecting group manipulations. In a seminal study, highlighting the potential of chemical synthesis towards polyubiquitination, Brik and co-workers119 prepared α-Syn modified at K12 with poly-Ub (K48 linked chains). This study employed an EPL approach and was synthetically very challenging due to the requirement for desulfurization of the residual thiol at each step. Indeed, this study highlighted some of the potential problems associated with the EPL approach to protein ligation. For example, the attempted ligation reaction between the synthetic Ub-tetramer and α-Syn failed to furnish any of the desired product due to the complete absence of any trans-thioesterification. In addition, an attempt to carry out sequential EPL ligation steps followed by a global desulfurization failed to give any of the desired product. The target tet-Ub-α-Syn protein construct was eventually prepared through two sequential ligations of di-Ub with α-Syn. Despite the challenges, this approach highlights the potential of S-to-N acyl transfer as a key ligation step in protein oligomer synthesis.
Recently, Guo and co-workers have reported an N-glycosylation reaction between glycosylamines and p-nitrophenyl thioester peptides involving an intermolecular S-to-N acyl transfer as the key step120. The methodology was found to be compatible with fully unprotected glycans. One disadvantage of the methodology is the high reactivity of the thioester, required to promote the intermolecular S-to-N acyl transfer, renders the process incompatible with aqueous conditions where hydrolysis of the thioester competes with the desired N-glycosylation reaction. The intermolecular approach is also limited in terms of chemoselectivity with participating side-reactions from lysine residues. Payne and et al.121 have reported an intermolecular approach to peptide ligation and glycopeptide synthesis. Hydrolysis of the thioester was minimized through the use of a mixed-solvent system containing N-methylpyrrolidinone (NMP) and guanidine hydrochloride (Gn.HCl)/HEPES buffer. The intermolecular process was found to be compatible with all amino acid residues with the exception of lysine that required protection as the lysine(ivDde) group. Kinetic studies comparing the rates of NCL versus the direct intermolecular ligation showed that although NCL is faster than direct aminolysis, intermolecular ligation reactions still proceeded at synthetically useful rates. This finding suggests that some ligation reactions proposed to occur via large-ring transition states, such as SAL, may actually proceed through direct intermolecular aminolysis121 (Fig. 7b).
Recently, Devaraj and co-workers122 have employed an NCL approach to prepare phospholipids spontaneously from thioesters. The chemoselectivity of the intramolecular S-to-N acyl transfer enables the preparation of amidophospholipids, which self-assemble in situ to form membrane bound vesicles. In this system, the NCL process is employed as a non-enzymatic method to drive the self assembly of phospholipid membranes in a manner analogous to the natural lipid synthesis process. The approach was further developed to demonstrate in situ formation of phospholipid membranes and concomitant spontaneous reconstitution of functional membrane proteins123 (Fig. 7c).
Applications in supramolecular chemistry and chemical probes
The chemoselective nature of the S-to-N acyl transfer process and the requirement for mild or neutral reactions conditions renders it an ideal ligation reaction for applications in supramolecular chemistry and for the design of chemical probes.
The ability to control chemoselectivity over large ring sizes lends itself to applications in supramolecular chemistry and in particular to the development of molecular machines. Indeed, with the award of the 2016 Nobel prize in chemistry to Sauvage, Stoddard and Feringa ‘for the design and synthesis of molecular machines’ it is anticipated that there will be a surge of interest in reversible chemoselective ligation reactions. The S-to-N acyl transfer has been employed in a small molecule machine, developed by Leigh and co-workers that mimic ribosomal protein synthesis124. A rotaxane ring tethered onto a molecular axel carries a thiolate group that iteratively ligates amino acid residues through an NCL process. The critical peptide bond forming reactions occur through an intramolecular S-to-N acyl transfer over a gradually increasing cyclic transition state of 11-, 14- and 17-membered rings. The sequential O-to-S/ S-to-N acyl transfer/catalyst regeneration/ring movement process highlights the power of sequential chemoselective ligation reactions for the development of sophisticated molecular machines that can mimic complex biological processes (Fig. 8a). Stulz and co-workers125 have demonstrated that DNA can be employed to template acyl transfer reactions between thioester modified oligonucleotide and a series of amine and thiol based nucleophiles.
The intramolecular S-to-N acyl transfer offers a chemoselective process that can be applied to the development of sophisticated biological probes suitable for proteomics and imaging. Of particular interest for biological target discovery is the development of Ubiquitin protease probes pioneered by Ploegh and co-workers126. Probes for deubiquitinating enzymes (DUBs) involving a native chemical ligation handle have been developed by Ovaa and co-workers127. In this approach, native chemical ligation between a Ub thioester and a γ- or δ-thiolysine residue was employed to forge the native isopeptide bond. The subsequent desulfurization step was used to introduce an electrophilic group that can capture the catalytic Cys residue of a DUB. A diubiquitin-based FRET probe that involves an S-to-N acyl transfer to ligate the FRET pair has also been reported128. Strongin and co-workers129 have utilized an intramolecular S-to-N acyl transfer as a key step in the development of a dual emission fluorescent probe that can distinguish between glutathione and cysteine/homocysteine. The probe design includes a reactive thioester moiety that undergoes thioester exchange in the presence of cysteine/homocysteine or glutathione. For cysteine/homocysteine this was followed by an intramolecular S-to-N acyl transfer over a 5- or 6-membered transition state, triggering a spirocyclization that led to the ‘quinone–phenol’ transduction of rhodol dyes, and an excited-state intramolecular proton transfer (ESIPT) process. In the case of glutathione, only the initial transthioesterification took place. This removed the intramolecular photo-induced electron transfer (PET) process caused by the electron deficient 4-nitrobenzene moiety and resulted in a large fluorescence enhancement at the rhodol emission band. No S-to-N acyl transfer was observed since the transfer would involve a ten-membered cyclic transition state which is disfavoured (Fig. 8b) (a detailed discussion on the size of cyclic transition states in S-to-N acyl transfer is provided in Box 2). A related strategy was reported by Guo and co-workers using a coumarin hemicyanine fluorescent dye130. In this system, only cysteine and no homocysteine or glutathione could participate in the S-to-N acyl transfer to activate the probe. The authors proposed that for homocysteine, following trans-thioesterification, the carboxylate anion could bind more tightly to the benzothiazolium N atom, thereby inhibiting the subsequent intramolecular S-to-N acyl transfer. These examples highlight the application of S-to-N acyl transfer in the development of selective chemical probes.
Outlook and future directions
The unique reactivity of thioester moieties renders them versatile substrates for a wide range of chemoselective processes. Tremendous advancements have been made in exploiting the synthetic potential of both inter- and intramolecular S-to-N acyl transfer process for a diverse range of applications. The application of small ring-size transition states (<6) remains the most general and robust approach for effecting efficient intramolecular acyl transfer reactions, but applications of larger ring size TS are rapidly becoming more widely utilized. The elucidation of precise, chemoselective acyl transfer reactions in nature serves as inspiration for the development of more sophisticated synthetic acyl transfer processes and in combination with chemoenzymatic/semi-synthesis approaches there appears to be few limitations on the synthetic scope of these processes. For chemical protein synthesis, the ‘cysteine problem’ associated with NCL has largely been ameliorated through the development of methods for cysteine-free ligation and desulfurization, however a general approach for traceless protein-protein ligation via NCL remains to be developed. Despite all of the powerful advances in NCL and the wide range of ligation auxiliaries now available, significant challenges still remain in incorporating these handles into full-length proteins, applying them under native folded conditions and subsequently removing them without requiring refolding as the final step. This remains an enormous challenge, which chemical synthesis alone cannot overcome. Therefore, a combination of amber suppression and bioorthogonal chemistry will be required to truly harness the power of chemistry in the semisynthesis of functional proteins. The synthesis of thioesters has been given a considerable boost through advanced synthetic methods and in particular through the introduction of intein-mediated ligation. Nevertheless, the development of novel chemical and enzymatic methods for the formation of thioesters in the presence of fully unprotected complex peptide and protein systems will continue to be an important area of research for the development of the field. The execution of chemoselective S-to-N acyl transfer reactions across large ring-systems remains a considerable challenge but supramolecular templating approaches and the use of natural/modified ligase enzymes offer intriguing possibilities for site-specificity in these reactions. No doubt, the continued discovery of novel S-to-N acyl transfer processes in nature and the harnessing of biological machinery will continue to stimulate research in this field. It is clear that the applications of S-to-N acyl transfer have expanded well beyond the traditional applications of peptide ligation and based on the high chemoselectivity of the process it is anticipated that this trend will continue. The ability to chemically introduce site-specific isopeptide bonds is likely to revolutionize the molecular analysis of biological systems. There is little doubt that the S-to-N acyl transfer will remain a focus of both chemists and biologists for the foreseeable future.
How to cite this article: Burke, H. M. et al. Exploring chemoselective S-to-N acyl transfer reactions in synthesis and chemical biology. Nat. Commun. 8, 15655 doi: 10.1038/ncomms15655 (2017).
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Trost, B. M. Selectivity: a key to synthetic efficiency. Science 219, 245–250 (1983).
Sletten Ellen, M. & Bertozzi Carolyn, R. Bioorthogonal chemistry: fishing for selectivity in a sea of functionality. Angew. Chem. Int. Ed. 48, 6974–6998 (2009).
Patterson, D. M., Nazarova, L. A. & Prescher, J. A. Finding the right (bioorthogonal) chemistry. ACS Chem. Biol. 9, 592–605 (2014).
Saito, F., Noda, H. & Bode, J. W. Critical evaluation and rate constants of chemoselective ligation reactions for stoichiometric conjugations in water. ACS Chem. Biol. 10, 1026–1033 (2015).
Goswami, A. & Van Lanen, S. G. Enzymatic strategies and biocatalysts for amide bond formation: tricks of the trade outside of the ribosome. Mol. BioSyst. 11, 338–353 (2015).
Pattabiraman, V. R. & Bode, J. W. Rethinking amide bond synthesis. Nature 480, 471–479 (2011).
deFigueiredo, R. M., Suppo, J.-S. & Campagne, J.-M. Nonclassical routes for amide bond formation. Chem. Rev. 116, 12029–12122 (2016).
Yang, W. & Drueckhammer, D. G. Understanding the relative acyl-transfer reactivity of oxoesters and thioesters: computational analysis of transition state delocalization effects. J. Am. Chem. Soc. 123, 11004–11009 (2001).
Erben, M. F., Boese, R., Della Vedova, C. O., Oberhammer, H. & Willner, H. Toward an intimate understanding of the structural properties and conformational preference of oxoesters and thioesters: gas and crystal structure and conformational analysis of dimethyl monothiocarbonate, CH3OC(O)SCH3. J. Org. Chem. 71, 616–622 (2006).
Deakyne, C. A. et al. Energetics of the lighter chalcogen analogues of carboxylic acid esters. J. Phys. Chem. B 114, 16253–16262 (2010).
Bondalapati, S., Jbara, M. & Brik, A. Expanding the chemical toolbox for the synthesis of large and uniquely modified proteins. Nat. Chem. 8, 407–418 (2016).
Chen, M., Heimer, P. & Imhof, D. Synthetic strategies for polypeptides and proteins by chemical ligation. Amino Acids 47, 1283–1299 (2015).
Macmillan, D. Evolving strategies for protein synthesis converge on native chemical ligation. Angew. Chem. Int. Ed. 45, 7668–7672 (2006).
McGrath, N. A. & Raines, R. T. Chemoselectivity in chemical biology: acyl transfer reactions with sulfur and selenium. Acc. Chem. Res. 44, 752–761 (2011).
Bode, J. W. Reinventing amide bond formation. Top. Organomet. Chem. 44, 13–34 (2013).
Kay, J. & Weitzman, P. D. J. in Krebs citric acid cycle-half a century and still turning Vol. 54 (Biochemical Society Symposia, 1987).
Berndsen, C. E. & Wolberger, C. New insights into ubiquitin E3 ligase mechanism. Nat. Struc. Mol. Biol. 21, 301–307 (2014).
Plechanovova, A. et al. Mechanism of ubiquitylation by dimeric RING ligase RNF4. Nat. Struc. Mol. Biol. 18, 1052–1059 (2011).
Noren, C. J., Wang, J. & Perler, F. B. Dissecting the chemistry of protein splicing and its applications. Angew. Chem. Int. Ed. 39, 450–466 (2000).
Eryilmaz, E., Shah, N. H., Muir, T. W. & Cowburn, D. Structural and dynamical features of inteins and implications on protein splicing. J. Biol. Chem. 289, 14506–14511 (2014).
Guimaraes, C. P. et al. Site-specific C-terminal and internal loop labeling of proteins using sortase-mediated reactions. Nat. Protoc. 8, 1787–1799 (2013).
Hendrickx, A. P. A., Budzik, J. M., Oh, S.-Y. & Schneewind, O. Architects at the bacterial surface: sortases and the assembly of pili with isopeptide bonds. Nat. Rev. Microbiol. 9, 166–176 (2011).
Chandru, K. et al. The abiotic chemistry of thiolated acetate derivatives and the origin of life. Sci. Rep. 6, 29883 (2016).
Hemantha, H. P. & Brik, A. Non-enzymatic synthesis of ubiquitin chains: where chemistry makes a difference. Bioorg. Med. Chem. 21, 3411–3420 (2013).
Goldstein, G. et al. Isolation of a polypeptide that has lymphocyte-differentiating properties and is probably represented universally in living cells. Proc. Natl Acad. Sci. USA 72, 11–15 (1975).
Pickart, C. M. Mechanisms underlying ubiquitination. Annu. Rev. Biochem. 70, 503–533 (2001).
Kang, H. J. & Baker, E. N. Intramolecular isopeptide bonds: protein crosslinks built for stress? Trends Biochem. Sci. 36, 229–237 (2011).
Kerscher, O., Felberbaum, R. & Hochstrasser, M. Modification of proteins by ubiquitin and ubiquitin-like proteins. Annu. Rev. Cell Dev. Biol. 22, 159–180 (2006).
Reyes-Turcu, F. E., Ventii, K. H. & Wilkinson, K. D. Regulation and cellular roles of ubiquitin-specific deubiquitinating enzymes. Annu. Rev. Biochem. 78, 363–397 (2009).
Bergink, S. & Jentsch, S. Principles of ubiquitin and SUMO modifications in DNA repair. Nature 458, 461–467 (2009).
Spasser, L. & Brik, A. Chemistry and biology of the ubiquitin signal. Angew. Chem. Int. Ed. 51, 6840–6862 (2012).
Vittal, V. et al. Intrinsic disorder drives N-terminal ubiquitination by Ube2w. Nat. Chem. Biol. 11, 83–89 (2015).
Gareau, J. R. & Lima, C. D. The SUMO pathway: emerging mechanisms that shape specificity, conjugation and recognition. Nat. Rev. Mol. Cell Biol. 11, 861–871 (2010).
Berrade, L. & Camarero, J. A. Expressed protein ligation: a resourceful tool to study protein structure and function. Cell Mol. Life Sci. 66, 3909–3922 (2009).
Klabunde, T., Sharma, S., Telenti, A., Jacobs, W. R. Jr & Sacchettini, J. C. Crystal structure of GyrA intein from Mycobacterium xenopi reveals structural basis of protein splicing. Nat. Struct. Biol. 5, 31–36 (1998).
Mazmanian, S. K., Liu, G., Ton-That, H. & Schneewind, O. Staphylococcus aureus sortase, an enzyme that anchors surface proteins to the cell wall. Science 285, 760–763 (1999).
Ahvazi, B. & Steinert, P. M. A model for the reaction mechanism of the transglutaminase 3 enzyme. Exp. Mol. Med. 35, 228–242 (2003).
Murthy, S. N. P., Lukas, T. J., Jardetzky, T. S. & Lorand, L. Selectivity in the post-translational, transglutaminase-dependent acylation of lysine residues. Biochemistry 48, 2654–2660 (2009).
Veeravalli, K., Boyd, D., Iverson, B. L., Beckwith, J. & Georgiou, G. Laboratory evolution of glutathione biosynthesis reveals natural compensatory pathways. Nat. Chem. Biol. 7, 101–105 (2011).
Du, Z. et al. Highly conserved histidine plays a dual catalytic role in protein splicing: a pKa shift mechanism. J. Am. Chem. Soc. 131, 11581–11589 (2009).
Melnyk, O. & Agouridas, V. From protein total synthesis to peptide transamidation and metathesis: playing with the reversibility of N,S-acyl or N,Se-acyl migration reactions. Curr. Op. Chem. Biol. 22, 137–145 (2014).
MacMillan, D., Adams, A. & Premdjee, B. Shifting native chemical ligation into reverse through N→S acyl transfer. Isr. J. Chem. 51, 885–899 (2011).
Nagaike, F. et al. Efficient microwave-assisted tandem N- to S-acyl transfer and thioester exchange for the preparation of a glycosylated peptide thioester. Org. Lett. 8, 4465–4468 (2006).
Kajihara, Y., Yoshihara, A., Hirano, K. & Yamamoto, N. Convenient synthesis of a sialylglycopeptide-thioester having an intact and homogeneous complex-type disialyl-oligosaccharide. Carbohydr. Res. 341, 1333–1340 (2006).
Ficht, S., Payne, R. J., Guy, R. T. & Wong, C.-H. Solid-phase synthesis of peptide and glycopeptide thioesters through side-chain-anchoring strategies. Chem. Eur. J. 14, 3620–3629 (2008).
Blanco-Canosa, J. B. & Dawson, P. E. An efficient Fmoc-SPPS approach for the generation of thioester peptide precursors for use in native chemical ligation. Angew. Chem. Int. Ed. 47, 6851–6855 (2008).
Zheng, J.-S., Tang, S., Qi, Y.-K., Wang, Z.-P. & Liu, L. Chemical synthesis of proteins using peptide hydrazides as thioester surrogates. Nat. Protoc. 8, 2483–2495 (2013).
Shelton, P. M. M., Weller, C. E. & Chatterjee, C. A. Facile N-mercaptoethoxyglycinamide (MEGA) linker approach to peptide thioesterification and cyclization. J. Am. Chem. Soc. 139, 3946–3949 (2017).
Kent, S. Origin of the chemical ligation concept for the total synthesis of enzymes (proteins). Biopolymers 94, iv–ix (2010).
Dawson, P. E., Muir, T. W., Clark-Lewis, I. & Kent, S. B. H. Synthesis of proteins by native chemical ligation. Science 266, 776–779 (1994). This seminal paper introduced the concept of native chemical ligation (NCL) as a strategy to join synthetic peptides and produce a full length protein; a key example of protein synthesis through linear assembly.
Hackeng, T. M., Griffin, J. H. & Dawson, P. E. Protein synthesis by native chemical ligation: expanded scope by using straightforward methodology. Proc. Natl Acad. Sci. USA 96, 10068–10073 (1999).
Dawson, P. E., Churchill, M., Ghadiri, M. R. & Kent, S. B. H. Modulation of reactivity in native chemical ligation through the use of thiol additives. J. Am. Chem. Soc. 119, 4325–4329 (1997).
Johnson, E. C. B. & Kent, S. B. H. Insights into the mechanism and catalysis of the native chemical ligation reaction. J. Am. Chem. Soc. 128, 6640–6646 (2006).
Thompson, R. E. et al. Trifluoroethanethiol: an additive for efficient one-pot peptide ligation- desulfurization chemistry. J. Am. Chem. Soc. 136, 8161–8164 (2014).
Torbeev, V. Y. & Kent, S. B. H. Convergent chemical synthesis and crystal structure of a 203 amino acid "covalent dimer" HIV-1 protease enzyme molecule. Angew. Chem. Int. Ed. 46, 1667–1670 (2007).
Yan, L. Z. & Dawson, P. E. Synthesis of peptides and proteins without cysteine residues by native chemical ligation combined with desulfurization. J. Am. Chem. Soc. 123, 526–533 (2001). This important paper introduced the concept of desulfurisation as an approach to enable native chemical ligation at sites other then cysteine.
Crich, D. & Banerjee, A. Native chemical ligation at phenylalanine. J. Am. Chem. Soc. 129, 10064–10065 (2007).
Chen, J., Wan, Q., Yuan, Y., Zhu, J. & Danishefsky, S. J. Native chemical ligation at valine: a contribution to peptide and glycopeptide synthesis. Angew. Chem. Int. Ed. 47, 8521–8524 (2008).
Ajish Kumar, K. S., Haj-Yahya, M., Olschewski, D., Lashuel, H. A. & Brik, A. Highly efficient and chemoselective peptide ubiquitylation. Angew. Chem. Int. Ed. 48, 8090–8094 (2009).
Dawson, P. E. Native chemical ligation combined with desulfurization and deselenization: a general strategy for chemical protein synthesis. Isr. J. Chem. 51, 862–867 (2011).
Malins, L. R. & Payne, R. J. Synthetic amino acids for applications in peptide ligation-desulfurization chemistry. Aus. J. Chem. 68, 521–537 (2015).
Hoffmann, F. W., Ess, R. J., Simmons, T. C. & Hanzel, R. S. The desulfuration of mercaptans with trialkyl phosphites. J. Am. Chem. Soc. 78, 6414 (1956).
Walling, C. T. & Rabinowitz, R. Reaction of thiyl radicals with trialkyl phosphates. J. Am. Chem. Soc. 79, 5326 (1957).
Wan, Q. & Danishefsky, S. J. Free-radical-based, specific desulfurization of cysteine: a powerful advance in the synthesis of polypeptides and glycopolypeptides. Angew. Chem. Int. Ed. 46, 9248–9252 (2007).
Gao, X.-F., Du, J.-J., Liu, Z. & Guo, J. Visible-light-induced specific desulfurization of cysteinyl peptide and glycopeptide in aqueous solution. Org. Lett. 18, 1166–1169 (2016).
Floyd, N. et al. Photoinduced, family-specific, site-selective cleavage of TIM-barrel proteins. J. Am. Chem. Soc. 131, 12518–12519 (2009).
Canne, L. E., Bark, S. J. & Kent, S. B. H. Extending the applicability of native chemical ligation. J. Am. Chem. Soc. 118, 5891–5896 (1996).
Low, D. W., Hill, M. G., Carrasco, M. R., Kent, S. B. H. & Botti, P. Total synthesis of cytochrome b562 by native chemical ligation using a removable auxiliary. Proc. Natl Acad. Sci. USA 98, 6554–6559 (2001).
Wu, B. et al. Building complex glycopeptides: development of a cysteine-free native chemical ligation protocol. Angew. Chem. Int. Ed. 45, 4116–4125 (2006).
Offer, J., Boddy, C. N. C. & Dawson, P. E. Extending synthetic access to proteins with a removable acyl transfer auxiliary. J. Am. Chem. Soc. 124, 4642–4646 (2002).
Kawakami, T. & Aimoto, S. A photoremovable ligation auxiliary for use in polypeptide synthesis. Tetrahedron Lett. 44, 6059–6061 (2003).
Brik, A., Yang, Y.-Y., Ficht, S. & Wong, C.-H. Sugar-assisted glycopeptide ligation. J. Am. Chem. Soc. 128, 5626–5627 (2006).
Payne, R. J. et al. Extended sugar-assisted glycopeptide ligations: development, scope, and applications. J. Am. Chem. Soc. 129, 13527–13536 (2007).
Brik, A., Ficht, S., Yang, Y.-Y., Bennett, C. S. & Wong, C.-H. Sugar-assisted ligation of N-linked glycopeptides with broad sequence tolerance at the ligation junction. J. Am. Chem. Soc. 128, 15026–15033 (2006).
Ficht, S., Payne, R. J., Brik, A. & Wong, C.-H. Second-generation sugar-assisted ligation: a method for the synthesis of cysteine-containing glycopeptides. Angew. Chem. Int. Ed. 46, 5975–5979 (2007).
Lutsky, M.-Y., Nepomniaschiy, N. & Brik, A. Peptide ligation via side-chain auxiliary. Chem. Commun. (Camb) 14, 1229–1231 (2008).
Chatterjee, C., McGinty, R. K., Pellois, J.-P. & Muir, T. W. Auxiliary-mediated site-specific peptide ubiquitylation. Angew. Chemie. Int. Ed. 46, 2814–2818 (2007).
Loibl, S. F., Harpaz, Z. & Seitz, O. A type of auxiliary for native chemical peptide ligation beyond cysteine and glycine junctions. Angew. Chem. Int. Ed. 54, 15055–15059 (2015).
Bang, D., Pentelute, B. L. & Kent, S. B. H. Kinetically controlled ligation for the convergent chemical synthesis of proteins. Angew. Chem., Int. Ed. 45, 3985–3988 (2006).
Bang, D., Pentelute, B. L., Gates, Z. P. & Kent, S. B. Direct on-resin synthesis of peptide-αthiophenylesters for use in native chemical ligation. Org. Lett. 8, 1049–1052 (2006).
Bang, D. & Kent, S. B. H. Protein synthesis: a one-pot total synthesis of crambin. Angew. Chem. Int. Ed. 43, 2534–2538 (2004).
Lee, J.-G., Kwon, Y.-J., Pentelute, B. L. & Bang, D.-H. Use of model peptide reactions for the characterization of kinetically controlled ligation. Bioconjug. Chem. 22, 1645–1649 (2011).
Mitchell, N. J. et al. Rapid additive-free selenocystine-selenoester peptide ligation. J. Am. Chem. Soc. 137, 14011–14014 (2015).
Muir, T. W., Sondhi, D. & Cole, P. A. Expressed protein ligation: A general method for protein engineering. Proc. Natl Acad. Sci. USA 95, 6705–6710 (1998). A groundbreaking paper that introduced the concept of expressed protein ligation (EPL) through the use of inteins. It vastly expanded the size limitation and scope of protein synthesis by combining NCL with recombinant protein production.
Flavell, R. R. & Muir, T. W. Expressed protein ligation (EPL) in the study of signal transduction, ion conduction, and chromatin biology. Acc. Chem. Res. 42, 107–116 (2009).
Camarero, J. A. & Kwon, Y. Traceless and site-specific attachment of proteins onto solid supports. Int. J. Pept. Res. Ther. 14, 351–357 (2008).
Hackenberger, C. P. R. & Schwarzer, D. Chemoselective ligation and modification strategies for peptides and proteins. Angew. Chem., Int. Ed. 47, 10030–10074 (2008).
Hofmann, R. M. & Muir, T. W. Recent advances in the application of expressed protein ligation to protein engineering. Curr. Opin. Biotechnol. 13, 297–303 (2002).
Nilsson, B. L., Soellner, M. B. & Raines, R. T. Chemical synthesis of proteins. Annu. Rev. Biophys. Biomol. Struct. 34, 91–118 (2005).
Hofmann, R. M. et al. Fluorescent monitoring of kinase activity in real time: development of a robust fluorescence-based assay for Abl tyrosine kinase activity. Bioorg. Med. Chem. Lett. 11, 3091–3094 (2001).
Nguyen, G. K. T. et al. Butelase-mediated cyclization and ligation of peptides and proteins. Nat. Protoc. 11, 1977–1988 (2016).
Izumi, M., Otsuki, A., Nishihara, M., Okamoto, R. & Kajihara, Y. Chemical Synthesis of a Synthetic Analogue of the Sialic Acid-Binding Lectin Siglec-7. Chem. Biochem. 15, 2503–2507 (2014).
Reif, A. et al. Semisynthesis of Biologically Active Glycoforms of the Human Cytokine Interleukin 6. Angew. Chem. Int. Ed. 53, 12125–12131 (2014).
Medini, K. et al. Chemical synthesis of a pore-forming antimicrobial protein, Caenopore-5, by using native chemical ligation at a Glu-Cys site. ChemBioChem 16, 328–336 (2015).
Valiyaveetil, F. I., Leonetti, M., Muir, T. W. & MacKinnon, R. Ion selectivity in a semisynthetic K+ channel locked in the conductive conformation. Science 314, 1004–1007 (2006).
Chalker, J. M. Prospects in the total synthesis of protein therapeutics. Chem. Biol. Drug Des. 81, 122–135 (2013).
Fernandez-Tejada, A. et al. Total synthesis of glycosylated proteins. Top. Curr. Chem. 362, 1–26 (2015).
Malins, L. R. & Payne, R. J. Recent extensions to native chemical ligation for the chemical synthesis of peptides and proteins. Curr. Opin. Chem. Biol. 22, 70–78 (2014).
Raibaut, L., El Mahdi, O. & Melnyk, O. Solid phase protein chemical synthesis. Top. Curr. Chem. 363, 103–154 (2015).
Shin, Y. et al. Fmoc-based synthesis of peptide-αthioesters: application to the total chemical synthesis of a glycoprotein by native chemical ligation. J. Am. Chem. Soc. 121, 11684–11689 (1999).
Wang, P. et al. Erythropoietin derived by chemical synthesis. Science 342, 1357–1360 (2013). The chemical synthesis of a single glycoform of EPO represented a major milestone in glycoprotein synthesis through NCL.
Murakami, M. et al. Chemical synthesis of erythropoietin glycoforms for insights into the relationship between glycosylation pattern and bioactivity. Sci. Adv. 2, e1500678 (2016).
Kochendoerfer, G. G. et al. Design and chemical synthesis of a homogeneous polymer-modified erythropoiesis protein. Science 299, 884–887 (2003).
Fekner, T., Li, X. & Chan, M. K. Nonenzymatic ubiquitylation. ChemBioChem 12, 21–33 (2011).
Faggiano, S. & Pastore, A. The challenge of producing ubiquitinated proteins for structural studies. Cells 3, 639–656 (2014).
Rott, R. et al. Monoubiquitylation of α-synuclein by seven in absentia homolog (SIAH) promotes its aggregation in dopaminergic cells. J. Biol. Chem. 283, 3316–3328 (2008).
Teixeira, L. K. & Reed, S. I. Ubiquitin ligases and cell cycle control. Annu. Rev. Biochem. 82, 387–414 (2013).
Cabri, W., Borghi, D., Arlandini, E., Sbraletta, P. & Bedeschi, A. Metal-catalyzed and -promoted cyclization for the synthesis of cephalosporins: a possible DAOC/DAC synthetase biomimetic process. Tetrahedron 49, 6837–6848 (1993).
Pham, G. H. & Strieter, E. R. Peeling away the layers of ubiquitin signaling complexities with synthetic ubiquitin-protein conjugates. Curr. Opin. Chem. Biol. 28, 57–65 (2015).
Catic, A., Collins, C., Church, G. M. & Ploegh, H. L. Preferred in vivo ubiquitination sites. Bioinformatics 20, 3302–3307 (2004).
Bavikar, S. N. et al. Chemical synthesis of ubiquitinated peptides with varying lengths and types of ubiquitin chains to explore the activity of deubiquitinases. Angew. Chem. Int. Ed. 51, 758–763 (2012).
Spicer, C. D. & Davis, B. G. Selective chemical protein modification. Nat. Commun. 5, 4740 (2014).
Weller, C. E. et al. Aromatic thiol-mediated cleavage of N-O bonds enables chemical ubiquitylation of folded proteins. Nat. Commun. 7, 12979 (2016).
McGinty, R. K., Kim, J., Chatterjee, C., Roeder, R. G. & Muir, T. W. Chemically ubiquitylated histone H2B stimulates hDot1L-mediated intranucleosomal methylation. Nature 453, 812–816 (2008).
Chatterjee, C., McGinty, R. K., Fierz, B. & Muir, T. W. Disulfide-directed histone ubiquitylation reveals plasticity in hDot1L activation. Nat. Chem. Biol. 6, 267–269 (2010).
Chen, J., Ai, Y., Wang, J., Haracska, L. & Zhuang, Z. Chemically ubiquitylated PCNA as a probe for eukaryotic translesion DNA synthesis. Nat. Chem. Biol. 6, 270–272 (2010).
Meier, F. et al. Semisynthetic, site-specific ubiquitin modification of α-synuclein reveals differential effects on aggregation. J. Am. Chem. Soc. 134, 5468–5471 (2012).
Virdee, S. et al. Traceless and site-specific ubiquitination of recombinant proteins. J. Am. Chem. Soc. 133, 10708–10711 (2011).
Haj-Yahya, M. et al. Synthetic polyubiquitinated α-synuclein reveals important insights into the roles of the ubiquitin chain in regulating its pathophysiology. Proc. Natl Acad. Sci. USA 110, 17726–17731 (2013).
Du, J.-J. et al. Convergent synthesis of N-linked glycopeptides via aminolysis of ω-Asp p-nitrophenyl thioesters in solution. Org. Lett. 18, 4828–4831 (2016).
Payne, R. J., Ficht, S., Greenberg, W. A. & Wong, C.-H. Cysteine-free peptide and glycopeptide ligation by direct aminolysis. Angew. Chem. Int. Ed. 47, 4411–4415 (2008).
Cole, C. M. et al. Spontaneous reconstitution of functional transmembrane proteins during bioorthogonal phospholipid membrane synthesis. Angew. Chem. Int. Ed. 54, 12738–12742 (2015).
Brea, R. J., Rudd, A. K. & Devaraj, N. K. Nonenzymatic biomimetic remodeling of phospholipids in synthetic liposomes. Proc. Natl Acad. Sci. USA 113, 8589–8594 (2016).
Lewandowski, B. et al. Sequence-specific peptide synthesis by an artificial small-molecule machine. Science 339, 189–193 (2013). A synthetic tour-de-force that illustrates the potential of S-to-N acyl transfer and related processes as synthetic tools to enable functional molecular machines.
McKee, M. L. et al. Peptidomimetic bond formation by DNA-templated acyl transfer. Org. Biomol. Chem. 9, 1661–1666 (2011).
Galardy, P., Ploegh, H. L. & Ovaa, H. Mechanism-based proteomics tools based on ubiquitin and ubiquitin-like proteins: crystallography, activity profiling, and protease identification. Methods Enzymol. 399, 120–131 (2005).
Mulder, M. P. C., El Oualid, F., ter Beek, J. & Ovaa, H. A native chemical ligation handle that enables the synthesis of advanced activity-based probes: Diubiquitin as a Case Study. ChemBioChem 15, 946–949 (2014).
Geurink, P. P. et al. Development of diubiquitin-based FRET probes to quantify ubiquitin linkage specificity of deubiquitinating enzymes. ChemBioChem 17, 816–820 (2016).
Yang, X.-F. et al. A dual emission fluorescent probe enables simultaneous detection of glutathione and cysteine/homocysteine. Chem. Sci. 5, 2177–2183 (2014).
Liu, J. et al. A carboxylic acid-functionalized coumarin-hemicyanine fluorescent dye and its application to construct a fluorescent probe for selective detection of cysteine over homocysteine and glutathione. RSC Adv. 4, 64542–64550 (2014).
Cowper, B. et al. Expanding the scope of N→S acyl transfer in native peptide sequences. Org. Biomol. Chem. 13, 7469–7476 (2015).
Ollivier, N., Behr, J.-B., El-Mahdi, O., Blanpain, A. & Melnyk, O. Fmoc solid-phase synthesis of peptide thioesters using an intramolecular N,S-acyl shift. Org. Lett. 7, 2647–2650 (2005).
Ollivier, N., Dheur, J., Mhidia, R., Blanpain, A. & Melnyk, O. Bis(2-sulfanylethyl)amino native peptide ligation. Org. Lett. 12, 5238–5241 (2010).
Vizzavona, J., Dick, F. & Vorherr, T. Synthesis and application of an auxiliary group for chemical ligation at the X-gly site. Bioorg. Med. Chem. Lett. 12, 1963–1965 (2002).
Nakamura, K. i., Sumida, M., Kawakami, T., Vorherr, T. & Aimoto, S. Generation of an S-peptide via an N-S acyl shift reaction in a TFA solution. Bull. Chem. Soc. Jpn 79, 1773–1780 (2006).
Hojo, H., Onuma, Y., Akimoto, Y., Nakahara, Y. & Nakahara, Y. N-alkyl cysteine-assisted thioesterification of peptides. Tetrahedron Lett. 48, 25–28 (2007).
Panda, S. S., Hall, C. D., Oliferenko, A. A. & Katritzky, A. R. Traceless chemical ligation from S-, O-, and N-acyl isopeptides. Acc. Chem. Res. 47, 1076–1087 (2014).
Monbaliu, J.-C. M., Dive, G., Stevens, C. V. & Katritzky, A. R. Governing parameters of long-range intramolecular S-to-N acyl transfers within (S)-acyl isopeptides. J. Chem. Theory Comput. 9, 927–934 (2013).
Biswas, S. et al. Long-range chemical ligation from N→N acyl migrations in tryptophan peptides via cyclic transition states of 10- to 18-members. Chem. Eur. J. 20, 8189–8198 (2014).
Panda, S. S. et al. Study of chemical ligation via 17-, 18- and 19-membered cyclic transition states. Chem. Biol. Drug Des. 80, 821–827 (2012).
Ha, K. et al. Long-range intramolecular S→N acyl migration: a study of the formation of native peptide analogues via 13-, 15-, and 16-membered cyclic transition states. J. Org. Chem. 77, 2637–2648 (2012).
Haase, C. & Seitz, O. Internal cysteine accelerates thioester-based peptide ligation. Eur. J. Org. Chem. 2096–2101 (2009).
Dawson, P. E. & Kent, S. B. Synthesis of native proteins by chemical ligation. Ann. Rev. Biochem. 69, 923–960 (2000).
We would like to thank Science Foundation Ireland (SFI) for funding (15/CDA/3310).
The authors declare no competing financial interests.
About this article
Organic Letters (2019)
Macromolecular Rapid Communications (2019)
Journal of the American Chemical Society (2019)
Organic Letters (2019)
α-Nitro-α,β-Unsaturated Ketones: An Electrophilic Acyl Transfer Reagent in Catalytic Asymmetric Friedel–Crafts and Michael Reactions
Organic Letters (2019)