Brief Communication


Nature Chemical Biology 3, 321 - 322 (2007)
Published online: 22 April 2007 | doi:10.1038/nchembio878

Introducing genetically encoded aldehydes into proteins

Isaac S Carrico1,3,4, Brian L Carlson2,4 & Carolyn R Bertozzi1,2


Methods for introducing bioorthogonal functionalities into proteins have become central to protein engineering efforts. Here we describe a method for the site-specific introduction of aldehyde groups into recombinant proteins using the 6-amino-acid consensus sequence recognized by the formylglycine-generating enzyme. This genetically encoded 'aldehyde tag' is no larger than a His6 tag and can be exploited for numerous protein labeling applications.


Over a decade ago investigators discovered that most sulfatases contain an aldehyde-bearing formylglycine residue within their active site that is required for catalytic activity1. The formylglycine residue is produced by co- or post-translational modification of a conserved cysteine residue found within the "sulfatase motif" (Fig. 1a). Definedas a contiguous 13-amino-acid sequence, this motif contains a highly conserved CXPXR submotif, where X is usually serine, threonine, alanine or glycine. Formylglycine-generating enzyme (FGE) oxidizes cysteine to formylglycine and has recently been identified in eukaryotes2, 3 and prokaryotes (data not shown). Notably, FGE recognizes its substrates based solely on primary sequence (refs. 3,4 and data not shown). This observation suggests that a small peptide tag based on the sulfatase motif, which we have termed the "aldehyde tag," might direct formation of formylglycine independent of its context (Fig. 1b). Furthermore, many organisms have endogenous FGE activity5, which suggests that this post-translational modification system could serve as a general means for engineering proteins for site-specific labeling6, 7, 8, 9, 10, 11, 12, 13.

Figure 1: The sulfatase motif can serve as an aldehyde tag for site-specific protein modification.

Figure 1 : The sulfatase motif can serve as an aldehyde tag for site-specific protein modification.

(a) Sequence alignment of the sulfatase motifs from a variety of sulfatases found in diverse organisms. (b) Nucleotides encoding the 6-amino-acid sequence can be appended to or inserted within a gene of interest. Upon expression, the encoded cysteine is modified to an aldehyde, which can be used as a chemical handle for a variety of applications such as fluorophore labeling. fGly, formylglycine.

Full size image (77 KB)

To evaluate this strategy, we expressed recombinant proteins with either N- or C-terminal aldehyde tags in Escherichia coli. We explored three protein targets: a C-terminally tagged maltose-binding protein (MBP), an N-terminally tagged human growth hormone (hGH) and an N-terminally tagged mycobacterial sulfotransferase (Stf0). Additionally, we tested two variants of the aldehyde tag: a 13-amino-acid tag that represents a full-length sulfatase motif (LCTPSRGSLFTGR) and a 6-amino-acid tag (LCTPSR) that includes only the core conserved residues (Table 1). Although E. coli has endogenous FGE activity14, we coexpressed the tagged proteins with an additional prokaryotic FGE from Mycobacterium tuberculosis (GenBank accession NP_215226) to maximize formylglycine formation efficiency (see Supplementary Methods and Supplementary Table 1 online).


Tryptic digestion of the 13-mer aldehyde-tagged Stf0 (ald13-Stf0) allowed direct identification of formylglycine by mass spectrometry (Supplementary Fig. 1 online). The presence of formylglycine was further confirmed by treating the tryptic digest with methoxyamine (1), a reagent that reacts selectively with aldehydes and ketones to form the corresponding oximes (Supplementary Fig. 1).

To quantify the extent of conversion of cysteine to formylglycine for all three proteins, we used an MS-based standard addition assay (see Supplementary Methods and Supplementary Fig. 2 online). The conversion efficiency was >90% for all but one of the constructs tested in this assay (Table 1). Considering that the sulfatase motif is natively found at an interior region of the sulfatase primary sequence, these results suggest that efficient aldehyde formation is independent of the tag's location within the protein. When ald6-MBP was expressed in E. coli without exogenous FGE, conversion efficiency dropped to 32 plusminus 5% (s.d.; Supplementary Fig. 3 online).

We next used the aldehydes for site-specific protein modification with aminooxy- or hydrazide-functionalized probes. A panel of aldehyde-tagged proteins was robustly labeled with Alexa Fluor 647 C5-aminooxyacetamide (Invitrogen; Fig. 2a). By contrast, control proteins in which the critical cysteine had been mutated to alanine demonstrated no significant labeling. Additionally, we showed that aldehyde-tagged proteins can be reversibly modified with multiple epitopes. We reacted ald6-MBP with biotin hydrazide (2), then displaced the N-acylhydrazone with a more thermodynamically stable oxime by subsequent reaction with an aminooxy-conjugated Flag peptide (Fig. 2b). This process enables sequential modification of proteins with independent tags for purification, detection or other applications.

Figure 2: Selective labeling of aldehyde-tagged proteins.

Figure 2 : Selective labeling of aldehyde-tagged proteins.

(a) Fluorophore labeling of aldehyde-tagged proteins. Top, labeling with Alexa Fluor 647 C5-aminooxyacetamide; bottom, protein loading was assessed by Sypro Ruby staining. (b) Tandem labeling of ald6-MBP with two different probes. Lane 1, protein incubated with biotin hydrazide; lane 2, protein incubated with biotin hydrazide and subsequently modified with methoxyamine; lane 3, protein incubated with biotin hydrazide and subsequently modified with aminooxy-Flag. Top, western blot probed with alpha-biotin antibody; middle, same western blot probed with alpha-Flag antibody; bottom, same western blot stained with Ponceau S. (c) PEGylation of ald6-Stf0 with 5 kDa (5K), 10 kDa (10K) or 20 kDa (20K) aminooxy-PEG. Black arrowhead, protein-PEG conjugate; white arrowhead, unreacted protein.

Full size image (33 KB)

Modification of pharmaceutical proteins with polyethylene glycol (PEG) chains can increase proteolytic stability while decreasing renal clearance and immunogenicity15. As a final application, we demonstrated the aldehyde tag's utility for site-specific PEGylation by reaction of ald6-Stf0 and ald6-MBP with aminooxy-PEG polymers of various molecular weights (Fig. 2c and Supplementary Fig. 4 online).

For a protein modification technique to find widespread use among biologists, it must be practical, general and available. The aldehyde tag fulfills all of these requirements. The motif is installed using basic molecular biology protocols and can function at either terminus or at an interior position of the protein. Furthermore, many aldehyde-specific reagents are commercially available. We envision many applications of this technology, including the introduction of biophysical probes and epitope tags onto proteins, as well as surface attachment of proteins in the context of microarrays. Additionally, combinatorial use of the aldehyde tag with other bioorthogonal labeling methods might offer new avenues for protein engineering.

Note: Supplementary information and chemical compound information is available on the Nature Chemical Biology website.

Top

Author Contributions

I.S.C. and B.L.C. carried out cloning, expression, purification and fluorescent tagging of the constructs. I.S.C. quantified conversion to fGly and performed multiple epitope assays. B.L.C. performed PEGylation assays. C.R.B. directed the project. All authors worked together to compose the manuscript.



Top

Acknowledgments

We thank M. Francis and J. Rush for helpful discussions and D. King and A. Falick for MS expertise. I.S.C. was supported by a postdoctoral fellowship from the US National Institutes of Health. B.L.C. was supported by a predoctoral fellowship from the US National Science Foundation. This work was supported by a grant from the US National Institutes of Health to C.R.B. (GM59907).

Competing interests statement:

The authors declare no competing financial interests.

Received 19 January 2007; Accepted 28 March 2007; Published online 22 April 2007.

Top

References

  1. Schmidt, B., Selmer, T., Ingendoh, A. & von Figura, K. Cell 82, 271–278 (1995). | Article | PubMed | ISI | ChemPort |
  2. Cosma, M.P. et al. Cell 113, 445–456 (2003). | Article | PubMed | ISI | ChemPort |
  3. Dierks, T. et al. Cell 113, 435–444 (2003). | Article | PubMed | ISI | ChemPort |
  4. Dierks, T., Lecca, M.R., Schlotterhose, P., Schmidt, B. & von Figura, K. EMBO J. 18, 2084–2091 (1999). | Article | PubMed | ISI | ChemPort |
  5. Landgrebe, J., Dierks, T., Schmidt, B. & von Figura, K. Gene 316, 47–56 (2003). | Article | PubMed | ISI | ChemPort |
  6. Gilmore, J.M., Scheck, R.A., Esser-Kahn, A.P., Joshi, N.S. & Francis, M.B. Angew. Chem. Int. Edn Engl. 45, 5307–5311 (2006). | Article | ChemPort |
  7. Link, A.J., Mock, M.L. & Tirrell, D.A. Curr. Opin. Biotechnol. 14, 603–609 (2003). | Article | PubMed | ISI | ChemPort |
  8. Chen, I., Howarth, M., Lin, W.Y. & Ting, A.Y. Nat. Methods 2, 99–104 (2005). | Article | PubMed | ISI | ChemPort |
  9. Griffin, B.A., Adams, S.R. & Tsien, R.Y. Science 281, 269–272 (1998). | Article | PubMed | ISI | ChemPort |
  10. Wang, L., Xie, J. & Schultz, P.G. Annu. Rev. Biophys. Biomol. Struct. 35, 225–249 (2006). | Article | PubMed | ISI | ChemPort |
  11. Clarke, K.M., Mercer, A.C., La Clair, J.J. & Burkart, M.D. J. Am. Chem. Soc. 127, 11234–11235 (2005). | Article | PubMed | ISI | ChemPort |
  12. George, N., Pick, H., Vogel, H., Johnsson, N. & Johnsson, K. J. Am. Chem. Soc. 126, 8896–8897 (2004). | Article | PubMed | ISI | ChemPort |
  13. Yin, J., Liu, F., Li, X. & Walsh, C.T. J. Am. Chem. Soc. 126, 7754–7755 (2004). | Article | PubMed | ISI | ChemPort |
  14. Dierks, T. et al. J. Biol. Chem. 273, 25560–25564 (1998). | Article | PubMed | ISI | ChemPort |
  15. Harris, J.M. & Chess, R.B. Nat. Rev. Drug Discov. 2, 214–221 (2003). | Article | PubMed | ISI | ChemPort |
  1. Department of Chemistry, Howard Hughes Medical Institute, B84 Hildebrand Hall, University of California, Berkeley, California 94720, USA
  2. Department of Molecular and Cell Biology, Howard Hughes Medical Institute, B84 Hildebrand Hall, University of California, Berkeley, California 94720, USA.
  3. Present address: Department of Chemistry, 533 Graduate Chemistry, Stony Brook University, Stony Brook, New York 11794, USA.
  4. These authors contributed equally to this work.

Correspondence to: Carolyn R Bertozzi1,2 e-mail: crb@berkeley.edu

MORE ARTICLES LIKE THIS

These links to content published by NPG are automatically generated.

RESEARCH

The mechanisms of acute ischemic injury in the cell processes of developing white matter astrocytes

Journal of Cerebral Blood Flow & Metabolism Original Article


Extra navigation

Subscribe to Nature Chemical Biology

Subscribe

Open Innovation Challenges

  • Corrosion Inhibitor

    • Deadline: Aug 19 2009
    • Reward: $10,000 USD

    The Seeker is looking for inhibitors of corrosion. This Challenge requires only a written descripti...

  • Mitigating Zinc Corrosion

    • Deadline: Aug 23 2009
    • Reward: $20,000 USD

    The Seeker is looking for novel methods to mitigate zinc corrosion/gassing in alkaline media. This ...

naturejobs

natureproducts