Letter | Published:

Encoding human serine phosphopeptides in bacteria for proteome-wide identification of phosphorylation-dependent interactions

Nature Biotechnology volume 36, pages 638644 (2018) | Download Citation


Post-translational phosphorylation is essential to human cellular processes, but the transient, heterogeneous nature of this modification complicates its study in native systems1,2,3. We developed an approach to interrogate phosphorylation and its role in protein-protein interactions on a proteome-wide scale. We genetically encoded phosphoserine in recoded E. coli4,5,6 and generated a peptide-based heterologous representation of the human serine phosphoproteome. We designed a single-plasmid library encoding >100,000 human phosphopeptides and confirmed the site-specific incorporation of phosphoserine in >36,000 of these peptides. We then integrated our phosphopeptide library into an approach known as Hi-P to enable proteome-level screens for serine-phosphorylation-dependent human protein interactions. Using Hi-P, we found hundreds of known and potentially new phosphoserine-dependent interactors with 14-3-3 proteins and WW domains. These phosphosites retained important binding characteristics of the native human phosphoproteome, as determined by motif analysis and pull-downs using full-length phosphoproteins. This technology can be used to interrogate user-defined phosphoproteomes in any organism, tissue, or disease of interest.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


Primary accessions



  1. 1.

    & Mechanisms of specificity in protein phosphorylation. Nat. Rev. Mol. Cell Biol. 8, 530–541 (2007).

  2. 2.

    et al. The structural basis for 14-3-3:phosphopeptide binding specificity. Cell 91, 961–971 (1997).

  3. 3.

    & Mitogen-activated protein kinase pathways mediated by ERK, JNK, and p38 protein kinases. Science 298, 1911–1912 (2002).

  4. 4.

    et al. Expanding the genetic code of Escherichia coli with phosphoserine. Science 333, 1151–1154 (2011).

  5. 5.

    et al. A flexible codon in genomically recoded Escherichia coli permits programmable protein phosphorylation. Nat. Commun. 6, 8130 (2015).

  6. 6.

    et al. Genomically recoded organisms expand biological functions. Science 342, 357–360 (2013).

  7. 7.

    et al. Architecture of the human interactome defines protein communities and disease networks. Nature 545, 505–509 (2017).

  8. 8.

    et al. A human interactome in three quantitative dimensions organized by stoichiometries and abundances. Cell 163, 712–723 (2015).

  9. 9.

    , , , & The PINK1-PARKIN mitochondrial ubiquitylation pathway drives a program of OPTN/NDP52 recruitment and TBK1 activation to promote mitophagy. Mol. Cell 60, 7–20 (2015).

  10. 10.

    et al. Defining roles of PARKIN and ubiquitin phosphorylation by PINK1 in mitochondrial quality control using a ubiquitin replacement strategy. Proc. Natl. Acad. Sci. USA 112, 6637–6642 (2015).

  11. 11.

    et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–D520 (2015).

  12. 12.

    et al. Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process. Nucleic Acids Res. 38, 2522–2540 (2010).

  13. 13.

    et al. Evolution of translation machinery in recoded bacteria enables multi-site incorporation of nonstandard amino acids. Nat. Biotechnol. 33, 1272–1279 (2015).

  14. 14.

    et al. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science 333, 348–353 (2011).

  15. 15.

    et al. SH2 domains recognize specific phosphopeptide sequences. Cell 72, 767–778 (1993).

  16. 16.

    et al. Designed phosphoprotein recognition in Escherichia coli. ACS Chem. Biol. 9, 2502–2507 (2014).

  17. 17.

    et al. ANIA: annotation and integrated analysis of the 14-3-3 interactome. Database (Oxford) 2014, bat085 (2014).

  18. 18.

    et al. TAZ: a novel transcriptional co-activator regulated by interactions with 14-3-3 and PDZ domain proteins. EMBO J. 19, 6778–6791 (2000).

  19. 19.

    et al. pLogo: a probabilistic approach to visualizing sequence motifs. Nat. Methods 10, 1211–1212 (2013).

  20. 20.

    et al. Bioinformatic and experimental survey of 14-3-3-binding sites. Biochem. J. 427, 69–78 (2010).

  21. 21.

    , & FoxO transcription factors: regulation by AKT and 14-3-3 proteins. Biochim. Biophys. Acta 1813, 1938–1945 (2011).

  22. 22.

    , , & Function of WW domains as phosphoserine- or phosphothreonine-binding modules. Science 283, 1325–1328 (1999).

  23. 23.

    , & HECT domain-containing E3 ubiquitin ligase Nedd4 interacts with and ubiquitinates Sprouty2. J. Biol. Chem. 285, 255–264 (2010).

  24. 24.

    et al. Structural studies of the Nedd4 WW domains and their selectivity for the Connexin43 (Cx43) carboxyl terminus. J. Biol. Chem. 291, 7637–7650 (2016).

  25. 25.

    et al. Ubiquitin ligase Nedd4L targets activated Smad2/3 to limit TGF-β signaling. Mol. Cell 36, 457–468 (2009).

  26. 26.

    & Nedd4 and Nedd4-2: closely related ubiquitin-protein ligases with distinct physiological functions. Cell Death Differ. 17, 68–77 (2010).

  27. 27.

    & YAP1 recruits c-Abl to protect angiomotin-like 1 from Nedd4-mediated degradation. PLoS One 7, e35735 (2012).

  28. 28.

    et al. Autoantigen discovery with a synthetic human peptidome. Nat. Biotechnol. 29, 535–541 (2011).

  29. 29.

    et al. A large-scale targeted proteomics assay resource based on an in vitro human proteome. Nat. Methods 14, 251–258 (2017).

  30. 30.

    et al. Quantifying protein interaction dynamics by SWATH mass spectrometry: application to the 14-3-3 system. Nat. Methods 10, 1246–1253 (2013).

  31. 31.

    et al. A large synthetic peptide and phosphopeptide reference library for mass spectrometry-based proteomics. Nat. Biotechnol. 31, 557–564 (2013).

  32. 32.

    et al. Scalable gene synthesis by selective amplification of DNA pools from high-fidelity microchips. Nat. Biotechnol. 28, 1295–1299 (2010).

  33. 33.

    et al. A facile strategy for selective incorporation of phosphoserine into histones. Angew. Chem. Int. Ed. Engl. 52, 5771–5775 (2013).

  34. 34.

    , & Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).

  35. 35.

    et al. SPAK and OSR1 play essential roles in potassium homeostasis through actions on the distal convoluted tubule. J. Physiol. (Lond.) 594, 4945–4966 (2016).

  36. 36.

    & Rapid and reproducible single-stage phosphopeptide enrichment of complex peptide mixtures: application to general and phosphotyrosine-specific phosphoproteomics experiments. Anal. Chem. 83, 7635–7644 (2011).

  37. 37.

    & MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).

  38. 38.

    et al. ProteomeXchange provides globally coordinated proteomics data submission and dissemination. Nat. Biotechnol. 32, 223–226 (2014).

Download references


We thank J. Niemann for his assistance with selecting phospho-specific antibodies to probe phosphosite synthesis, P. Anderson for assistance with oligonucleotide library synthesis, the Yale Center for Genome Analysis for high-throughput sequencing and K. Mohler for helpful comments on the manuscript. K.W.B. is supported by the National Science Foundation Graduate Research Fellowship under grant no. DGE-1122492. R.S. received support from the Raymond and Beverly Sackler Institute for Biological, Physical and Engineering Sciences at Yale University and from Southern Connecticut State University School of Graduate Studies, Research and Innovation. J.R. is supported by the US National Institutes of Health (NIH; GM117230, GM125951, DK0174334, CA209992). F.J.I. acknowledges support from the NIH (GM117230, GM125951, CA209992), DARPA (HR0011-15-C-0091), NSF (MCB-1714860, CHE-1740549), DuPont, and the Arnold and Mabel Beckman Foundation. M.G. is supported by the NIH (HG008126).

Author information


  1. Department of Cellular & Molecular Physiology, Yale University, New Haven, Connecticut, USA.

    • Karl W Barber
    • , Svetlana Rogulina
    •  & Jesse Rinehart
  2. Systems Biology Institute, Yale University, West Haven, Connecticut, USA.

    • Karl W Barber
    • , Paul Muir
    • , Richard V Szeligowski
    • , Svetlana Rogulina
    • , Mark Gerstein
    • , Farren J Isaacs
    •  & Jesse Rinehart
  3. Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut, USA.

    • Paul Muir
    •  & Farren J Isaacs
  4. Biology Department, Southern Connecticut State University, New Haven, Connecticut, USA.

    • Richard V Szeligowski
  5. Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA.

    • Mark Gerstein
  6. Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, USA.

    • Mark Gerstein
  7. Department of Computer Science, Yale University, New Haven, Connecticut, USA.

    • Mark Gerstein
  8. Agilent Laboratories, Agilent Technologies, Santa Clara, California, USA.

    • Jeffrey R Sampson


  1. Search for Karl W Barber in:

  2. Search for Paul Muir in:

  3. Search for Richard V Szeligowski in:

  4. Search for Svetlana Rogulina in:

  5. Search for Mark Gerstein in:

  6. Search for Jeffrey R Sampson in:

  7. Search for Farren J Isaacs in:

  8. Search for Jesse Rinehart in:


J.R., F.J.I. and K.W.B. conceived the project, designed the experiments and analyzed the data. K.W.B, R.V.S. and S.R. performed molecular cloning, protein expression/purification and western blotting. K.W.B. performed mass spectrometry and Hi-P sample preparation, data collection, and analysis. K.W.B., P.M., M.G. and F.J.I. performed high-throughput sequencing and network analysis. K.W.B and F.J.I. developed oligonucleotide design and library preparation strategies. J.R.S. provided guidance on DNA construct design strategies using Agilent's Oligo Library Synthesis. J.R. and K.W.B. wrote the manuscript with input from all of the other authors.

Competing interests

J.R., F.J.I., K.W.B. and J.R.S. have filed a provisional patent application with the US Patent and Trademark Office (US Patent Application No. 62/639,279) related to this work.

Corresponding author

Correspondence to Jesse Rinehart.

Integrated supplementary information

Supplementary information

PDF files

  1. 1.

    Supplementary Text and Figures

    Supplementary Figures 1–12

  2. 2.

    Life Sciences Reporting Summary

  3. 3.

    Supplementary Notes

    Supplementary Notes 1–11

Excel files

  1. 1.

    Supplementary Data 1

    Oligonucleotide sequences encoding human phosphoserine phosphosite library

  2. 2.

    Supplementary Data 2

    Mass spectrometry analysis of mode #1 phosphosite library preparations

  3. 3.

    Supplementary Data 3

    Phospho-specific antibodies for evaluation of recombinant phosphosites

  4. 4.

    Supplementary Data 4

    Phosphosites observed by Hi-P using 14-3-3β and SepOTSλ

  5. 5.

    Supplementary Data 5

    Phosphosites observed by Hi-P using 14-3-3σ and SepOTSλ

  6. 6.

    Supplementary Data 6

    Positional amino acid frequencies in pSer-encoding phosphosite populations interacting with 14-3-3β by Hi-P

  7. 7.

    Supplementary Data 7

    Positional amino acid frequencies in pSer-encoding phosphosite populations interacting with 14-3-3σ by Hi-P

  8. 8.

    Supplementary Data 8

    Full-length recombinant human phosphoprotein genes identified as candidate 14-3-3β candidate interactors, synthesized for pull-down studies

  9. 9.

    Supplementary Data 9

    Phosphosites observed by Hi-P using NEDD4 WW2 and SepOTSλ

  10. 10.

    Supplementary Data 10

    Phosphosites observed by Hi-P using NEDD4 WW2 and supD tRNA

  11. 11.

    Supplementary Data 11

    Phosphosites observed by Hi-P using NEDD4-2 WW2 and SepOTSλ

  12. 12.

    Supplementary Data 12

    Phosphosites observed by Hi-P using NEDD4-2 WW2 and supD tRNA

  13. 13.

    Supplementary Data 13

    Co-immunoprecipitation mass spectrometry data analysis

  14. 14.

    Supplementary Data 14

    HTS sequencing from biological triplicate Hi-P experiments

  15. 15.

    Supplementary Data 15

    Orthogonal primer sequences for phosphosite DNA library amplification

  16. 16.

    Supplementary Data 16

    Synthesized DNA sequences

  17. 17.

    Supplementary Data 17

    Primer sequences

About this article

Publication history