Article | Published:

Autoantigen discovery with a synthetic human peptidome

Nature Biotechnology volume 29, pages 535541 (2011) | Download Citation


Immune responses targeting self-proteins (autoantigens) can lead to a variety of autoimmune diseases. Identification of these antigens is important for both diagnostic and therapeutic reasons. However, current approaches to characterize autoantigens have, in most cases, met only with limited success. Here we present a synthetic representation of the complete human proteome, the T7 peptidome phage display library (T7-Pep), and demonstrate its application to autoantigen discovery. T7-Pep is composed of >413,000 36-residue, overlapping peptides that cover all open reading frames in the human genome, and can be analyzed using high-throughput DNA sequencing. We developed a phage immunoprecipitation sequencing (PhIP-Seq) methodology to identify known and previously unreported autoantibodies contained in the spinal fluid of three individuals with paraneoplastic neurological syndromes. We also show how T7-Pep can be used more generally to identify peptide-protein interactions, suggesting the broader utility of our approach for proteomic research.

  • Subscribe to Nature Biotechnology for full access:



Additional access options:

Already a subscriber?  Log in  now or  Register  for online access.


  1. 1.

    et al. Fitness correlates of heritable variation in antibody responsiveness in a wild mammal. Science 330, 662–665 (2010).

  2. 2.

    et al. Phage display of cDNA libraries: enrichment of cDNA expression using open reading frame selection. Biotechniques 36, 1018–1022 (2004).

  3. 3.

    & Paraneoplastic neurological degenerations: keys to tumour immunity. Nat. Rev. Cancer 4, 36–44 (2004).

  4. 4.

    et al. Autoantibody signatures in prostate cancer. N. Engl. J. Med. 353, 1224–1235 (2005).

  5. 5.

    et al. A protein microarray signature of autoantibody biomarkers for the early detection of breast cancer. J. Proteome Res. 10, 85–96 (2011).

  6. 6.

    , , , & Selecting open reading frames from DNA. Genome Res. 13, 980–990 (2003).

  7. 7.

    et al. Identification of Hnrph3 as an autoantigen for acute anterior uveitis. Clin. Immunol. 138, 60–66 (2011).

  8. 8.

    , , & Counting the uncountable: statistical approaches to estimating microbial diversity. Appl. Environ. Microbiol. 67, 4399–4406 (2001).

  9. 9.

    & Immune surveillance of tumors. J. Clin. Invest. 117, 1137–1146 (2007).

  10. 10.

    & Paraneoplastic syndromes involving the nervous system. N. Engl. J. Med. 349, 1543–1554 (2003).

  11. 11.

    & Paraneoplastic opsoclonus-myoclonus ataxia associated with non-small-cell lung carcinoma. J. Neurooncol. 90, 213–216 (2008).

  12. 12.

    & A two-parameter generalized Poisson model to improve the analysis of RNA-seq data. Nucleic Acids Res. 38, e170 (2010).

  13. 13.

    & Maximum likelihood estimation for the generalized poisson distribution. Comm. Statist. Theory Methods 13, 1533–1547 (1984).

  14. 14.

    & Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28–36 (1994).

  15. 15.

    et al. CTdatabase: a knowledge-base of high-throughput and curated data on cancer-testis antigens. Nucleic Acids Res. 37, D816–D819 (2009).

  16. 16.

    et al. Efficient simultaneous presentation of NY-ESO-1/LAGE-1 primary and nonprimary open reading frame-derived CTL epitopes in melanoma. J. Immunol. 165, 7253–7261 (2000).

  17. 17.

    et al. Identification of multiple cancer/testis antigens by allogeneic antibody screening of a melanoma cell line library. Proc. Natl. Acad. Sci. USA 95, 6919–6923 (1998).

  18. 18.

    , & The human-specific Yp11.2/Xq21.3 homology block encodes a potentially functional testis-specific TGIF-like retroposon. Mamm. Genome 13, 463–468 (2002).

  19. 19.

    et al. A genecentric human protein atlas for expression profiles based on antibodies. Mol. Cell. Proteomics 7, 2019–2027 (2008).

  20. 20.

    , , , & Identification of autoantibody epitopes of glutamic acid decarboxylase in stiff-man syndrome patients. J. Immunol. 152, 930–934 (1994).

  21. 21.

    et al. High-resolution autoreactive epitope mapping and structural modeling of the 65 kDa form of human glutamic acid decarboxylase. J. Mol. Biol. 287, 983–999 (1999).

  22. 22.

    et al. TRIM9, a novel brain-specific E3 ubiquitin ligase, is repressed in the brain of Parkinson's disease and dementia with Lewy bodies. Neurobiol. Dis. 38, 210–218 (2010).

  23. 23.

    et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature 437, 1173–1178 (2005).

  24. 24.

    et al. The SIOD disorder protein SMARCAL1 is an RPA-interacting protein involved in replication fork restart. Genes Dev. 23, 2415–2425 (2009).

  25. 25.

    et al. Structural basis for the recognition of DNA repair proteins UNG2, XPA, and RAD52 by replication factor RPA. Cell 103, 449–456 (2000).

  26. 26.

    , & Continuous and discontinuous protein antigenic determinants. Nature 322, 747–748 (1986).

  27. 27.

    , & High resolution functional analysis of antibody-antigen interactions. J. Mol. Biol. 226, 851–865 (1992).

  28. 28.

    et al. Analysis of in vivo role of alpha-fodrin autoantigen in primary Sjogren's syndrome. Am. J. Pathol. 167, 1051–1059 (2005).

  29. 29.

    et al. Detection of apoptosis-specific autoantibodies directed against granzyme B-induced cleavage fragments of the SS-B (La) autoantigen in sera from patients with primary Sjogren's syndrome. Clin. Exp. Immunol. 142, 148–154 (2005).

  30. 30.

    , , & Antibodies to covalent aggregates of insulin in blood of insulin-using diabetic patients. Diabetes 36, 838–841 (1987).

  31. 31.

    et al. Autoantibodies to alpha-synuclein in inherited Parkinson's disease. J. Neurochem. 101, 749–756 (2007).

  32. 32.

    , , & The clinical spectrum of anti-GAD antibody-positive patients with stiff-person syndrome. Neurology 55, 1531–1535 (2000).

  33. 33.

    et al. Diversity of phage-displayed libraries of peptides during panning and amplification. Molecules 16, 1776–1803 (2011).

  34. 34.

    et al. hORFeome v3.1: a resource of human open reading frames representing over 10,000 human genes. Genomics 89, 307–315 (2007).

Download references


This work was supported in part by grants from the Department of Defense (W81XWH-10-1-0994 and W81XWH-04-1-0197) to S.J.E., and in part by the US National Institutes of Health (K08CA124804), The American Recovery and Reinvestment Act (3P30CA023100-25S8), Sontag Foundation Distinguished Scientist Award and a James S. McDonnell Foundation award to S.K. N.L.S. is a fellow of the Susan G. Komen for the Cure Foundation. S.J.E. is an investigator with the Howard Hughes Medical Institute. We would like to thank S. Gowrisankar, O. Iartchouk and L. Merrill for assistance with Illumina sequencing, and D. Šćepanović for statistical support.

Author information

Author notes

    • Zhenming Zhao

    Present address: Biogen Idec, Cambridge, Massachusetts, USA.


  1. Harvard-MIT Division of Health Sciences and Technology, Cambridge, Massachusetts, USA.

    • H Benjamin Larman
    •  & Uri Laserson
  2. Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.

    • H Benjamin Larman
  3. Department of Genetics, Harvard University Medical School, and Division of Genetics, Howard Hughes Medical Institute, Brigham and Women's Hospital, Boston, Massachusetts, USA.

    • H Benjamin Larman
    • , Zhenming Zhao
    • , Mamie Z Li
    • , Alberto Ciccia
    • , M Angelica Martinez Gakidis
    • , Nicole L Solimini
    •  & Stephen J Elledge
  4. Department of Mathematics, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.

    • Uri Laserson
  5. Department of Genetics, Harvard University Medical School, Boston, Massachusetts, USA.

    • Uri Laserson
    •  & George M Church
  6. Division of Neuro-Oncology, Department of Neurosciences, University of California, San Diego, Moores Cancer Center, La Jolla, California, USA.

    • Santosh Kesari
  7. Agilent Technologies, Genomics, Santa Clara, California, USA.

    • Emily M LeProust


  1. Search for H Benjamin Larman in:

  2. Search for Zhenming Zhao in:

  3. Search for Uri Laserson in:

  4. Search for Mamie Z Li in:

  5. Search for Alberto Ciccia in:

  6. Search for M Angelica Martinez Gakidis in:

  7. Search for George M Church in:

  8. Search for Santosh Kesari in:

  9. Search for Emily M LeProust in:

  10. Search for Nicole L Solimini in:

  11. Search for Stephen J Elledge in:


S.J.E. conceived the project, which was supervised by N.L.S. and S.J.E. Z.Z. designed the DNA sequences for synthesis. Oligo libraries were constructed by E.M.L. Cloning was performed by M.Z.L., M.A.M.G. and N.L.S. The T7-Pep, T7-NPep, and T7-CPep phage libraries were constructed by N.L.S. and characterized by N.L.S. and H.B.L. The PhIP-Seq protocol was developed and implemented by H.B.L. Clinical evaluations and patient sample acquisitions were performed by S.K. Statistical analysis of PhIP-Seq data was conceived by U.L. under the supervision of G.M.C. and implemented by H.B.L. PhIP-Seq candidates were confirmed by H.B.L. The RPA2 experiment was performed by A.C. The manuscript was prepared by H.B.L. and edited by N.L.S. and S.J.E.

Competing interests

The authors declare no competing financial interests.

Corresponding authors

Correspondence to Nicole L Solimini or Stephen J Elledge.

Supplementary information

PDF files

  1. 1.

    Supplementary Text and Figures

    Supplementary Tables 1–3 and Supplementary Figs. 1–9

About this article

Publication history





Rights and permissions

To obtain permission to re-use content from this article visit RightsLink.