Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Protocol
  • Published:

CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes

Abstract

Cell–cell communication mediated by ligand–receptor complexes is critical to coordinating diverse biological processes, such as development, differentiation and inflammation. To investigate how the context-dependent crosstalk of different cell types enables physiological processes to proceed, we developed CellPhoneDB, a novel repository of ligands, receptors and their interactions. In contrast to other repositories, our database takes into account the subunit architecture of both ligands and receptors, representing heteromeric complexes accurately. We integrated our resource with a statistical framework that predicts enriched cellular interactions between two cell types from single-cell transcriptomics data. Here, we outline the structure and content of our repository, provide procedures for inferring cell–cell communication networks from single-cell RNA sequencing data and present a practical step-by-step guide to help implement the protocol. CellPhoneDB v.2.0 is an updated version of our resource that incorporates additional functionalities to enable users to introduce new interacting molecules and reduces the time and resources needed to interrogate large datasets. CellPhoneDB v.2.0 is publicly available, both as code and as a user-friendly web interface; it can be used by both experts and researchers with little experience in computational genomics. In our protocol, we demonstrate how to evaluate meaningful biological interactions with CellPhoneDB v.2.0 using published datasets. This protocol typically takes ~2 h to complete, from installation to statistical analysis and visualization, for a dataset of ~10 GB, 10,000 cells and 19 cell types, and using five threads.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Overview of the database.
Fig. 2: Overview of the statistical method framework used to infer ligand–receptor complexes specific to two cell types from single-cell transcriptomics data.
Fig. 3: Example dataset run with CellPhoneDB and CellPhoneDB v.2.0.
Fig. 4: Diagram showing how lists are generated.
Fig. 5: Screenshot of the web portal.

Similar content being viewed by others

Data availability

The decidua and placenta datasets can be downloaded from ArrayExpress, with experiment code E-MTAB-6701.

Code availability

The CellPhoneDB code is available at https://github.com/Teichlab/cellphonedb. It can also be downloaded from https://cellphonedb.org/downloads. The code in this paper has been peer-reviewed.

References

  1. Svensson, V., Vento-Tormo, R. & Teichmann, S. A. Exponential scaling of single-cell RNA-seq in the past decade. Nat. Protoc. 13, 599–604 (2018).

    Article  CAS  Google Scholar 

  2. Ramilowski, J. A. et al. A draft network of ligand-receptor-mediated multicellular signalling in human. Nat. Commun. 6, 7866 (2015).

  3. Hie, B., Cho, H., DeMeo, B., Bryson, B. & Berger, B. Geometric sketching compactly summarizes the single-cell transcriptomic landscape. Cell Syst. 8, 483–493.e7 (2018).

  4. Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).

    Article  CAS  Google Scholar 

  5. Vento-Tormo, R. et al. Single-cell reconstruction of the early maternal-fetal interface in humans. Nature 563, 347–353 (2018).

    Article  CAS  Google Scholar 

  6. Braga, F. A. V. et al. A cellular census of human lungs identifies novel cell states in health and in asthma. Nat. Med. 25, 1153–1163 (2019).

  7. Stewart, B. J. et al. Spatiotemporal immune zonation of the human kidney. Science 365, 1461–1466 (2019).

    Article  CAS  Google Scholar 

  8. Popescu, D.-M. et al. Decoding the development of the blood and immune systems during human fetal liver haematopoiesis. Nature 574, 365–371 (2019).

    Article  CAS  Google Scholar 

  9. Davidson, S. et al. Single-cell RNA sequencing reveals a dynamic stromal niche within the evolving tumour microenvironment. Preprint at bioRxiv: https://doi.org/10.1101/467225 (2018)

  10. Skelly, D. A. et al. Single-cell transcriptional profiling reveals cellular diversity and intercommunication in the mouse heart. Cell Rep. 22, 600–610 (2018).

    Article  CAS  Google Scholar 

  11. Camp, J. G. et al. Multilineage communication regulates human liver bud development from pluripotency. Nature 546, 533–538 (2017).

    Article  CAS  Google Scholar 

  12. Pavličev, M. et al. Single-cell transcriptomics of the human placenta: inferring the cell communication network of the maternal-fetal interface. Genome Res. 27, 349–361 (2017).

    Article  Google Scholar 

  13. Puram, S. V. et al. Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell 171, 1611–1624.e24 (2017).

    Article  CAS  Google Scholar 

  14. Suryawanshi, H. et al. A single-cell survey of the human first-trimester placenta and decidua. Sci. Adv. 4, eaau4788 (2018).

    Article  CAS  Google Scholar 

  15. Zhou, J. X., Taramelli, R., Pedrini, E., Knijnenburg, T. & Huang, S. Author correction: Extracting intercellular signaling network of cancer tissues using ligand-receptor expression patterns from whole-tumor and single-cell transcriptomes. Sci. Rep. 8, 17903 (2018).

    Article  CAS  Google Scholar 

  16. Cohen, M. et al. Lung single-cell signaling interaction map reveals basophil role in macrophage imprinting. Cell 175, 1031–1044.e18 (2018).

    Article  CAS  Google Scholar 

  17. Kumar, M. P. et al. Analysis of single-cell rna-seq identifies cell-cell communication associated with tumor characteristics. Cell Rep. 25, 1458–1468.e4 (2018).

    Article  CAS  Google Scholar 

  18. Halpern, K. B. et al. Paired-cell sequencing enables spatial gene expression mapping of liver endothelial cells. Nat. Biotechnol. 36, 962–970 (2018).

    Article  CAS  Google Scholar 

  19. Joost, S. et al. Single-cell transcriptomics of traced epidermal and hair follicle stem cells reveals rapid adaptations during wound healing. Cell Rep. 25, 585–597.e7 (2018).

    Article  CAS  Google Scholar 

  20. Boisset, J.-C. et al. Mapping the physical network of cellular interactions. Nat. Methods 15, 547–553 (2018).

    Article  CAS  Google Scholar 

  21. Lubeck, E., Coskun, A. F., Zhiyentayev, T., Ahmad, M. & Cai, L. Single-cell in situ RNA profiling by sequential hybridization. Nat. Methods 11, 360–361 (2014).

    Article  CAS  Google Scholar 

  22. Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).

    Article  Google Scholar 

  23. Rodriques, S. G. et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).

    Article  CAS  Google Scholar 

  24. Svensson, V. A method for transcriptome-wide gene expression quantification in intact tissues. Immunol. Cell Biol. 97, 439–441 (2019).

    Article  Google Scholar 

  25. Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230 (2014).

    Article  CAS  Google Scholar 

  26. Proteomics Standards Initiative. Proteomics Standards Initiative common query interface in Encyclopedia of Systems Biology (eds Dubitzky, W., Wolkenhauer, O., Cho, K.H. & Yokota, H.) 1798–1798 (Springer, 2013): https://doi.org/10.1007/978-1-4419-9863-7_101243

  27. Protein interaction data curation: the International Molecular Exchange (IMEx) consortium. Nat. Methods 9, 345–350 (2012).

  28. Orchard, S. et al. The MIntAct project-IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42, D358–D363 (2014).

    Article  CAS  Google Scholar 

  29. Breuer, K. et al. InnateDB: systems biology of innate immunity and beyond-recent updates and continuing curation. Nucleic Acids Res. 41, D1228–D1233 (2013).

    Article  CAS  Google Scholar 

  30. Clerc, O. et al. MatrixDB: integration of new data with a focus on glycosaminoglycan interactions. Nucleic Acids Res. 47, D376–D381 (2019).

    Article  CAS  Google Scholar 

  31. Licata, L. et al. MINT, the molecular interaction database: 2012 update. Nucleic Acids Res. 40, D857–D861 (2012).

    Article  CAS  Google Scholar 

  32. Brown, K. R. & Jurisica, I. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biol. 8, R95 (2007).

    Article  Google Scholar 

  33. Bachelerie, F. et al. International Union of Pharmacology. LXXXIX. Update on the extended family of chemokine receptors and introducing a new nomenclature for atypical chemokine receptors. Pharmacol. Rev. 66, 1–79 (2014). erratum 66, 467 (2014).

    Article  Google Scholar 

  34. Satija, R. et al. Spatial reconstruction of the single-cell gene expression data. Nat. Biotechnol. 33, 495–502 (2015).

    Article  CAS  Google Scholar 

  35. Wolf, F. A. et al. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 19, 15 (2018).

    Article  Google Scholar 

Download references

Acknowledgements

We thank K. Meyer and M. Stubbington for scientific discussions; P. Porras for advice on querying the IMEx database; L. Garcia-Alonso and K. Polanski for carefully reading the manuscript; G.J. Wright, L. Wood and G. Graham for advice on protein–protein interactions; and J. Eliasova and A. Hupalowska for help with the illustrations. We are grateful to A. Lopez and YDEVS members for their help with the webserver and the implementation of the code in GitHub, as well as to all the Teichmann lab and Vento-Tormo lab members for their fruitful advice. The project was supported by Wellcome Sanger core funding (WT206194) and a Wellcome Strategic Support Science award (211276/Z/18/Z).

Author information

Authors and Affiliations

Authors

Contributions

M.E., S.A.T. and R.V.-T. conceived and developed the protocol and wrote the manuscript. M.V.-T. developed the database, implemented the code in the webserver and GitHub and contributed to writing the manuscript.

Corresponding author

Correspondence to Roser Vento-Tormo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Protocols thanks Evangelia Petsalaki and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Related links

Key references using this protocol

Vento-Tormo, R. et al. Nature 563, 347–353 (2018): https://doi.org/10.1038/s41586-018-0698-6

Stewart, B. et al. Science 365, 1461–1466 (2019): https://doi.org/10.1126/science.aat5031

Popescu, D. et al. Nature 574, 365–371 (2019): https://doi.org/10.1038/s41586-019-1652-y

Integrated supplementary information

Supplementary Figure 1 Diagram of the database structure.

a) database schema, b) protein_input/complex_input storage in the CellPhoneDB database tables. The multidata entity stores fields common to complex_input and protein_input. This makes it easier and faster for the user to perform interaction queries because interaction_table is only related to multidata_table. All non-common fields are stored in either protein_table or complex_table. Complex fields are stored in complex_composition_table. The is_complex and total_protein field are created for optimization purposes.

Supplementary Figure 2 Example of complex_input components stored in CellPhoneDB.

An example of two complex_input rows with two and four components.

Supplementary information

Supplementary Information

Supplementary Figs. 1 and 2.

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Efremova, M., Vento-Tormo, M., Teichmann, S.A. et al. CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat Protoc 15, 1484–1506 (2020). https://doi.org/10.1038/s41596-020-0292-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41596-020-0292-x

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics