Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

Imbalanced single-cell data integration leads to loss of biological information

The Iniquitate pipeline assessed the impacts of cell-type imbalance on single-cell RNA sequencing integration through perturbations to dataset balance. The results indicated that cell-type imbalance not only leads to loss of biological signal in the integrated space, but also can change the interpretation of downstream analyses after integration.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: The Iniquitate pipeline, experiments and analyses.

References

  1. Svensson, V., da Veiga Beltrame, E. & Pachter, L. A curated database reveals trends in single-cell transcriptomics. Database 2020, baaa073 (2020). A database that curates papers that use single-cell RNA sequencing technology and tracks key factors such as tissue type, techniques and number of cells sequenced.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Argelaguet, R. et al. Computational principles and challenges in single-cell data integration. Nat. Biotechnol. 39, 1202–1215 (2021). This review paper presents current paradigms in single-cell data integration, outstanding challenges and future directions.

    Article  CAS  PubMed  Google Scholar 

  3. Tran, H. T. N. et al. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 21, 12 (2020). This paper is a comprehensive benchmarking analysis of single-cell RNA sequencing integration methods across various data scenarios.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Luecken, M. D. et al. Benchmarking atlas-level data integration in single-cell genomics. Nat. Methods 19, 41–50 (2022). A comprehensive benchmark that expands on the work done by Tran et al. by incorporating more methods, modalities and preprocessing parameters, and larger atlas-level datasets.

    Article  CAS  PubMed  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Maan, H. et al. Characterizing the impacts of dataset imbalance on single-cell data integration. Nat. Biotechnol. https://doi.org/10.1038/s41587-023-02097-9 (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Imbalanced single-cell data integration leads to loss of biological information. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-023-02114-x

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41587-023-02114-x

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing