Abstract
Spatially resolved transcriptomics (SRT) studies are becoming increasingly common and large, offering unprecedented opportunities in mapping complex tissue structures and functions. Here we present integrative and reference-informed tissue segmentation (IRIS), a computational method designed to characterize tissue spatial organization in SRT studies through accurately and efficiently detecting spatial domains. IRIS uniquely leverages single-cell RNA sequencing data for reference-informed detection of biologically interpretable spatial domains, integrating multiple SRT slices while explicitly considering correlations both within and across slices. We demonstrate the advantages of IRIS through in-depth analysis of six SRT datasets encompassing diverse technologies, tissues, species and resolutions. In these applications, IRIS achieves substantial accuracy gains (39–1,083%) and speed improvements (4.6–666.0) in moderate-sized datasets, while representing the only method applicable for large datasets including Stereo-seq and 10x Xenium. As a result, IRIS reveals intricate brain structures, uncovers tumor microenvironment heterogeneity and detects structural changes in diabetes-affected testis, all with exceptional speed and accuracy.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
24,99 € / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
274,03 € per year
only 22,84 € per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout






Similar content being viewed by others
Data availability
The original public data used in this work can be accessed through the following links: human DLPFC data by 10x Visium available at http://spatial.libd.org/spatialLIBD/, with human post-mortem brain single-nucleus RNA-seq reference data available at Synapse (https://www.synapse.org/#!Synapse:syn18485175); human SCC data by ST are available at GEO accession GSE144240, with the human SCC scRNA-seq reference data available at GEO accession GSE144236; mouse spermatogenesis data by Slide-seq are available at https://www.dropbox.com/s/ygzpj0d0oh67br0/Testis_Slideseq_Data.zip?dl=0, with the mouse testis scRNA-seq reference data available at GEO accession GSE112393; mouse brain (coronal section) Vizgen MERFISH data are available at https://info.vizgen.com/mouse-brain-data, with the mouse brain scRNA-seq reference data available at http://mousebrain.org/adolescent/; mouse olfactory bulb by Stereo-seq data are available at https://db.cngb.org/stomics/mosta/download/, with the mouse olfactory bulb scRNA-seq data available at GEO accession GSE121891; human BC by 10x Xenium data are available at https://www.10xgenomics.com/products/xenium-in-situ/preview-dataset-human-breast, with the human BC scRNA-seq reference data available at GSE accession GSE176078; details about the data we used in this study are provided in Supplementary Tables 1 and 2.
Code availability
The IRIS software package and source code have been deposited at https://xiangzhou.github.io/software/ and https://github.com/YingMa0107/IRIS. All scripts used to reproduce all the analysis are also available at the same website.
References
Chen, K. H., Boettiger, A. N., Moffitt, J. R., Wang, S. & Zhuang, X. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 348, aaa6090 (2015).
Janesick, A. et al. High resolution mapping of the tumor microenvironment using integrated single-cell, spatial and in situ analysis. Nat. Commun. 14, 8353 (2023).
Wang, X. et al. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science 361, eaat5691 (2018).
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
10x Genomics: Visium Spatial Gene Expression. 10x Genomics https://www.10xgenomics.com/ (2024).
Rodriques, S. G. et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).
Stickels, R. R. et al. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 39, 313–319 (2021).
Chen, A. et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 185, 1777–1792. e1721 (2022).
Moses, L. & Pachter, L. Museum of spatial transcriptomics. Nat. Methods https://doi.org/10.1038/s41592-022-01409-2 (2022).
Rao, A., Barkley, D., França, G. S. & Yanai, I. Exploring tissue architecture using spatial transcriptomics. Nature 596, 211–220 (2021).
Hu, J. et al. SpaGCN: integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat. Methods 18, 1342–1351 (2021).
Zhao, E. et al. Spatial transcriptomics at subspot resolution with BayesSpace. Nat. Biotechnol. 39, 1375–1384 (2021).
Li, Z. & Zhou, X. BASS: multi-scale and multi-sample analysis enables accurate cell type clustering and spatial domain detection in spatial transcriptomic studies. Genome Biol. 23, 1–35 (2022).
Xu, H. et al. Unsupervised spatially embedded deep representation of spatial transcriptomics. Genome Med. 16, 12 (2024).
Long, Y. et al. Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST. Nat. Commun. 14, 1155 (2023).
Dong, K. & Zhang, S. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nat. Commun. 13, 1739 (2022).
Allen, C., Chang, Y., Ma, Q. & Chung, D. MAPLE: a hybrid framework for multi-sample spatial transcriptomics data. Preprint at bioRxiv https://doi.org/10.1101/2022.02.28.482296 (2022).
Ma, Y. & Zhou, X. Spatially informed cell-type deconvolution for spatial transcriptomics. Nat. Biotechnol. https://doi.org/10.1038/s41587-022-01273-7 (2022).
Li, H. et al. A comprehensive benchmarking with practical guidelines for cellular deconvolution of spatial transcriptomics. Nat. Commun. 14, 1548 (2023).
Bove, A. et al. Local cellular neighborhood controls proliferation in cell competition. Mol. Biol. Cell 28, 3215–3228 (2017).
Maynard, K. R. et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat. Neurosci. 24, 425–436 (2021).
Mathys, H. et al. Single-cell transcriptomic analysis of Alzheimer’s disease. Nature 570, 332–337 (2019).
Huuki-Myers, L. et al. Integrated single cell and unsupervised spatial transcriptomic analysis defines molecular anatomy of the human dorsolateral prefrontal cortex. Preprint at bioRxiv. https://doi.org/10.1101/2023.02.15.528722 (2023).
Tang, Q. et al. Anatomical organization and spatiotemporal firing patterns of layer 3 neurons in the rat medial entorhinal cortex. J. Neurosci. 35, 12346–12354 (2015).
Chung, W.-S. et al. Novel allele-dependent role for APOE in controlling the rate of synapse pruning by astrocytes. Proc. Natl Acad. Sci. USA 113, 10186–10191 (2016).
Ocklenburg, S. et al. Myelin water fraction imaging reveals hemispheric asymmetries in human white matter that are associated with genetic variation in PLP1. Mol. Neurobiol. 56, 3999–4012 (2019).
Swanson, O. K. & Maffei, A. From hiring to firing: activation of inhibitory neurons and their recruitment in behavior. Front. Mol. Neurosci. 12, 168 (2019).
Ji, A. L. et al. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell 182, 497–514. e422 (2020).
Yano, S., Tazawa, H., Kagawa, S., Fujiwara, T. & Hoffman, R. M. FUCCI real-time cell-cycle imaging as a guide for designing improved cancer therapy: a review of innovative strategies to target quiescent chemo-resistant cancer cells. Cancers 12, 2655 (2020).
Pastushenko, I. & Blanpain, C. EMT transition states during tumor progression and metastasis. Trends Cell Biol. 29, 212–226 (2019).
Kim, J. H. & Kim, N. Regulation of NFATc1 in osteoclast differentiation. J. bone Metab. 21, 233–241 (2014).
Olson, E., Geng, J. & Raghavan, M. Polymorphisms of HLA-B: influences on assembly and immunity. Curr. Opin. Immunol. 64, 137–145 (2020).
Chen, H. et al. Dissecting mammalian spermatogenesis using spatial transcriptomics. Cell Rep. 37, 109915 (2021).
Green, C. D. et al. A comprehensive roadmap of murine spermatogenesis defined by single-cell RNA-seq. Dev. Cell 46, 651–667. e610 (2018).
Linn, E., Ghanem, L., Bhakta, H., Greer, C. & Avella, M. Genes regulating spermatogenesis and sperm function associated with rare disorders. Front. Cell Dev. Biol. 9, 634536 (2021).
Ren, S. et al. The expression, function, and utilization of Protamine1: a literature review. Transl. Cancer Res. 10, 4947 (2021).
Yan, W. et al. Zmynd15 encodes a histone deacetylase-dependent transcriptional repressor essential for spermiogenesis and male fertility. J. Biol. Chem. 285, 31418–31426 (2010).
Lehtiniemi, T. & Kotaja, N. in Genetics of Human Infertility Vol. 21 (eds Vogt, P. H. & Schmid, K.) 101–115 (2017).
Wang, X. et al. BET bromodomain inhibitor JQ1 regulates spermatid development by changing chromatin conformation in mouse spermatogenesis. Genes Dis. 9, 1062–1073 (2022).
Nguyen, A. L. & Schindler, K. Specialize and divide (twice): functions of three aurora kinase homologs in mammalian oocyte meiotic maturation. Trends Genet. 33, 349–363 (2017).
Zhang, L. et al. Reprogramming of Sertoli cells to fetal-like Leydig cells by Wt1 ablation. Proc. Natl Acad. Sci. USA 112, 4003–4008 (2015).
Ding, G.-L. et al. The effects of diabetes on male fertility and epigenetic regulation during spermatogenesis. Asian J. Androl. 17, 948 (2015).
Antonetti, D. A., Reynet, C. & Kahn, C. R. Increased expression of mitochondrial-encoded genes in skeletal muscle of humans with diabetes mellitus. J. Clin. Investig. 95, 1383–1388 (1995).
Al‑Kafaji, G., Sabry, M. A. & Bakhiet, M. Increased expression of mitochondrial DNA‑encoded genes in human renal mesangial cells in response to high glucose‑induced reactive oxygen species. Mol. Med. Rep. 13, 1774–1780 (2016).
Tepe, B. et al. Single-cell RNA-seq of mouse olfactory bulb reveals cellular heterogeneity and activity-dependent molecular census of adult-born neurons. Cell Rep. 25, 2689–2703. e2683 (2018).
Nagayama, S., Homma, R. & Imamura, F. Neuronal organization of olfactory bulb circuits. Front. Neural Circuits 8, 98 (2014).
Gudjohnsen, S. A. et al. Meningeal melanocytes in the mouse: distribution and dependence on Mitf. Front. Neuroanat. 9, 149 (2015).
DeSisto, J. et al. Single-cell transcriptomic analyses of the developing meninges reveal meningeal fibroblast diversity and function. Dev. Cell 54, 43–59. e44 (2020).
Chen, Y. et al. A single-cell atlas of mouse olfactory bulb chromatin accessibility. J. Genet. Genomics 48, 147–162 (2021).
Li, C. et al. Identifying potential diagnostic genes for diabetic nephropathy based on hypoxia and immune status. J. Inflamm. Res. 14, 6871 (2021).
Zeppilli, S. et al. Molecular characterization of projection neuron subtypes in the mouse olfactory bulb. eLife 10, e65445 (2021).
Wang, D. et al. VIP interneurons regulate olfactory bulb output and contribute to odor detection and discrimination. Cell Rep. 38, 110383 (2022).
Hawrylycz, M. et al. in Springer Handbook of Bio-/Neuroinformatics 1111–1126 (Springer, 2014).
Haslinger, A., Schwarz, T. J., Covic, M. & Chichung Lie, D. Expression of Sox11 in adult neurogenic niches suggests a stage-specific role in adult neurogenesis. Eur. J. Neurosci. 29, 2103–2114 (2009).
Young, J. K., Heinbockel, T. & Gondré-Lewis, M. C. Astrocyte fatty acid binding protein-7 is a marker for neurogenic niches in the rat hippocampus. Hippocampus 23, 1476–1483 (2013).
Fujiwara, N. & Cave, J. W. Partial conservation between mice and humans in olfactory bulb interneuron transcription factor codes. Front. Neurosci. 10, 337 (2016).
Sun, X., Liu, X., Starr, E. R. & Liu, S. CCKergic tufted cells differentially drive two anatomically segregated inhibitory circuits in the mouse olfactory bulb. J. Neurosci. 40, 6189–6206 (2020).
Wu, S. Z. et al. A single-cell and spatially resolved atlas of human breast cancers. Nat. Genet. 53, 1334–1347 (2021).
Su, K. et al. TC-1 (c8orf4) enhances aggressive biologic behavior in lung cancer through the Wnt/β-catenin pathway. J. Surg. Res. 185, 255–263 (2013).
Han, S. et al. CEACAM5 and CEACAM6 are major target genes for Smad3-mediated TGF-β signaling. Oncogene 27, 675–683 (2008).
Von Ahrens, D., Bhagat, T. D., Nagrath, D., Maitra, A. & Verma, A. The role of stromal cancer-associated fibroblasts in pancreatic cancer. J. Hematol. Oncol. 10, 1–8 (2017).
Yu, D. & Hung, M.-C. Overexpression of ErbB2 in cancer and ErbB2-targeting strategies. Oncogene 19, 6115–6121 (2000).
Miligy, I. M. et al. The clinical and biological significance of HER2 over-expression in breast ductal carcinoma in situ: a large study from a single institution. Br. J. Cancer 120, 1075–1082 (2019).
Thorat, M. A. et al. Prognostic and predictive value of HER2 expression in ductal carcinoma in situ: results from the UK/ANZ DCIS randomized trial. Clin. Cancer Res. 27, 5317–5324 (2021).
Bergholtz, H. et al. Contrasting DCIS and invasive breast cancer by subtype suggests basal-like DCIS as distinct lesions. NPJ Breast Cancer 6, 26 (2020).
Jian, L. et al. AGR3 promotes estrogen receptor‑positive breast cancer cell proliferation in an estrogen‑dependent manner. Oncol. Lett. 20, 1441–1451 (2020).
O’Brien, S. L. et al. CENP‐F expression is associated with poor prognosis and chromosomal instability in patients with primary breast cancer. Int. J. Cancer 120, 1434–1443 (2007).
Li, B. et al. Benchmarking spatial and single-cell transcriptomics integration methods for transcript distribution prediction and cell type deconvolution. Nat. Methods 19, 662–670 (2022).
Sun, D., Liu, Z., Li, T., Wu, Q. & Wang, C. STRIDE: accurately decomposing and integrating spatial transcriptomics using single-cell RNA sequencing. Nucleic Acids Res. 50, e42 (2022).
Nitzan, M., Karaiskos, N., Friedman, N. & Rajewsky, N. Gene expression cartography. Nature 576, 132–137 (2019).
Cang, Z. & Nie, Q. Inferring spatial and signaling relationships between cells from single cell transcriptomic data. Nat. Commun. 11, 1–13 (2020).
Abdelaal, T., Mourragui, S., Mahfouz, A. & Reinders, M. J. SpaGE: spatial gene enhancement using scRNA-seq. Nucleic Acids Res. 48, e107–e107 (2020).
Lopez, R. et al. A joint model of unpaired data from scRNA-seq and spatial transcriptomics for imputing missing gene expression measurements. Preprint at arXiv https://doi.org/10.48550/arXiv.1905.02269 (2019).
Biancalani, T. et al. Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram. Nat. Methods 18, 1352–1362 (2021).
Cable, D. M. et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat. Biotechnol. 40, 517–526 (2022).
Kleshchevnikov, V. et al. Cell2location maps fine-grained cell types in spatial transcriptomics. Nat. Biotechnol. 40, 661–671 (2022).
Korotkevich, G. et al. Fast gene set enrichment analysis. Preprint at bioRxiv https://doi.org/10.1101/060012 (2021).
Allen Reference Atlas – Mouse Brain [brain atlas]. Allen Institute for Brain Science https://atlas.brain-map.org/ (2024).
Imamura, F., Ito, A. & LaFever, B. J. Subpopulations of projection neurons in the olfactory bulb. Front. Neural Circuits 14, 561822 (2020).
Wang, X., Park, J., Susztak, K., Zhang, N. R. & Li, M. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nat. Commun. 10, 1–9 (2019).
Chung, F. R. & Graham, F. C. Spectral Graph Theory Vol. 92 (American Mathematical Society, 1997).
Grone, R., Merris, R. & Sunder, V. The Laplacian spectrum of a graph. SIAM J. Matrix Anal. Appl. 11, 218–238 (1990).
Argelaguet, R., Cuomo, A. S., Stegle, O. & Marioni, J. C. Computational principles and challenges in single-cell data integration. Nat. Biotechnol. 39, 1202–1215 (2021).
Korsunsky, I., Nathan, A., Millard, N. & Raychaudhuri, S. Presto scales Wilcoxon and auROC analyses to millions of observations. Preprint at bioRxiv https://doi.org/10.1101/653253 (2019).
Dolgalev, I. msigdbr: MSigDB gene sets for multiple organisms in a tidy data format. R package version 7. R Project https://cran.r-project.org/web/packages/msigdbr/msigdbr.pdf (2020).
Peres, R. M. R. et al. Comparative evaluation of the erbB2 and hormone receptor status of neighboring invasive and in situ components of ductal carcinomas of the breast. Int. J. Biol. Markers 24, 238–244 (2009).
Villanueva, H. et al. The emerging roles of steroid hormone receptors in ductal carcinoma in situ (DCIS) of the breast. J. Mammary Gland Biol. Neoplasia 23, 237–248 (2018).
Hussein, M. R., Abd-Elwahed, S. R. & Abdulwahed, A. R. Alterations of estrogen receptors, progesterone receptors and c-erbB2 oncogene protein expression in ductal carcinomas of the breast. Cell Biol. Int. 32, 698–707 (2008).
Goh, C. W. et al. Invasive ductal carcinoma with coexisting ductal carcinoma in situ (IDC/DCIS) versus pure invasive ductal carcinoma (IDC): a comparison of clinicopathological characteristics, molecular subtypes, and clinical outcomes. J. Cancer Res. Clin. Oncol. 145, 1877–1886 (2019).
Harada, S. et al. The significance of HER‐2/neu receptor positivity and immunophenotype in ductal carcinoma in situ with early invasive disease. J. Surg. Oncol. 104, 458–465 (2011).
Miligy, I. M. et al. The clinical significance of oestrogen receptor expression in breast ductal carcinoma in situ. Br. J. Cancer 123, 1513–1520 (2020).
Voorwerk, L. et al. Immune landscape of breast tumors with low and intermediate estrogen receptor expression. NPJ Breast Cancer 9, 39 (2023).
Acknowledgements
This study was supported by the National Institutes of Health (NIH) grants R01GM126553, R01HG011883 and R01GM144960, all to X.Z.
Author information
Authors and Affiliations
Contributions
Y.M. and X.Z. conceived the idea and designed the study. Y.M. developed the method, implemented the software and analyzed real data. Y.M. and X.Z. wrote the manuscript, and all authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Peer review
Peer review information
Nature Methods thanks Chenfei Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Rita Strack, in collaboration with the Nature Methods team.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Analyzing the mouse brain Vizgen MERFISH single-cell data.
(a) The structure of the mouse brain with the main tissue regions annotated from the Allen Reference Atlas – Mouse Brain. (b) Spatial domains detected by IRIS, spaGCN, BayesSpace, BayesSpaceJoint, and SEDR in the ‘Sample 2’ analysis. Details of the tissue slices used in this setting are provided in Supplementary Table 2. (c) Boxplots display CHAOS values for different methods, which measure the spatial continuity and compactness of the detected spatial domains from different methods, on the tissue replicates of slice 2 (n = 2, left panel). Each boxplot ranges from the first and third quartiles with the median as the horizontal line while whiskers represent 1.5 times the interquartile range from the lower and upper bounds of the box. Compared spatial domain detection methods (x-axis) include spaGCN (yellow), BayesSpace (purple), BayesSpaceJoint (green), SEDR (blue), and IRIS (red). Line plots display CHAOS values when varying the pre-specified number of spatial domains. The median CHAOSs across all 3 tissue replicates of regional sample 2 was calculated. (d) Scatter plots display the spatial distribution of important brain tissue related marker genes. (e) Heatmap plot displays the estimated mean cell type proportion for representative cell types in each spatial domain detected by IRIS. Color scale was normalized to 0-1 range. (f) Spatial scatter plot displays the spatial distribution of IRIS estimated cell type proportion for representative cell types across spatial locations. For D – F, the results are shown for the example S2R3 slice in the main analysis.
Supplementary information
Supplementary Information
The file contains Supplementary Figs. 1–74, Tables 1–12 and Notes 1–17. The table of contents is displayed on page 1, and all the other contents are displayed on page 2 of the Supplementary Information file.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ma, Y., Zhou, X. Accurate and efficient integrative reference-informed spatial domain detection for spatial transcriptomics. Nat Methods 21, 1231–1244 (2024). https://doi.org/10.1038/s41592-024-02284-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41592-024-02284-9


