NeST: nested hierarchical structure identification in spatial transcriptomic data

Walker, Benjamin L.; Nie, Qing

doi:10.1038/s41467-023-42343-x

Download PDF

Article
Open access
Published: 17 October 2023

NeST: nested hierarchical structure identification in spatial transcriptomic data

Nature Communications volume 14, Article number: 6554 (2023) Cite this article

6334 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

Spatial gene expression in tissue is characterized by regions in which particular genes are enriched or depleted. Frequently, these regions contain nested inside them subregions with distinct expression patterns. Segmentation methods in spatial transcriptomic (ST) data extract disjoint regions maximizing similarity over the greatest number of genes, typically on a particular spatial scale, thus lacking the ability to find region-within-region structure. We present NeST, which extracts spatial structure through coexpression hotspots—regions exhibiting localized spatial coexpression of some set of genes. Coexpression hotspots identify structure on any spatial scale, over any possible subset of genes, and are highly explainable. NeST also performs spatial analysis of cell-cell interactions via ligand-receptor, identifying active areas de novo without restriction of cell type or other groupings, in both two and three dimensions. Through application on ST datasets of varying type and resolution, we demonstrate the ability of NeST to reveal a new level of biological structure.

Statistical analysis of spatial expression patterns for spatially resolved transcriptomic studies

Article 27 January 2020

Spatial transcriptomics at subspot resolution with BayesSpace

Article 03 June 2021

ClusterMap for multi-scale clustering analysis of spatial gene expression

Article Open access 08 October 2021

Introduction

Spatial transcriptomic (ST) data provides the ability to measure gene expression from cells in tissue while preserving spatial information, allowing insight into the spatial structure of tissue. A variety of ST data collection methods exist, varying in genome coverage, spatial resolution, and capture efficiency, including Visium¹, Slide-seq^2,3, and other alternatives^4,5 which mark locations using spatially-identified barcodes; and multiplexed in-situ hybridization (ISH) imaging based methods^{6,7,8,9,10,11}, which through single-molecule imaging measure gene expression levels at single-cell or subcellular resolution.

ST data can be used to understand how groups of cells work together in tissue to perform various biological functions. These groups can exist on drastically different scales: they may contain only a handful of cells, or thousands or millions; they may be groups of cells of the same cell type, or a mixture of multiple different cell types; they may be characterized by the shared expression of only one or a few genes, or thousands. Additionally, the organization of cells in tissue may exhibit a nested hierarchy, where a large structure or region of tissue contains subregions that themselves also have distinct biological meaning and characteristic gene expression patterns, such as structures in the brain built from a collection of internal layers^12,13. This region-within-region organization can also be viewed as the spatial analog of identifying subpopulations within a cell type in single-cell RNA-seq analysis, such as studied in immune cells^14,15, stem cells¹⁶, cancer cells^17,18,19, fibroblasts²⁰, and for cell activation²¹.

Current segmentation methods for ST data analysis divide cells or spots in the dataset into regions such as to maximize some measure of within-region similarity²². While this bears similarity to clustering in scRNA-seq data, the desire to obtain spatially coherent regions motivates inclusion of spatial information into the segmentation process. Approaches for this task include expectation-maximization with a Hidden Markov Random Field prior^23,24, fully Bayesian clustering with hyperresolution enhancement²⁵, and empirical Bayesian clustering²⁶. Alternatively, many methods utilize graph neural network (GNN) approaches. SpaGCN²⁷ integrates segmentation with identification of spatially variables genes. SEDR²⁸ jointly optimizes two autoencoders, one for expression and one for spatial information. SCAN-IT²⁹ uses a Deep Graph Infomax³⁰ (DGI) framework for embeddings. STAGATE³¹ applies graph attention (GAT) learning on a cell-type aware network. stMVC³² uses a multi-view semi-supervised GAT framework combining expression and histological information. SpaceFlow³³ combines a DGI framework with spatial regularization of the latent space. An alternative approach is considered in Multilayer³⁴, in which areas of enriched activity are computed independently for each gene before being combined into a segmentation. However, these methods share a common output data modality: a partition of cells into disjoint spatial regions, limiting the output to representing only one mode of spatial variation that covers the greatest possible number of genes. As a result, those methods are unable to capture multiscale region-within-region structure. In contrast, the hierarchical structure produced with standard agglomerative hierarchical clustering methods, created by repeatedly merging smaller clusters into larger, contains clusters that do not represent spatially localized regions, or fail to combine spatially adjacent and transcriptionally similar cells.

Additionally, most methods require tuning of the spatial scale at which regions are detected, such as by choosing a number of regions, creating challenges when the scale of structure is not known or varies in space. Non-segmentation approaches to analyzing spatial structure in expression include Node-centric Expression Modeling³⁵, which applies GNNs and variance attribution to identify spatial relationships in gene expression and cell communication, and DIALOGUE³⁶, which identifies multi-cellular programs, coordinated functional expression patterns dependent on cell type, but do not produce representations of structure in terms of contiguous spatial domains.

We introduce NeST, a method which identifies nested hierarchical structure in ST data through finding coexpression hotspots – representations of spatially localized areas that coexpress a collection of genes. Our method efficiently performs simultaneous searches for coexpression over every possible subset of genes and every spatially contiguous subset of spots, allowing it to operate at multiple scales in both space and number of genes while also capturing nested or overlapping structure in space and identify structures in tissue with no prior knowledge of relevant genes or spatial scale. By applying a spatial diffusion model, NeST is able to identify regions of tissue active in cell-cell interactions (CCI) without being constrained by fixed groupings of cells, such as by cell type. Through application to six ST datasets varying in modality and spatial resolution, we demonstrate the ability of NeST to uncover nested and multiscale biological structure, and to identify spatially localized CCI activity in both two and three dimensions. We further apply downstream analysis and visualization tools to show the localized areas or genes of particular interest, and to capture biological relationships and differences between spatial expression patterns of genes.

Results

Identifying nested, hierarchical structure with NeST

NeST is designed to work with ST datasets on any spatial resolution, covering anywhere from 100–20,000 genes, especially those with nested structure, which is illustrated with a Slideseq dataset of the hippocampus^2,3 (Fig. 1a). The hippocampal formation involves four main sub-regions: the CA1, CA2, CA3 regions, and the dentate gyrus (DG). Given an ST dataset, NeST identifies coexpression hotspots (CH), contiguous regions in which some subset of genes are highly expressed (Fig. 1b). Coexpression hotspots are scale-free and may contain arbitrarily few or many spots, and arbitrarily few or many genes, without requiring choice of a preferred spatial scale. Furthermore, as they can overlap, coexpression hotspots have the power to represent the nested structure or other overlapping structures that commonly occur in ST data.

**Fig. 1: NeST identifies nested, hierarchical structure in ST data through coexpression hotspot framework.**

NeST works by first computing single-gene hotspots, localized areas where a particular gene is enriched in expression, for each gene up to full transcriptome coverage. Hotspots are identified by binarizing gene expression using Otsu’s algorithm and applying DBscan clustering, producing a separate hotspot for each spatially-dense group of high-expression cells (Fig. 1c1). Then, a hotspot network is constructed, where each hotspot is a node and edges connect hotspots with a similar shape and location, computed using Jaccard similarity (Fig. 1c2). Finally, communities are extracted from this network, representing groups of highly similar single-gene hotspots, and each group is combined into a single coexpression hotspot (Fig. 1c3, see Methods for details). The largest coexpression hotspot in terms of number of spots/cells is labeled coexpression hotspot 0 (CH0) and further coexpression hotspots are numbered in decreasing order of size. In contrast to a segmentation, any single cell in NeST may be contained in one coexpression hotspot, multiple, or none at all.

By applying a spatial diffusion to expression of ligand genes and then combining with receptor gene expression, NeST computes spatial cell-cell interaction (CCI) hotspots, localized areas in space in which many cells are receiving the effect of a ligand-receptor interaction. Then, the same pipeline is applied to produce coexpression hotspots that relate functional CCI activity to gene activity localized in the same area, as well as perform differential expression and other downstream analysis of the identified functional regions. When three-dimensional spatial data across multiple layers is available, NeST applies a full 3D diffusion model and thereby can compute 3D CCI, including between different layers, producing fully 3D hotspots (Fig. 1d). NeST also contains a variety of downstream analysis tools designed to work with the hotspot framework, such as differential expression analysis and identification of marker genes and decomposition of single-cell expression in terms of coexpression patterns (Fig. 1d).

Nested hierarchical structure in the hippocampus

The coexpression hotspots identified by NeST accurately capture both the full hippocampal structure, as well as the four subregions within (Fig. 2a). The CA2 region is particularly difficult to identify due to its small size and similarity to CA3, but the coexpression framework allows it to be detected by NeST.

**Fig. 2: Coexpression hotspots capture nested hierarchical organization of hippocampus.**

NeST represents the hierarchical relationship among hotspots as a tree, with the smallest hotspot that contains over 75% of another hotspot as the parent of that hotspot. We identify hierarchical marker genes, differentially expressed genes in a region that are enriched relative to parent, sibling, and child coexpression hotspots in the hierarchical structure, but not necessarily relative to coexpression hotspots elsewhere in the tissue (see Methods for details). We find that many NeST marker genes agree with marker genes from a hippocampal marker gene database³⁷ (Fig. 2b), and all of the NeST marker genes clearly agree with the hierarchical structure (Supplementary Fig. 1a–e, genes for remaining hotspots in Supplementary Fig. 2a–n). We note that such marker genes may be restricted to a single region or expressed in multiple – for example, Neurod6 is expressed in all regions except the DG, and Ncald is expressed in all regions except CA1. Visualizing all marker genes in a heatmap, we see that while almost all CA3 genes have some expression in the CA2 region, the CA2 region has a number of exclusive marker genes (Fig. 2c), and the DG has the most distinct expression profile. Ultimately, the spatial localization inherent to coexpression hotspots filters out cells with similar expression but in a different location (c.f. original annotation of CA1/CA2/CA3 cells in Supplementary Fig. 1fg), allowing for downstream analysis free from spurious inclusion of such cells.

To better understand the presence of nested structure in this dataset, we introduce the nested structure plot, which shows all coexpression hotspots arranged in layers, representing successively finer-scale structure, showing the presence of two layers of structure in the hippocampus, and one layer elsewhere (Fig. 2d). Intuitively, this indicates the presence of up to two layers of overlapping coexpression hotspots within the hippocampal structure, and no overlap elsewhere (see Fig. 1b and all coexpression hotspots in Supplementary Fig. 3). Because both layers are biologically meaningful and clearly substantiated by marker genes, this underscores the value of the coexpression hotspot framework that is able to capture all layers of structure, over a simple segmentation.

Four-layer nested structure in human breast cancer dataset

We next consider a Visium dataset of human breast cancer tissue and compute coexpression hotspots, observing both a high degree of overlap between coexpression hotspots and histology, as well as many overlapping hotspots indicating nested structure (Fig. 3a). Indeed, it is found that there are up to four layers of nested hotspots representing successively finer structure (Fig. 3b). Each successive layer expresses additional cancer marker genes – considering top-1 markers, coexpression hotspot 0 (CH0) expresses DEGS2; CH2 expresses BRINP3; CH10 expresses SUSD3; and CH19 expresses LOXL2 (Supplementary Fig. 4a). This nested structure, however, is not captured by a segmentation (Fig. 3c). This dataset contains three nested structures: one with four layers, one with three, and one with two (Fig. 2d). Each of these represents areas of tumor tissue containing subregions expressing additional genes, which cannot be represented under a segmentation framework. We also remark that the tree of nested groups produced by a hierarchical agglomerative clustering contains many clusters that are not spatially localized, as well as divisions of homogeneous regions into further subclusters (Supplementary Fig. 6).

**Fig. 3: NeST simultaneously identifies four layers of hierarchical organization in breast cancer tissue.**

Many regions of the tissue are not in any coexpression hotspot (see Fig. 3a) – this is because some regions do not contain groups of genes exhibiting similar spatial expression. To quantify this, we construct the spatial coherence score statistic, a normalized metric representing how many genes have spatially coherent expression patterns at each point in space (see Methods for details), showing a clear alignment with the known histology (Fig. 3e, see histology Supplementary Fig. 4b).

This tissue contains a tertiary lymphoid structure (TLS) which is of biological interest, and previous studies have used domain-specific knowledge to identify the presence of TLS^38,39,40,41. However, by using the unique expression metric (Fig. 3f, see Methods), which identifies coexpression hotspots whose expressed genes are very different than the rest of the tissue, clearly highlights a single location – that of the TLS. This is because the TLS has a distinctive expression profile that is not present elsewhere in the sample, unlike the tumors which share similar expression across much of the tissue. Visualizations of marker genes for the corresponding coexpression hotspot confirm the presence of highly individual expression in this area including known TLS marker genes (Fig. 3g). Because NeST analyzes spatial coexpression across the entire genome, no prior knowledge of which genes are likely to be relevant is required.

Benchmarking and validation

We compare NeST to two segmentation methods, HMRF²³ and SpaGCN²⁷, over a range of numbers of regions, on their ability to identify the top two layers of structure in the upper left tumor region, consisting of one outer region and three inner regions (Supplementary Fig. 5a, marker genes in Supplementary Fig. 5b). The structures can also be seen in the histology plot (Supplementary Fig. 4b). NeST produces a higher Jaccard score on the outer structure, indicating a better match with this structure, along with superior to equivalent performance on the inner structure depending on number of regions (Fig. 4a). When comparing the ratio between the size (number of spots contained) between the largest coexpression hotspot and the smallest, we see it is an order of magnitude higher than the ratio between largest and smallest regions for the HMRF and SpaGCN segmentation methods, over a wide range of numbers of regions (Fig. 4b). We also perform a similar comparison on the Slideseq dataset from Fig. 2, excluding SpaGCN due to performance, and similarly see a transition between detecting outer and inner structure at a setting of 25 regions (Supplementary Fig. 7). However, the HMRF method does not identify the CA2 region that NeST finds.

We perform comparison between the coexpression hotspots and those computed with random subsets of the total set of genes, and then for each coexpression hotspot in the full dataset, we compute the average Jaccard similarity with the best-match hotspot over 10 realizations of random subsets, both for subsets of 50% of the total genes (Fig. 4c) and 80% (Fig. 4d). We observe that almost all of the hotspots are not significantly affected by the removal of 20% of the genes, with a high average score in Fig. 4d. In the case that 50% of the genes are removed, all hippocampus hotspots except the very small CA2 are effectively preserved, but a number of hotspots towards the edge of the domain are largely lost. This provides a quantification of the sensitivity of each individual hotspot to the data.

In order to measure the specific effect that NeST tuning parameters have on the result, we constructed a synthetic five-layer hierarchical dataset by recursively dividing half of the domain into a new region (see correct coexpression hotspots in Fig. 4e), with 2048 total genes of which a certain fraction contained spatial information, divided evenly over the regions. We varied each of the four main tuning parameters over a range, computed coexpression hotspots, and then compared those to the ground truth (Fig. 4f, see Methods for details). We observe that with as few as 64 out of 2048 spatial genes, NeST was able to effectively identify the regions in this dataset, whereas segmentation methods required a much higher number of spatial genes to identify the structure (Supplementary Fig 8). Additionally, the critical values for the parameters at which the output stopped agreeing with the ground truth was largely insensitive to the signal-to-noise ratio, validating our approach of setting standard default values for these parameters.

We now consider a Visium dataset of the anterior mouse cortex, computing all coexpression hotspots with NeST as well as segmentations using several methods: HMRF²³, BayesSpace²⁵, and SpaGCN²⁷, all of which are configured to identify a particular number of regions. By computing the overlap between segmentations and coexpression hotspots 0, 2, 6, 8, 16, and 20 (all of which are meaningful, substantiated by marker genes seen in Supplementary Fig. 9ab, Supplementary Fig. 10), the segmentation methods best identify large coexpression hotspots CH0 and CH2 when set to a coarse spatial scale (low number of regions), and small structures such as CH16 and CH20 when set to a fine spatial scale (large number of regions) (Fig. 4g), as expected. However, NeST is able to identify these structures on widely different spatial scales at the same time. Additionally, through hotspot decomposition, NeST is able to represent the overall expression pattern of genes in terms of coexpression hotspots, showing what parts of a gene’s expression can be explained by spatial coexpression shared with other genes (Supplementary Fig. 9c).

We also illustrate an example of the HMRF segmentation where CH0 and CH2, have been subdivided (Fig. 4h). By visualizing the spot expression non-spatially as a UMAP plot, the divisions are also separated in expression space (Fig. 4i). Furthermore, in the case of CH2, which the HMRF segmentation divides into two regions, visualization of differentially expressed genes between the two regions shows that the expression patterns of the DE genes do not agree with the segmentation boundary (Fig. 4j). Thus searching for structure on a particular spatial scale may lead to large structures being unnecessarily subdivided. Conversely, at the scales at which the large structures are found, spatially small structures like CH16 and CH20 (coexpressing 120 and 994 genes respectively) are missed.

NeST similarity maps identify related but spatially distinct structures

Given a coexpression hotspot representing a particular structure, NeST similarity maps show which areas of tissue also express a large fraction of those genes. Computing the similarity map for coexpression hotspot 8 (CH8), we see that the area of CH6 also lights up (Fig. 5a), indicating that the two coexpression hotspots have similar expression patterns and represent related structure. Inspection of the shared genes between CH6 and CH8 reveals marker genes such as Olig1 as well as genes known to be involved in the myelination process⁴² (Fig. 5b), suggesting that the CH6/CH8 structure is enriched in oligodendrocytes and may be involved in myelination processes.

**Fig. 5: NeST similarity maps identify repeated and unique structures.**

Taking advantage of the fact that NeST represents these structures distinctly, we search for any possible differences between them. Performing DE analysis, we identify Ccn2 as highly enriched in CH6 relative to CH8 (Fig. 5c). Visualizing the spatial gene expression, we observe that indeed Ccn2 is expressed only in CH6, but not in CH8 (Fig. 5d). Ccn2 has been tied to regulation of myelination^43,44, and so this suggests that Ccn2 may modulate a difference in myelination behavior between the CH6 and CH8 regions. This observation also holds in a second dataset taken from another slice of the same anterior cortex (Supplementary Fig. 11a), validating its consistency. However, because the CH6 and CH8 regions are very similar, they are typically identified as the same region by segmentation methods, if they are identified at all (see Supplementary Fig. 11b). This highlights the importance of distinguishing between spatially disjoint regions with extremely similar expression, as they may still have notable differences.

We also demonstrate the ability of NeST similarity maps to compare expression patterns across different samples. Taking a Visium dataset containing both a control and a disease (dextran sodium sulfate-induced colitis) sample of intestinal tissue, we test whether NeST can identify structures present in the disease condition but not the control condition. By taking the average similarity from the reference (disease) structure across the control-sample similarity map, we quantify how similar a particular coexpression hotspot is to a different sample. In the case that this average similarity is high, we identify shared structures across both datasets (Fig. 5e). In the case that it is low, we identify structures unique to the disease dataset (Fig. 5f). Above we introduced the uniqueness score, which identifies gene expression patterns localized to only one single area in a dataset. In contrast to this, the inter-sample comparison shown here identified patterns present, in any amount, in one dataset but not the other. This allows us to find differential patterns whether they are present only in one subsection of the sample or repeatedly across it.

We also show NeST analysis on a time sequence of developing mouse embryos⁴⁵, ranging from E9.5 to E16.5 in one-day intervals with one embryo per day (see Methods for details on datapoint selection). For coexpression hotspot 0 in the final datapoint (E16.5), representing the brain, we compute similarity maps over all seven previous datapoints, showing where in the earlier embryos similar genes were expressed (Fig. 5g). We show examples for CH1 through CH9 in Supplementary Fig. 12a–i, including other organs such as the liver (Supplementary Fig. 12a), heart (Supplementary Fig. 12b), and lung (Supplementary Fig. 12f), as well as examples of specific genes (Supplementary Fig. 13ab, 14, 15).

Finally, we show similarity maps for the Visium breast cancer dataset from Fig. 3. For a coexpression hotspot such as CH0, a top-level hotspot in the upper left hierarchical tumor structure, we see similarity across much of the tumor tissue in the dataset (Fig. 5h). In contrast, for the TLS, there is very low similarity anywhere else in the tissue (Fig. 5i), consistent with its identification as highly unique in Fig. 3h. We can also compute a dendrogram from the pairwise similarity values. Compared to the nested structure shown in Fig. 3 which shows the spatial relationships of hotspots, this shows the transcriptional similarity, and identifies that the tumor tissue can be split into two large groups (Fig. 5j, k). By combining spatially localized coexpression hotspots with similarity analysis, NeST simultaneously captures both spatial and transcriptional relationships between distinct structures in tissue.

Spatial localization of cell–cell signaling within cell types

We next use NeST to show that spatial localization regions enriched in CCI differs significantly from cell type boundaries in a developing mouse embryo⁹. We remark that this dataset also contains nested structure visible in the brain region, with coexpression hotspots identifying both the full brain and the forebrain, midbrain, and hindbrain regions (Supplementary Fig. 16a, b). This structure is also not effectively identified by HMRF segmentation (Supplementary Fig. 16c). Just as coexpression hotspots freely identify gene coexpression where it occurs, not constrained to a single layer segmentation, CCI hotspots identify CCI activity where it occurs and are not constrained to any preset partition, such as by cell type.

Taking as example the Dll1-Notch1 ligand-receptor interaction, known to play a critical role in development⁴⁶, NeST identifies CCI hotspots based on a spatial diffusion model (Fig. 6a) to determine the level of ligand each cell is exposed to, and combining this with the receptor distribution (Fig. 6b) to determine an overall activity score (Fig. 6c). Then, similarly to the single-gene case, density-based clustering extracts hotspots in which many active cells are clustered together (see Methods for details). Note that in spatial areas where either the ligand or receptor is absent (circled regions in Fig. 6a–f), NeST shows the lack of CCI activity, whereas computing CCI only between or within cell types cannot make this sub-cell-type scale distinction.

**Fig. 6: NeST CCI hotspots localize regions of tissue active in CCI at single-cell, sub-cell type resolution.**

The robustness of CCI hotspots is confirmed by the very high similarity in active target regions for the Notch1-Dll1 and Notch1-Dll3 interactions, as well as interactions from other pathways such as Fgf15-Fgfr1 (all interactions in Supplementary Fig. 17a). This is further reinforced by considering related functional genes such as Lfng^47,48, which is also observed to expressed in a similar spatial pattern (Supplementary Fig. 17b). This suggests biological significance of the four enriched regions, which appear as coexpression hotspots CH6, CH7, CH10, and CH18 (see Supplementary Fig. 9d for all coexpression hotspots). We refer to these four locations as Notch-enriched coexpression hotspots. Notch-enriched hotspots are found to have a higher coherence score than surrounding areas, indicating the presence of genes expressed specifically in these areas (Supplementary Fig. 17c). However, only CH18 lines up cleanly with cell type boundaries (Supplementary Fig. 17d) expressed in the presomitic mesoderm. CH7 and CH10 are subregions within the brain, corresponding to the hindbrain and part of the forebrain, and CH6 is a subregion within the spinal cord. In order to identify possible downstream effects or spatial correlations with Notch signaling, we compare cells that are targets of Notch signaling with other cells of the same cell type that are not targets of Notch signaling. Specifically, we first perform differential expression analysis between spinal cord cells that are contained within the spinal cord Notch-active coexpression hotspot CH6 and spinal cord cells not within CH6. We see enrichment of a number of genes in CH6 cells, including FGF pathway genes such as Sfrp1 and Fgfr3, and a number of Hox genes, which do show expression specific to the CH6 area (Fig. 6g, Supplementary Fig. 17e). Similar analysis within the brain, comparing cells within brain Notch-active coexpression hotspots CH7 and CH10 to cells outside, shows significant DE between active and nonactive areas, with several CCI-related genes such as Fgfr3 and Lfng highly enriched (Fig. 6h). Comparing CH7 cells with CH10 cells, the two Notch-active subregions within the brain, we see most notable differences in Otx2, known to be expressed in the forebrain^49,50, and Sfrp1, known to be expressed towards the hindbrain⁵¹ (Fig. 6h). Visualizing DE across all four Notch-active coexpression hotspots simultaneously, CH18 is observed to have the most distinct expression pattern (Fig. 6i), consistent with its identity as the one Notch-active hotspot specific a unique cell type.

Under the hypothesis that CCI activity lines up with cell types, we expect the fraction of overlap between CCI hotspots and cells of a given type to be close to either 0 (not active) or 1 (active). However, only presomitic mesoderm, neural-mesodermal progenitors (NMP), and lateral plate mesoderm exhibit high CCI coverage, defined as the fraction of cells of that cell type that are contained within a CCI hotspot (Fig. 6j). The Dll1-Notch1 CCI hotspots which exhibit highest overlap with NMP do not appear to form an NMP-specific structure (Supplementary Fig. 17f), but the Wnt5a-Fzd2 interaction is widely expressed in a pattern specific to lateral plate mesoderm cells (Fig. 6k), which is consistent with prior study⁵². Overall, in most cases, CCI activity is heterogeneous within cell types, challenging the standard approach of computing CCI on a cell-type by cell-type basis.

3D NeST identifies Cck communication between layers and Tac signaling in behavior-associated regions in merFISH dataset

Finally, we use NeST to analyze three-dimensional spatial data using a merFISH dataset of the mouse cortex⁶ containing approximately 74,000 cells over 12 distinct z-slices, each separated by a distance of 50 μm. The CCI inference proceeds similarly to above, however with a 3D diffusion model that allows for ligands to diffuse between different layers in the z-axis (Fig. 7a, see Methods for details). In this model, cells expressing the receptor may be activated by ligand expression by cells in other slices. We highlight a group of Cck expressing cells on a single layer surrounded by other cells expressing the Cckbr receptor, in which case a number of target cells on adjacent layers are identified through the 3D diffusion model (Fig. 7b) – inter-slice communication that could not have been detected through 2D analysis (Fig. 7c). Furthermore, the source cells are annotated as Ambiguous, and the target cells as primarily Excitatory and Inhibitory (Fig. 7d, Supplementary Fig. 18a). All of these labels are spatially distributed through the entire region, so the spatial nature of this communication link would not be detectable through the typical cell-type-based analysis.

**Fig. 7: NeST computes CCI hotspots fully in three dimensions revealing inter-layer communications and functional regions associated with social behavior in 3D mouse cortex.**

To better understand the three-dimensional structure of this dataset we compute three-dimensional regions (Fig. 7e, Supplementary Fig. 19, see Methods for details) as well as three-dimensional hotspots for cell–cell interactions (examples in Supplementary Fig. 18b–e). We illustrate the ability of NeST to find biologically meaningful functional regions by highlighting the case of the Tac1-Tacr1 interaction in the top four slices (Fig. 7f). CCI hotspots allow us to distinguish between CCI active cells in different areas of the tissue, and so we zoom into the topmost slice for further comparison (Fig. 7g). When we perform non-spatial CCI analysis using CellChat⁵³, considering only the cell types, we observe that interaction is predicted even in areas of the tissue without ligand or receptor expression (Fig. 7h, c.f. ligand and receptor expression in Fig. 7i, j), underscoring the importance of correctly using spatial information when computing cell–cell interactions. We observe some genes, such as Avpr1a and Chat (Fig. 7k, l), appear to be enriched in the upper bilateral hotspots, and so we call these Chat+ hotspots. Comparing the two Chat+ hotspots to all other Tac1-Tacr1 hotspots (in 3D), we can clearly see the difference in expression. In order to understand the role of these particular hotspots, we perform GO term analysis, finding enrichment of terms related to behavior, such as GO:0002118, aggressive behavior, and GO:0035176, social behavior, as well as many terms related to blood pressure due to the presence of Avpr1a (Fig. 7n). As Avpr1a, Chat, and Oxtr have been linked to behavior, we hypothesize that these cells represent a functional region in which interactions of Tac signaling and several other genes modulate behavior. When viewing the prevalence of different ligand-receptor interactions across z-slices, we see that there is significant heterogeneity between z-slices, with some ligand-receptor interactions enriched in lower, middle, or upper slices, further reinforcing the importance of capturing the full 3D behavior of cell-cell interactions (Fig. 7o).

Discussion

Through its ability to identify nested, hierarchical, and multiscale structure in ST data, NeST represents an important next step in the method development of ST data analyses. NeST is released as a Python package and interfaces with the standard Anndata format⁵⁴ to allow easy application to new datasets. NeST is highly scalable; for example, computing coexpression hotspots on the Slideseq dataset with full transcriptome coverage and over 40,000 beads can be done in minutes on a standard laptop.

NeST allows for the identification of hierarchical structure as well as other spatially organized gene coexpression, and it contains a wide range of associated visualization tools in order to reveal the hierarchical structures in ST data, as well as compare spatial expression patterns within and between data samples. Beyond this, NeST leverages the unique nature of coexpression hotspots compared to traditional segmentations in order to allow for analyses such as spatial hierarchical marker genes, differential expression analysis between similar but spatially distinct structures, and functional analysis of single-cell resolution cell-cell interactions. NeST thus fulfills a previously unmet need in the analysis of spatial structure from ST data.

NeST is not a replacement for segmentation methods, but rather a new analysis offering additional tools to explore spatial gene expression patterns that have nested structure. The coexpression hotspot framework represents multiple layers of structure simultaneously, allowing analysis impossible with a segmentation; conversely, the segmentation approach captures the most dominant mode of spatial variation which can improve performance in some cases where structure is not clearly visible in any single gene.

One limitation of NeST in identifying coexpression hotspots is the initial step of computing single-gene hotspots. NeST relies on binarization for computation of single-gene hotspots, which could hide certain types of structures such as boundary regions with gradients in gene expression. This task bears a strong resemblance to the heavily-studied task of image segmentation, and more sophisticated processing such as incorporating convolutional neural network models could improve identification of single-gene hotspots. This could address a limitation in the DBscan-based clustering, which is not effectively able to find very thin layers whose width is less than the parameter ϵ. Additionally, computing single-gene hotspots while preserving continuous nature of gene expression could allow for our method for identifying coexpression to be extended to gradients, such as those found in developing embryos⁵⁵ or the brain⁵⁶. An iterative method in which single-gene hotspots are refined based on information from tentatively computed coexpression hotspots may also increase performance through improved sharing of information across genes. Another avenue for improvement would be developing notions of significance, such as through statistical testing, that assess how unlikely a particular coexpression hotspot would arise through chance, which would further increase the ability of NeST to identify and highlight the most important spatial structures in a dataset. Furthermore, future work could seek to expand our notion of hierarchical marker genes and more extensively investigate the process of computing differential expression between overlapping groups of cells.

As spatial transcriptomic technologies continue to evolve, capturing more genes with greater efficiency and resolution, there is an ever-greater need for computational methods able to identify structure at multiple scales, filter out areas of increased interest, and substantiate the biological significance of identified spatial structure. The ability of NeST to identify multiscale, multilayer, explainable structure will open many new doors in the development of ST methodology and analysis of ST datasets.

Methods

Preprocessing

NeST is designed to be applied directly to full-transcriptome data and therefore no filtering of highly-variable genes, etc. is performed. Expression data is normalized and logarithmized before further analysis. For the Slideseq dataset, an additional spatial smoothing step is performed due to the high degree of spatial noise, in which the expression level at each bead is replaced by the average of the 20^th and 80^th quantiles of beads within a smoothing radius of 30 μm.

Single-gene hotspots

We define a single-gene hotspot as a set of cells in a connected, localized subregion of space in which a particular gene is highly expressed. The first step is to binarize the data, which we perform using Otsu’s algorithm⁵⁷, which divides the cells/spots into two groups such as to minimize the variance in expression within each group. Once the binarization has been performed, the locations of cells above the threshold are extracted, producing a set of two-dimensional points. We then apply DBscan density-based clustering⁵⁸ to this set, which first identifies core points, those for which at least min_samples other points exist with a radius ${{{{{\rm{\epsilon }}}}}}$. Then, all core points within radius ${{{{{\rm{\epsilon }}}}}}$ are connected, and this produces the single-gene hotspots. The DBscan clustering is applied separately for each gene, but this process is not computationally intensive even without parallelization, being able to compute hotspots for all genes in a typical Visium dataset in under a minute. Optionally, after computing the hotspots, an $\alpha$-shape⁵⁹ boundary can be drawn enclosing the spots in the hotspot, and then the hotspot can be replaced with the set of all spots within the boundary. This means that the Jaccard similarity between hotspots (referenced below) corresponds to exactly the overlap in area between the hotspots. $\alpha$-shapes⁵⁹ are a generalization of convex hulls such that $\alpha=0$ corresponds to the convex hull, and progressively larger values of $\alpha$ tighten the boundary, such that it becomes concave and more closely surrounds the points. The shape becomes undefined for sufficiently large values of $\alpha$, and NeST uses a bisection algorithm to automatically select a large but valid value for $\alpha$. However, this comes at the cost of sparsity, and is not computationally tractable on single-cell resolution datasets.

Note that the computation of single-gene hotspots also serves as a filter for spatially-variable genes, as many genes whose expression does not follow a spatial pattern do not have sufficiently localized expression to have any single-gene hotspots identified, and therefore are filtered out from subsequent analysis.

Coexpression hotspots

After computing all single-gene hotspots, we compute a similarity score between all possible pairs of hotspots. Here, we use the Jaccard similarity, which for two hotspots ${{{{{{\rm{H}}}}}}}_{{{{{{\rm{i}}}}}}}$ and ${{{{{{\rm{H}}}}}}}_{{{{{{\rm{j}}}}}}}$ is computed as

$${S}_{{ij}}=\frac{\left|{H}_{i}\cap {H}_{j}\right|}{\left|{H}_{i}\cup {H}_{j}\right|}$$

(1)

The Jaccard similarity, being uniformly 0 for non-overlapping hotspots, leads to a sparse similarity matrix and can be efficiently computed even for very large numbers of hotspots, such as those arising from full-transcriptome gene hotspot computation. It can be rewritten as

$${S}_{{ij}}=\frac{\left|{H}_{i}\cap {H}_{j}\right|}{\left|{H}_{i}\right|+\left|{H}_{j}\right|-\left|{H}_{i}\cap {H}_{j}\right|}$$

(2)

meaning the only pairwise relationship required is the (generally sparse) overlap matrix

$${O}_{{ij}}=\left|{H}_{i}\cap {H}_{j}\right|$$

(3)

which represents the number of elements present in both hotspots. Instead of directly computing the pairwise overlap by iterating over every possible pair of hotspots, we iterate over every element (cell or spot in the ST dataset), identify the set of all hotspots containing that element, and then tally one overlap between every pair of hotspots in that set. As a result, non-overlapping hotspots do not consume any computation time. We combine this with a parallelized spatial chunking algorithm in order to maintain tractability even over full-transcriptome coverage, in which the dataset is divided into a rectangular grid (size does not affect results but we generally use 10 × 10) and the overlap counts are tallied separately in parallel for each grid square and then combined together.

The entries of the Jaccard similarity matrix over a user-defined threshold value are used as the weighted adjacency matrix defining the hotspot similarity network. Here, we use a threshold of 0.6 for Visium datasets and 0.3 for single-cell datasets (the lower threshold value due to the increased sparsity in single-cell datasets). Finally, we identify communities in the network using the Leiden algorithm⁶⁰. Any community with more than min_genes single-gene hotspots is used to create a coexpression hotspot. Given a group of single-gene hotspots, the corresponding coexpression hotspot is defined as the set of all spots/cells contained in over a certain fraction of constituent single-gene hotspots. Here, we use a value of 30%. The representation of the coexpression hotspot preserves reference to its constituent single-gene hotspots, so the set of genes being coexpressed can be utilized in downstream analysis.

Hierarchical marker genes

In order to identify marker genes for nested hierarchical structures, NeST performs differential expression (DE) analysis using the two-sided Wilcoxon rank-sum test (also known as the Mann-Whitney U test) along with Benjamini-Hochberg FDR correction. For a given coexpression hotspot, those genes that are positively expressed at a significant level (here we use p < 0.001) over all parent, sibling, and child coexpression hotspots in the hierarchical structure plot are labeled as hierarchical marker genes for that coexpression hotspot. This can be further illustrated by using the NeST similarity map feature. Given a coexpression hotspot representing a particular structure, similarity maps show which areas of tissue also express a large fraction of those genes. encoding which hotspots are children of (i.e. contained within) other hotspots is computed by labeling any hotspot for which over 75% is contained within another hotspot as a child of that hotspot. In the case that there are multiple levels of nested structure, the parent of a hotspot is the smallest hotspot (i.e. the next level up) that contains at least 75% of its spots. Additionally, In the case that structure is not hierarchical, NeST can also compute marker genes over any user-provided set of coexpression hotspots using the same procedure.

Coexpression hotspot decomposition

Once coexpression hotspots are computed, the expression of individual genes can be decomposed in terms of coexpression hotspots. NeST includes a procedure for identifying a subset of the identified coexpression hotspots that best matches a set of single-gene hotspots. For a particular gene, we define the match score as the number of spots in the single-gene hotspot that are also in the coexpression hotspot, minus the number of spots in the coexpression hotspot that are not in the single-gene hotspot. Letting ${\left\{{H}_{i}\right\}}_{i=1}^{N}$ be the set of $N$ single-gene hotspots for one particular gene and $H={\cup }_{i=1}^{N}{H}_{i}$, then we the match score for coexpression hotspot j is given as:

$$M{S}_{j}=\left|H\cap C{H}_{j}\right|-\left|\bar{H}\cap C{H}_{j}\right|$$

(4)

We take the hotspot $C{H}_{k}$ that maximizes this, $k={argma}{x}_{j}{M}{S}_{j}$, add it to the decomposition, and then update H to reflect only the spots that are not covered by the decomposition:

$$H\leftarrow H-C{H}_{k}$$

(5)

where ${-}$ denotes set subtraction. This process is repeated, adding more coexpression hotspots to the decomposition, until no coexpression hotspot has a positive match score.

Spatial coherence and unique expression score

We consider gene expression to be spatially coherent when many genes are expressed in the same subregion of tissue, with similar boundaries to the region of expression. We define a spatial coherence score to identify which areas of tissue exhibit the highest coherence, calculated by taking the subset of all single-gene hotspots that are a member of a coexpression hotspot and counting the number of such hotspots the spot or cell is contained in. Then, the score is normalized to range from 0 to 1 over all spots. In this way, areas in tissue with a large number of cells that are contained by many very similar hotspots have a higher spatial coherence score, but since the score is computed in terms of not the coexpression hotspot but rather its constituent single gene hotspots, the spatial coherence score varies more smoothly than simply looking at coexpression hotspots. The unique expression score is computed by the same procedure, except the subset of single-gene hotspots is restricted to those hotspots that are a member of a coexpression hotspot, and no other hotspot of the same gene is a member of a different coexpression hotspot. This means we are identifying those genes that only have spatially coherent expression in one specific area of tissue, as opposed to genes that exhibit a repeated structure and are expressed around the tissue (see Fig. 3e, f).

Cell–cell interaction hotspots

In this manuscript we perform CCI using the curated set of ligand-receptor interactions from Cellchat⁵³. Specifically, we make use of the ligand gene symbol, receptor gene symbol, pathway name, and type annotation. We consider ligand-receptor pairs with annotations of “Secreted signaling” and “Cell-cell contact” in the database. Our method identifies all ligand/receptor pairs in the database for which both genes are present in the dataset and proceeds to perform analysis on those interactions.

Our method applies two ligand transport models: a diffusion-based model for secreted signaling interactions, and a neighbor-based model for cell-cell contact interactions. For each model, we construct a matrix ${A}_{{ij}}$ that represents the fraction of expressed ligand transported from cell i to cell j. In the diffusion model, the ligand expression of a cell is distributed to all neighbors within a certain cutoff distance $\epsilon$, which we select to be 100 μm, expressed in the same spatial units as the data. The diffusion kernel is chosen to have a standard deviation of half the cutoff. The cell-cell contact matrix is constructed by taking the Delauney triangulation (where an edge between a pair of cells indicates that no other cell lies between them) and removing all edges over a certain threshold, which we take to be 20 μm. As a first-order correction for cell size, the transport matrices are normalized to have a row sum of 1. For each cell, we apply the ligand transfer model to spread its ligand expression over nearby cells, and then follow the procedure of CellChat⁵³, log-normalizing expression and then further normalizing to a maximum value of 1, and computing for each cell the product of the receptor expression and the transported ligand expression combined with a Hill function to determine the cell-level CCI activity. Letting ${L}_{i}^{c}$ and ${R}_{j}^{c}$ be the expression of ligand i and receptor j respectively for cell c, the activity is computed as:

$${{{{{\rm{activity}}}}}}=\frac{{L}_{i}^{c}{R}_{j}^{c}}{{K}_{h}+{L}_{i}^{c}{R}_{j}^{c}}$$

(6)

We generally take ${K}_{h}$ as $0.5$, but as the Hill function is monotonic the output of the permutation tests described below are invariant to the choice of ${K}_{h}$.

In order to identify cells which exhibit a high level of activity, we perform permutation tests, computing ${N}_{{perm}}$ random permutations of the activity values. In each permutation, the gene expression vectors of each cell are shuffled across cells (applying the same permutation to each gene), while keeping spatial position the same. Then, the ligand transport model is applied to the shuffled expression and activation scores are computed for each permutation. We construct a distribution of null-hypothesis values by combining activation scores across all cells and all permutations. For a significance level $\alpha$, the significance cutoff for that interaction will be chosen as the $1-\alpha$ quantile of the set of permuted activity scores. We then compute the binarized activation by testing the expression level of each cell against the cutoff for that interaction, and computation of hotspots then proceeds identically to gene expression hotspots as described above.

Three-dimensional CCI analysis

For 3D datasets such as the merFISH dataset, the CCI can be run in 3D by providing an additional input representing the z coordinate value of each cell. The ligand diffusion model then uses 3D Euclidean distance, combined over all layers, instead of 2D Euclidean distance. Additionally, when performing permutation tests for significance, cells are only permuted within the same layer (cells with the same z-value). CCI hotspots are first computed individually for each layer, as described above, and then are matched across layers. To do this, we identify a nearest-cell-matching across each pair of adjacent layers using linear sum assignment with the cost set to the squared Euclidean distance. We then create a network, where each active cell (over all layers) is a node, and edges are drawn between each cell and its k-nearest neighbors (k = 20) in the same layer, as well as its matched neighbor in each adjacent layer. Intra-layer edges are weighted by the distance between cells i and j (in the same layer) as

$${w}_{{ij}}^{{{{{{\rm{intra}}}}}}}={e}^{-0.04{d}_{{ij}}}$$

(7)

where 0.04 is a constant chosen based on the distance units of this dataset. Inter-layer edges between cells i and j (in adjacent layers) are weighted by a factor $0.001{\alpha }^{2}$, where $\alpha$ is the number of k-nearest neighbors of cell i whose inter-layer matched neighbor is one of the k-nearest neighbors of j. This places greater weight on cells for which the inter-layer matching is spatially consistent. Then, communities are identified in this combined multi-layer network using the Leiden algorithm, using a RB vertex partition with resolution parameter $\gamma=0.02$ for intra-layer edges and a CPM vertex with resolution parameter $0$ for inter-layer edges, as is a standard for modularity optimization on multi-layer networks. The clusters output by the Leiden algorithm are taken as the combined multi-layer CCI hotspots. This matching process is performed separately for each interaction.

Three-dimensional regions

To assist in visualization of three-dimensional structure in the merFISH dataset⁶ along with three-dimensional CCI analysis, we identify regions using the Leiden multilayer network communication detection algorithm⁶⁰. The network is constructed with both intra- and inter-layer edges, and intra-layer edges are derived from both spatial proximity and transcriptional similarity. For spatial proximity, the 20-nearest neighbor graph is used as the adjacency matrix, weighted by a factor of

$${A}_{{ij}}^{{{{{{\rm{space}}}}}}}={e}^{-\frac{{d}_{{ij}}}{0.04}}$$

(8)

where ${d}_{{ij}}$ is the distance between cells i and j and 0.04 is a weighting factor based on the scale of distances in the dataset, over all pairs i and j in the 20-nearest-neighbors graph. The expressional similarity matrix is given by

$${A}_{{ij}}^{{{{{{\rm{expr}}}}}}}={e}^{-\frac{\left|{z}_{i}-{z}_{j}\right|}{2}}$$

(9)

where ${z}_{i}$ is the 8-dimensional PCA embedding of the expression of cell i. Finally, the full intra-layer adjacency matrix is given by

$${{{{{{\bf{A}}}}}}}^{{{{{{\rm{inter}}}}}}}=\alpha {{{{{{\bf{A}}}}}}}^{{{{{{\rm{space}}}}}}}+\left(1-\alpha \right){{{{{{\bf{A}}}}}}}^{{{{{{\rm{expr}}}}}}}$$

(10)

where $\alpha$ reflects the relative weighting between spatial and transcriptional similarity, here taken as $\alpha=0.2$. The inter-layer edges are computing by performing a linear sum assignment between cells in every pair of adjacent layers under square-Euclidean cost. The communities labels are computed using the Leiden algorithm⁶⁰ using an RBConfigurationVertexPartition over intra-layer edges and a CPMVertexPartition with node_size = 0 over inter-layer edges.

Parameter choice

Here we describe the parameters of NeST that may vary by dataset or analysis, as well as recommendations on how to set them when performing analysis.

Alternative segmentation methods

The HMRF method was performed using a Python re-implementation of the HMRF algorithm included in NeST as nest.hmrf.HMRFSegmentationModel. For analysis, we used values $\beta=1$ and a k-nearest-neighbors graph with $k=6$. SpaGCN was run according to the tutorial values, running in the mode without histology image information, using k-means initialization and a maximum of 200 epochs although it reached convergence before the limit on tested datasets. A wrapper for SpaGCN replicating the analysis is available in nest as nest.methods.SpaGCN. BayesSpace was run using recommended values, with 2000 highly-variable genes and 16 principal components. A wrapper for BayesSpace using rpy2 replicating the analysis is available in nest as nest.methods.BayesSpace.

CellChat

The database of ligand-receptor interactions from CellChat was used in determining what interactions to form CCI hotspots from. The CellChatDB.mouse database was loaded, and was filtered to use only secreted signaling and cell-cell contact interactions. This list was used as the database for NeST CCI hotspots.

The CellChat analysis of CCI was run following the procedure described in the original publication⁵³, with the communication probabilities computed using a truncated mean of 0.05. The exact procedure can be accessed through the nest package through the nest.methods.CellChat class. The CellChat score referenced in Fig. 7h was computed by taking the CellChat output, which is a communication weight for each pair of cell type, and then setting each individual cell to the sum of incoming weight for its cell type.

Benchmarking

Due to the lack of ground truth annotation on the breast cancer dataset, the reference shown in Supplementary Fig. 3e was a manually curated combination of the models tested. Specifically, the HMRF and SpaGCN methods were each run twice, set for 4 and 15 regions, and then the output was separated into connected components to address the lack of localization in the segmentation methods, and the best-match label was manually identified for each of the four regions (outer and three inner). Then, combined with the best-match NeST coexpression hotspots, the reference was constructed as the set of spots assigned to that region by at least two of the three methods. We note that this is not meant to be taken directly as a ground truth, i.e. 1 being a perfect score, but rather a relative measure of how closely the output of the segmentation methods capture these structures compared to NeST.

Subsampling validation was performed by taking ten random subsets of the total set of genes in the datasets and independently computing NeST coexpression hotspots for each, and then these subsampled hotspots were compared to the original hotspots from the full dataset. For each of the original hotspots, and for each of the ten realizations, the Jaccard similarity to the best-match subsampled hotspot was computed. This was averaged over the ten realizations to produce the ultimate score. A score near one means that even with the subsampled set of genes, an identical coexpression hotspot is essentially always found, and a score near zero means that particular coexpression hotspot is no longer found with the reduced number of genes.

When benchmarking on the synthetical hierarchy data (see below for details on how the data is constructed), we take the default values for the four parameters from Table 1 and vary one of them away from the default at a time, and then compute coexpression hotspots. We perform a one-to-one matching via linear sum assignment between the computed coexpression hotspot and the ground truth (overlapping) regions, attempting to maximize the Jaccard similarity of matched regions. In the case that NeST identifies too many or too few coexpression hotspots, those unmatched hotspots are given a score of zero. The scores are averaged over all matched pairs and unmatched hotspots to produce the Jaccard score shown on the y-axis of Fig. 4f.

Table 1 List of NeST parameters

Full size table

Comparison between NeST and three segmentation methods: BayesSpace, HMRF, and SpaGCN, was performed on the mouse anterior cortex dataset by performing a series of segmentations with each method, varying the number of regions from 2 to 32, and then measuring the similarity between selected NeST coexpression hotspots that were observed to represent meaningful structure and the best-match region from each segmentation, as we vary the selected number of total regions in the segmentation. Here, we again remark that we are not assuming the NeST hotspot to be exactly the ground truth, but we have verified the significance of these structures by checking the expression of individual genes. A Jaccard similarity above approximately 0.75 should be considered a good match and therefore a success by the segmentation method in finding that structure.

Synthetic data

In order to capture both hierarchical structure and multiple spatial scales, we consider a spatial structure consisting of a series of layers, each of which contains a new region covering half of the old. In other words, layer 1 covers the whole region, layer 2 covers half of the region, layer 3 covers half of layer 2, etc. Here we used a 5-layer dataset with a total of 2048 genes. Of these 2048 genes, a selected fraction were marked as spatial genes, assigned to one of the five layers, and then were expressed more heavily in that region than outside. Non-spatial genes have a spatially uniform expression distribution. Expression was modeled using a zero-inflated Poisson distribution, and the dataset was log-normalized.

Stereo-seq dataset

The analysis shown on the Stereo-seq MOSTA dataset⁴⁵ consisted of one sample for each time point from E9.5 to E16.5, using the sample labeled E1S1 for each timepoint. $\epsilon$ values were scaled with the dataset as 0.02 times the length of the sample in the vertical direction. All samples used a density of 0.5, threshold of 0.3, and resolution of 1.0.

Statistics and reproducibility

No statistical method was used to predetermine sample size. When selecting subsets of samples from collections of datasets, samples were always selected by numerically lowest label. Otherwise, no data were excluded from the analyses. NeST requires no filtering of highly variable genes and can be directly applied to full-transcriptome data. The experiments were not randomized. The Investigators were not blinded to allocation during experiments and outcome assessment.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All relevant data supporting the key findings of this study are available within the article and its Supplementary Information files. Visium 10x were accessed via SCANPY⁵⁴. Other datasets were used through the Squidpy package⁶¹. Necessary code to load all datasets and use them with NeST is available as part of the NeST package. Raw forms of transcriptomic datasets are also available from the original authors. The Visium 10x datasets used in this study are available in 10x Genomics database at https://support.10xgenomics.com/spatial-gene-expression/datasets. The Slide-seqV2 dataset used in this study is available in the Single Cell Portal database at https://singlecell.broadinstitute.org/single_cell/study/SCP815/highly-sensitive-spatial-transcriptomics-at-near-cellular-resolution-with-slide-seqv2. The SeqFISH dataset used in this study is available in the Spatial Mouse Atlas database at https://marionilab.cruk.cam.ac.uk/SpatialMouseAtlas/. The MERFISH dataset used in this study is available in Dryad at https://doi.org/10.5061/dryad.8t8s248. The intestine colitis Visium dataset used in this study is available in the GEO database under accession code GSE169749. The Stereo-seq data of mouse embryo development is available in the CNGB database under accession code CNP0001543 or at https://db.cngb.org/stomics/mosta/download/. The CellChat database of ligand-receptor interactions used in this study is part of the CellChat R library available at https://github.com/sqjin/CellChat. Source data are provided with this paper.

Code availability

NeST is available as a Python package and can be accessed at https://github.com/bwalker1/NeST⁶².

References

Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
ADS PubMed Google Scholar
Rodriques, S. G. et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).
CAS PubMed Central ADS PubMed Google Scholar
Stickels, R. R. et al. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 39, 313–319 (2021).
CAS PubMed Google Scholar
Vickovic, S. et al. High-definition spatial transcriptomics for in situ tissue profiling. Nat. Methods 16, 987–990 (2019).
CAS PubMed Central PubMed Google Scholar
Cho, C.-S. et al. Seq-Scope: submicrometer-resolution spatial transcriptomics for single cell and subcellular studies. Biorxiv 2021.01.25.427807 https://doi.org/10.1101/2021.01.25.427807 (2021).
Moffitt, J. R. et al. Molecular, spatial and functional single-cell profiling of the hypothalamic preoptic region. Science 362, eaau5324 (2018).
PubMed Central ADS PubMed Google Scholar
Shah, S., Lubeck, E., Zhou, W. & Cai, L. In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus. Neuron 92, 342–357 (2016).
CAS PubMed Central PubMed Google Scholar
Eng, C.-H. L. et al. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH+. Nature 568, 235–239 (2019).
CAS PubMed Central ADS PubMed Google Scholar
Lohoff, T. et al. Integration of spatial and single-cell transcriptomic data elucidates mouse organogenesis. Nat. Biotechnol. 1–12 https://doi.org/10.1038/s41587-021-01006-2 (2021).
Chen, K. H., Boettiger, A. N., Moffitt, J. R., Wang, S. & Zhuang, X. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 348, aaa6090 (2015).
PubMed Central PubMed Google Scholar
Lubeck, E., Coskun, A. F., Zhiyentayev, T., Ahmad, M. & Cai, L. Single-cell in situ RNA profiling by sequential hybridization. Nat. Methods 11, 360–361 (2014).
CAS PubMed Central PubMed Google Scholar
French, L. & Pavlidis, P. Relationships between gene expression and brain wiring in the adult rodent brain. Plos Comput. Biol. 7, e1001049 (2011).
CAS PubMed Central ADS PubMed Google Scholar
Nowakowski, T. J. et al. Spatiotemporal gene expression trajectories reveal developmental hierarchies of the human cortex. Science 358, 1318–1323 (2017).
CAS PubMed Central ADS PubMed Google Scholar
Papalexi, E. & Satija, R. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat. Rev. Immunol. 18, 35–45 (2018).
CAS PubMed Google Scholar
Li, W. V. & Li, J. J. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat. Commun. 9, 997 (2018).
PubMed Central ADS PubMed Google Scholar
Nguyen, Q. H. et al. Single-cell RNA-seq of human induced pluripotent stem cells reveals cellular heterogeneity and cell state transitions between subpopulations. Genome Res. 28, 1053–1066 (2018).
CAS PubMed Central PubMed Google Scholar
Nguyen, Q. H. et al. Profiling human breast epithelial cells using single cell RNA sequencing identifies cell diversity. Nat. Commun. 9, 2028 (2018).
PubMed Central ADS PubMed Google Scholar
Peng, J. et al. Single-cell RNA-seq highlights intra-tumoral heterogeneity and malignant progression in pancreatic ductal adenocarcinoma. Cell Res. 29, 725–738 (2019).
CAS PubMed Central PubMed Google Scholar
Moncada, R. et al. Integrating microarray-based spatial transcriptomics and single-cell RNA-seq reveals tissue architecture in pancreatic ductal adenocarcinomas. Nat. Biotechnol. 38, 333–342 (2020).
CAS PubMed Google Scholar
Deng, C.-C. et al. Single-cell RNA-seq reveals fibroblast heterogeneity and increased mesenchymal fibroblasts in human fibrotic skin diseases. Nat. Commun. 12, 3709 (2021).
CAS PubMed Central ADS PubMed Google Scholar
Wu, Y. E., Pan, L., Zuo, Y., Li, X. & Hong, W. Detecting activated cell populations using single-cell RNA-seq. Neuron 96, 313–329.e6 (2017).
CAS PubMed Google Scholar
Walker, B. L., Cang, Z., Ren, H., Bourgain-Chang, E. & Nie, Q. Deciphering tissue structure and function using spatial transcriptomics. Commun. Biol. 5, 220 (2022).
PubMed Central PubMed Google Scholar
Zhu, Q., Shah, S., Dries, R., Cai, L. & Yuan, G.-C. Identification of spatially associated subpopulations by combining scRNAseq and sequential fluorescence in situ hybridization data. Nat. Biotechnol. 36, 1183–1190 (2018).
CAS Google Scholar
Dries, R. et al. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. Genome Biol. 22, 78 (2021).
CAS PubMed Central PubMed Google Scholar
Zhao, E. et al. Spatial transcriptomics at subspot resolution with BayesSpace. Nat. Biotechnol. 1–10 https://doi.org/10.1038/s41587-021-00935-2 (2021).
Yang, Y. et al. SC-MEB: spatial clustering with hidden Markov random field using empirical Bayes. Brief. Bioinform. 23, bbab466 (2022).
PubMed Google Scholar
Hu, J. et al. SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat. Methods 18, 1342–1351 (2021).
PubMed Google Scholar
Fu, H. et al. Unsupervised Spatially Embedded Deep Representation of Spatial Transcriptomics. Biorxiv 2021.06.15.448542 https://doi.org/10.1101/2021.06.15.448542 (2021).
Cang, Z., Ning, X., Nie, A., Xu, M. & Zhang, J. SCAN-IT: Domain segmentation of spatial transcriptomics images by graph neural network. in British Machine Vision Conference 1–10 (2021).
Veličković, P. et al. Deep Graph Infomax. In International Conference on Learning Representations (2019).
Dong, K. & Zhang, S. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nat. Commun. 13, 1739 (2022).
CAS PubMed Central ADS PubMed Google Scholar
Zuo, C. et al. Elucidating tumor heterogeneity from spatially resolved transcriptomics data by multi-view graph collaborative learning. https://doi.org/10.21203/rs.3.rs-1287670/v1 (2022).
Ren, H., Walker, B. L., Cang, Z. & Nie, Q. Identifying multicellular spatiotemporal organization of cells with SpaceFlow. Nat. Commun. 13, 4076 (2022).
CAS PubMed Central ADS PubMed Google Scholar
Moehlin, J., Mollet, B., Colombo, B. M. & Mendoza-Parra, M. A. Inferring biologically relevant molecular tissue substructures by agglomerative clustering of digitized spatial transcriptomes with multilayer. Cell Syst. 12, 694–705.e3 (2021).
CAS PubMed Google Scholar
Fischer, D. S., Schaar, A. C. & Theis, F. J. Learning cell communication from spatial graphs of cells. Biorxiv 2021.07.11.451750 https://doi.org/10.1101/2021.07.11.451750 (2021).
Jerby-Arnon, L. & Regev, A. Dialogue maps multicellular programs in tissue from single-cell or spatial transcriptomics data. Nat. Biotechnol. 1–11 https://doi.org/10.1038/s41587-022-01288-0 (2022).
Cembrowski, M. S., Wang, L., Sugino, K., Shields, B. C. & Spruston, N. Hipposeq: a comprehensive RNA-seq database of gene expression in hippocampal principal neurons. Elife 5, e14997 (2016).
PubMed Central PubMed Google Scholar
Horeweg, N. et al. Tertiary lymphoid structures critical for prognosis in endometrial cancer patients. Nat. Commun. 13, 1373 (2022).
CAS PubMed Central ADS PubMed Google Scholar
Schumacher, T. N. & Thommen, D. S. Tertiary lymphoid structures in cancer. Science 375, eabf9419 (2022).
CAS PubMed Google Scholar
Sautès-Fridman, C., Petitprez, F., Calderaro, J. & Fridman, W. H. Tertiary lymphoid structures in the era of cancer immunotherapy. Nat. Rev. Cancer 19, 307–325 (2019).
PubMed Google Scholar
Andersson, A. et al. Spatial deconvolution of HER2-positive breast cancer delineates tumor-associated cell type interactions. Nat. Commun. 12, 6012 (2021).
CAS PubMed Central ADS PubMed Google Scholar
Thakurela, S. et al. The transcriptome of mouse central nervous system myelin. Sci. Rep.-uk 6, 25828 (2016).
CAS ADS Google Scholar
Gonzalez, D. & Brandan, E. CTGF/CCN2 from skeletal muscle to nervous system: impact on neurodegenerative diseases. Mol. Neurobiol. 56, 5911–5916 (2019).
CAS PubMed Google Scholar
Ercan, E. et al. Neuronal CTGF/CCN2 negatively regulates myelination in a mouse model of tuberous sclerosis complex. J. Exp. Med. 214, 681–697 (2017).
CAS PubMed Central PubMed Google Scholar
Chen, A. et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 185, 1777–1792.e21 (2022).
CAS PubMed Google Scholar
Bolós, V., Grego-Bessa, J. & de la Pompa, J. L. Notch signaling in development and cancer. Endocr. Rev. 28, 339–363 (2007).
PubMed Google Scholar
Okubo, Y. et al. Lfng regulates the synchronized oscillation of the mouse segmentation clock via trans-repression of Notch signalling. Nat. Commun. 3, 1141 (2012).
ADS PubMed Google Scholar
Feller, J., Schneider, A., Schuster-Gossler, K. & Gossler, A. Noncyclic Notch activity in the presomitic mesoderm demonstrates uncoupling of somite compartmentalization and boundary formation. Gene Dev. 22, 2166–2171 (2008).
CAS PubMed Central PubMed Google Scholar
Kurokawa, D. et al. Regulation of Otx2 expression and its functions in mouse forebrain and midbrain. Development 131, 3319–3331 (2004).
CAS PubMed Google Scholar
Rhinn, M. et al. Sequential roles for Otx2 in visceral endoderm and neuroectoderm for forebrain and midbrain induction and specification. Development 125, 845–856 (1998).
CAS PubMed Google Scholar
Ellies, D. L., Church, V., Francis-West, P. & Lumsden, A. The WNT antagonist cSFRP2 modulates programmed cell death in the developing hindbrain. Development 127, 5285–5295 (2000).
CAS PubMed Google Scholar
Sweetman, D., Wagstaff, L., Cooper, O., Weijer, C. & Münsterberg, A. The migration of paraxial and lateral plate mesoderm cells emerging from the late primitive streak is controlled by different Wnt signals. Bmc Dev. Biol. 8, 63–63 (2008).
PubMed Central PubMed Google Scholar
Jin, S. et al. Inference and analysis of cell–cell communication using CellChat. Nat. Commun. 12, 1088 (2021).
CAS PubMed Central ADS PubMed Google Scholar
Wolf, F. A., Angerer, P. & Theis, F. J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 19, 15 (2018).
PubMed Central PubMed Google Scholar
Gurdon, J. B. & Bourillot, P.-Y. Morphogen gradient interpretation. Nature 413, 797–803 (2001).
CAS ADS PubMed Google Scholar
Lein, E. S. et al. Genome-wide atlas of gene expression in the adult mouse brain. Nature 445, 168–176 (2007).
CAS ADS PubMed Google Scholar
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cyber. 9, 62–66 (1979).
Google Scholar
Xu, X., Ester, M., Kriegel, H.-P. & Sander, J. A distribution-based clustering algorithm for mining in large spatial databases. Proc 14th Int Conf Data Eng 324–331 https://doi.org/10.1109/icde.1998.655795 (1998).
Edelsbrunner, H., Kirkpatrick, D. & Seidel, R. On the shape of a set of points in the plane. IEEE Trans. Inf. Theory 29, 551–559 (1983).
MathSciNet MATH Google Scholar
Traag, V. A., Waltman, L. & van Eck, N. J. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep.-uk 9, 5233 (2019).
CAS ADS Google Scholar
Palla, G. et al. Squidpy: a scalable framework for spatial omics analysis. Nat. Methods 19, 171–178 (2022).
CAS PubMed Central PubMed Google Scholar
Walker, B. L. & Nie, Q. Nested hierarchical structure in spatial transcriptomic data with NeST. NeST, https://doi.org/10.5281/zenodo.8339642 (2023).

Download references

Acknowledgements

The project was partly supported by the National Science Foundation grants DMS176372 and CBET2134916, the National Institutes of Health grants U01AR073159, R01DE030565, and R01AR079150, and a Simons Foundation Grant (594598), awarded to Q.N.

Author information

Authors and Affiliations

The NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, CA, 92627, USA
Benjamin L. Walker & Qing Nie
Department of Mathematics, University of California Irvine, Irvine, CA, 92627, USA
Benjamin L. Walker & Qing Nie
Department of Developmental and Cell Biology, University of California Irvine, Irvine, CA, 92627, USA
Qing Nie

Authors

Benjamin L. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Qing Nie
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.W. and Q.N. conceived the project; B.W. implemented the algorithm and code and conducted data analysis. All the authors wrote and approved the manuscript; Q.N. supervised the research.

Corresponding author

Correspondence to Qing Nie.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks F. William Townes and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Walker, B.L., Nie, Q. NeST: nested hierarchical structure identification in spatial transcriptomic data. Nat Commun 14, 6554 (2023). https://doi.org/10.1038/s41467-023-42343-x

Download citation

Received: 13 June 2023
Accepted: 06 October 2023
Published: 17 October 2023
DOI: https://doi.org/10.1038/s41467-023-42343-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.