Introduction

Recently available detailed maps of physical networks provide an opportunity to systematically investigate the relationship between their shape and their network properties. For example, open-data large-scale experimental projects provide a neuron-level mapping of biological neural networks1,2, high-throughput MRI measurements map out detailed vascular networks3, or mycelia mapping projects explore local morphological and mechanical properties of fungi by representing them as networks of filaments4. The nodes and links of these physical networks have two distinguishing features: (1) they are characterized by a complex three-dimensional shape and (2) they physically interact with each other, for example, nodes and links obey volume exclusion, i.e., they cannot overlap. To understand how physicality affects network evolution and function, we must extend the toolset of network science to take these features into account.

Recent work investigated artificial spatial embeddings of complex networks that obey volume exclusion5, the entanglement of physical links6, models of physical network growth7,8, and the effect of physical shape on the dynamics on networks8. However, systematic exploration of the three-dimensional shape, the network properties, and the relationship between them in real networks is still lacking. Such exploration is hampered by the lack of standardized representation. First, there is the technical difficulty that experimental maps of physical networks, like neural or molecular networks, are collected, processed, and analyzed with domain-specific methodology. Therefore, any investigation of physical networks must be preceded by the time-consuming and computationally burdensome task of data pre-processing. Second, even seemingly simple questions like what is a node and a link in a physical network carry a level of ambiguity: a physical network is a continuous object in space; to represent it as a network, we must discretize it into nodes and links. The definition of nodes and links in turn affects, for example, what properties of the network we can study or the right choice of null models.

Here, we compile and standardize 15 data sets from various domains. Each of these physical networks is composed of tube-like objects bound together at junction points; motivating us to treat the junction points as nodes and the tubes connecting them as physical links. We characterize both the physical shape and the abstract network structure and the correlations between them. For this, we calculate standard descriptors such as the degree distribution of the abstract network or the fractal dimension of the layout. We also introduce a measure of link confinement to understand the role of volume exclusion, which compares the physical links to a null model that randomizes link trajectories. The remainder of the paper is organized as follows: In the next section, we describe the data sets we collected and their standardization. In sections “Abstract network properties”–“Link confinement correlation profiles”, we analyze the data sets’ abstract network properties, their physical shape, and the emergent correlations between network and shape. Finally, section “Discussion” provides a brief discussion.

Figure 1
figure 1

Physical networks. (a) A physical network composed of three tube-like links bound together at a single junction point. (b) The skeleton representation of (a) approximates the original structure as a collection of vertices (black points) and edges (colored segments). (c) The combinatorial or abstract network of (a) captures the connectivity of the system without the physical structure: nodes represent junction points and terminal points, with a link between two nodes that are directly connected by a physical link. (d) The skeleton representation allows us to approximate the original volume of real physical networks—e.g., vascular network (top) and mitochondrial network (bottom). (e) We compiled a set of 15 physical networks from various domains. The size of the networks varies greatly: the number of skeleton segments \(N_{\text {seg}}\) capturing the shape of each network spans 4 orders of magnitude.

Data

Our goal is to systematically study the three-dimensional layout of physical networks and to understand the relationship between their physical properties and their network structure. For this, we collected 15 data sets from various domains, including individual neurons9,10,11,12, biological neural networks13, plant roots14, vascular networks3, a mitochondrial network15, and the imprint of an anthill (see SI S1.). Before any analysis, however, we must uniformly represent and standardize these data sets. In the following sections, we propose the use of a labeled skeleton representation, which efficiently captures both the physical shape and abstract network of physical networks.

Skeleton representation

Experimental imaging techniques that are used to capture the shape of physical networks, such as scanning electron microscopy or magnetic resonance imaging, typically output a three-dimensional image composed of voxels. Hence, this voxel representation is the most accurate description available of network layouts. A three-dimensional voxel image, however, is difficult to handle both computationally and analytically; therefore a more compact representation of the data is needed.

Physical networks are typically composed of tube-like objects bound together at junction points (Fig. 1a), making them suitable to be approximated by a series of straight segments and radii of the network at the endpoints of the segments (Fig. 1b). The process of creating these segments from a raw data format is called skeletonization16, often employed in the fields of computer graphics17 and neuroscience18,19. Formally, a skeleton representation \(\mathcal {S}\) of a physical network is a graph whose set of vertices \(\mathcal {V}\) correspond to points in space and set of edges \(\mathcal {E}\) correspond to segments connecting point pairs. Therefore, each vertex \(i\in \mathcal {V}\) has a set of coordinates \(\textbf{r}_i = (x_i,y_i,z_i)\) and a radius \(\rho _i\) associated to it. Figure 1b shows the skeleton of Fig. 1a.

The radius \(\rho _i\) captures the thickness of the physical network at each vertex i. Therefore, we can approximate the original occupied volume of a segment connecting vertex i with radius ρi and vertex j with ρj as a truncated cone, and the full volume of the network is approximated by the sum of these truncated cones. More specifically, for each edge (ij) in \(\mathcal {S}\) we add a truncated cone with axis \((\textbf{r}_i,\textbf{r}_j)\) and parallel faces with radii \(\rho _i\) and \(\rho _j\) which has volume:

$$\begin{aligned} V_{\text {seg}}(i,j) = \frac{1}{3} \cdot \pi \cdot (\rho _i^{2} + \rho _j^{2} + \rho _i\cdot \rho _j) \cdot |\textbf{r}_i -\textbf{r}_j|. \end{aligned}$$
(1)

Three-dimensional physical network data obtained from experiments is routinely skeletonized, and the skeleton of the network is published together with the raw data. When their original volume is approximated, they can be visualized as shown in Fig. 1d. In fact, all but one of the 15 data sets that we study here was skeletonized by the original authors, the only exception is the anthill imprint provided to us as a surface mesh which we skeletonized using the Tangent-ball20 algorithm from the Skeletor Python module19. The experimental setup and the choices made during the skeletonization may affect our analysis which is performed on the skeleton. For example, increasing the number of skeleton segments \(N_{\text {seg}}\) increases how well the skeleton approximates the original shape of the network. However, increasing \(N_{\text {seg}}\) also increases the size of the data set, hence increasing the computational burden of the analysis. To improve the uniformity of the data sets, we perform two pre-processing steps:

  1. 1.

    Merging segments: If two consecutive segments \((\textbf{r}_i,\textbf{r}_j)\) and \((\textbf{r}_j,\textbf{r}_k)\) are appear parallel to each other in the data set, we merge them into a single segment \((\textbf{r}_i,\textbf{r}_k)\) (see SI S2.2).

  2. 2.

    Skeleton healing: Due to noisy data, a skeleton may be disconnected even when it represents a single continuous object in reality. For example, a skeleton of a neuron may appear to have multiple components. To remedy this, we add a segment to connect the two nearest skeleton vertices from two disconnected components. We repeat this step until the skeleton becomes connected.

Following these pre-processing steps, the number of segments \(N_{\text {seg}}\) in a skeleton, as shown in Fig. 1e, spans three orders of magnitude from mitochondrial networks, which have approximately \(10^3\) segments, to fruit fly neural networks which have up to \(10^6\) segments.

Network structure

A physical network is a continuous object embedded in Euclidean space, to characterize this object as a network we must separate it into discrete nodes and links. For this, first note that all 15 data sets that we collected can be seen as a collection of tube-like objects bound together at junction points. Hence, we define the junction and terminal points of the tubes as physical nodes and the non-branching tubes pairwise connecting these terminal and junction points as physical links. A motivation for this definition is that cutting a physical link (i.e., a tube) at any point along its length causes the same disruption to the connectivity of the network.

More formally, for a skeleton representation \(\mathcal {S}\) of a physical system, we define each physical node to correspond to a skeleton vertex i with degree \(k(i)\ne 2\), and each physical link to corresponds to a path in the skeleton given by the ordered set \(\mathcal {T}(i_0,i_l)=[(i_0,i_1),(i_1,i_2),\ldots ,(i_{l-1},i_l)]\), such that \(k(i_0),k(i_l)\ne 2\) and \(k(i_j)=2\) for \(j=1,2,\ldots ,l-1\).

With the above definition of nodes and links, we can talk about the abstract or combinatorial network \(\mathcal {G}\) of the system which captures its connectivity without the physical structure. The skeleton \(\mathcal {S}\) is one possible physical realization of the abstract network \(\mathcal {G}\); however, there are many possible physical realizations of the same \(\mathcal {G}\). In general, we are interested in understanding the relationship between the physical layout captured by \(\mathcal {S}\) and the network structure captured by \(\mathcal {G}\).

As an example consider the physical network shown in Fig. 1a which consists of three tubes bound together at a single junction point. Its skeleton representation (Fig. 1b), therefore, has three vertices with degree 1 corresponding to the terminal points, one vertex with degree \(k=3\) corresponding to the junction point, and several vertices with degree \(k=2\) tracing the trajectory of the tubes. This means that the network consists of four physical nodes and three physical links, and its abstract network is a star (Fig. 1c).

Finally, note that for a given skeleton \(\mathcal {S}\), the above definition of physical nodes and links is not the only viable definition. For example, in a neural network, it is natural to treat a neuron as a physical node and synapses between them as links, as individual neurons can have complex three-dimensional shapes that can be represented by a skeleton itself. More generally, subgraphs of a large \(\mathcal {S}\) may represent functional units in a physical network and it can be useful to treat these functional units as physical nodes8. Note, however, that our definition of the abstract network \(\mathcal {G}\) provides the most detailed picture of the system’s connectivity, and other definitions can be thought of as coarse-grained versions of \(\mathcal {G}\).

With the skeleton representation and the definition of the abstract network at hand, we are in the position to start our analysis. In the following sections, we first explore the structure of the abstract networks of the 15 data sets, then we continue by characterizing their physical properties, and finally, we investigate the relation between the two.

Abstract network properties

The above definition of physical nodes and links allows us to explore the properties of the abstract networks capturing the connectivity of physical networks without their three-dimensional structure; we focus on the degree distribution and motif frequencies.

The nodes in our physical networks are terminal and junction points, meaning that by construction nodes cannot have degree \(k=2\), only degree \(k=1\) or \(k>2\). Figure 2a shows the degree distribution p(k) of all 15 networks, and our main observation is that most nodes have degree \(k=1\) or \(k=3\), and nodes with a larger degree are exceedingly rare, i.e., \(p(1)+p(3)\approx 1\). This means that junctions tend to be bifurcation points along the tube-like physical links making up the network. This is in line with previous empirical observations and theoretical predictions for neurons21,22 and transport networks23,24. The observed narrow degree distribution is in contrast with degree-heterogeneous networks typically in the focus of network science and should be accounted for by mathematical models of physical networks7.

In graph topological terms, 7 out of 15 collected physical networks are trees: the individual neurons, the anthill imprint, and the plant roots. Each bifurcation point with degree \(k=3\) in a tree creates one new leaf node with degree \(k=1\); therefore the fact that these networks are trees together with the observation that most junctions are bifurcation points, completely determines their degree distribution as \(p(1)\approx p(3)\approx 1/2\). The remainder of the networks contain cycles: The vascular networks have no terminal points, apart from a few appearing due to finite sample size; therefore are almost completely composed of \(k=3\) nodes. The mitochondrial network representing a network of molecular strands has the highest fraction of \(k>3\) nodes. Finally, the fruit fly neural networks represent a collection of individual neurons which are trees, bound together by synapses, and their degree distribution closely resembles that of trees.

To explore the local loop structure of the networks, we calculate the abundance of observed 4-node motifs. Namely, we focus on two motifs: the star motif and the 4-cycle. To quantify their abundance, we calculate their Z-score compared to their degree-preserving randomized counterparts:

$$\begin{aligned} z_\text {s/c} = \frac{n_\text {s/c}-\left\langle n_\text {s/c}\right\rangle }{\sigma _\text {s/c}}, \end{aligned}$$
(2)

where \(n_\text {s/c}\) is the number of occurrences of the star and 4-cycle motifs in the original networks, and the expected value \(\left\langle n_\text {s/c}\right\rangle\) and standard deviation \(\sigma _\text {s/c}\) is estimated by creating 200 independent randomizations25, while keeping the degree sequence fixed. Figure 2b shows the scatter plot of \(z_\text {s}\) and \(z_\text {c}\) for the 15 networks. As expected, in tree networks containing no cycles the star motif is slightly over-represented \(z_\text {s}>0\), and the cycle motif is slightly under-represented \(z_\text {s}<0\), while for networks containing cycles, we find the opposite. The highest abundance of 4-cycles is observed for the networks representing different brain regions of the fruit fly brain.

Based on the degree distributions and motif profile of the abstract networks, the physical networks fall into three broad categories: (1) topological trees (the individual neurons, the root systems, and the anthill tunnel imprint), (2) lattice-like networks that are characterized by a loopy structure and few terminal points (the vascular networks and the mitochondrial network), and (3) linked trees which are a collection of trees bound together by additional links (the fruit fly brain regions).

Figure 2
figure 2

Abstract network properties. (a) The abstract networks are composed of terminal and bifurcation points; therefore, their degree distribution is mostly concentrated on \(k=1\) and \(k=3\). More specifically, lattice-like networks, such as vascular and mitochondrial networks, are mostly made up of branching nodes (degree \(k=3\)). For the rest of the network, nodes with \(k=1\) and \(k=3\) are approximately evenly split, as expected for tree networks. (b) We calculate the z-scores of four-node star and cycle motifs of the original networks compared to random networks with the same degree distribution. Markers are the average of \(n_{\text {t}} = 200\) independent randomizations, and error bars representing the standard error of the mean are smaller than the marker size. Both axes are symlog axes, i.e., linear from − 1 to 1, log otherwise for both positive and negative values. The star motif is slightly over- and the cycle motif is slightly under-represented in tree networks (green markers), as expected. In contrast, stars are under and cycles are overrepresented in both lattices (blue markers) and linked trees (red markers). The high z-scores observed for the 4-cycle indicate a lattice-like structure.

Physical properties

In the previous section, we measured properties of the abstract network capturing the connectivity of the 15 physical networks. We continue our investigation focusing on the physical properties of the system: we characterize the three-dimensional shape of the system without considering the abstract network.

Space filling and fractal dimension

A fundamental property of a physical network is its space-filling, i.e., the amount of volume it occupies from the available space. We expect that networks that are tightly packed in space are strongly affected by physicality5,8,26, although the study of random physical networks built from straight links suggests that volume exclusion can play a significant role even for diminishing small space filling26. Many real physical networks have irregular shapes, hence much of their bounding box is unoccupied. Therefore, instead of measuring space-filling globally, we divide the axis-aligned bounding box of each physical network into rectangular boxes. We then measure the local space-filling in each box i as

$$\begin{aligned} \phi (i) =\frac{V_\text {occ}(i)}{V_\text {box}}, \end{aligned}$$
(3)

where \(V_\text {occ}(i)\) is the volume of the intersection of the network and box i and \(V_\text {box}\) is the volume of the rectangular box. The distribution of \(\phi (i)\) depends on the choice of \(V_\text {box}\); therefore, to ensure consistency, we set them separately for each data set such that every bounding box is split into a \(10\times 10\times 10\) grid of boxes (SI S4.1.). Figure 3a shows the distribution of \(\phi\) for all 15 physical networks, revealing that the physical networks fill out the space with mostly sparse regions and fewer denser regions. Therefore we expect that physicality will also affect the network structure unevenly: volume exclusion can limit the number and shape of links in dense regions.

To further characterize the shape of the networks, we calculate their box-counting fractal dimension \(D_{\text {f}}\), which compactly describes the space-filling of a physical object on multiple scales27,28,29 and is widely used to characterize the shape of complex biological systems29,30. Possible values of \(D_{\text {f}}\) for connected networks range between 1 and the embedding dimension \(D = 3\), Fig. 3b shows \(D_{\text {f}}\) for the 15 networks, we find that both lattices and linked trees have \(D_{\text {f}}\gtrapprox 2\), while trees are typically characterized by \(D_{\text {f}}\lessapprox 2\), except for the anthill imprint. The fractal scaling spans at least two orders of magnitude of length scales (see SI S4.2.), again pointing towards regions of high and low physical density at different resolution levels.

Figure 3
figure 3

Space-filling and fractal dimension. (a) The boxplots of local space-filling \(\phi\) show the coexistence of mostly sparse and fewer dense regions, where the vertical lines indicate the median, the boxes span the 25th-75th percentile range, and the whiskers extend to 1.5 times the interquartile range. (b) The fractal dimension of physical networks \(D_\text {f}\) ranges between 1 and the embedding dimension \(D=3\). Linked trees (red) and lattices (blue) have fractal dimension values between \(D_\text {f} \approx 2.0\) and \(D_\text {f} \approx 2.3\), while trees (green) have more variation, ranging from \(D_\text {f} \approx 1.5\) to \(D_\text {f} \approx 2.0\) (except for from the anthill imprint). We estimate \(D_\text {f}\) using the box-counting method, the error bars indicate the standard deviation of the local scaling estimates (see SI S4.2).

Link volume and shape

The distribution of space-filling and the fractal dimension characterize the shape of physical networks as a whole. In this section, we continue by quantifying the shape of individual links; we focus on their volume and their straightness.

The skeleton describing the three-dimensional shape of a physical network is composed of straight segments connecting pairs of skeleton vertices and a radius associated to each skeleton vertex, allowing us to approximate the volume belonging to a segment in Eq. (1) as a truncated cone. The total volume of a link (ij) is then:

$$\begin{aligned} V_\text {link}(i,j) =\frac{1}{V_\text {total}}\sum _{(v,w) \in (i,j)}{V_{\text {seg}}(v,w)}, \end{aligned}$$
(4)

where \(V_{\text {seg}}(v,w)\) is the volume of each segment tracing the link (ij). We normalize the link volume by the total volume of the network \(V_\text {total}\), setting the unit of measurement. We find that linked trees, or the fruit fly neural networks, have consistently high link volume heterogeneity, as their distributions span 6 to 8 orders of magnitude, which is higher compared to most lattices and trees (Fig.4a).

We also measure the aspect ratio of physical links \(a(i,j)=\rho _\text {link}(i,j)/l_\text {link}(i,j)\), where \(\rho _\text {link}\) is the average radius and \(l_\text {link}(i,j)\) is the length of link (ij). We find the largest average aspect ratios for one of the fruit fly networks and the anthill (\(\text {med}(a) \approx 0.3\)), while \(\text {med}(a)\) is substantially lower for other data sets (see SI S1.1). Overall, this confirms that physical links are elongated tube-like objects.

Since physical links are tube-like objects, we can capture most of their shape by characterizing their one-dimensional trajectory. Here, we calculate the deviation of the link trajectories from a straight line, quantifying how curved a link is. For this, we rely on a measure of straightness introduced originally in the context of geographical networks31, namely we calculate the complimentary straightness for each link

$$\begin{aligned} \bar{S}(i,j) = 1 - \frac{|\textbf{r}_i - \textbf{r}_j|}{l_{\text {link}}(i,j)}, \end{aligned}$$
(5)

where \(|\textbf{r}_i - \textbf{r}_j|\) is the Euclidean distance between nodes i and j and \(l_{\text {link}}(i,j)=\sum _{(v,w) \in (i,j)}|\textbf{r}_v - \textbf{r}_w|\) is the length of the physical link (ij). The complimentary straightness \(\bar{S}(i,j)\) is 0 if the physical link is straight and close to 1 if it follows a winding trajectory much longer than the straight path between the two points.

Calculating the median of complimentary link straightness distribution, \(\text {med}(\bar{S})\), reveals that links in all of the 15 physical networks tend to follow a trajectory close to a straight line: most networks have \(\text {med}(\bar{S})\approx 0.1\), meaning that the length of links is most often less than 10% longer than the optimal straight trajectory. Similarly to link volume heterogeneity, linked trees tend to cluster together and are among the networks with the straightest links with \(\text {med}(\bar{S}) = 0.05\). Reference26 introduced random linear physical networks, a minimal model that constructs a physical network from straight cylinders. The fact that we observed an abundance of straight or close-to-straight links lends support for using such linear physical network models to understand the role of physicality in real networks. Note, however, that although most links are close to straight, the distribution of \(\bar{S}\) is right-skewed as seen in Fig. 4, which points to a smaller fraction of links that significantly deviate from a straight trajectory. (see SI S3.).

Finally, we computed the correlations between link straightness S(ij) and the total link length \(l_\text {link}(i,j)\) and volume \(V_\text {link}(i,j)\) for each data set using Kendall’s rank correlation coefficient \(\tau\) 32. Figure 4b shows that for all networks, there is a positive rank correlation \(\tau >0\) between \(\bar{S}(i,j)\) and \(l_\text {link}(i,j)\), indicating that longer links tend to follow a more winding path. We also observe a positive correlation \(\tau >0\) between \(\bar{S}(i,j)\) and \(V_\text {link}(i,j)\), since longer links tend to have larger volume. The only exception to this is the fruit fly neural networks, for which \(\bar{S}(i,j)\) and \(V_\text {link}(i,j)\) are negatively correlated \(\tau <0\).

One possible cause contributing to the negative correlation is that fruit fly neural networks are composed of neurons that have large somata, which are represented in the data set as short, yet high-volume segments (see SI S1.7.).

Figure 4
figure 4

Link shape, length and volume. (a) Box plots of \(\bar{S}\) and \(V_{\text {link}}\), where the vertical lines indicate the median, the boxes span the 25th-75th percentile range, and the whiskers extend from the 0.1th to the 99.9th percentile. We find that the bulk of the distributions fall in the range between \(\text {med}(\bar{S}) =0\) and \(\text {med}(\bar{S})= 0.1\), indicating that the networks are mostly composed of close-to-straight links. Link volume \(V_{\text {link}}\) distributions span a wider range for linked trees and are the most narrow for lattices. (b) By computing Kendall rank correlation \(\tau\) between link complementary straightness \(\bar{S}\), link volume \(V_{\text {link}}\) and link trajectory length \(l_{\text {link}}\), we observe a consistent trend of \(\tau > 0\), meaning that longer and more voluminous links tend to have more winding paths. This trend is only reversed for the fruit fly networks, which have \(\tau < 0\) between \(\bar{S}\) and \(V_{\text {link}}\). This could be explained by neuron somatas, which are represented as high-volume, straight links composed of a small number of skeleton segments.

Link confinement

In the previous sections, we found that space-filling and link properties are heterogeneously distributed: most regions of space are sparse and most links are close-to-straight, yet there exists dense regions of the network and a small fraction of links follow paths that deviate from a straight line significantly. This suggests that volume exclusion or other repulsive physical interactions may also play an uneven role in shaping the network. To further investigate this hypothesis we devise a quantity that captures the confinement of a link by other components of a network.

A link (ij) following a trajectory \(\mathcal {T}(i,j)\) in a real physical network obeys volume exclusion: it does not overlap with other links. Our strategy to quantify the role of repulsive forces that may shape \(\mathcal {T}(i,j)\) is to calculate the number of overlaps with other links for a random ensemble of synthetic links that follow similar trajectories to \(\mathcal {T}(i,j)\). If the synthetic links typically overlap with many other links, the trajectory \(\mathcal {T}(i,j)\) is an outlier and must be shaped by forces not captured by the random ensemble.

The trajectory of a physical link (ij) in our data sets is given by the ordered set \(\mathcal {T}(i,j)\) of oriented three-dimensional segments. To generate the random trajectory \(\mathcal {T}_\text {r}(i,j)\), we shuffle the order of the segments while maintaining their orientation and length, creating a uniform random permutation of \(\mathcal {T}(i,j)\). The randomization preserves the endpoints and the total length of the link, but otherwise removes any correlation between subsequent segments (see SI S4.4.); therefore, the possible link trajectories \(\mathcal {T}_\text {r}(i,j)\) have the same complimentary straightness \(\bar{S}\) as the original link. Next, we estimate I(ijlk), the expected number of intersections between the randomized link \(\mathcal {T}_\text {r}(i,j)\) and a non-randomized link \(\mathcal {T}(k,l)\). To quantify the confinement of the link (ij), we sum up the expected number of intersections with other links:

$$\begin{aligned} C(i,j) = \sum _{k,l\ne i,j}I(i,j;l,k)+I(l,k;i,j), \end{aligned}$$
(6)

where the first term corresponds to intersections when link (ij) is randomized, and the second term corresponds to intersections when link (lk) is randomized. Note that the summation in Eq. (6) excludes links that share an endpoint with (ij). We do this to exclude trivial intersections from the count, since adjacent links (ij) and (jk) necessarily overlap at the junction point j even for non-randomized link trajectories.

The procedure of calculating C(ij) is illustrated by Fig. 5a: we start with a link (green) surrounded by two neighboring links (red and blue). The figure shows \(n_{\text {t}} = 2\) randomization trials of the green link: In trial one, the randomized link intersects the red link, but not the blue. In trial two, the randomization creates an intersection with the blue link; the two links, however, are adjacent (they share a junction point), hence the intersection is not counted. In this particular example, the contribution to the link confinement measure from the randomization of the green link will be:

$$\begin{aligned} I(\text {green},\text {red})&= \frac{0+1}{2} = 0.5 \end{aligned}$$
(7)
$$\begin{aligned} I(\text {green},\text {blue})&= \frac{0+0}{2} = 0 \end{aligned}$$
(8)

To complete the calculation of \(C(\text {green})\), we also need to randomize the red link to estimate \(I(\text {red},\text {green})\) in the same manner, while \(I(\text {blue},\text {green})=0\) by definition. Finally, the confinement of the green link is obtained by summing up the contributions, i.e., \(C(\text {green})=I(\text {green},\text {red})+I(\text {red},\text {green})\).

Collision detection between link trajectories is a computationally expensive task, in practice we randomize each link \(n_\text {t}=20\) times (and \(n_\text {t}= 5\) for the fruit_fly_1 network) and we rely on an efficient collision detection algorithm leveraging kd-trees33 (see SI S4.4. and SI S4.5. for details).

Figure 5b shows a large variation in link confinement C. Across all networks, physical links are characterized by \(C\approx 0\), indicating that these links are not affected by the physical proximity of other components of the network. However, we also find highly confined links with \(C > 10\) and even \(C > 100\) expected intersections, suggesting again that physicality tends to play a heterogeneous role in forming networks. In particular, the linked tree networks typically have heavy-tailed link confinement distributions (see SI S5.). In terms of absolute counts, the linked trees or the fruit fly neural networks show the highest values of link confinement. This can be explained by the fact that these networks are composed of multiple neurons, hence in these networks, we have more complete information about the physical environment of the links, compared to networks that describe single neurons.

Figure 5
figure 5

Link confinement. (a) To quantify volume exclusion at the link level, we introduce link confinement C, the expected number of intersections after randomizing a link’s trajectory. To estimate C of the green link, we randomize its trajectory twice. In trial #1, the green link intersects the red link, while in trial #2, it intersects the blue link. The blue link, however, shares an endpoint with the green link; therefore their intersection is not counted. (b) The distribution of C for each physical network. There are many links with link confinement values close or equal to \(C=0\), and typical values of link confinement are around \(C\sim 10\), and only for the fruit fly networks, there are links with link confinement values \(C > 100\), indicating highly confined links for the linked tree networks.

Link confinement correlation profiles

In the previous section, we defined the link confinement C as the expected number of intersections if a link would follow a random trajectory, allowing us to identify links whose trajectory is most affected by repulsive forces in the network. Here, we characterize the properties of such confined links by calculating the Kendall rank correlation \(\tau\) between the link confinement C and other link properties. Specifically, we focus on the (1) physical properties, complementary straightness \(\bar{S}\) and link volume \(V_\text {link}\) (2) abstract network properties link betweenness \(B_\text {link}\) and link degree \(k_{\text {link}}(i,j)\), where the latter is defined as the sum of the degrees of the endpoints of link (ij).

Figure 6 shows the correlation profiles of all 15 networks. A persistent pattern we observe is the positive correlation between link confinement C and link volume \(V_\text {link}\). This is expected since larger links have more opportunities to intersect or be intersected by neighboring links. On the other hand, correlations between link confinement and straightness show a more curious pattern: we observe that most networks tend to have positive and significant correlations \(\tau\) between the link confinement C and link complementary straightness \(\bar{S}\), as expected, indicating that more winding links are also more confined. However, for linked trees (fruit fly neural networks) we find a negative \(\tau\) between C and \(\bar{S}\). To explain this recall that in the fruit fly neural networks we found a negative correlation between link volume and link length and a positive correlation between straightness \(\bar{S}\) (Sec. 4.2). This means that short links tend to be more confined due to their large volume, while also follow a straighter path.

For correlations between link confinement C and the abstract network properties, such as link betweenness \(B_\text {link}\) and degree \(k_\text {link}\), we find consistent and significant positive correlations for fruit fly neural networks. This indicates emergent correlations between the three-dimensional layout and abstract network properties of physical networks: more central links in the abstract network tend to be more confined in physical space. For lattice-like networks and trees, we find less consistent and weaker positive correlations. Overall, we are able to show that the abstract network structure and physical layout are intertwined for networks where we have sufficient information about the surrounding environment of the physical links, such as the fruit fly neural networks, which contain multiple neurons in close proximity with each other.

Figure 6
figure 6

Link confinement correlations. We compute the Kendall rank correlation \(\tau\) between the link confinement C and the physical and the abstract network properties of links, obtaining a correlation profile for each physical network. Across all datasets, except some trees, link volume \(V_{\text {link}}\) and link confinement C have a statistically significant (\(p<0.01\)) moderate positive relationship. We also find a significant association with complementary straightness \(\bar{S}\), which is positive for lattices and trees, and negative for linked trees, consistent with the correlations between \(V_\text {link}\) and \(\bar{S}\) (Fig. 4). For the linked trees, which are the data sets with the most complete information about the environment of the physical links, we find a significant positive association between the centrality of the links in the abstract network (link degree \(k_{\text {link}}\) and link betweenness \(B_{\text {link}}\)) and their link confinement C.

Discussion

Experimental data describing the three-dimensional shape of physical networks is increasingly becoming available, and the growth in the number and size of these data sets is expected to continue: connectome of the human brain consists of \(\approx 10^9\) neurons and fungal mycorrhizal networks are estimated to span \(\approx 10^{17}\) km in Earth’s soil34. The new data calls for extending the toolset of network science to analyze, model, and understand how the three-dimensional layout and physical interactions shape the structure and function of physical networks. Here, we contributed to this effort in three distinct ways: (1) We collected and standardized 15 data sets describing the three-dimensional layout of physical networks from diverse domains. (2) We characterized the structure of both the abstract network and physical layout of the 15 systems using descriptors such as the degree distribution and fractal dimension. (3) We introduced link confinement as a method to quantify how physical interactions shape link trajectories in physical networks, allowing us to investigate emergent correlations between physical and abstract network properties.

Our work may support future research on physical networks in several ways. First, we promote the use of labeled skeleton graphs to represent both the layout and the connectivity of physical networks. The skeleton captures the shape of the network, while the labeling identifies the physical objects corresponding the the nodes and links of the abstract network. Here, we focused on treating junction points in the skeleton as nodes and sequences of segments connecting them as links; however, the labeled skeletons are not limited to such interpretation. For example, sub-graphs representing larger functional units, such as neurons in the brain, can be identified as physical nodes.

Second, our results also inform theoretical models of physical network growth. Recent work that modeled physical nodes as spheres and links between them as tubes5,7. We found that most physical links follow close to straight trajectories, suggesting that linear physical network models where links are straight cylinders are indeed a useful class of models to understand physicality in networks. On the other hand, these physical network models generalize the classic Erdős-Rényi and Barabási-Albert models to physical space and thus do not restrict the node degree. We, however, found that junction points in real physical networks almost exclusively have degree three, a fact that must be accounted for by future models. Note that to obtain real physical networks with non-trivial degree distributions one must abandon identifying junction points as nodes, instead we must identify larger sub-graphs of the skeleton as physical nodes. In more formal terms, these networks can be modeled as a network-of-networks: we represent each physical node as a skeleton that has junction points with degree 3, and these physical sub-networks are bound together to form a network-of-networks with no restriction on the number of connections a sub-network can make with other sub-networks8. Future work may explore the relationship between the network-of-networks representation and the more fine-grained junction network representation.

Finally, we quantified the physical confinement of individual links by comparing the path that links follow to randomized trajectories, allowing us to identify correlations between physical and abstract network properties. In general, understanding the relationship between physical shape and abstract network structure is one of the key challenges of physical network research7,8,35. Future work may rely on other spatially randomized null models and abstract network measures to probe the relation between the two.

Our work is limited by the scope of the available data sets and computational constraints. First, our data sets do not contain information about the environment the networks are embedded in; therefore, we can only investigate interactions between the components of the network and not interactions between the networks and their surroundings. For example, we found the strongest relationship between link confinement and abstract network structure for the fruit fly neural network data sets and we found a weaker or no relationship for individual neurons. This is likely due to the fact that the fruit fly data sets contain multiple neurons, thus capturing more of the environment of individual physical links. Future work may consider more complete data sets as they become available or theoretical models of network growth could incorporate non-trivial environments.