Network extraction by routing optimization

Baptista, Diego; Leite, Daniela; Facca, Enrico; Putti, Mario; De Bacco, Caterina

doi:10.1038/s41598-020-77064-4

Download PDF

Article
Open access
Published: 30 November 2020

Network extraction by routing optimization

Diego Baptista¹^na1,
Daniela Leite¹^na1,
Enrico Facca²,
Mario Putti³ &
…
Caterina De Bacco¹

Scientific Reports volume 10, Article number: 20806 (2020) Cite this article

2624 Accesses
12 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Routing optimization is a relevant problem in many contexts. Solving directly this type of optimization problem is often computationally intractable. Recent studies suggest that one can instead turn this problem into one of solving a dynamical system of equations, which can instead be solved efficiently using numerical methods. This results in enabling the acquisition of optimal network topologies from a variety of routing problems. However, the actual extraction of the solution in terms of a final network topology relies on numerical details which can prevent an accurate investigation of their topological properties. In fact, in this context, theoretical results are fully accessible only to an expert audience and ready-to-use implementations for non-experts are rarely available or insufficiently documented. In particular, in this framework, final graph acquisition is a challenging problem in-and-of-itself. Here we introduce a method to extract network topologies from dynamical equations related to routing optimization under various parameters’ settings. Our method is made of three steps: first, it extracts an optimal trajectory by solving a dynamical system, then it pre-extracts a network, and finally, it filters out potential redundancies. Remarkably, we propose a principled model to address the filtering in the last step, and give a quantitative interpretation in terms of a transport-related cost function. This principled filtering can be applied to more general problems such as network extraction from images, thus going beyond the scenarios envisioned in the first step. Overall, this novel algorithm allows practitioners to easily extract optimal network topologies by combining basic tools from numerical methods, optimization and network theory. Thus, we provide an alternative to manual graph extraction which allows a grounded extraction from a large variety of optimal topologies. The analysis of these may open up the possibility to gain new insights into the structure and function of optimal networks. We provide an open source implementation of the code online.

An evaluation tool for backbone extraction techniques in weighted complex networks

Article Open access 09 October 2023

Ali Yassin, Abbas Haidar, … Olivier Togni

Dynamical efficiency for multimodal time-varying transportation networks

Article Open access 29 November 2021

Leonardo Bellocchi, Vito Latora & Nikolas Geroliminis

Topological assessment of recoverability in public transport networks

Article Open access 26 March 2024

Renzo Massobrio & Oded Cats

Introduction

Investigating optimal network topologies is a relevant problem in several contexts, with applications varying from transportation networks^1,2,3,4, communication systems^5,6,7, biology^8,9 and ecology^10,11,12. Depending on the specified objective function and set of constraints of a routing optimization problem¹³, optimal network topologies can be determined by different processes ranging from energy-minimizing tree-like structures ensuring steeper descent through a landscape as in river basins¹⁰ to the opposite scenario of loopy structures that favor robustness to fluctuations and damage as in leaf venation^12,14, the retina vascular system^15,16 or noise-cancelling networks⁷.

In many applications, optimal networks can arise from an underlying process defined on a continuous space rather than a discrete network as in standard combinatorial optimization routing problems^17,18,19,20. Optimal routing networks try to move resources by minimizing the transportation cost. This cost may be taken to be a function of the traveled distance, such as in Steiner trees, or proportional to the dissipated energy, such as optimal channel networks or resistance networks. The common denominator of these configurations is that they have a tree-like shape, i.e., optimal routing networks are loopless^1,21. Recent developments in the mathematical theory of optimal transport^11,13 have proved that this is indeed the case and that complex fractal-like networks arise from branched optimal transport problems²². While the theory starts to consolidate, efficient numerical methods are still in a pre-development stage, in particular in the case of branched transport, where only a few results are present^23,24, reflecting the obstacle that all these problems are NP-hard. Recent promising results^25,26 map a computationally hard optimization problem into finding the long-time behavior of a system of dynamic partial differential equations, the so-called Dynamic Monge-Kantorovich (DMK) approach, which is instead numerically accessible, computationally efficient, and leads to network shapes that resemble optimal structures²⁷. Working in discretized continuous space, and in many network-based discretizations such as lattice-like networks as well, requires the use of threshold values for the identification of active network edges. This has the main consequence that there might be no obvious final resulting network, an output that would be trivial when starting from an underlying search space formed by predefined selected network structures. For example, the output of a numerically discretized (by, e.g., the Finite Element method) routing optimization problem in a 2D space is a real-valued function on a set of (x, y) points defined on a grid or triangulation, which already has a graph structure. Despite the underlying graph, this grid function contains numerous side features, such as small loops and dangling vertices, that prevent the recognition of a clear optimal network structure. Obtaining this requires a suitable identification of vertices and edges that should contain the optimal network properties embedded in the underlying continuous space. In other words, the output of a routing optimization problem in continuous space carries unstructured information about optimality that is hard to interpret in terms of network properties. Extracting a network topology from this unstructured information would allow, on one hand, better interpretability of the solution and enable the comparison with networks resulting from discrete space. On the other hand, the use of tools from network theory to investigate optimality properties, for instance, to perform clustering or classification tasks based on a set of network features.

One can frame this problem as that of properly compressing the information contained in the “raw” solution of a routing problem in continuous space into an interpretable network structure while preserving the important properties connected to optimality. This is a challenging task, as compression might result in losing important information. The problem is made even more complex because one may not know in advance what are the relevant properties for the problem at hand, a knowledge that could help drive the network extraction procedure. This is the case for any real network, where the intrinsic optimality principle is elusive and can only be speculated about by observing trajectories, an approach adopted for instance when processing images in biological networks^28,29,30,31.

Several works have been proposed to tackle domain-specific network extraction. These methods include using segmentation techniques on a set of image pixels to extract a skeleton^28,29,32 that is then converted into a network; a pipeline combining different segmentation algorithms building from OpenCV³³, which is made available with an intuitive graphical interface³⁴; graph-based techniques³⁵ that sample junction-points from input images; methods that use deep convolutional neural networks³⁶ or minimum cost path computations³⁷ to extract road networks from images. These are mainly using image processing techniques as the input is an image or photograph, which might not necessarily be related to a routing optimization problem. In this work, we propose a new approach for the extraction of network topologies and build a protocol to address this problem. This can take in input the numerical solution of a routing optimization problem in continuous space as described in^25,26,27 and then processes it to finally output the corresponding network topology in terms of a weighted adjacency matrix. However, it can also be applied to more general inputs, such as images, which may not necessarily come from the solution of an explicit routing transportation problem. Specifically, our work features a collection of numerical routines and graph algorithms designed to extract network structures that can then be properly analyzed in terms of their topological properties. The extraction pipeline consists in a sequence of three main algorithmic steps: (i) compute the steady-state solutions of the DMK equations (DMK-Solver); (ii) extract the optimal network solution of the routing optimization problem (graph pre-extraction); (iii) filter the network removing redundant structure (graph filtering). While for this work we test and demonstrate our algorithm on routing scenarios coming from DMK, which constitute our main motivation, we remark that only the first step is specific to these, whereas the last two steps are applicable beyond these settings. The graph pre-extraction step consists of a set of rules aiming at building a network from an input that is not explicitly a topological structure made of nodes and edges. The filtering step is based on a principled mathematical model inspired by that of the first step, which leads to an efficient algorithmic implementation. Our network filter has a nice interpretation in terms of a cost function that interpolates between an operating cost and an infrastructure one, contrarily to common approaches used in image processing for filtering, which often relies on heuristics. Our numerical approach is based on finite element-like solvers that transform the problem into a finite sequence of linear systems with dimension equal to the number of nodes in the network. Using a careful combination of efficient numerical solvers, the high computational efficiency of our implementation allows addressing large scale problems, out of reach for standard methods of combinatorial optimization. In addition, the algorithmic complexity of our approach is independent of the number of sources and sinks, unlike more standard methods based on Steiner tree solvers^38,39.

A successful execution will return a representation of the network in terms of an edge-weighted undirected network. The resulting weights are related to the optimal flow, solution of the routing problem. Once the network is obtained, practitioners can deploy arbitrary available network analysis software^40,41,42,43 or custom-written scripts to investigate properties of the optimal topologies. For instance, given that our model easily adapts to receive images as input, a promising application is that of extracting optimal network topologies from biological networks, in particular in systems that display a dynamic behavior of self-optimization, as recently found this being the case for neuronal networks⁴⁴. Note that our optimal transport-based approach naturally calculates Wasserstein-type distances between discrete measures on the network. This can be used, like other geometric approaches in network analysis, to address different network-related applications, for example for geometry-based community detection algorithms^45,46,47. While our primary goal is to provide a framework and tool to solve the research question of how to extract network topologies resulting from routing optimization problems in continuous space or any other image containing a network structure, we also aim at encouraging non-expert practitioners to automatically extract networks from such problems or from more general settings beyond that. Thus we make available an open-source algorithmic implementation and executables of this work at https://github.com/Danielaleite/Nextrout.

The routing optimization problem

In this section, we describe the main ideas and establish notation. We start by introducing the dynamical system of equations corresponding to the DMK routing optimization problem as proposed by Facca et al.^25,26,27 In these works, the authors first generalize the discrete dynamics of the slime mold Physarum Polycephalum (PP) to a continuous domain; then they conjecture that, like its discrete counterpart, its solution tends to an equilibrium point which is the solution of the Monge-Kantorovich optimal mass transport⁴⁸ as time goes to infinity.

We denote the space where the routing optimization problem is set as ${\varOmega }\in {\mathbb {R}}^{n}$, an open bounded domain that compactly contains $f(x)=f^+(x)-f^-(x)\in {\mathbb {R}}$, the forcing function, describing the flow generating sources $f^+(x)$ and sinks $f^-(x)$. It is assumed that the system is isolated, i.e., no fluxes are entering or exiting the domain from the boundary. This imposes the constraint $\int _{\varOmega }f(x)dx = 0$ to ensure mass balance. It is assumed that the flow is governed by a transient Fick-Poiseuille type flux $q=- \mu {\nabla }u$, where $\mu (t,x),u(t,x)$ are called conductivity or transport density and transport potential, respectively.

The continuous set dynamical Monge-Kantorovich (DMK) equations are given by:

$$\begin{aligned} -\nabla \cdot (\mu (t,x)\nabla u(t,x))&= {} f^+(x)-f^-(x) \,, \end{aligned}$$

(1)

$$\begin{aligned} \frac{\partial \mu (t,x)}{\partial t}&= {} \left[ \mu (t,x)\nabla u(t,x)\right] ^{\beta } - \mu (t,x) \,, \end{aligned}$$

(2)

$$\begin{aligned} \mu (0,x)&= {} \mu _0(x) > 0 \,, \end{aligned}$$

(3)

where $\nabla =\nabla _{x}$. Equation (1) states the spatial balance of the Fick-Poiseuille flux and is complemented by no-flow Neumann boundary conditions; Eq. (2) enforces the system dynamics in analogy with the discrete PP model and Eq. (3) provides the initial configuration of the system. The parameter $\beta$ captures different routing transportation mechanisms. A value of $\beta <1$ enforces optimal solutions to avoid traffic congestion; $\beta = 1$ is shortest path-like; while $\beta > 1$ encourages consolidating the flow so to use a smaller amount of network-like infrastructure, and is related to branched transport^11,49. Within a network-like interpretation, qualitatively, $\mu (x,t)$ describes the capacity of the network edges. With hydraulic interpretation, we can think of the edges as pipes, small cylindrical channels where the mass is passing through, and the capacity is proportional to the size of the pipe diameter. Thus, its initial distribution $\mu _{0}(x)$ describes how the initial capacities are distributed.

In this work, solving the routing optimization problem consists of finding the steady state solution $(\mu ^*, u^*):{\varOmega }\rightarrow {\mathbb {R}}_{\ge 0}\times {\mathbb {R}}$ of Eq. (1), i.e. $(\mu ^*(x),u^*(x))=\lim _{t\rightarrow +\infty }(\mu (t,x),u(t,x))$. Numerical solution of the above model can be obtained by means of a double discretization in time and space^25,26,27. The resulting solver (called from now on DMK-Solver) has been shown to be efficient, robust and capable of identifying the typically singular structures that arise from the original problem. In Fig. 1, some visual examples of the numerical $\mu ^*$ obtained for different values of $\beta$ are shown. The same authors showed that the DMK-Solver is able to emulate the results for the discrete formulation of the PP model proposed by Tero et al.⁵⁰

Under appropriate regularity assumptions, it can be shown^26,27 that the equilibrium solution of the above problem $(\mu ^*(x),u^*(x))$ is a minimizer of the following functional:

$$\begin{aligned} {{\mathscr {L}}}(\mu ,u)=\frac{1}{2}\int _{{\varOmega }}\mu |\nabla u|^2 dx + \int _{{\varOmega }}\frac{\mu ^{P(\beta )}}{P(\beta )}dx \,, \end{aligned}$$

(4)

where $P(\beta )=(2-\beta )/\beta$. In words, this functional is the sum of the total energy dissipated during transport (the first term is the Dirichlet energy corresponding to the solution of the first PDE) plus a nonlinear (sub-additive) function of the total capacity of the system at equilibrium. In terms of costs, this functional can be interpreted as the cost of transport, assumed to be proportional to the total dissipated energy, and the cost of building the transport infrastructure, assumed to be a nonlinear function (with power $2-\beta$) of the total transport capacity of the system.

We exploit the robustness of this numerical solver to extract the solutions of DMK equations corresponding to various routing optimization problems. We here focus on the case $\beta \ge 1$, where the approximate support of $\mu ^*$ displays a network-like structure. This is the first step of our extraction pipeline, which we denote as DMK-Solver. The numerical solution of these equations does not allow for a straightforward network representation. Indeed, depending on various numerical details related to the spatial discretization and other parameters, one usually obtains a visually well-defined network structure (see Fig. 1) whose rendering as a graph object is however uncertain and non-unique. This in turns can hinder a proper investigation of the topological properties associated to routing optimization problems, motivating the main contribution of our work: the proposal of a graph extraction pipeline to automatically and robustly extract network topologies from the solutions’ output of DMK-Solver. We reinforce that our contribution is not limited to this application, but is also able to extract network-like shapes from any kind of image where a color or greyscale thresholds can be used to identify the sought structure.

Our extraction pipeline then proceeds with two main steps: pre-extraction and graph filtering. The first one tackles the problem of translating a solution from the continuous scenario into a graph structure, while the second one addresses the problem of removing redundant graph structure resulting from the previous step. A pseudo-code of the overall pipeline is provided in Algorithm 1. In that pseudo-code, mesh-related parameters specify how the mesh for the discretization of space is built. Specifically, we could specify ndiv, the number of divisions in the x axis and nref, the number of refinements, i.e. the number of times each triangle on the grid generated by a specific ndiv is subdivided into four triangles.

Our final goal is to translate the solution pair $(\mu ^*, u^*)$ into a proper network structure using several techniques from graph theory. With these networks at hand, a practitioner is then able to investigate topologies associated with this novel representation of routing optimization solutions.

Graph preliminary extraction

In this section, we expand on the graph pre-extraction step: extracting a network representation from the numerical solution output of the DMK-Solver. This involves a combination of numerical methods for discretizing the space and translating the values of $\mu ^{*}$, and $u^{*}$ into edge weights of an auxiliary network, which we denote as $G=(V,E,W)$, where $V$ is the set of nodes, $E$ the set of edges and $W$ the set of weights.

The DMK solver outputs the solution on a triangulation of the domain ${\varOmega }$ (here also named grid) and denoted as ${\varDelta }_{\varOmega }=\{T_i\}_i$, with $\cup T_{i}={\varOmega }$. The numerical solution, piecewise constant on each triangle $T_{i}$, is considered assigned to the triangle barycenter (center of gravity) at position $\mathbf{b }_{i}=(x_{i},y_{i})\in {\varOmega }$. Note that in this work we focus on a 2D space, but the procedure can be generalized to 3D. This means that the result is a set of pairs $\left\{ \left( \mu ^{*}(\mathbf{b }_{i}),u^{*}(\mathbf{b }_{i}) \right) \right\} _{i}$. We can track any function of these two quantities. For simplicity, we use $\mu ^{*}$ (see Fig. 1 for various examples), but one could use $u^{*}$ or a function of these two. This choice does not affect the procedure, although the resulting network might be different.

We neglect information on the triangles where the solution is smaller than a user-specified threshold $\delta \in {\mathbb {R}}_{\ge 0}$, in order to work only with the most relevant information. Formally, we only keep the information on $T_{i}$ such that $\mu ^{*}(\mathbf{b }_{i})\ge \delta$. We observed empirically that in many cases, several triangles contain a value of $\mu ^{*}$ that is orders of magnitude smaller than others, see for instance the scale of Fig. 1. Since we want to build a network that connects these barycenters, we remark that this procedure depends on the choice of the threshold $\delta$: if $\delta _1<\delta _2$, then $G({\delta _2}) \subset G({\delta _1})$. On one hand, the smaller $\delta$, the more likely $G$ is to be connected, but at the cost of containing many possibly loop-forming edges and nodes (the extreme case $\delta =0$ uses the whole grid to build the final network); on the other hand, the higher $\delta$, the smaller the final network is (both in terms of the number of nodes and edges). Thus one needs to tune the parameter $\delta$ such that resulting paths from sources to sinks are connected while avoiding the inclusion of redundant information.

The set of relevant triangles does not correspond to a straightforward meaningful network structure, i.e. a set of nodes and edges connecting neighboring nodes. In fact, we want to remove as much as possible the biases introduced by the underlying triangulation and thus we start by connecting the triangle barycenters. For this, we need rules for defining nodes, edges and weights on the edges. Here, we propose three methods for defining the graph nodes and edges and two functions to assign the weights. The overall graph pre-extraction routine is given by choosing one of the former and one of the latter, and it can be applied also to more general inputs beyond solutions of the DMK-Solver.

Rules for selecting nodes and edges

Selecting $V$ and $E$ requires defining the neighborhood $\sigma (T_{i})$ of a triangle in the original triangulation ${\varDelta }_{{\varOmega }}$ (for i such that $\mu ^{*}(\mathbf{b }_{i})\ge \delta$). We consider three different procedures:

(I)
Edge-or-node sharing: $\sigma (T_{i})$ is the set of triangles that either share a grid edge or a grid node with $T_i$.
(II)
Edge-only sharing: $\sigma (T_{i})$ is the set of triangles that share a grid edge with $T_i$. Note that $|\sigma (T_i)|\le 3, \ \ \forall i$.
(III)
Original triangulation: let $v,\,w,\,s$ be the grid nodes of $T_i$ ; then add v, w, s to $V$ and $(v,\,w),\,(w,\,s),\,(s,\,v)$ to $E$. Note that in this case we make direct use of the graph associated to the triangulation and consider $\sigma (T_{i})$ as in rule (II).

It is worth mentioning that since the grid ${\varDelta }_{\varOmega }$ is non-uniform and $\mu ^{*}$ is not constant, we cannot control a priori the degree $d_{i}$ of a node i in the graph $G$ generated for a particular threshold $\delta$. We give examples of networks resulting from these three definitions in Fig. 2 and a pseudo-code for them in Algorithm 2.

Rules for selecting weights

The weights $w_{ij}$ are assigned to edges $e_{ij}:=(i,j) \in E$ by the function $w(\mu (\mathbf{b }_{i}),\mu (\mathbf{b }_{j}))$, considering the density defined on the original triangles. We consider two possibilities for this function:

(i)
Average (AVG): $w_{ij}= \frac{\mu (\mathbf{b }_{i})+\mu (\mathbf{b }_{j})}{2}$ .
(ii)
Effective reweighing (ER): $w_{ij}= \frac{\mu (\mathbf{b }_{i})}{d_{i}}+\frac{\mu (\mathbf{b }_{j})}{d_{j}}$ .

While using the average as in (i) captures the intuition, it may overestimate the contribution of a triangle when this has more than one neighbor in $G$ with the risk of calculating a total density larger than the original output of the DMK-Solver. To avoid this issue, we consider an effective reweighing as in (ii), where each triangle contribution by the degree $d_{i}=|\sigma _{i}|$ of a node $i\in V$ is reweighted, with $\sigma _{i}$ the set of neighbors of i. This guarantees the recovery of the density obtained from DMK-Solver, since $\frac{1}{2} \sum _{i,j } w_{ij}= \frac{1}{2}\sum _{i }\left[ \mu (\mathbf{b }_{i}) + \sum _{j \in \sigma _{i}}\frac{\mu (\mathbf{b }_{j})}{d_{j}}\right] =\sum _{i} \mu (\mathbf{b }_{i})$, where in the sum we neglected isolated nodes, i.e. i s.t. $d_{i}=0$. Note that in the case of choosing the original triangulation for node and edge selection (case (III) above), the ER rule does not apply; in that case, we use AVG, i.e. given an edge e, its weight is the average between its two neighboring triangles.

Graph filtering

The output of the graph extraction step is a network closer to our expectation of obtaining an optimal network topology resulting from a routing optimization problem. However, this network may contain redundant structures like dangling nodes or small irrelevant loops (see Fig. 2). These are not related to any intrinsic property of optimality, but rather are a feature of the discretization procedure resulting from the graph pre-extraction step. It is thus important to filter the network by removing these redundant parts. However, how to perform this removal in an automated and principled way is not an obvious task. One has to be careful in removing enough structure, while not compromising the core optimality properties of the network. This removal is then a problem in-and-of-itself, we name it graph filtering step. We now proceed by explaining how we tackle it in a principled way and discuss its quantitative interpretation in terms of minimizing a cost function interpolating between an operating and an infrastructural cost.

The DiscreteDMK-Solver

Going beyond heuristics and inspired by the problem presented in “The routing optimization problem” section, we consider as a solution for the graph filtering step, the implementation of a second routing optimization algorithm to the network $G$ output of the pre-extraction step, i.e. in discrete space. Several choices for this could be drawn, for instance, from routing optimization literature⁵¹, but we need to make sure that this second optimization step does not modify any of the intrinsic properties related to optimality resulting from the DMK-Solver. We thus propose to use a discrete version of the DMK-Solver (discrete-DMK-Solver). This was proven to be related to the Basis Pursuit (BP) optimization problem⁵². In fact, BP is related⁵³ to the PP dynamical problem in discrete space and the discrete-DMK-Solver gives a solution to the PP in discrete space⁵². The discretization results in a reduction of the computational costs for solutions of BP problems, compared to standard combinatorial optimization approaches⁵². Being an adaptation to discrete settings of our original optimization problem, it is a natural candidate for a graph filtering step, preserving the solution’s properties.

The problem is stated as follows. Consider the signed incidence matrix ${\mathbf {B}} \in {\mathbb {M}}_{N\times M}$ of a weighted graph $G=(V,E,W)$, with entries $B_{ie}=\pm 1$ if the edge e has node i as start/end point, 0 otherwise; $N=|V|$ and $M=|E|$. Denote $\mathbf {\ell } = \left\{ \ell _{e}\right\} _{e}$ the vector of edge lengths, ${\mathbf {f}}$ a N-dim vector of source-sink values with entries satisfying $\sum _{i\in V}f_{i}=0$; this is the discrete analogues of the source-sink function ${{\mathbf f}(x)}$ introduced in Section “The routing optimization problem”; the functions $\mu (t)\in {\mathbb {R}}^{M}$ and $u(t)\in {\mathbb {R}}^{N}$ correspond to the conductivity and potential respectively, similarly to the continuous case, but this time they are vectors with entries $\mu _{e}(t)$ and $u_{i}(t)$ defined on edges and nodes respectively. The PP discrete dynamics corresponding to the original routing optimization problem can be written as:

$$\begin{aligned} f_{i}&= {} \sum _{e} B_{ie} \frac{\mu _{e}(t)}{\ell _{e}} \sum _{j}B_{ej} \, u_{j}(t) \,, \end{aligned}$$

(5)

$$\begin{aligned} \mu _{e}'(t)&= {} \left[ \frac{\mu _{e}(t)}{\ell _{e}}|\sum _{j}B_{ej}\, u_{j}(t)|\right] ^{\beta _{d}}-\mu _{e}(t) \,, \end{aligned}$$

(6)

$$\begin{aligned} \mu _{e} (0)> & {} 0 \,, \end{aligned}$$

(7)

where $|\cdot |$ is the absolute value element-wise. Equation (5) corresponds to Kirchoff’s law, Eq. (6) is the discrete dynamics with $\beta _{d}$ a parameter controlling for different routing optimization mechanisms (analogously to $\beta$ in Eq. 2); Eq. (7) is the initial condition. The importance of this system stems in having an interesting theoretical correspondence: its equilibrium point corresponds to the minimizer of a cost function analogous to Eq. (4) that, similarly to the continuous case, can be interpreted as global energy functional. This is:

$$\begin{aligned} {{\mathscr {L}}}_{\beta }(\mu (t))= \frac{1}{2} \sum _{e}\mu _{e}(t)\left( \frac{1}{\ell _{e}}\sum _{j}B_{ej}\, u_j(\mu (t))\right) ^{2}\ell _{e} +\frac{1}{2}\sum _{e}\frac{\mu _{e}(t)^{P(\beta )}}{P(\beta )}\,\ell _{e} \,, \end{aligned}$$

(8)

where $P(\beta )={(2-\beta )}/{\beta }$ and $u(\mu (t))$ is a function implicitly defined as the solution of Eq. (5). The first term corresponds to the energy dissipated during transport, it can be interpreted as the operating costs, whereas the second is the infrastructural cost. The equilibrium point of $\mu _{e}(t)$ is stationary at the previous energy function, and for $\beta _{d}=1$ it acts also as the global minimizer due to its convexity. For $\beta _{d}>1$ the energy is not convex, thus in general the functional will present several local minima towards which the dynamics will be attracted. The case $\beta _{d}<1$ does not act as a filter because it encourages trajectories to spread through the network, instead of removing edges, and so not interesting to our purposes. Discretization in time of Eq. (6) by the implicit Euler scheme combined with Newton method leads to an efficient numerical solver, see Facca et al.⁵² for more details. The above scheme gives the solution to the BP problem and represents the discrete-DMK-Solver. Similarly to the graph pre-extraction step, the filtering is also valid beyond networks related to solutions of the DMK-Solver. It applies to more general inputs if defined on a discrete space, for instance, images. Finally, notice that the filter generates a graph with a new set of nodes and edges, both subsets of the corresponding ones in $G$, result of the pre-extraction. The weights of the final graph can then be assigned with same rules as in “Rules for selecting nodes and edges”; in addition, one can consider as weights the values of $\mu ^*_{e}$ resulting from the BP problem (we named this weighing method “BPW”). Alternatively, one can ignore the weights of BP and keep (for the edges remaining after the filter) the weights as in the previous pre-extraction step (labeled as “IBP”). Analogously to what done on the original triangulation, we discard the edges e for which $\mu _e<\delta _d$. In our experiments we use as initial density distribution $\mu _e(0) = w(e), \forall e \in E$, where $w$ correspond to the weight of the edge $e$ in the pre-extracted graph. Figure 3 shows an example of three filtering settings on the same input.

Selecting sources and sinks

The discrete-DMK-Solver requires in input a set of source and sink nodes ($S^{+}$ and $S^{-}$) that identify the support of the forcing vector $\mathbf{f }$ introduced in “The Discrete DMK-Solver”. However, the graph pre-extraction output $G$ might contain redundant nodes (or edges) as mentioned before. In principle, among the nodes $i \in V,$ all of those contained in the support of $f(x = \mathbf{b}_{i}),$ i.e. contained in the supports of sources and sinks of the original routing optimization problem in Eq. (1), are eligible to be treated as sources or sink in the resulting network. However, several paths connecting source and sink nodes may be redundant and clearly not compatible with an optimal routing network (see Supplementary Fig. S2 for such an example). Therefore, it is important to select “representatives” for sources and sinks, such that the final network is heuristically closer to optimality. Here we propose a criterion to select source and sink nodes from the eligible ones in each of the connected components $\{C_m\}_m$ of $G$, using a combination of two network properties. Starting from the complete graph formed by all the nodes characterized by a significant (above the threshold) density, source and sink nodes and rates are defined as follows. A node $i {\in} S^{+},$ i.e. is a source $f_{i}>0,$ if either i) is in the convex hull of the set of eligible sources or ii) its betweenness centrality is smaller than a given threshold $\tau _{BC}$. Similarly for sink nodes in $S^{-}$. This is because, on one side, nodes in the convex hull capture the outer shape structure of the source and sink sets defined in the continuous problem; on the other side, nodes with small values of the betweenness centrality capture the end-points of $G$ inside the source and sink sets, analogously to leaves (i.e., degree-one nodes). Note that, due to the high graph connectivity, degree centrality is not appropriate for selecting these ending parts. We present these ideas in more detail in the Supplementary Fig. S2. Once we have identified the sets of source and sink vertices, we need to assign a proper value $ f_{i} $ such that Kirchhoff law is satisfied in each of the different connected components $C_{m}$. It is reasonable to assume that each connected component is “closed”, i.e. $\sum _{i \in C_{m}} f_{i}=0 \,, \forall C_{m}$. Denoting with $|S|$ the number of elements in a set $S$ and $V(C_{m} ) $ the set of nodes in $C_{m}$, we then distribute the mass-fluxes uniformly by setting $f_{i}=\frac{1}{|S^{+}\cap V(C_m)|}$ for $i \in S^{+}$, and $f_{i}=-\frac{1}{|S^{-}\cap V(C_m)|}$ for $i \in S^{-}$ sinks ($f_{i}=0$ otherwise) so that the total original source and sink flux is assigned to the overall source/sink nodes of all $C_{m}$. Note that this procedure maintains the overall system and each connected component “closed”, as stated above.

Computational complexity

The numerical implementation of our graph extraction algorithm is based on finite element-like solvers that transform the problem into a finite sequence of linear systems. This implies that we need to run a variable number $N_{T}$ of iterations in time, each requiring $N_{N}$ Newton steps. Every Newton step requires the approximate solution of a linear system of dimension N by pre-conditioner conjugate gradient solver, which has complexity $O(N\log N)$⁵⁴. The time complexity of our graph extraction algorithm is then $O(N_{T}\, \times N_{N}\, \times N\log N)$. In practice, because of exponential convergence of the time discretization towards equilibrium⁵², $N_{T}$ is typically constant approximately $<10^{2}$, instead $N_{N}\sim 5$. In the worst cases $N_{T}\, N_{N}\sim N^{0.3}$. Thus the total complexity is $O(N\log N)$.

The time complexity of other related approaches such as the ORC-based algorithms is dominated by the computation of the Wasserstein distance, which typically takes $O(M\,k\times k^{3}\log k)$, where $k=2M/N$ is the average network degree, when using linear programming and can be further improved using wavelet earth-mover-distance approximation approaches⁵⁵. While $M>N$, in sparse networks such as those used in our experiments, $M\sim N$.

Other approaches that solve similar problems are based on Steiner tree solvers³⁹ and have a complexity which depends on the number of sources and sinks, in addition to the system size. Instead, our method complexity does not depend on them, but only on the network size.

Model validation

Our extraction pipeline proceeds by compressing routing information in the raw output of the DMK-Solver (although what follows is not restricted to this case) on a lean network structure. This might lead us to lose relevant information in the process. Hence, we need to devise a posteriori estimates that provide quantitative guidance on the “leanness” and information loss of the final network. Here we propose metrics to evaluate the compression performance of the various graph pre-extraction and filtering protocols. The raw information is made of a set of weights $w(T_{i})$ representing the values $(\mu ^{*},u^{*})$ on each of the triangles $T_{i}\in {\varOmega }$. We consider as the truth benchmark the distribution of $w$, or any other quantity of interest, supported on the subgrid $ \Delta _{\Omega }^{\delta } \subset \Delta _{\Omega } $ formed by all triangles where $w$ is larger than the threshold value $ \delta $, i.e., ${\varDelta }_{\varOmega }^{\delta }:=\{T_i \in {\varDelta }_{\varOmega }: w(T_i)\ge \delta \}$. We expect that a good compression scheme should preserve both the total amount of the weights from the original solution in ${\varDelta }^{\delta }_{{\varOmega }}$ and the information of where these weights are located inside the domain ${\varOmega }$. Also, we want this compression to be parsimonious, i.e. to store the least amount of information as possible. We test against these two requirements by proposing two metrics that measure: i) an information difference between the raw output of the DMK-Solver and the network extracted using our procedure, capturing the information of where the weights are located in space; ii) the amount of information needed to store the network.

Our first proposed metric relies on partitioning ${\varOmega }$ in several subsets and then calculating the difference in the extracted network weights and the uncompressed output, locally within each subset. More precisely, we partition ${\varOmega }$ into $P$ non intersecting subsets $C_{\alpha } \subset {\varOmega }$, with $\alpha =1,\dots ,P$ and $\cup _{1}^{P}C_{\alpha }={\varOmega }$. For example, we define $C_\alpha = [x_i,x_{i+1}]\times [y_j,y_{j+1}]$, for $x_i,x_{i+1},y_j$ and $y_{j+1},$ consecutive elements of N-regular partitions of [0, 1], and $P=(N-1)^2$. Denote with $w_{\delta }(T_{i})$ the weight on the triangle $T_{i}\in {\varDelta }^{\delta }_{{\varOmega }}$, resulting from the DMK-Solver (usually a function of $\mu ^*$ and $u^*$). If we denote the local weight of ${\varDelta }_{\varOmega }^{\delta }$ inside $C_{\alpha }$ as $w_{\alpha }=\sum _{i: \mathbf{b }_{i}\in C_{\alpha }}w_{\delta }(T_{i})$, then we propose the following evaluation metric:

$$\begin{aligned} {\hat{w}}_q(G):&= {} \dfrac{1}{P}\left[ \sum _{\alpha =1}^{P}\left( |\sum _{e\in E}{\mathbb {I}}_\alpha (e) \ w_{e} - w_{\alpha }\,|\right) ^{q} \right] ^\frac{1}{q} \,, \end{aligned}$$

(9)

where ${\mathbb {I}}_\alpha (e)$ is an indicator of whether an edge $e=(i,j) \in E$ is inside an element $C_{\alpha }$ of the partition, i.e. ${\mathbb {I}}_\alpha (e)=1,0,1/2$ if both $\mathbf{b }_{i},\mathbf{b }_{j}$ are in $C_{\alpha }$, none of them are, or only one of them is, respectively. In words, ${\hat{w}}_{q}(G)$ is a distance between the weights of the network extracted by our procedure and the original weights, output of the DMK-Solver, over each of the local subsets $C_{\alpha }$. This metric penalizes networks that either place large-weight edges where they were not present in the original triangulation, or low-weight ones where they were instead present originally. In this work, we consider the Euclidean distance, i.e. $q=2$, but other choices are also possible. Note that ${\hat{w}}_{q}(G)$ does not say anything about how much information was required to store the processed network. If we want to encourage parsimonious networks, i.e. networks with few redundant structures, then we should include in the evaluation the monitoring of $L(G)=\sum _{e\in E} \ell _{e}$, the total path length of the compressed network, where the edge length $\ell _{e}$ can be specified based on the application. Standard choices are uniform $\ell _{e}=1,\, \forall e$ or the Euclidean distance between $\mathbf{b }_{i}$ and $\mathbf{b }_{j}$. Intuitively, networks with small values of both ${\hat{w}}_{q}(G)$ and $L(G)$ are both accurate and parsimonious representations of the original DMK solutions defined on the triangulation.

We evaluate numerous graph extraction pipelines in terms of these two metrics on various routing optimization problem settings and parameters. In Fig. 4 we show the main results for a distribution of 170 networks obtained with $\beta \in \left\{ 1.1,\, 1.2,\, 1.3\right\}$ and $\beta _{d}=1.1$. Similar results were obtained for other parameter settings. Networks are generated as follows: first, we choose a set of 5 different initial transport densities $\mu _0$, grouped in parabola-like, delta-like and uniform distributions, and a set of 12 different configurations for sources/sinks (mainly rectangles placed in different positions along the domain, see Supplementary Information for more details). Then, for each of these setups, we run our procedure: (i) first the DMK-Solver calculates the solution of the continuous problems; (ii) then we apply the graph pre-extraction procedure according rules of “Rules for selecting nodes and edges" and weights as in “Rules for selecting weights”; iii) finally, we run the graph filtering step and consider various weight functions, as described in Fig. 4.

We observe that not applying the final filtering step and considering rule I with ER to build the graph (I-ER-None), the values of ${\hat{w}}_2(G)$ are smaller than other cases. This is expected as by filtering we remove information and thus achieve better performance with this metric when compared to no filtering. However, we pay a price in terms of total relative length as $L(G)/L_{max}$ is larger for this case. When working with rule II, we notice the appearance of many non-optimal small disconnected components, and this effect deteriorates if filtering is activated. Corresponding statistics show low values for both ${\hat{w}}_2(G)$ and $L(G)/L_{max}$. We argue that this is because rule II produces, by construction, fewer redundant objects than rule I in the initial phase. This might have a similar effect as a filter but is done a priori during the pre-extraction, because rule II produces in this phase a limited number of effective neighbors. However, this comes at a price of higher variability with the sampled networks, as the variance of ${\hat{w}}_2(G)$ is higher than for the other combinations. Among the possibilities with filtering applied, we observe that rule I performs better than rule III, while all the weighting rules give a similar performance in terms of both metrics. Any combination involving rule I plus filtering has a similar performance as rule II in terms of both metrics but with smaller variability. Finally, these combinations perform differently in terms of the number of disconnected components (not shown here), with rule II producing more spurious splittings, as already mentioned. Depending on the application at hand, a practitioner should select one of these combinations based on their properties as discussed in this section. We give an example of a network generated with I-ER-ER in Fig. 5.

Application: network analysis of a vein network

We demonstrate our protocol on a biological network of fungi foraging for resources in space. The network structure corresponds to the fungi response to food cues while foraging⁵⁶. Edges are veins or venules and connect adjacent nodes. This and those of other types of fungi are well known networks typically studied using image segmentation methods^28,29,30,31. It is thus interesting to compare results found by these techniques and by our approach, under the conjecture that the underlying dynamic driving the network structure could be the same as the optimality principles guiding our extraction pipeline. In particular, we are interested in analyzing the distribution $P(\ell )$ of the vein lengths, i.e. the network edges. The benchmark $P(\ell )$ distribution obtained by Baumgarten and Hauser²⁸ using image processing techniques is an exponential of the type $P(\ell ) = P_{0}\,e^{-\gamma \ell }$. Accordingly, as shown in Fig. 6, we find that an exponential fit (with values $P_0=234.00, \gamma = 36.32$) well captures the left part of the distribution, i.e. short edges. Differences between fit and observed data can be seen in the right-most tail of the graph, corresponding to longer path lengths, where the data decay faster than the fit. However, we find that the exponential fit is nevertheless better than other distributions, such as the gamma and log-normal proposed in Dirnberger and Mehlhorn⁵⁷ for the P. Polycephalum. Drawing definite quantitative conclusions is beyond the scope of our work, as this example aims at a qualitative illustration of possible applications that can be addressed with our model. In general, however, it seems not possible to choose a single distribution that well fits both center and tails of the distribution for various datasets of this type⁵⁷.

To conclude, we demonstrate the flexibility of our graph extraction method on a more general input than the one extracted from DMK-Solver. Specifically, we consider as example an image of P. Polycephalum taken from data publicly available in the Slime Mold Graph Repository (SMGR) repository⁵⁸. We first downsample an image of the SMGR’s KIST Europe data set, using OpenCV (left) and a color scale defined on the pixels as an artificial $\mu ^*$ function. We build a graph using the graph pre-extraction and graph filtering steps as shown in Fig. 7. Notice that our protocol in its standard settings with filtering can only generate tree-like structures. Therefore, if we want to obtain a network with loops as we did in Fig. 7, we should consider a modification of our routine, which can be done in a fully automatized way, as explained in more details in the Supplementary S4. In short, after the graph pre-extraction step, where loops are still present, we extract a tree-like structure close to the original loopy graph and give this in input to the filtering. We can then add a posteriori edges that connect terminals that were close by in the graph obtained from the pre-extraction step but removed by the filter, thus recovering loops. In case obtaining loops is not required, our routine can be used with no modifications. Adapting our filtering model to allow for loopy structures in a principled way, analogously to what done in “Graph preliminary extraction", will be subject of future work.

Discussion

We propose a graph extraction method for processing raw solutions of routing optimization problems in continuous space into interpretable network topologies. The goal is to provide a valuable tool to help practitioners bridging the gap between abstract mathematical principles behind optimal transport theory and more interpretable and concrete principles of network theory. While the underlying routing optimization scheme behind the first step of our routine uses recent advances of optimal transport theory, our tool enables automatic graph extraction without requiring expert knowledge. We purposely provide a flexible routine for graph extraction so that it can be easily adapted to serve the specific needs of practitioners from a wider interdisciplinary audience. We thus encourage users to choose the parameters and details of the subroutines to suitably customize the protocol based on the application of interest. To help guiding this choice, we provide several examples here and in the Supplementary Information. We anticipate that this work will find applications beyond that of automating graph extraction from routing optimization problems. We remark that two of the three steps of our protocol apply to inputs that might not necessarily come from solutions of routing optimization. Indeed, the pipeline can be applied to any image setting where an underlying network needs to be extracted. This can have relevant impact in applications involving biological systems like neuronal networks, for which we observe an increasing amount of data from imaging experiments. The advantage of our setting with respect to more conventional machine learning methods is that the final structure extracted with our approach minimizes a clearly defined energy functional, that can be interpreted as the combination of the total dissipated energy during transport and the cost of building the transport infrastructure. We foresee that this minimizing interpretation together with the simplification of the pipeline from abstract modeling to final concrete network outputs will foster cross-breeding between fields as our tool will inform network science with optimal transport principles and vice-versa. In addition, we expect to advance the field of network science by promoting the creation of new network databases related to routing optimization problems. For instance, an interesting direction for future work is to extend our optimal transport-based method to address other network-related applications such as geometry-based community detection.

Code availability

open source codes and executables are available at https://github.com/Danielaleite/Nextrout.

References

Banavar, J. R., Colaiori, F., Flammini, A., Maritan, A. & Rinaldo, A. Topology of the fittest transportation network. Phys. Rev. Lett. 84, 4745 (2000).
Article ADS CAS PubMed Google Scholar
Corson, F. Fluctuations and redundancy in optimal transport networks. Phys. Rev. Lett. 104, 048703 (2010).
Article ADS PubMed CAS Google Scholar
Li, G. et al. Towards design principles for optimal transport networks. Phys. Rev. Lett. 104, 018701 (2010).
Article ADS CAS PubMed Google Scholar
Yeung, C. H., Saad, D. & Wong, K. M. From the physics of interacting polymers to optimizing routes on the london underground. Proc. Nat. Acad. Sci. 110, 13717–13722 (2013).
Article ADS MathSciNet CAS PubMed MATH PubMed Central Google Scholar
Guimerà, R., Díaz-Guilera, A., Vega-Redondo, F., Cabrales, A. & Arenas, A. Optimal network topologies for local search with congestion. Phys. Rev. Lett. 89, 248701 (2002).
Article ADS PubMed CAS Google Scholar
Donetti, L., Hurtado, P. I. & Munoz, M. A. Entangled networks, synchronization, and optimal network topology. Phys. Rev. Lett. 95, 188701 (2005).
Article ADS PubMed CAS Google Scholar
Ronellenfitsch, H., Dunkel, J. & Wilczek, M. Optimal noise-canceling networks. Phys. Rev. Lett. 121, 208301 (2018).
Article ADS CAS PubMed Google Scholar
Gazit, Y., Berk, D. A., Leunig, M., Baxter, L. T. & Jain, R. K. Scale-invariant behavior and vascular network formation in normal and tumor tissue. Phys. Rev. Lett. 75, 2428 (1995).
Article ADS CAS PubMed Google Scholar
Garlaschelli, D., Caldarelli, G. & Pietronero, L. Universal scaling relations in food webs. Nature 423, 165 (2003).
Article ADS CAS PubMed MATH Google Scholar
Balister, P. et al. River landscapes and optimal channel networks. Proc. Nat. Acad. Sci. 115, 6548–6553 (2018).
Article MathSciNet PubMed MATH CAS PubMed Central Google Scholar
Santambrogio, F. Optimal channel networks, landscape function and branched transport. Interfaces Free Bound. 9, 149–169 (2007).
Article MathSciNet MATH Google Scholar
Katifori, E., Szöllősi, G. J. & Magnasco, M. O. Damage and fluctuations induce loops in optimal transport networks. Phys. Rev. Lett. 104, 048704 (2010).
Article ADS PubMed CAS Google Scholar
Santambrogio, F. Optimal transport for applied mathematicians. Birkäuser, NY 55, 58–63 (2015).
MATH Google Scholar
Messinger, S. M., Mott, K. A. & Peak, D. Task-performing dynamics in irregular, biomimetic networks. Complexity 12, 14–21 (2007).
Article Google Scholar
Fruttiger, M. Development of the mouse retinal vasculature: angiogenesis versus vasculogenesis. Investig. Ophthalmol. Vis. Sci. 43, 522–527 (2002).
Google Scholar
Schaffer, C. B. et al. Two-photon imaging of cortical surface microvessels reveals a robust redistribution in blood flow after vascular occlusion. PLoS Biol. 4, e22 (2006).
Article PubMed PubMed Central CAS Google Scholar
Altarelli, F., Braunstein, A., Dall’Asta, L., De Bacco, C. & Franz, S. The edge-disjoint path problem on random graphs by message-passing. PloS one10, e0145222 (2015).
Bayati, M. et al. Statistical mechanics of steiner trees. Phys. Rev. Lett. 101, 037208 (2008).
Article ADS CAS PubMed Google Scholar
De Bacco, C., Franz, S., Saad, D. & Yeung, C. H. Shortest node-disjoint paths on random graphs. J. Stat. Mech: Theory Exp. 2014, P07009 (2014).
Article MathSciNet MATH Google Scholar
Braunstein, A. & Muntoni, A. P. The cavity approach for steiner trees packing problems. J. Stat. Mech: Theory Exp. 2018, 123401 (2018).
Article MathSciNet MATH Google Scholar
Ronellenfitsch, H. & Katifori, E. Global optimization, local adaptation, and the role of growth in distribution networks. Phys. Rev. Lett. 117, H364-5 (2016).
Article CAS Google Scholar
Brancolini, A. & Solimini, S. Fractal regularity results on optimal irrigation patterns. Journal de Mathématiques Pures et Appliquées 102, 854–890 (2014).
Article MathSciNet MATH Google Scholar
Xia, Q. Motivations, ideas and applications of ramified optimal transportation. ESAIM Math. Model. Numer. Anal. 49, 1791–1832 (2015).
Article MathSciNet MATH Google Scholar
Pegon, P., Santambrogio, F. & Xia, Q. A fractal shape optimization problem in branched transport. Journal de Mathématiques Pures et Appliquées 123, 244–269 (2019).
Article MathSciNet MATH Google Scholar
Facca, E., Cardin, F. & Putti, M. Towards a stationary Monge–Kantorovich dynamics: the physarum polycephalum experience. SIAM J. Appl. Math. 78, 651–676 (2018).
Article MathSciNet MATH Google Scholar
Facca, E., Daneri, S., Cardin, F. & Putti, M. Numerical solution of Monge–Kantorovich equations via a dynamic formulation. J. Sci. Comput. 82, 1–26 (2020).
Facca, E. & Cardin, F. Branching structures emerging from a continuous optimal transport model. J. Comput. Phys.Submitted (2020).
Baumgarten, W. & Hauser, M. Detection, extraction, and analysis of the vein network. J. Comput. Interdis. Sci. 1, 241–249 (2010).
Google Scholar
Obara, B., Grau, V. & Fricker, M. D. A bioimage informatics approach to automatically extract complex fungal networks. Bioinformatics 28, 2374–2381 (2012).
Article CAS PubMed Google Scholar
Bebber, D. P., Hynes, J., Darrah, P. R., Boddy, L. & Fricker, M. D. Biological solutions to transport network design. Proc. R. Soc. B Biol. Sci. 274, 2307–2315 (2007).
Article Google Scholar
Boddy, L., Wood, J., Redman, E., Hynes, J. & Fricker, M. D. Fungal network responses to grazing. Fungal Genet. Biol. 47, 522–530 (2010).
Article PubMed Google Scholar
Dehkordi, M. T., Sadri, S. & Doosthoseini, A. A review of coronary vessel segmentation algorithms. J. Med. Signals Sens. 1, 49 (2011).
Article PubMed PubMed Central Google Scholar
Bradski, G. & Kaehler, A. Learning OpenCV: Computer vision with the OpenCV library (OReilly Media Inc, New York, 2008).
Google Scholar
Dirnberger, M., Kehl, T. & Neumann, A. Nefi: Network extraction from images. Sci. Rep. 5, 15669 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Chai, D., Forstner, W. & Lafarge, F. Recovering line-networks in images by junction-point processes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition1894–1901 (2013).
Wang, J., Song, J., Chen, M. & Yang, Z. Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine. Int. J. Remote Sens. 36, 3144–3169 (2015).
Article ADS Google Scholar
Wegner, J. D., Montoya-Zegarra, J. A. & Schindler, K. Road networks as collections of minimum cost paths. ISPRS J. Photogr. Remote Sens. 108, 128–137 (2015).
Article ADS Google Scholar
Hwang, F. K. & Richards, D. S. Steiner tree problems. Networks 22, 55–89 (1992).
Article MathSciNet MATH Google Scholar
Liu, L., Song, Y., Zhang, H., Ma, H. & Vasilakos, A. V. Physarum optimization: A biology-inspired algorithm for the steiner tree problem in networks. IEEE Trans. Comput. 64, 818–831 (2013).
MathSciNet MATH Google Scholar
Hagberg, A., Swart, P. & S Chult, D. Exploring network structure, dynamics, and function using networkx. Tech. Rep., Los Alamos National Lab.(LANL), Los Alamos, NM (United States) (2008).
Xu, K., Tang, C., Tang, R., Ali, G. & Zhu, J. A comparative study of six software packages for complex network research. In 2010 Second International Conference on Communication Software and Networks, 350–354 (IEEE, 2010).
Batagelj, V. & Mrvar, A. Pajek-program for large network analysis. Connections 21, 47–57 (1998).
MATH Google Scholar
Bastian, M., Heymann, S. & Jacomy, M. Gephi: an open source software for exploring and manipulating networks. In Third international AAAI conference on weblogs and social media (2009).
Yin, C. et al. Network science characteristics of brain-derived neuronal cultures deciphered from quantitative phase imaging data. Sci. Rep. 10, 1–13 (2020).
Article CAS Google Scholar
Sia, J., Jonckheere, E. & Bogdan, P. Ollivier-ricci curvature-based method to community detection in complex networks. Sci. Rep. 9, 1–12 (2019).
Article ADS CAS Google Scholar
Ni, C.-C., Lin, Y.-Y., Luo, F. & Gao, J. Community detection on networks with ricci flow. Sci. Rep. 9, 1–12 (2019).
Article CAS Google Scholar
Xue, Y. & Bogdan, P. Reliable multi-fractal characterization of weighted complex networks: algorithms and implications. Sci. Rep. 7, 1–22 (2017).
Article CAS Google Scholar
Evans, L. C. & Gangbo, W. Differential equations methods for the Monge–Kantorovich mass transfer problem Vol. 653 (American Mathematical Soc, New York, 1999).
MATH Google Scholar
Xia, Q. On landscape functions associated with transport paths. Discrete Contin. Dyn. Syst 34, 1683–1700 (2014).
Article MathSciNet MATH Google Scholar
Tero, A., Kobayashi, R. & Nakagaki, T. A mathematical model for adaptive transport network in path finding by true slime mold. J. Theor. Biol. 244, 553–564 (2007).
Article MathSciNet PubMed MATH Google Scholar
Oliveira, C. A. & Pardalos, P. M. Mathematical aspects of network routing optimization (Springer, Berlin, 2011).
Book MATH Google Scholar
Facca, E., Cardin, F. & Putti, M. Physarum dynamics and optimal transport for basis pursuit. arXiv preprint arXiv:1812.11782 (2018).
Straszak, D. & Vishnoi, N. K. Irls and slime mold: Equivalence and convergence. arXiv preprint arXiv:1601.02712 (2016).
Chan, R. H. & Ng, M. K. Conjugate gradient methods for toeplitz systems. SIAM Rev. 38, 427–482 (1996).
Article MathSciNet MATH Google Scholar
Li, W., Ryu, E. K., Osher, S., Yin, W. & Gangbo, W. A parallel method for earth movers distance. J. Sci. Comput. 75, 182–197 (2018).
Article MathSciNet MATH Google Scholar
Fricker, M., Boddy, L. & Bebber, D. Network organisation of mycelial fungi. In Biology of the fungal cell, 309–330 (Springer, 2007).
Dirnberger, M. & Mehlhorn, K. Characterizing networks formed by p. polycephalum. J. Phys. D Appl. Phys. 50, 224002 (2017).
Article ADS CAS Google Scholar
Dirnberger, M., Mehlhorn, K. & Mehlhorn, T. Introducing the slime mold graph repository. J. Phys. D Appl. Phys. 50, 264001 (2017).
Article ADS CAS Google Scholar

Download references

Acknowledgements

The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for supporting Diego Baptista and Daniela Leite.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Diego Baptista and Daniela Leite.

Authors and Affiliations

Max Planck Institute for Intelligent Systems, Cyber Valley, 72076, Tübingen, Germany
Diego Baptista, Daniela Leite & Caterina De Bacco
Centro di Ricerca Matematica Ennio De Giorgi, Scuola Normale Superiore, Piazza dei Cavalieri, 3, Pisa, Italy
Enrico Facca
Department of Mathematics “Tullio Levi-Civita”, University of Padua, via Trieste 63, Padua, Italy
Mario Putti

Authors

Diego Baptista
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Leite
View author publications
You can also search for this author in PubMed Google Scholar
Enrico Facca
View author publications
You can also search for this author in PubMed Google Scholar
Mario Putti
View author publications
You can also search for this author in PubMed Google Scholar
Caterina De Bacco
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to developing the models, analyzing the results and reviewing the manuscript. E.F, M.P. and C.D.B. conceived the experiment(s), D.B. and D.L. conducted the experiments.

Corresponding author

Correspondence to Caterina De Bacco.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Baptista, D., Leite, D., Facca, E. et al. Network extraction by routing optimization. Sci Rep 10, 20806 (2020). https://doi.org/10.1038/s41598-020-77064-4

Download citation

Received: 04 May 2020
Accepted: 05 November 2020
Published: 30 November 2020
DOI: https://doi.org/10.1038/s41598-020-77064-4

This article is cited by

Convergence properties of optimal transport-based temporal hypergraphs
- Diego Baptista
- Caterina De Bacco
Applied Network Science (2023)
Community detection in networks by dynamical optimal transport formulation
- Daniela Leite
- Diego Baptista
- Caterina De Bacco
Scientific Reports (2022)
Multicommodity routing optimization for engineering networks
- Alessandro Lonardi
- Mario Putti
- Caterina De Bacco
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.