Introduction

The universe synthesizes lighter particles into heavier ones and the mechanism which remains a heated area of exploration, includes element nucleosynthesis1 and nuclear processes2,3,4,5,6,7,8. Much attention has been drawn on topics like the limits of nuclear binding and the island of stability9,10,11. Those studies help us to understand nuclear reactions taking place both in the distant galaxies and in the reactors on the earth. Detailed studies of nuclide consumption or production through reactions are required with the support of massive nuclear data. The JINA REACLIB database, maintained by the Joint Institute for Nuclear Astrophysics, is a well-known source of thermonuclear reaction rates. It aims to compile a complete set of nuclear reaction rates that is continuously updated and regularly snapshotted12. The dataset contains various kinds of nuclear reactions, parameters for calculating reaction rates, Q values, etc., providing rich information for nuclear-related research directions.

Besides acquiring exact nuclide abundance by solving sets of time-dependent differential equations in the database, it is challenging to treat the nuclear reaction system as a complex network to explore its statistical characteristics. The basic idea is introduced from graph theory, which considers the interacting units in a system as nodes and the relationship between two units as an edge, thus the system can be studied as a graph (network). The topic of complex networks has achieved significant advances since the ‘small world’13 and ‘scale-free’14 characteristics were found prevalent in many real world systems such as social connections, the Internet and distributed infrastructures15. The structure and dynamics of networks mapped from those systems turn out to be distinct from that of regular or random networks16,17,18 and complex networks outperform them in modeling real world systems. For example, the ‘scale-free’ structure of the Internet, which is hierarchical with many hubs, explains how easy it is for viruses to propagate. These findings help us to understand the systems at more profound levels and have benefited researches in various areas19, hopefully including nuclear reactions.

An important branch of the studies on complex networks covered metabolic networks or chemical reaction networks. The topological scaling properties were first analyzed of 43 organisms from different kinds of life in a directed network approach and similarity was found within and outside biological systems20. It is believed that ‘scale-free’ was crucial for the robustness. Moreover, the chemistry of planetary atmospheres and the interstellar medium were compared and it was demonstrated that only the network of the Earth, coupling with its biosphere, was scale-free21. Efforts were made for obtaining more results from different systems like interstellar chemistry22,23,24, trying to explain the origins of complexity in large systems or to guide experiments25. A majority of the studies map their reaction systems into networks using substrate-product method, which treats reactants and products as nodes and connects them with edges if they are in the same reaction equation. In reference26, several mapping methods were evaluated. It was concluded that substrate-product and substance networks could best preserve modularity information. Although effective to a certain degree, as further discussion was made on precision of the methods used27,28, the methods for mapping reactions into networks still need improvement to better fit a specific system in order to avoid inaccuracy. Worth mentioning, the bipartite graph is also an efficient way to characterize reactions29. It maps the substance and reaction into two kinds of nodes in one network, but the analysis of bipartite graph is complicated. We believe the best way is to extract the most information with a simpler method.

In the present work, we study a directed weighted nuclear reaction network based on substrate-product method, including all nuclides and reactions from the JINA REACLIB database. Additionally, we label each edge with 4 types: neutron, proton, 4He and the remainder, hence a 4-layered network. The multilayer network provides more precise description of a system, without which one might draw misinforming conclusions30. To efficiently analyze the multilayer network, frameworks and models are studied31,32,33,34,35,36 and metrics for multilayer structure are proposed such as edge overlap37,38, various centralities39,40 and correlations41,42,43, adding a new dimension to complex networks research. Detailed reviews on multilayer network are given in reference44,45,46. According to the dependence between nodes in different layers47,48,49, there are generally 3 types of multilayer network: one-to-one, partially dependent and one-to-many. We focus on the one-to-one dependent situation and the multilayer substrate-product method falls into the multidimensional network category, which is originated from colored edge network. There are by far some attempts on biomedicine in reaction/metabolic systems using multilayer method. Here, the advantage of multilayer network analysis helps us to find some structural characteristics of nuclides relating to their physical property, which is a possible path to address the nuclide stability problem that nuclear physics and astrophysics pay close attention to.

Results

Multilayer reaction network

The dataset used in this paper is the latest version of REACLIBV2.0 containing 8048 nuclides treated as nodes and 82851 reactions mapped into directed edges (pairs from each reactant to each product). Over 3000 nuclides, either exist naturally or are synthesized artificially, whose half-life (th) data are complemented from NuDat (http://www.nndc.bnl.gov/nudat2/), while the rest are from theoretical calculation. Take 40Ca as an example, the recorded reactions involving this nuclide are as follows:

According to the substrate-product method, for the first reaction of 40Ca, we have 4 edges: one from 40Ca to n and one from 40Ca to 39Ca and vice versa because of the reverse reaction. However, as neutron, proton and 4He are the most needed particles in nuclear reactions, more than half of the reactions turn a nuclide into other nuclides by capturing or releasing one or more of these three particles, making their degrees k (number of edges) close to 104. If they are considered as normal nodes like other nuclides, whose degrees are smaller than 102, their role of intermediary material will be lost in the topology and the degree distribution will be highly biased (see Supplementary Figure 1). The nodes with extremely high degrees in reaction systems, often referred to as currency nodes, can make the network difficult to analyze and they are usually deleted in reaction networks. Here, instead of completely deleting these currency nodes, we label an edge with one of the three particles (‘n’ for neutron, ‘p’ for proton or ‘h’ for 4He) when a nuclide needs this very particle to produce another nuclide. If none of the three particles are needed as reactant in a reaction, the edges from reactants to products will be labeled as ‘the remainder’ or ‘r’, which is not as simple as the situation for the other 3 layers and some information can be lost. Hence, the first reaction of 40Ca has only one r-edge from 40Ca to 39Ca and one n-edge from 39Ca to 40Ca. The corresponding topologies of 40Ca’s 4 layers are shown in Fig. 1.

Figure 1
figure 1

The topologies of 40Ca’s 4 layers.

(a) n-layer, (b) p-layer, (c) h-layer, (d) r-layer. The network is constructed based on substrate-product method with edges labeled with currency particles in the reactants.

Note that there are mainly 4 ways to map a reaction into a network: substrate-product network (connecting reactants to products), substrate-substrate network (reactants to reactants and products to products), substance network (the combination of the first and second methods) and reaction-reaction network (taking reactions as nodes and connecting 2 reactions if they share the same substance). Many reactions have multiple substrates while an edge in the network analysis usually connects only one pair of nodes, thus some topological structure could be lost when mapping a system to a network. However, in the case of nuclear reactions, the majority only have one or two reactants (see Supplementary Note 1), so that the reactions are basically about one nuclide to another, making the information lost very limited. Also, if there are multiple products, some products can be decided following the conservation of mass (e.g. the 4th reaction of 40Ca, also written as 40Ca(n,p)40K, the information about p is redundant). In this sense, the complete reaction information can be preserved. For other methods, the connection between reactants or products, as in substrate-substrate and substance network, could bring in misleading information, while reaction-reaction network loses too much. So the substrate-product network is our best choice. The topologies of the whole system can be found in Supplementary Figure 2.

Topological characteristics and stable nuclides

The network is usually studied via its adjacency matrix A. The matrix element aij = 1 if there is an edge from node i to j, otherwise aij = 0. In a directed network, in-degree and out-degree are counted separately by summing up A in different directions. As for multilayer network, we have A[α] for layer α, where, in our nuclear reaction network, M = 4 and α  {n, p, h, r}. A is denoted as the aggregated network of those 4 layers, aij = 1 if . The degree distributions of aggregated network and r-layer on a Z-N plane are shown in Fig. 2. The degrees of neutron-rich nuclides are mostly greater than those of proton-rich ones. While for n-, p- and h-layers, all the degree distributions have a peak at k = 3, the networks are very homogeneous. We also notice that k and km (averaging degrees of nearest neighbors) are highly correlated, which can be recognized as assortativity and has been found prevalent in many other networks. k[r] and have a certain correlation, but there is little in the rest of 3 layers (see Supplementary Figure 3–6 and Supplementary Note 2).

Figure 2
figure 2

Degree distributions of aggregated network and r-layer on a Z-N plane.

The in-degree and out-degree are calculated for each nuclide and the values are indicated by color, with x-axis being the number of neutrons (N) in that nuclide and y-axis being protons (Z). (a) In-degree, (b) Out-degree and (c) Degree difference are for the aggregated network. (d) In-degree, (e) Out-degree, (f) Degree difference are for the r-layer.

The abundant topological characteristics indicate there is rich information stored in the structure of the aggregated network and r-layer, so we explore them further by subtracting in-degree with out-degree as shown in the lower panels of Fig. 2. We can see that the degree difference of r-layer looks similar to the aggregated network, but there is a clear path-like positioned nodes whose in-degree is larger than out-degree (red dots), which are surrounded by scattered nodes that have smaller in-degree than out-degree (blue dots) along the path. On the other hand, the aggregated network does not possess this kind of pattern. As Fig. 3 shows, which is an enlarged version of panel (f) in Fig. 2, this path fits the beta-stability line smoothly and we would infer that this pattern describes most the positions of stable nuclides. Also, the remainder of the red part may be related to the r-process of nucleosynthesis.

Figure 3
figure 3

β-stability line fitting.

The valley of stability is fitted on the degree difference distribution of r-layer. The red solid line and the purple dash-dotted line are both empirical predictions of beta-stability lines where and . The black solid lines with arrows could be able to outline the r-process. The black dotted line is auxiliary following Z = N. The inset demonstrates the fitting result of condition . Green dots are 192 stable nuclides that match for the condition. Yellow dots are 31 unstable nuclides mis-included by the condition. Red dots are 40 stable nuclides missed by the condition. The ratio of correct prediction is relatively high.

To determine the characteristics of stable nuclides quantitatively, we compared the degree differences of the aggregated network and r-layer and found that over 80% of the stable nuclides follow the condition (see the inset of Fig. 3). As in-degree and out-degree correspond to the general ability of production and consumption of a nuclide, the aggregated network describes the total balance, so it is reasonable for ki,in = ki,out to be one of the stability conditions. While in r-layer, where the n, p, 4He reactions are excluded, the condition of could result that the stable nuclides can be accumulated. As Fig. 4 shows, this condition does not apply to the stable nuclides with number of protons smaller than 15, which is probably because the area has more longer edges according to our measure. Thus we can conclude that for the Z ≥ 15 part, the topology is more regular, while the rest part has more complexity. Furthermore, the stable condition indicates that the degree differences in other 3 layers are not the same for in and out situations, so that the difference in r-layer can be compensated, eliminating the difference in the aggregated network. We checked this inference via node overlapping coefficient as depicted in Fig. 5. The stable nuclides do have much higher edge overlap in the in-direction than out-direction. In fact, are mostly the highest among all the nuclides and are the lowest down to zero.

Figure 4
figure 4

Degree difference over number of protons.

In the region of approximately Z ≥ 15, the in-degree and out-degree in the aggregated network are equal, while the difference is mostly 2 in the r-layer. For Z ≤ 15, the stable nuclides show topological complexity.

Figure 5
figure 5

Node overlapping coefficient.

(a) oi,in and (b) oi,out of all nuclides are plotted over number of protons. The red triangles mark the stable nuclides which have high oi,in and low oi,out.

Weighted analysis and nuclide half-life

Networks without weights are usually called binary, which, in our case, only indicates the reactions that have the capability to take place. While the weights on the edges make the structure even more complicated and draw a clearer picture of the system. For each reaction, there are 7 parameters (ai) for calculating reaction rates λ using Equation 1, where T9 is the temperature in Gigakelvin ref. 12. A larger λ indicates higher probability for a reaction to occur, which can be mapped to the notion of edge weight in network theory, describing how easy it is to travel between nodes or how important the edge is.

Although kin and kout are highly correlated in binary network (upper and middle panels of Fig. 2), sin and sout have various correlation under different temperature values as the reaction rate is a function of T9 (see Supplementary Figure 7). This could be the reason why a 4-layered network is able to lay foundation for all complex systems at atomic level with 3 layers being rather regular. The complexity comes from the diversity of edge weights. Of the temperature range we explored, it is observed that for T9 ≤ 3, a reciprocal relationship

exists between th and as in the upper panel of Fig. 6. Approximately 2500 out of 3000 nuclides with half-life values fit for Equation 2 if the temperature is low, where th is in second. The proportion of fitted nuclides drops as T9 increases. Those unfitted points could be caused by insufficient data to form the network. This result can help to predict the half-lives of unknown nuclides when their reaction rates with neighbors are calculated, as shown in the lower panel of Fig. 6. A comparison with half-life distribution plotted using Nudat data can be found in Supplementary Figure 8. Although correlated to to some extent, does not have such relationship with th, nor strengths of other layers. The reason for this phenomenon can be explained that half-life is determined by how easy it is for a nuclide to convert into others and r-layer contains mainly reactions that are classified as one nuclide to others without any currency particles. Also, it indicates that the life-time of a nuclide does not depend on its input channel by which a nuclide is formed, i.e. the life-time of a certain nucleus has no memory for the formation history of itself.

Figure 6
figure 6

The relationship between half-life and out-strength of r-layer and a prediction.

(a) At T9 = 0.01, the half-life of approximately 2500 out of 3000 nuclides scales reciprocally with node out-strength of r-layer. (b) A prediction of half-life based on Eq. 2. ln(th) of a nuclide is indicated by color, with x-axis being the number of neutrons (N) in that nuclide and y-axis being protons (Z).

Discussion

The whole set of nuclear reactions is one of the most fundamental systems in the world. The network constructed is quite different from previously studied ones. The average shortest distance is a bit longer than the network diameter, which indicates that it is not such a small world. The degree distribution is narrow with peaks, far from the power-law often observed. Three layers of the 4-layered network are quite regular. The nodes are spatially constrained, lacking enough long range edges to make the communication or propagation more efficient, yet they can generate much more complicated items like chemical molecules and biological tissues. The temperature-dependent strength may take the credit for the complexity. Despite the special structure, some interesting node characteristics are observed from the complex network point of view.

The multilayer network construction method is effective, even some of the products are deleted, we can still extract important information from the network. As stability is about the balance of in and out corresponding to production and consumption in reactions, the degree difference and node overlapping coefficient are able to identify the majority of stable nuclides and the relationship between half-life and out-strength in r-layer are discovered. The results are obtained with the help of multidimensional method, linking nuclide characteristics with topological structures.

When we focus on a complicated reaction process (e.g. r-process is about continually capturing neutrons to produce heavier nuclides), it usually occurs only in a certain layer, even if the network structure is simple. Therefore, as we study nucleosynthesis and nuclear processes, based on the primitive results of nuclear reaction network analysis, our future work will focus on a better designed mapping method, the dynamics on the network and more detailed analysis of each layer.

Methods

Mapping reactions into a network

Figure 7 is a demonstration of our mapping method. Note that the number of edge types between two nodes can range from 0 to the number of layers M.

Figure 7
figure 7

Mapping reactions into a network.

(a) Pseudo equation sets including 4 kinds of reactions. (b) Mapping all the reactants and products into a network, similar to a bipartite graph, where a blue square stands for a reaction and a red circle for a nuclide. (c) The aggregated network. Edges are labeled according to the currency particles in the reactants.

Node degree and overlapping coefficient

The in-degree and out-degree for node i in layer α are defined as in Equation 3. is the number of different types of edges between node i and j, called edge overlap, which is proposed in ref. 37. An average edge overlap is defined in ref. 34. To evaluate the overall overlap of a node in different layers, we define a normalized node overlapping coefficient oi, as in Equation 4, based on the edge overlap, where ki is the degree of node i in the aggregated network.

Node strength

We calculate the rates of each reaction and assign the values to the edges respectively as edge weight. The rate value ranges from approximately 10−300 to 1050 at T9 = 0.1 and varies as T9 grows from less than 0.01 up to 10. The weights of the two edges with opposite directions between a pair of nodes are usually not the same as the rates of a reaction and its reverse reaction often differ. For T9 ≥ 0.01, the calculation of reverse rates should be corrected by including partition functions, as they are calculated via detailed balance without partition functions in the dataset. To understand the characteristics of a node, we derive node strength si by summing up edge weights from or to all the neighboring nodes as shown in Equation 5.

Additional Information

How to cite this article: Zhu, L. et al. Multilayer network analysis of nuclear reactions. Sci. Rep. 6, 31882; doi: 10.1038/srep31882 (2016).