Abstract
Capability to compare biological models is a crucial step needed in an analysis of complex organisms. Petri nets as a popular modelling technique, needs a possibility to determine the degree of structural similarities (e.g., comparison of metabolic or signaling pathways). However, existing comparison methods use matching invariants approach for establishing a degree of similarity, and because of that are vulnerable to the state explosion problem which may appear during calculation of a minimal invariants set. Its occurrence will block usage of existing methods. To find an alternative for this situation, we decided to adapt and tests in a Petri net environment a method based on finding a distribution of graphlets. First, we focused on adapting the original graphlets for notation of bipartite, directed graphs. As a result, 151 new graphlets with 592 orbits were created. The next step focused on evaluating a performance of the popular Graphlet Degree Distribution Agreement (GDDA) metric in the new environment. To do that, we decided to use randomly generated networks that share typical characteristics of biological models represented in Petri nets. Our results confirmed the usefulness of graphlets and GDDA in Petri net comparison and discovered its limitations.
Introduction
During the last two decades it was more and more evident that living organisms as well as their functional blocks, like organs, tissues, cells, etc., are complex biological systems. It means that for a deep understanding of their nature and functionality it is not sufficient to analyze their building blocks separately (which was a dominating approach in biological sciences). More global view on the relations among elementary biological components is necessary. In other words, living organisms should be seen as complex systems because dense networks of interconnections among their components determine their structure and functionality. So, a systems approach is needed for studying their properties. Such an approach has been for decades applied in technical sciences, so some already developed methods can be adapted to biological research. This led to an emergence of systems biology, i.e., a branch of science, where biological phenomena are analyzed from systems point of view^{1}.
A basis for a formal analysis of any system, not only a biological one, is a model expressed in a language of some branch of mathematics. Like in other areas of science and technology, differential equations play an important role also in modelling biological systems. However, in this area they have some serious limitations following, among others, from the fact that usually it is very difficult, or even impossible, to collect all quantitative data characterizing the analyzed system. It follows from the fact that technically it is very hard to measure in a laboratory all parameters describing dozens or hundreds of elementary reactions/processes taken into account in a model. These parameters are necessary in models based on differential equations but are not used in graphbased models since they describe only a structure of the system (which in most cases is crucial for the functionality of biological systems). This is one of the reasons that models based on graph theory are often built and analyzed.
One of the more and more popular approaches to modelling biological systems is based on Petri nets (metabolic pathways, organelles, cells, and drug development^{2,3}). Nets of this type are not graphs but they have a structure of a weighted directed bipartite graph^{4,5,6}.
In the analysis of biological systems there happens that there is a need to compare two or more of them. Such a comparison may lead to, e.g., discovering some important, well conserved subprocesses. There is a lot of methods of graph comparison (e.g. graph distance^{7}, graph edit distance^{8}, maximal common subgraph^{9}), but since Petri nets are not graphs, such methods usually cannot be directly applied to comparisons of models expressed in the language of Petri nets theory. Moreover, there is only few methods for comparison of Petri netbased models of biological systems known in the literature^{10,11}. Since there is a growing popularity of Petri nets in systems biology, there is also a growing need for such methods.
In this paper we propose a method for Petri nets comparison based on graphlets. They are sets of connected subgraphs and almost two decades ago were proposed for an analysis of protein–protein interaction (PPI) networks^{12}. Such networks are undirected graphs describing interactions in a set of analyzed proteins.
The organization of this paper is as follows. In Section Methods basic concepts of Petri nets and graphlets are briefly presented. In “Results and discussion” section graphlets which can be used for Petri nets comparison are proposed. Moreover, in this section results of computational experiments in which Graphlet Degree Distribution Agreement (GDDA) measure was tested in the context of the proposed graphlets, are presented. The last section contains conclusions.
Methods
Graphs
Let V be a nonempty set and \(E \subseteq \{\{v_i,v_j\} : v_i \in V \wedge v_i \in V\}\). The pair (V, E) is then called a undirected graph where V is the set of vertices, or nodes, while E is its set of edges. Let V be a nonempty set and \(A \subseteq V \times V\). The pair (V, A) is then called a directed graph or digraph, where V is the set of vertices, or nodes, while A is its set of arcs. Graph \(G = (V,E)\) is called a bipartite graph, if \(V = V_1 \cup V_2, V_1 \cap V_2 = \emptyset\) and for every edge \(\{x,y\} \in E\), there is \(x\in V_1 \wedge y \in V_2\)^{13}.
Petri nets
As has been mentioned in the previous section, Petri nets are one of mathematical formalisms which can be used to model and analyze biological systems. They have a structure of a biparted, directed graph with weighted arcs. Formally Petri net can be defined as a 5tuple \(Q = (T,P,F,W,M_0)\), where:

\(P=\{p_1,p_2,...,p_n\}\) and \(T=\{t_1,t_2,...,t_r\}\) are finite, disjoint sets of places and transitions, respectively;

\(F \subseteq (P \times T) \cup (T \times P)\) is a finite set of arcs;

\(W : F \rightarrow {\mathbb {Z}}^{+}\) is a weight function;

\(M_0 : P \rightarrow {\mathbb {N}}\) is a function assigning number of tokens to each place —called net state function^{4}.
They are composed of vertices of two kinds, i.e., places and transitions, which can be connected by arcs. Places correspond to passive components of a modelled system, while transitions are counterparts of its active components (i.e., some elementary processes). Arcs describe casual relationships between components of these two types, can join only vertices of different types. Places, transitions and arcs constitute a structure of a Petri net, which is a directed bipartite graph. In fact, this graph is weighted, because arcs are labeled by positive integers called weights. Tokens reside in places and flow through a net via transitions. This flow is governed by the transition firing rule. According to it a transition is active if in every of its preplaces, i.e., these ones which directly precede it, a number of tokens is equal to at least a weight of an arc connecting the place with the transition. An active transition can be fired, what means that tokens flow from its preplaces to its postplace, i.e., those ones which directly succeed the transition. The numbers of flowing tokens are equal to the weights of the respective arcs. The flow of tokens correspond to a flow of substances, signals, information, etc. in the modelled system and it brings a kind of dynamics to Petri nets, which is a great advantage of these nets in comparison to graphs in the context of systems modelling. Unfortunately because of computational limitations those flows are not always available to calculate in large networks.
Petri nets have an intuitive graphical representation, where places are depicted as circles, transitions as rectangles or bars, arcs as arrows, and tokens as dots or numbers inside places^{4,5,6}—example on Fig. 1.
The possible flows of tokens are determined by the structure of a Petri net. So, a basis of Petri nets comparison is a comparison of their structures, i.e., the weighted directed bipartite graphs.
Graphlets
Few of methods based on graphlets, has been used for comparison of graph models of biological phenomena, however, it has not been used for models expressed in the language of Petri nets theory. Undirected graphlets (for short graphlets) were proposed in Ref.^{12} in the context of analysis of PPI networks. The general idea of such an analysis was based on a comparison of frequencies of appearance of some subgraphs in an analyzed PPI network with corresponding frequencies in random graphs (some of related research focus on finding induced subgraphs using graphlets, which is not the topic of this paper). The subgraphs whose frequencies were compared in this approach are connected nonisomorphic graphs on n vertices, where n is usually in the range [2, 5]. These graphs are called graphlets^{12}. For \(n\in [2,5]\) there are 30 graphlets (see Fig. 2). The basic measure, called relative graphlet frequency distance, used for comparison of two graphs G and H based on graphlet frequencies is defined in the following way^{12}:
where \(F_i(G)=\log \frac{N_i(G)}{T(G)}\), \(N_i(G)\) is the number of graphlets of ith type (cf. Fig. 2) in graph G and \(T(G)=\sum _{i=0}^{{l}}N_i(G)\) is the total number of graphlets in graph G and l is the maximal index of a graphlet. the maximal index of an orbit
As this measure is not precise enough for a significant number of comparison cases (it is easy to construct networks with exactly the same degree distribution whose structure and function differ substantially^{14}), another one based on the concept of orbit, was proposed^{14}. The idea of orbit follows from the notion of degree distribution. A degree of vertex v is the number of edges incident to this vertex. But an edge is the smallest graphlet, i.e., the only graphlet containing two vertices. This simple observation leads to an extension of the notion of degree distribution since it may be considered a number of graphlets built on greater number of vertices which are incident with vertex v. However, it is important to distinguish various cases of such an incidency, i.e., which vertex of a given graphlet is incident with vertex v. The idea can be easily illustrated on the example of graphlet \(G_1\) (see Fig. 3). From topological point of view there is no difference whether vertex \(u_1\) is adjacent to vertex v or vertex \(u_3\) is adjacent to v, but these two cases differ from the case, where vertex \(u_2\) is adjacent to vertex v. So, a set of vertices of a given graphlet can be divided into subsets containing vertices which are equivalent in the already described sense. These subsets are orbits. More formally, an orbit can be defined using the notion of an automorphism—a special case of an isomorphism. Given two graphs \(H_1=(V_1,E_1)\) and \(H_2=(V_2,E_2)\) an isomorphism is function \(f:V_1\rightarrow V_2\), which is a bijection such that for every two vertices \(v_1,v_2\in V_1\) edge \(\{v_1,v_2\}\in E_1\) if and only if edge \(\{f(v_1),f(v_2)\}\in E_2\). An isomorphism of a graph to itself is an automorphism. A set of all automorphisms of given graph \(G=(V,E)\) (with an operation of superposition) forms a group called an automorphism group of graph G, which is usually denoted by Aut(G). For vertex \(v\in V\) an automorphism orbit or simply orbit of this vertex is set \(Orb(v)=\{u\in V: u=f(v), f\in Aut(G)\}\)^{14}.
Vertices \(u_1\) and \(u_3\) in graph \(G_1\) belong to one orbit, while vertex \(u_2\) belongs to another orbit (and these are the only orbits of this graph).
So, now we can measure how many vertices in a given graph are incident with graphlets \(G_i\), \(i=0,1,\ldots ,l\), where l is the greatest index of a graphlet in the considered set of graphlets. We should distinguish the cases of incidency with various orbits in graphlet \(G_i\). In other words, we can measure how many vertices are incident with orbits \(O_j\), \(j=0,1,\ldots ,m\), where m is the greatest index of an orbit. For a given automorphism orbit j, \(d^j_G(k)\) is the sample distribution of the number of nodes in G touching the appropriate graphlet ktimes^{14}. However, in this way we obtain a large collection of numbers (degree distributions) characterizing the analyzed graph. They can be arranged in the following matrix^{15}:
It should be noticed that in practice, where finite graphs are considered, the matrix is finite, since it is upper bounded by the number of vertices of the analyzed graph (α is the maximal possible number of orbit occurrences in one vertex).
It would be better to have one number, instead of such a matrix, which could be used in comparisons of graphs. A measure of this type has been proposed and called a GDD agreement (GDDA)^{14}.
First, \(d_G^j(k)\) is scaled:
and then a normalized jth distribution is calculated:
Next, for graphs G and H a jth distance based on their normalized distributions is defined:
A value of this distance is in the range [0, 1]. When the value is equal to 0, it means that the two graphs have identical distributions, and the greater the value is, the greater differences between the graphs are.
It is also convenient to consider a jth agreement between the two graphs:
And on this basis an agreement between graphs G and H can be defined in the following way:
The idea of graphlets can be extended to the case of directed graphs. They are called digraphlets. It is worth considering since Petri nets have a structure of a directed bipartite graph. (When graphlets up to size 5 are used, then \(l=29\) and \(m=72\).)
There are 39 directed graphlets for \(n\in \{3,4\}\) and one directed graphlet on two nodes (see Fig. 5). In these graphlets there are 129 orbits^{16}.
In this case the directed GDDA (DGDDA) can be defined analogously to the undirected case^{16}:
where d is the maximal index of an orbit in a directed graphlet and agreements \(A_d^j(G,H), j=0,1,\ldots ,128\) are calculated for such orbits.
Results and discussion
Graphlets for Petri nets
A problem of finding all graphlets in compared models might be the largest obstacle for application of graphlet based methods, especially in large networks. To resolve this problem a numerous heuristics and exact algorithms has been proposed. This paper focuses on determining if graphlet based measures are useful for comparison of Petri netbased models of biological systems. Since we concentrate on properties of graphlets and not on algorithms, a simple greedy method is used for this research.
Petri nets were proposed for modelling concurrent, parallel processes and initially used as representation for automata theory problems^{4}. For many years the main area of their applications was computer science or more generally, technical systems, but recently they are more and more often used for modelling and analysis of complex biological systems^{2,17}.
Petri netbased models, that can be found in practical applications (e.g., industrial), usually do not have size comparable to average PPI nets (they are usually smaller). However, in a biological context models based on them can have hundreds of nodes or more.
Despite their popularity, there are only two methods for comparison of Petri netbased models of biological systems^{10,11} (both use matching of invariants or structures based on invariants, and because of that are potentially vulnerable to the state explosion problem^{18}). The reason of this situation can be the fact that it is not an easy task to adapt exact methods of graph comparison to Petri nets. Comparison algorithms for digraphs have higher complexity than their counterparts for undirected graphs. On the other hand, for bipartite graphs there are efficient dedicated methods. For Petri nets both aspects need to be taken under consideration. Because of that, the possibility of exploiting a graphletbased approach is interesting.
It is not difficult to realize that since Petri nets have a structure of a directed bipartite graph, directed graphlets (which will be called digraphlets) are natural candidates for a good starting point for a construction of a Petri nets comparison method^{15,16}.
In 2017 two extensions to directed graphlets were proposed^{15,16} and have been used as a base for graphlets dedicated to Petri nets. Additional modifications like distinction between vertex types (Fig. 7) were needed. Not every digraphlet can occur in a Petri net because of its bipartite structure, i.e., digraphlets with cycles of odd length do not fit this structure and have not participated in further modifications (Figs. 4, 5). It should be mentioned that graphlets with parallel arcs were not considered in this paper.
In result digraphlets \(G^D_0\)–\(G^D_3\), \(G^D_6\)–\(G^D_{17}\) and undirected graphlets \(G_9\), \(G_{10}\), \(G_{11}\), \(G_{16}\), \(G_{20}\) have acceptable structure (see Figs. 4, 5).
Since in Petri nets there are two types of vertices (i.e., places and transitions) each digraphlet suitable for Petri nets comparison can exists in two variants that have the same structure but types of vertices change to opposite (i.e., a place in one variant will be changed to a transition in second one and vice versa), what is shown in Fig. 7. For example, the simplest digraphlet \(G_0^D\) can have two variants, i.e., the one, where the transition has an incoming arc, and the one, where the transition has an outgoing arc. An exception are digraphlets based on graphlet \(G_{{16}}\) which has a form of directed cycle. Graphlets adapted to Petri nets environment will be called pngraphlets. They are shown on Fig. 6.
Figure 7 and Table 1 shows for pngraphlets of sizes in the range [2, 5] the number of all possible structures of a given size. Columns denoted by \(\Sigma\) show the number of pngraphlets or orbits up to the specified size.
GDDA for Petri nets (PNGDDA) is based on the proposed pngraphlets:
where p is the maximal index of an orbit in a pngraphlet and agreements \(A_{pn}^j(G,H)\) are calculated for orbits in such pngraphlets. Since for pngraphlets build on 2, 3, 4 or 5 vertices there is 592 orbits, \(p=591\) if such pngraphlets are considered.
Figure 8 contains an example of a simple Petri net. It is a 7vertex connected net with one cycle, one source transition (\(t_0\)) and one sink transition (\(t_1\)) (vertices that have only incoming or outgoing arc, respectively). For this network Table 2 contains distribution of orbits from the first 6 pngraphlets (\(G^{PN}_0{}G^{PN}_5\)—see Fig. 9). Columns in this table represents orbits while rows correspond to vertices. A value in a given cell is a number of corresponding orbit occurrences in a given vertex. Orbits in pngraphlets are strictly connected to specific type of a vertex (c.f. Fig. 9). For example, orbit 0 can occur only in transitions and because of that its value for occurrence in places will always be 0 (situation highlighted by gray color in Table 2).
Computational experiments
The earlier described GDDA is a frequently used metric for graphletbased comparison of networks^{14}. It represents an interesting approach to graph comparison, which is efficient in comparing large networks, where more standard measures based on isomorphism (e.g., graph distance, maximal common subgraph, graph edit distance) are not. Complexity of graph isomorphism is still open and comparison methods based on isomorphism are limited to relatively small graphs (e.g., finding maximal common subgraph is an NPcomplete problem). Algorithms for finding all (undirected and directed) graphlets have time complexity of \(O (nd^{k1})\)^{16}, while finding all orbits has complexity \(O (ed^{k3})\)^{19} (where d is a maximum vertex degree, n number of vertices, e number of edges), and represent an alternative that can be used in practical applications.
Pngraphlets are modifications of the basic idea of graphlets that might change some known characteristics of the original graphlets. This possibility pressed us to perform tests that could allow to evaluate performance of comparison methods based on the proposed pngraphlets and check an impact of this modifications on GDDA value in comparison of Petri netbased models of biological systems.
Such models share a number of common structural characteristics. Defining them, allow to create random networks that fulfils these characteristics, hence could be used for testing GDDA in a biological context. Using such networks gives possibility to test the method on a large number of cases and check its reactions to specific characteristics, i.e., a high ratio of places to transitions numbers.
To simulate biological models random networks should follow a list of restrictions:

1.
Each of the nets contains both source and sink transitions or non of them. Transitions of those types represent an interface of the modelled system with its environment

2.
A ratio of the number of arcs to the number of vertices is equal to 1.3 (an average ratio). This value is based on Petri netbased models of biological systems available in the literature.

3.
They do not contain read nor inhibitor arcs (a read arc correspond to a pair of bidirected arcs, while an inhibitor arc can be used to prevent transitions from fire in specific net states). While very useful in representation of biological processes, both types of arcs are not compatible with incidence matrix representation, which is a base for many analytical methods. Because of that they are often avoided in biological models.

4.
Arc weight can be greater than 1. Weight greater than one can represent approximate quantitative relationships in the modelled biological system.
In models of biological systems, the number of transitions is usually slightly grater than the number of places. However, in order to check pngraphlet behaviour also in less typical nets, in the tests randomly generated Petri nets with greater differences between the numbers of places and transitions were also used. Because small nets are more vulnerable to differences, nets with size from 20 to 100 vertices, incremented by 5, were used in the experiment. For each pair (n, m), where \(n,m \in [10,50]\) are the numbers of places and transitions, respectively, 100 Petri nets were generated.
PNGDDA and pngraphlets size
The number of Petri net graphlets is smaller than the number of their classical directed counterparts, which allow for usage of 5node pngraphets in all of the performed tests. One of the interesting questions is how a comparison precision changes with the number of pngraphlets used? To answer this question, a value of PNGDDA was calculated for Petri net graphlets of sizes 2, 3, 4 and 5.
On the three diagrams presented in Fig. 10 there can be observed a reduction of an average similarity value depending on the number of pngraphlets used to calculate PNGDDA. Regardless of the size of used pngraphlets it has been observed that an average similarity rises with the size of the tested networks. As a result of this characteristic, the same value of PNGDDA metric for net with size 20 and net with size 100 will represent different scale of similarities. This information needs to be considered in interpretation of the comparison results.
For better representation of local changes of PNGDDA value in response to an increase of a pngraphlet size two heatmaps are presented in Fig. 11. For increase of the size of pngraphlets from 3 to 4 nodes an average reduction of GDDA value of 0.081 is observed (from 0.012 to 0.147). The highest decrease of PNGDDA value was observed for nets with higher number of transitions than places. An increase of pngraphlets size from 4node pngraphlets to 5node ones results in lower decrease of GDDA value (range from 0.022 to 0.095 with average 0.058). The highest changes are visible for nets with size greater than 50 nodes.
The acquired results confirm that a usage of the largest possible set of pngraphlets will increase a difference detection levels, which is what could be expected. At the same time it has been observed, that the scale of precision improvement is related to structural characteristic of compared models (e.g., a ratio between the numbers of transitions and places).
PNGDDA and ADT based differences
An interesting question is how PNGDDA metric reacts to differences, that represent a biologically important substructure? According to Ref.^{21} the ADT (Abstract Dependent Transitions) set is the smallest meaningful element that can be biologically interpreted. Such structure is build around transitions from that set. They correspond to structures based on Maximal Common Transitions sets (MCT sets), which are often used in an analysis of Petri netbased models of biological systems. They allow for a decomposition of a model into a set of subprocesses. A connected variant of ADT is called conADT and they can be members of various graph classes like trees or cycles. We are interested in changes of GDDA in reaction to differences between two compared nets, where difference has a form of ADTbased subgraphs.
In Table 3 a distribution of pngraphlets in the tested conADT subnets has been presented. It shows that a structural characteristic of subnets can be detected from it, e.g., a high similarity between subnets B,C and smaller with A. In Table 4 the results of a comparison using PNGDDA for the mentioned previously subnets have been shown. The results also confirm the observations made on the basis of Table 3. While changes in PNGDDA value are relatively small, it can be observed that a difference in size between the compared pairs can have a significant impact, e.g., PNGDDA between subnets A–B and A–C is smaller that between AF. This is caused by a higher number of nodes in subnets B and C, which implies a larger number of pngraphlets, different numbers of orbits associated with additional nodes and because of that a lower value of PNGDDA. This should be especially visible on small subnets, where each additional node with an arc creates a number of additional pngraphlets.
To resolve this case, a next computational experiment was performed. In this experiment an impact of conADTbased differences on PNGDDA was examined. For each random Petri net its duplicate extended by a specific ADTbased structure was generated. The conADTbased structures used in this experiment are shown in Fig. 13. Structures \(A{}E\) were connected by a single arc with the rest of the generated net. For case F we decided to connected the structure by 2 arcs and check its impact on PNGDDA.
In Fig. 12 the results for each \(A{}F\) extension variant was presented in a separate diagram that represents the behavior of PNGDDA value when the net size increases and the ratio of the numbers of places and transitions changes (the right bottom part of Fig. 12). To better present differences between each two variants a set of heatmaps was added (the left top part of Fig. 12).
An impact of structures \(A{}D\) on the PNGDDA value is very similar, despite that they correspond to different graph structures (D is a cycle, \(A{}C\) are trees). It is a result of very similar sets of pngraphlets that exists in those subnets. The largest level of differences through all test cases was observed between structure F and other extensions. It is caused by multiple connections to the original net. Each of those connections is responsible for new pngraphlets that did not occur initially in the original net nor in its extension (Fig. 13).
To show a significance of multiple connection, a new test was performed. Each of the original test nets was extended multiple times by the same structure A—see Fig. 14. An impact on PNGDDA of three A subnets connected by a single arc with the rest of the net was lower than for single structure F connected by two arcs. To properly describe that impact on PNGDDA let us focus on the smallest case with 10 transitions and 10 places. Even when it was extended by 3 structures (resulting in 47node net), its PNGDDA value with relation to the base net (20node net) was higher than the similarity between the base net and the one extended by structure F (25node net).
The performed test showed that PNGDDA is a measure which properly describes similarities between Petri netbased models of biological systems. It should be noted that such models have some characteristics (e.g., the ratio of the numbers of transitions and places) which models of other types (i.e., models of nonbiological phenomena) may not have. The test also indicated that on the basis of PNGDDA it is not possible to distinguish what is the structure of a difference between the compared Petri nets (however, it is also not possible in the case of the classical GDDA applied to PPI networks comparison).
All diagrams with the results from experiments, were generated using a Python data visualization library Seaborn (v0.11.2)^{20}.
Conclusions
The popularity of Petri nets creates a pressure for new and better analysis methods that could satisfy demands from newly explored areas of science. Unfortunately, Petri nets suffer from shortage of dedicated comparison algorithm and adaptation of graphlets can change this situation.
The process of adaptation to new environments and improvement of graphlets has been already started^{15} and in this paper we continue it by proposing a new variant of pngraphlets dedicated for Petri nets. Moreover, we performed extensive tests to evaluate PNGDDA metric, which resulted in confirmation of its potential and allowed to determine its limitations when used to compare Petri netbased models of biological systems. Additionally, the work on graphlets and its adaptation resulted in a new extension of our application Holmes, which is a platform for modelling, simulating an analysis of Petri nets^{22}.
Data availibility
The datasets generated and/or analysed during the current study are not publicly available due to their large size, but are available from the corresponding author on reasonable request.
References
Klipp, E. et al. Systems Biology. A Textbook (Wiley, 2009).
Koch, I., Reisig, W. & Schreiber, F. Modeling in Systems Biology. The Petri Net Approach Vol. 16 (Springer, London, 2010).
Materi, W. & Wishart, D. S. Computational systems biology in drug discovery and development: Methods and applications. Drug Discov. Today 12, 295–303 (2007).
Murata, T. Petri nets: Properties, analysis and applications. Proc. IEEE 77, 541–580 (1989).
David, R. & Alla, H. Discrete, Continuous, and Hybrid Petri Nets (Springer, 2010).
Reisig, W. Understanding Petri Nets: Modeling Techniques, Analysis Methods, Case Studies (Springer, 2013).
Zelinka, B. On a certain distance between isomorphism classes of graphs. Casopis pro pestovani matematiky 100, 371–373 (1975).
Bunke, H. On a relation between graph edit distance and maximum common subgraph. Pattern Recogn. Lett. 18, 689–694 (1997).
McGregor, J. J. Backtrack search algorithms and the maximal common subgraph problem. Softw. Pract. Exp. 12, 23–34 (1982).
Baldan, P., Cocco, N. & Simeoni, M. Comparison of metabolic pathways by considering potential fluxes. Biological Processes & Petri Nets (2012).
Szawulak, B. & Formanowicz, P. Net decomposition as a base for Petri net comparison (in Polish). In Automatyzacja Procesów Dyskretnych Vol. 2 (eds Świerniak, A. & Krystek, J.) 215–222 (Wydawnictwo Politechniki Ślaskiej, 2018).
Pržulj, N., Corneil, D. G. & Jurisica, I. Modeling interactome: Scalefree or geometric? Bioinformatics 20, 3508–3515 (2004).
Grimaldi, R. P. Discrete and Combinatorial Mathematics; An Applied Introduction (AddisonWesley Longman Publishing Co., Inc., 1985).
Pržulj, N. Biological network comparison using graphlet degree distribution. Bioinformatics 23, e177–e183 (2007).
Aparicio, D., Ribeiro, P. & Silva, F. Extending the applicability of graphlets to directed networks. IEEE/ACM Trans. Comput. Biol. Bioinf. 14, 1302–1315 (2017).
Sarajlić, A., MalodDognin, N., Yaveroğlu, Ö. N. & Pržulj, N. Graphletbased characterization of directed networks. Sci. Rep. 6, 1–14 (2016).
Baldan, P., Cocco, N., Marin, A. & Simeoni, M. Petri nets for modelling metabolic pathways: A survey. Nat. Comput. 9, 955–989 (2010).
Koch, I. & Ackermann, J. On functional module detection in metabolic networks. Metabolites 3, 673–700 (2013).
Hočevar, T. & Demšar, J. Computation of graphlet orbits for nodes and edges in sparse graphs. J. Stat. Softw. 71, 1–24 (2016).
Waskom, M. L. Seaborn: Statistical data visualization. J. Open Source Softw. 6, 3021 (2021).
Heiner, M. Understanding network behavior by structured representations of transition invariants. In Algorith. Bioprocess. (eds Condon, A. et al.) 367–389 (Springer, London, 2009).
Radom, M. et al. Holmes: A graphical tool for development, simulation and analysis of petri net based models of complex biological systems. Bioinformatics 33, 3822–3823 (2017).
Acknowledgements
This work has been partially supported by statutory funds of Poznan University of Technology.
Author information
Authors and Affiliations
Contributions
Conceptualization: B.S.; Methodology: B.S.; Validation: B.S., P.F.; Investigation: B.S., P.F.; Visualization: B.S.; Supervision: P.F.; Writing—original draft preparation: B.S., P.F.; Writing—review and editing: B.S., P.F.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Szawulak, B., Formanowicz, P. Graphlets in comparison of Petri netbased models of biological systems. Sci Rep 12, 20942 (2022). https://doi.org/10.1038/s41598022245355
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598022245355
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.