Rapid-SL identifies synthetic lethal sets with an arbitrary cardinality

Dehghan Manshadi, Mehdi; Setoodeh, Payam; Zare, Habil

doi:10.1038/s41598-022-18177-w

Download PDF

Article
Open access
Published: 18 August 2022

Rapid-SL identifies synthetic lethal sets with an arbitrary cardinality

Mehdi Dehghan Manshadi¹,
Payam Setoodeh¹^na1 &
Habil Zare^2,3^na1

Scientific Reports volume 12, Article number: 14022 (2022) Cite this article

934 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The multidrug resistance of numerous pathogenic microorganisms is a serious challenge that raises global healthcare concerns. Multi-target medications and combinatorial therapeutics are much more effective than single-target drugs due to their synergistic impact on the systematic activities of microorganisms. Designing efficient combinatorial therapeutics can benefit from identification of synthetic lethals (SLs). An SL is a set of non-essential targets (i.e., reactions or genes) that prevent the proliferation of a microorganism when they are “knocked out” simultaneously. To facilitate the identification of SLs, we introduce Rapid-SL, a new multimodal implementation of the Fast-SL method, using the depth-first search algorithm. The advantages of Rapid-SL over Fast-SL include: (a) the enumeration of all SLs that have an arbitrary cardinality, (b) a shorter runtime due to search space reduction, (c) embarrassingly parallel computations, and (d) the targeted identification of SLs. Targeted identification is important because the enumeration of higher order SLs demands the examination of too many reaction sets. Accordingly, we present specific applications of Rapid-SL for the efficient targeted identification of SLs. In particular, we found up to 67% of all quadruple SLs by investigating about 1% of the search space. Furthermore, 307 sextuples, 476 septuples, and over 9000 octuples are found for Escherichia coli genome-scale model, iAF1260.

A novel antibiotic class targeting the lipopolysaccharide transporter

Article Open access 03 January 2024

Engineering a synthetic gene circuit for high-performance inducible expression in mammalian systems

Article Open access 17 April 2024

Generative AI for designing and validating easily synthesizable and structurally novel antibiotics

Article 22 March 2024

Introduction

A number of human pathogenic microorganisms show multidrug resistance, which is a serious challenge in the era of global healthcare^1,2. Most of these species benefit from several pathogenicity factors (i.e., the production of antigens) and broad drug-resistance mechanisms (i.e., antibiotic target mutations). Hence, disrupting the activity of only a single gene in these microorganisms does not guarantee to prevent their growth or the biosynthesis of virulence factors. Furthermore, targeting the essential reactions or genes in some pathogens may cause a significant increase in biofilm-associated reactions. This implies that single essential genes may not be proper targets for these types of microorganisms³. In contrast, multi-target medications and combinatorial therapeutics synergistically impress the microorganisms’ systematic activities; thus, they have been recommended to be much more operative and they show less drug resistance than single targets⁴.

Computational systems biology proposes powerful methodologies to address biomedical queries (e.g., human disease metabolism, the identification of potential drug targets) via a multidisciplinary systems-level study that considers multifaceted interactions between many elements in biological networks⁵. Constraint-based models (CBMs) are very influential in this regard. These models are successfully employed as operative mathematical representations of genome-scale metabolic models (GEMMs) by imposing the governing context- and condition-specific constraints on genome-scale metabolic network reconstructions (GENREs). CBMs can comprehensively analyze metabolic activities and examine the physiological properties of biological systems. Deploying CBMs, systematic analyses can be performed by applying the potent class of computational techniques that are available in constraint-based reconstruction and analysis (COBRA) toolbox^6,7,8.

Because in silico studies save significant time and expense, these methods are widely employed to identify the various effects of reaction and gene knockouts on the flux distribution of the metabolic networks of interest. These knockout studies can be implemented to identify new drug targets from three perspectives⁹: (a) targeting virulence factors³, (b) metabolite-centric targeting^10,11,12, and (c) targeting essential reactions and genes^{13,14,15,16,17}. The last perspective is known as the most common method for identifying potential drug targets, and it is not limited to the deletion of only one reaction or gene. Synthetic lethals (SLs) are pairs of non-essential reactions or genes that are deleterious to an organism when they are disrupted simultaneously¹⁸. Similarly, when the number of targets increases, higher order synthetic lethal sets (n > 2) can be obtained¹⁸.

We should note that although the identification of higher order synthetic lethal sets can bring in new targets for utilizing different drugs in the design of combinatorial therapeutics, this approach is not common in practice¹⁹. However, this concept, not certainly by design, might have been deployed already for many drug combination strategies. One example of this strategy is the combination of daptomycin, cefoperazone, and doxycycline for eradication of Borrelia burgdorferi, through loss of membrane potential as well as inhibition of energy metabolism, cell wall peptidoglycan synthesis, and protein synthesis²⁰. There are other examples in cancer therapeutics such as the combination of BRAF and EGFR inhibitors which effectually influence AKT, MEK and ERK signaling, suggested for colon cancer patients with BRAF mutations¹⁹. In the mentioned cases, combinatorial therapeutics resulted in more effective impacts compared to monotherapies due to the synergistic effects on different functionalities of the cells.

Two approaches are used to computationally identify SLs: exhaustive search and search space reduction. Exhaustive search is straightforward and has been used in some studies^17,21, but applying this approach to identify higher order SLs, especially when the cardinality of SLs is greater than three, is not feasible due to computational time problems. Based on the available computational resources, we estimated that the required computational time for the exhaustive search would be over 180 days to obtain all quadruple SLs for Escherichia coli using iAF1260²² GEMM. Therefore, other methods are required to handle such problems by reducing the search space. Depending on the suggested criteria used to reduce the search space, some of these methods can find only a fraction of the higher order SLs¹⁸, while some other methods aim to find all the SLs^{23,24,25,26,27,28}.

One of these methods, called “SL Finder,” performs an optimization-based search for the exhaustive and targeted identification of SLs¹⁸. In order to reduce the search space, this method employs the flux-coupling analysis²⁹ to add only one of the fully coupled reactions in the knockout list. This approach was used to discover all double and triple SLs and conduct a targeted identification of a few quadruple and quintuple SLs for iAF1260 GENRE of E. coli.

MCSEnumerator finds instead intervention strategies by enumerating the elementary modes of the dual network³⁰ of the corresponding metabolic network²³. It is a powerful approach especially for metabolic engineering applications. Further improvements were made on this approach to obtain the generalized framework of MCSEnumerator and accelerate the dual calculations^24,25,26. MCSEnumerator was applied to find all double to quintuple SLs in iAF1260²³. However, the computational time increases exponentially for SLs that have higher cardinalities, and therefore, the search procedure needs to be stopped after finding a predefined number of SLs or a time limit is reached. Alternatively, in this paper, we propose a targeted enumeration algorithm aiming to increase search efficiency.

Fast-SL is a powerful algorithm that drastically reduces the search space by purging the search space of reactions that are guaranteed not to produce SLs²⁷. Fast-SL computes a flux distribution that maximizes the growth rate using a minimum value for the sum of fluxes (${l}_{1}$-norm) in order to identify flux-carrying reactions. In the next step, the algorithm searches only through these flux-carrying reactions, as well as their combinations, to identify SLs within a reduced search space. The authors reported the identification of 127 new synthetic lethal genes in E. coli, which had not been found by SL Finder. Also, Fast-SL outperforms the MCSEnumerator by finding the same SLs about four times faster. Fast-SL provided a valuable idea for finding SLs in a reduced search space, but the implementation of this method has two major drawbacks. First, the authors developed different procedures in order to obtain the SLs with different cardinalities, up to quadruple SLs. Therefore, to obtain SLs with more than four targets in each set, an entirely new procedure for each cardinality must be developed. Consequently, if one follows the implementation footsteps in the original Fast-SL, the procedure becomes extremely complicated and requires labor-intensive work to develop. The second drawback is that Fast-SL lacks an organized search method; therefore, several duplicated cases are studied in the original Fast-SL. This causes serious problems when searching for SLs with high cardinalities.

Logic transformation of model (LTM) is another method used in this field. This method changes the stoichiometry matrix (i.e., the S matrix) by adding pseudo-metabolites and reactions to consider the gene-protein-reaction associations (GPRs)²⁸. However, the LTM method increases the size of the S matrix, which in turn enlarges the problem size. Thus, more linear programming problems (LPs) must be solved to find SLs. Hence, this method becomes extremely time consuming to perform knockouts regarding higher order SLs.

As mentioned earlier, drug resistance is an important concern and identification of new drug targets based on the concept of synthetic lethality can be a suitable solution for this issue. However, comparing the effects of the different synthetic lethal sets on the metabolic network and its functionalities reveals that some of the sets with higher cardinalities can make stronger and deeper impacts on the network. For instance, we can categorize the synthetic lethal sets into two types: (a) SLs that yield auxotroph strains and (b) SLs that yield strains lacking essential functionalities. The first type of SLs yields strains that are able to restore their growth if the missing nutrients are supplied. In contrast, the strains yielded in the second group cannot restore their growth even if extra components are provided in the growth medium. We expect that the SLs of the second group function more effectively and enable us to aim targets that are harder to resist by pathogens. Based on our in silico observations, higher order SLs provide us with more of these more effective SLs.

The purpose of the current work is to develop a comprehensive and straightforward reimplementation of the Fast-SL algorithm to facilitate the identification of higher order SLs. We call our implementation Rapid-SL, which has two major steps that are iteratively performed based on the depth-first search (DFS) algorithm³¹: (1) identification of the seed space (i.e., reactions with nonzero fluxes) and (2) searching within the seed space to find the solutions. The main difference between this new implementation and the original Fast-SL is the compartmentalization of the searching process into several branches. This branching allows embarrassing parallelization³² and prevents the examination of duplicate cases. This reduces the search space by about 35–60% compared to Fast-SL. However, in the modern drug discovery process, the target identification is typically the beginning step. Therefore, as in the case of Fast-SL, further analysis on the Rapid-SL results, as a biological hypothesis, is required to reach an approved drug.

In order to examine the performance of the developed method, we compared the results of Rapid-SL and Fast-SL for three microorganisms. Afterwards, we introduced three applications for Rapid-SL that could be effective for the targeted identification of higher order SLs, particularly when the cardinality of SLs is greater than four targets. Accordingly, we can: (1) search among a specific list of reactions chosen consistent with a biological context, (2) apply graph-based search methods, and (3) selectively enumerate the SLs among the DFS branches. Based on our in silico experiments in the current work, over 9000 octuple (n = 8) SL reactions were reported for E. coli using iAF1260 GEMM. We hope that the identification of higher order synthetic lethal sets using efficient tools such as Rapid-SL paves the way for systematic designing of effective combinatorial therapeutics in future studies.

Materials and methods

To make the identification of the higher order SLs feasible, we must reduce the search space. The knockout of a reaction set that includes only non-flux-carrying reactions does not change the flux through biomass formation reaction²⁷. Therefore, to reduce the search space, we first identify and focus on the set of flux-carrying reactions, which we denote as the seed space (J_nz) in this work. In the second step, we search for the SLs within the seed space. Moreover, each non-lethal subset of the seed space defines a new proliferating mutant strain. Using a DFS approach, we repeat the first and second steps for each of the new mutant strains (Fig. 1). This iterative process continues until certain stopping conditions are met. Each step of the process, as well as the stopping conditions, are described in the following.

First step: identification of the seed space

We denote the flux of reactions by ν_js. In the first step, two flux-balance-analysis (FBA)-related LPs³³ are considered and solved. These LPs lead to the identification of a flux distribution that maximizes the flux of the biomass objective function (ν_bio), while the ${l}_{1}$-norm of the fluxes is set to its minimum value. The first LP is defined as

$$\begin{aligned} & {\text{min}}\mathop \sum \limits_{j} \left| {\nu_{j} } \right| \\ & {\text{s.t}}. \\ & \nu_{bio} = \, \nu_{bio,WT} \\ & {\text{S }}\nu \, = \, 0 \\ & \nu_{{{\text{lb}}}} \le \, \nu \, \le \, \nu_{{{\text{ub}}}} \\ \end{aligned}$$

(1)

where ν_bio,WT is the growth of the wild-type strain calculated by solving the following LP problem:

$$\begin{aligned} & {\text{max }}\nu_{bio} \\ & {\text{s}}.{\text{t}}. \\ & {\text{S }}\nu \, = \, 0 \\ & \nu_{{{\text{lb}}}} \le \, \nu \, \le \, \nu_{{{\text{ub}}}} \\ \end{aligned}$$

(2)

The goal of computing this flux distribution is to characterize the flux-carrying reactions, or the seed space.

Applying flux-variability analysis (FVA)³⁴ instead of computing ${l}_{1}$-norm of the fluxes would provide us with more information about the effect of each reaction on the biomass objective function. However, FVA is a time consuming process and using this method repetitively would cripple the whole process.

Second step: searching within the seed space

All combinations of the reactions in the seed space have the potential to form SLs; therefore, the exhaustive search is performed in the second step. However, when an SL is found in this step, the corresponding supersets are excluded to prevent the investigation of duplicated cases or the production of trivial answers. Furthermore, each non-lethal set identified in this step defines a proliferating mutant (i.e., a new virtual strain). This second step also includes the listing of all non-lethal sets to investigate their related proliferating mutants by removing more potential reactions in the next level of the search. Figure 2 depicts these explanations using a toy model. This step is performed in a parallel loop in the first level for the wild-type strain to decrease the wall-clock time.

Backtracking and the stopping conditions

As described by Pratapa et al.²⁷, removing a set of reactions that includes only non-flux-carrying reactions would have no effect on the flux of biomass formation reaction; therefore, at least one reaction in the seed space of the wild-type strain (J_nz) should participate in each SL. Here, we generalized this statement from the wild-type strain to any virtual strain obtained during our search procedure. In other words, each reaction designated for removal in subsequent steps of the DFS algorithm should originate from the seed space of the parent virtual strain. Therefore, after we evaluate the first and the second steps for the wild-type strain, we iteratively repeat these two steps for all the resulting virtual strains identified in the second step of the previous level. Each of these mutants is treated the same as the wild-type strain; therefore, we face an iterative problem, which is handled using the DFS algorithm (the associated pseudocode is available in Supplementary Note B). Note that, other organized search algorithms such as breadth-first search³⁵ and best-first search³⁶ instead of DFS can be used easily in our implementation.

The search proceeds from the root node, which consists of a nonlethal set. As an example, consider a general non-lethal set, ∆_m with m members, which is derived from the evaluation of the second step for the wild-type strain. Let ${\text{J}}_{\text{nz}}^{\Delta{\text{m}}}$ be the seed space of the mutant strain that results from the removal of the non-lethal set of Δ_m. The set of ${\text{J}}_{\text{nz}}^{\Delta{\text{m}}}$ is evaluated by passing the corresponding mutant to the first step. Because all the reactions in J_nz and their combinations are studied in the other branches, only the flux-carrying reactions of this mutant, which belong to J_nz, are considered at this level. If there are any reactions at this level (i.e., ${\text{J}}_{{{\text{nz}}}}^{{\Delta {\text{m}}}} - {\text{J}}_{{{\text{nz}}}} \ne \emptyset$), the second step is triggered for all the members of ${\text{J}}_{\text{nz}}^{\Delta{\text{m}}} - {\text{J}}_{\text{nz}}$. In Rapid-SL, backtracking occurs in three cases, and extensions cannot go deeper:

(a)
when a set is found to be lethal.
(b)
when no new reaction gains non-zero flux after removing a set (i.e., ${\text{J}}_{{{\text{nz}}}}^{{\Delta {\text{m}}}} - {\text{J}}_{{{\text{nz}}}} = \emptyset$).
(c)
when the size of the examined set reaches the maximum desired cardinality.

Pratapa et al.²⁷ state that Fast-SL is not an embarrassingly parallel algorithm; however, they provide a parallel version of the Fast-SL only for the evaluation of quadruple SLs. This parallel version performs parallel calculations for some specific parts of the Fast-SL algorithm. Unlike Fast-SL, Rapid-SL is an embarrassingly parallel algorithm because it is possible to evaluate the branches of the DFS algorithm using a parallel procedure.

Figure 3 shows an example that illustrates the backtracking process. In this toy example, the maximum cardinality is five (n = 5).

Enumeration of synthetic lethal gene sets

To enumerate the synthetic lethal gene sets, the same procedure is employed, except that those non-zero-flux reactions obtained in each part are converted to the functioning genes using GPR rules (Supplementary Note C). In this work we focused on the enhancement of the identification of SL reactions, which is the main step in the process of finding SL genes. To find SL genes, other improvements can be made by involving and translating GPR rules to make further reduction in the search space prior to the main identification process. Methods such as gMCS³⁷ effectively use this feature for identification of synthetic lethal genes.

Results

We present our results in two parts. First, the performances of Fast-SL and Rapid-SL were compared in the identification of SLs for three microorganisms (see the Supplementary Note D for the comparison between Rapid-SL and duality-based methods). Then, we report the results of the three applications of Rapid-SL for the targeted enumeration of the higher order SLs. The overall computation time of the Fast-SL and Rapid-SL is mostly dependent on the time that is spent on solving the LPs. Therefore, to ensure a fair comparison between Fast-SL and Rapid-SL, we reported the number of LP problems that were solved by each approach. Furthermore, a comparison of wall-clock runtime is provided in Supplementary Note E. The results were obtained using a workstation with a 2.2 GHz Intel Xeon E5-2696 v4 processor, which has 12 cores available for computation.

Synthetic-lethals of the three microorganisms

Table 1 shows the respective numbers of SLs with different cardinalities (up to quadruples) obtained by our implementation and obtained by the original Fast-SL. Since the SLs identified by the both methods were found to be the same, the table does not report the number of these SLs found by each method.

Table 1 Comparison of the number of LPs solved by Rapid-SL vs. Fast-SL for three GEMMs.

Full size table

Table 1 indicates that our new implementation explores about 40–65% of the search space of the original Fast-SL, and it does not omit any potential cases (Supplementary Files S1–S3). This reduction in the search space is achieved by preventing the investigation of identical cases produced in different branches.

Applications of Rapid-SL

As the maximum desired cardinality of SLs increases, there is an exponential increase in both the search space and the number of cases to be examined in order to find all possible SLs. As a result, it is not feasible to find all possible SLs with high cardinalities (e.g., octuple SLs) using the algorithms that are currently available^23,27. Therefore, we take advantage of our new implementation to effectively investigate these large search spaces. Here we introduce three applications of Rapid-SL to perform the targeted enumeration of higher order SLs.

Searching a list of specific targets

The simplest method to find a fraction of solutions is to specify only a limited group of reactions. However, it is not clear what reactions should be selected. These reactions may be selected from a specific subsystem or pathway that has been diagnosed as important for the growth of the microorganism. For example, we performed a search to find octuple SLs (i.e., with eight reactions in a set) among 65 core reactions introduced by Hädicke and Klamt for generating a core model from iJO1366^39,40. The results obtained in this application are presented in Supplementary File S4.

Our new implementation makes this analysis feasible, but at the first sight, it may seem that using Rapid-SL is not necessary, and it may seem sufficient to find these results using an exhaustive search because of the small number of reactions involved in the analysis. However, Rapid-SL uses a search space that is about 50 times smaller than the exhaustive search, which consequently requires an extremely time-consuming process even for small numbers of reactions. Also, it is not feasible to perform this analysis using the original Fast-SL, because separated algorithms should be devised for the cardinality of each SL.

Applying constraints on the branching of the DFS

Since Rapid-SL applies the DFS algorithm to investigate the search space, it is possible to define thresholds or conditions to limit the branching and search only the more probable parts of the corresponding tree. For example, we established a criterion in which sets are allowed to branch only if their deletion reduces the growth rate of the corresponding strain by at least 1%. The results obtained by applying this criterion to the process of identifying octuple SLs (n = 8) for iAF1260 are presented in the Supplementary File S5. Here, the critical value of 1% was selected based on trial and error. Other values could be employed based on the studied GEMM and the growth medium. Also, other types of constraints could be defined, such as the change in the pool of a specific metabolite or the fluxes of other reactions.

Selective enumeration among the DFS branches

Consider the process of seeking the quadruple SLs of E. coli using iAF1260. If we group the branches of the Rapid-SL algorithm based on the number of reactions in the starting node of each branch, it is evident that the number of LPs solved in each group substantially increases as the cardinality of the starting node increases. On the other hand, the number of identified SLs per LP solved dramatically decreases (Table 2). Therefore, a large portion of the SLs can be identified by performing a lethality analysis on a limited number of branches.

Table 2 The number of SLs and corresponding LPs solved in each group of branches of the Rapid-SL, while searching for single lethals to quadruple SLs in iAF1260. The branches are grouped based on the number of members in their starting node. Evaluating only the first group of branches identifies over 34% of SLs, while only about 0.65% of LPs must be examined.

Full size table

According to Table 2, it is possible to extend only the branches in Group (I) to identify over 34% of all SLs (excluding single lethals), while about 0.65% of all LPs are examined. It should be noted that 254 SLs identified in Group (I), consist of 74 double, 98 triple, and 82 quadruple SLs. The same analyses were performed for the same microorganism (E. coli) with a different genome-scale model (i.e. iJO1366) and also for a different type of microorganism (i.e. Klebsiella pneumoniae, iYL1228⁴¹) to check the generalizability of this observation (Table 3).

Table 3 SLs identified by evaluation of only the branches with one reaction in the starting node (Group I).

Full size table

It could be inferred from Table 3 that an evaluation of the branches in Group (I) is a reliable approach to find a considerable fraction of all SLs. For the GEMMs that were studied, we found up to 67% of all SLs (i.e. including double, triple and quadruple SLs) using the illustrated method by examining only about 1% of the search space that must be evaluated to find all quadruple SLs. We applied this method to find the octuple SLs of iAF1260 to investigate the efficiency of this approach for identification of higher order SLs with more than four targets in each set (Supplementary File S6).

Table 4 summarized the results of the three applications of Rapid-SL and according to this table, over 9000 octuple SLs were found using the illustrated application of Rapid-SL. Based on the size of the GEMM and the maximum desired cardinality of SLs, it is possible to consider other groups of branches. For instance, Table 2 shows that evaluating both Groups (I) and (II) for iAF1260 reduces the search space by over 90% while identifying over 63% of all SLs.

Table 4 Results of three introduced applications.

Full size table

Discussion

In this paper, we introduce Rapid-SL as a new implementation of Fast-SL that enables the algorithm to find higher order SLs with arbitrary cardinalities. Unlike Fast-SL, this new implementation fully supports embarrassingly parallel computations. Furthermore, compared to Fast-SL, the application of the DFS algorithm (a structured search method) decreased the number of evaluated LP problems by about 35–60%. The original implementation of Fast-SL is not embarrassingly parallel and suffers from time consuming sequential computations in some of its steps. Accordingly, for larger models and higher order SLs, the difference between the computational time of Fast-SL and Rapid-SL increases and Rapid-SL becomes more and more efficient. Although Rapid-SL is not limited in terms of the cardinality of SLs, it is not feasible to seek for all SLs with higher cardinalities, especially when n > 4, without using computer clusters. When using a single conventional computer, the runtime of this examination may extend to several months because of the tremendous number of potential cases. Owing to our proper implementation, Rapid-SL can effectively find a considerable portion of higher order SLs by searching only a relatively small fraction of potential cases. Accordingly, three Rapid-SL applications were introduced: (a) searching among a selected list of potential reactions, (b) applying constraints on the branching of the DFS, and (c) selective enumeration among the DFS branches. These applications identified up to 67% of quadruple SLs by searching about 1% of the potential cases. Particularly, over 9,000 octuple synthetic reactions were found for iAF1260 in the third application. Accordingly, Rapid-SL can be effective for investigating large models such as genome-scale metabolic models of human cells to find drug targets with high cardinalities.

Although the first two applications find fewer SLs than the other application, they may still be useful for seeking SLs with specific biological considerations. The importance of this feature becomes clearer noting that a single organism may have more than several thousands SLs with high cardinalities, and experimental validation of all of these SLs is not feasible due to the immense number of required experiments. Therefore, the both scenarios require the consideration of biological criteria when searching for useful SL sets. In future work, we will focus on defining new criteria to reduce the number of potential drug-targetable SLs.

Conclusion

Although the combinatorial therapeutics are expected to be effective against drug resistance pathogens and higher order SLs can potentially nominate candidates for simultaneously attacking multiple targets, it is still challenging to determine which combination would be practical and most useful. For example, possible negative synergistic effects narrow down the practical drug combinations. Furthermore, when the number of targets in an SL increases, the chance of finding a set with all druggable targets decreases. Therefore, it would be desirable to devise a systematic pipeline to investigate the identified higher order SLs as the starting point and derive a combinatorial therapeutic design as the outcome.

Data availability

Rapid-SL is publicly available at https://github.com/CSBLaboratory/RapidSL (Supplementary File S7).

References

Jackson, R. A. & Chen, E. S. Synthetic lethal approaches for assessing combinatorial efficacy of chemotherapeutic drugs. Pharmacol. Ther. 162, 69–85 (2016).
Article CAS Google Scholar
Van Duin, D. & Paterson, D. L. Multidrug-resistant bacteria in the community: Trends and lessons learned. Infect. Dis. Clin. 30, 377–390 (2016).
Article Google Scholar
Xu, Z., Fang, X., Wood, T. K. & Huang, Z. J. A systems-level approach for investigating Pseudomonas aeruginosa biofilm formation. PLoS One 8, e57050 (2013).
Article ADS CAS Google Scholar
Silver, L. L. Multi-targeting by monotherapeutic antibacterials. Nat. Rev. Drug Discov. 6, 41–55. https://doi.org/10.1038/nrd2202 (2007).
Article CAS PubMed Google Scholar
Oberhardt, M. A., Yizhak, K. & Ruppin, E. Metabolically re-modeling the drug pipeline. Curr. Opin. Pharmacol. 13, 778–785 (2013).
Article CAS Google Scholar
Heirendt, L. et al. Creation and analysis of biochemical constraint-based models: The COBRA Toolbox v3. 0. arXiv preprint arXiv:1710.04038 (2017).
Lewis, N. E., Nagarajan, H. & Palsson, B. O. Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–305 (2012).
Article CAS Google Scholar
Palsson, B. Systems Biology (Cambridge University Press, 2015).
Dougherty, B. V., Moutinho, T. J. Jr. & Papin, J. Accelerating the drug development pipeline with genome-scale metabolic network reconstructions. Syst. Biol. 6, 139–162 (2017).
Article CAS Google Scholar
Singh, S., Malik, B. K. & Sharma, D. K. Choke point analysis of metabolic pathways in E. histolytica: A computational approach for drug target identification. Bioinformation 2, 68 (2007).
Article Google Scholar
Kim, H. U., Kim, T. Y. & Lee, S. Y. Genome-scale metabolic network analysis and drug targeting of multi-drug resistant pathogen Acinetobacter baumannii AYE. Mol. BioSyst. 6, 339–348 (2010).
Article CAS Google Scholar
Kim, H. U. et al. Integrative genome-scale metabolic analysis of Vibrio vulnificus for drug targeting and discovery. Mol. Syst. Biol. 7, 460 (2011).
Article Google Scholar
Chavali, A. K., Whittemore, J. D., Eddy, J. A., Williams, K. T. & Papin, J. A. Systems analysis of metabolism in the pathogenic trypanosomatid Leishmania major. Mol. Syst. Biol. 4, 177 (2008).
Article Google Scholar
Thiele, I. et al. A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2. BMC Syst. Biol. 5, 1–9 (2011).
Article Google Scholar
Chavali, A. K., D’Auria, K. M., Hewlett, E. L., Pearson, R. D. & Papin, J. A. A metabolic network approach for the identification and prioritization of antimicrobial drug targets. Trends Microbiol. 20, 113–123. https://doi.org/10.1016/j.tim.2011.12.004 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hartman, H. B. et al. Identification of potential drug targets in Salmonella enterica sv. Typhimurium using metabolic modelling and experimental validation. Microbiology 160, 1252–1266 (2014).
Article CAS Google Scholar
Sigurdsson, G., Fleming, R. M., Heinken, A. & Thiele, I. A systems biology approach to drug targets in Pseudomonas aeruginosa biofilm. PLoS One 7, e34337. https://doi.org/10.1371/journal.pone.0034337 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Suthers, P. F., Zomorrodi, A. & Maranas, C. D. Genome-scale gene/reaction essentiality and synthetic lethality analysis. Mol. Syst. Biol. 5, 301. https://doi.org/10.1038/msb.2009.56 (2009).
Article CAS PubMed PubMed Central Google Scholar
Ashworth, A. & Lord, C. J. Synthetic lethal therapies for cancer: What’s next after PARP inhibitors?. Nat. Rev. Clin. Oncol. 15, 564–576 (2018).
Article CAS Google Scholar
Feng, J., Auwaerter, P. G. & Zhang, Y. Drug combinations against Borrelia burgdorferi persisters in vitro: Eradication achieved by using daptomycin, cefoperazone and doxycycline. PLoS One 10, e0117207 (2015).
Article Google Scholar
Tymoshenko, S. et al. Metabolic needs and capabilities of Toxoplasma gondii through combined computational and experimental analysis. PLoS Comput. Biol. 11, e1004261 (2015).
Article Google Scholar
Feist, A. M. et al. A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol. Syst. Biol. 3, 121 (2007).
Article Google Scholar
von Kamp, A. & Klamt, S. Enumeration of smallest intervention strategies in genome-scale metabolic networks. PLoS Comput. Biol. 10, e1003378 (2014).
Article Google Scholar
Schneider, P., von Kamp, A. & Klamt, S. An extended and generalized framework for the calculation of metabolic intervention strategies based on minimal cut sets. PLoS Comput. Biol. 16, e1008110 (2020).
Article CAS Google Scholar
Klamt, S., Mahadevan, R. & von Kamp, A. Speeding up the core algorithm for the dual calculation of minimal cut sets in large metabolic networks. BMC Bioinform. 21, 1–21 (2020).
Article Google Scholar
Miraskarshahi, R., Zabeti, H., Stephen, T. & Chindelevitch, L. MCS2: Minimal coordinated supports for fast enumeration of minimal cut sets in metabolic networks. Bioinformatics 35, i615–i623 (2019).
Article CAS Google Scholar
Pratapa, A., Balachandran, S. & Raman, K. Fast-SL: An efficient algorithm to identify synthetic lethal sets in metabolic networks. Bioinformatics 31, 3299–3305 (2015).
Article CAS Google Scholar
Zhang, C., Ji, B., Mardinoglu, A., Nielsen, J. & Hua, Q. Logical transformation of genome-scale metabolic models for gene level applications and analysis. Bioinformatics 31, 2324–2331 (2015).
Article CAS Google Scholar
Burgard, A. P., Nikolaev, E. V., Schilling, C. H. & Maranas, C. D. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 14, 301–312. https://doi.org/10.1101/gr.1926504 (2004).
Article CAS PubMed PubMed Central Google Scholar
Ballerstein, K., von Kamp, A., Klamt, S. & Haus, U.-U. Minimal cut sets in a metabolic network are elementary modes in a dual network. Bioinformatics 28, 381–387 (2012).
Article CAS Google Scholar
Tarjan, R. Depth-first search and linear graph algorithms. SIAM J. Comput. 1, 146–160 (1972).
Article MathSciNet Google Scholar
Herlihy, M. & Shavit, N. The Art of Multiprocessor Programming, Revised First Edition (Morgan Kaufmann, 2012).
Orth, J. D., Thiele, I. & Palsson, B. Ø. What is flux balance analysis?. Nat. Biotechnol. 28, 245 (2010).
Article CAS Google Scholar
Gudmundsson, S. & Thiele, I. Computationally efficient flux variability analysis. BMC Bioinform. 11, 1–3 (2010).
Article Google Scholar
Bundy, A. & Wallen, L. Catalogue of Artificial Intelligence Tools 13–13 (Springer, 1984).
Vempaty, N. R., Kumar, V. & Korf, R. E. AAAI. 434–440.
Apaolaza, I., Valcarcel, L. V. & Planes, F. J. gMCS: Fast computation of genetic minimal cut sets in large networks. Bioinformatics 35, 535–537 (2019).
Article CAS Google Scholar
Jamshidi, N. & Palsson, B. Ø. Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targets. BMC Syst. Biol. 1, 26 (2007).
Article Google Scholar
Hädicke, O. & Klamt, S. EColiCore2: A reference network model of the central metabolism of Escherichia coli and relationships to its genome-scale parent model. Sci. Rep. 7, 39647 (2017).
Article ADS Google Scholar
Orth, J. D. et al. A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011. Mol. Syst. Biol. 7, 535 (2011).
Article Google Scholar
Liao, Y.-C. et al. An experimentally validated genome-scale metabolic reconstruction of Klebsiella pneumoniae MGH 78578, i YL1228. J. Bacteriol. 193, 1710–1717 (2011).
Article CAS Google Scholar

Download references

Acknowledgements

We sincerely thank Vincente LeCornu for textual edits.

Author information

These authors contributed equally: Payam Setoodeh and Habil Zare.

Authors and Affiliations

Department of Chemical Engineering, School of Chemical, Petroleum and Gas Engineering, Shiraz University, Shiraz, Iran
Mehdi Dehghan Manshadi & Payam Setoodeh
Glenn Biggs Institute for Alzheimer’s & Neurodegenerative Diseases, 7400 Merton Minter, San Antonio, TX, 78229, USA
Habil Zare
Department of Cell Systems and Anatomy, University of Texas Health Science Center, San Antonio, San Antonio, TX, USA
Habil Zare

Authors

Mehdi Dehghan Manshadi
View author publications
You can also search for this author in PubMed Google Scholar
Payam Setoodeh
View author publications
You can also search for this author in PubMed Google Scholar
Habil Zare
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.D.M., P.S., and H.Z. developed the algorithms. M.D.M. implemented the algorithms. P.S. and H.Z. analyzed the results. All authors wrote and reviewed the manuscript.

Corresponding authors

Correspondence to Payam Setoodeh or Habil Zare.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Information 6.

Supplementary Information 7.

Supplementary Information 8.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dehghan Manshadi, M., Setoodeh, P. & Zare, H. Rapid-SL identifies synthetic lethal sets with an arbitrary cardinality. Sci Rep 12, 14022 (2022). https://doi.org/10.1038/s41598-022-18177-w

Download citation

Received: 11 February 2022
Accepted: 05 August 2022
Published: 18 August 2022
DOI: https://doi.org/10.1038/s41598-022-18177-w

This article is cited by

Logic programming-based Minimal Cut Sets reveal consortium-level therapeutic targets for chronic wound infections
- Maxime Mahout
- Ross P. Carlson
- Sabine Peres
npj Systems Biology and Applications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and methods

First step: identification of the seed space

Second step: searching within the seed space

Backtracking and the stopping conditions

Enumeration of synthetic lethal gene sets

Results

Synthetic-lethals of the three microorganisms

Applications of Rapid-SL

Searching a list of specific targets

Applying constraints on the branching of the DFS

Selective enumeration among the DFS branches

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links