IMGMD: A platform for the integration and standardisation of In silico Microbial Genome-scale Metabolic Models

Ye, Chao; Xu, Nan; Dong, Chuan; Ye, Yuannong; Zou, Xuan; Chen, Xiulai; Guo, Fengbiao; Liu, Liming

doi:10.1038/s41598-017-00820-6

Download PDF

Article
Open access
Published: 07 April 2017

IMGMD: A platform for the integration and standardisation of In silico Microbial Genome-scale Metabolic Models

Chao Ye^1,3,
Nan Xu^1,3,
Chuan Dong²,
Yuannong Ye^4,5,
Xuan Zou^1,3,
Xiulai Chen^1,3,
Fengbiao Guo² &
…
Liming Liu^1,3

Scientific Reports volume 7, Article number: 727 (2017) Cite this article

3009 Accesses
8 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Genome-scale metabolic models (GSMMs) constitute a platform that combines genome sequences and detailed biochemical information to quantify microbial physiology at the system level. To improve the unity, integrity, correctness, and format of data in published GSMMs, a consensus IMGMD database was built in the LAMP (Linux + Apache + MySQL + PHP) system by integrating and standardizing 328 GSMMs constructed for 139 microorganisms. The IMGMD database can help microbial researchers download manually curated GSMMs, rapidly reconstruct standard GSMMs, design pathways, and identify metabolic targets for strategies on strain improvement. Moreover, the IMGMD database facilitates the integration of wet-lab and in silico data to gain an additional insight into microbial physiology. The IMGMD database is freely available, without any registration requirements, at http://imgmd.jiangnan.edu.cn/database.

A workflow for generating multi-strain genome-scale metabolic models of prokaryotes

Article 20 December 2019

Reconstruction of a catalogue of genome-scale metabolic models with enzymatic constraints using GECKO 2.0

Article Open access 30 June 2022

ODFM, an omics data resource from microorganisms associated with fermented foods

Article Open access 20 April 2021

Introduction

Genome-scale metabolic models (GSMMs) are a kind of a mathematical model that integrates multiple types of omics data, such as genomics, transcriptomics, proteomics, and metabolomics. GSMMs can clarify the relations among genes, proteins, and reactions. Models can be used to describe all biochemical reactions, metabolites, and genes involved in the metabolism of a specific organism. They have been used to decipher metabolic, regulatory, and signaling networks at the whole-organism level^1,2,3. Since the first GSMM of Haemophilus influenzae Rd was constructed in 1999⁴, more than 300 GSMMs for over 100 organisms have been built^{5, 6}.

GSMMs have been developed for over 15 years, and four steps are involved in their construction: creation of a draft model, manual refinement, conversion to a mathematical format, and network evaluation^6,7,8,9. Nonetheless, owing to differences in nomenclature¹⁰, integrity, and correctness¹¹ as well as the format¹² of published GSMMs, these models cannot be directly applied by other researchers. To reduce the manual labour needed for model construction and to increase the quality of GSMMs, some databases and tools, such as BIGG Models¹¹ and MetaNetX¹³, have been developed. Nonetheless, they are limited by the quality and quantity of models. For example, BIGG Models includes only 80 high-quality GSMMs (as of 30 November 2016), which is far from the number of published models. In MetaNetX, only 24 (14.7%) models have been published, which were validated by experimental results (http://www.metanetx.org/cgi-bin/mnxweb/repository).

In this study, we built a database named In silico Microbial Genome-scale Metabolic Models (IMGMD) in the LAMP (Linux + Apache + MySQL + PHP) system. It provides a platform for integration and standardisation of all published microbial GSMMs. In IMGMD, users are not only able to browse and download standardised GSMMs but can also reconstruct GSMMs automatically. In addition to pathway mining and mutation library functions, users can access information that can guide pathway design and metabolic target identification.

Results and Discussion

Database content and web interface

IMGMD (http://imgmd.jiangnan.edu.cn/database/) has a user-friendly website for the following applications: (1) It can be used to download standardised GSMMs; this module integrates model-related information, such as gene–protein–reaction relations, genome information, and references (Fig. 1A). (2) It enables auto-reconstruction of GSMMs; this tool is based on homology alignments, and only sequences that meet a threshold are used for model construction. Additionally, transport proteins and sub-cellular location are identified for further model refinement (Fig. 1B). (3) It can be applied to explore potential pathways; using this function, users can explore the potential pathways from one metabolite to another in a certain GSMM (Fig. 1C). (4) It guides metabolic engineering; the mutation library includes in silico and in vivo metabolic engineering results, and accordingly, it provides guidance for target searches (Fig. 1D).

All web interfaces of the IMGMD database were tested in various browsers, such as Google Chrome, Mozilla Firefox, Internet Explorer, Opera, and Safari on Windows or Linux platforms. Despite minor differences in appearance, all tools functioned normally in all the tested browsers and on all platforms. Among the browsers tested, Google Chrome and Mozilla Firefox provided the best user experience. Hence, we recommend that users access the database using one of these two browsers.

The ‘model browse’ function in IMGMD

Using model browse, users can browse, search, and download almost all published microorganism models. From the main page of model browse, basic model information, such as the number of genes, reactions, and metabolites can be accessed. Using the search bar, models can be queried by organism name, model name, kingdom, or year of publication. We chose Saccharomyces cerevisiae as an example to demonstrate the use of ‘Search for organism’. All 8 S. cerevisiae models are returned. Then, by clicking on ‘Saccharomyces cerevisiae S288c’ for model iND750, a user can find detailed information about the organism (e.g., strain, genome information, and ORFs), model (e.g., model name, cell compartments, model download, and in silico media for simulation), and reference (e.g. reference name, journal name, and publication date; Fig. 2). The genome information is linked to the NCBI database¹⁴, which contains the genome assembly and annotation report for a microorganism. ORFs are linked to the protein sequence downloaded from the UniProt database¹⁵. The in silico media are linked to the MediaDB database¹⁶, a database of microbial growth conditions in defined media, which can be applied as the constraint condition for metabolic model growth. These standardised GSMMs in IMGMD can be further applied to many analyses using the COBRA Toolbox^17,18,19,20. The ‘model browse’ module attempts to integrate scattered data on organisms, models, and literature, and promotes the establishment of GSMM standardisation.

The ‘model auto-construction’ function in IMGMD

Five steps are needed to construct a model in IMGMD: (1) choosing three models for reference; (2) uploading the genome sequence; (3) choosing a threshold (eukaryotic: identity ≥40%, identity ≤10E-30; prokaryotic: identity ≥30%, identity ≤10E-6); (4) entering an e-mail address to receive results (optional); (5) submitting the job to the IMGMD database. Once the job is complete, the results contain three parts, including the model, transport proteins, and prediction of protein subcellular localisation (Fig. 3). Model construction is automatically implemented on the basis of the sequence alignment results. After protein sequences are submitted, the local BLASTP program will calculate the sequence similarity. Sequences that meet the established threshold are automatically screened using a Python script written in our lab. Based on the local Blast results, genes with high similarity are replaced in the reference models. Additionally, transport proteins are identified according to the alignment results, using the TCDB database²¹. For eukaryotic organisms, WoLF PSORT²² was chosen, whereas for prokaryotic organisms (gram-positive, gram-negative, or Archaea), PSORTb²³ was employed to predict protein subcellular localisation.

Although some software or platforms for model auto-reconstruction have been developed, including ModelSEED²⁴, RAVEN²⁵, COBRA Toolbox²⁶, SuBliMinal²⁷, these tools have their advantages and disadvantages¹². For instance, ModelSEED (http://modelseed.org/) is a Web service that includes the RAST genome annotation tool. Based on the annotation results, a model for a specific organism can be reconstructed automatically. Given that the RAST service (http://rast.nmpdr.org/rast.cgi) can annotate only prokaryotes, ModelSEED has limited applicability to eukaryotes. Besides, model construction by ModelSEED will take a long time, according to the job numbers. IMGMD is also a web platform that serves for model construction. It is based on the results of genome homologous alignment. Users can upload a target organism’s genome sequence and choose relevant parameters. After submission of the job to IMGMD, results will be returned within 1 day. Nonetheless, a model constructed by IMGMD is a draft model. It still needs to be further processed to obtain a GSMM. The COBRA Toolbox is based on the Matlab platform, which is commonly used for model construction. The COBRA Toolbox requires users to have basic Matlab knowledge and an advanced computer configuration for model analysis (Table 1).

Table 1 Selected characteristics of software platforms for reconstruction and simulation of metabolic networks.

Full size table

Pathway mining function in IMGMD

In this module, users can explore metabolic pathways at three levels. (1) According to the input metabolites as substrates and products, total pathways from a substrate to product in a GSMM can be output. For example, in the Mortierella alpina model iCY1106²⁸, 21 pathways exist from glucose to pyruvate, indicating that in addition to the basic glycolysis pathway in M. alpina (according to the KEGG pathway²⁹), other pathways also could generate pyruvate. On the web page of pathway-mining results, information about the substrate and production can be linked to some metabolic databases, like KEGG²⁹, ModelSEED²⁴, ChEBI³⁰, and PubChem³¹. Besides, on the page of detailed pathway information, reactions participating in a pathway are shown, including Reaction ID, Formula, Genes, Subsystem, and EC numbers (Fig. 4). (2) Comparisons between two or more GSMMs help to understand phenotypic characteristics based on metabolic pathway differences. When comparing the pathway differences between two Archaea, Methanococcus maripaludis (iMM518)³² and Methanosarcina barkeri (iMG746)³³, there were 8 and 12 pathways from glucose to pyruvate, respectively. (3) Pathways that generate highly valuable products may exist in typical organisms. To mine these potential pathways, users can choose all collected models for the search, and then choose reactions in which species and corresponding genes can serve as references for a target strain to guide strain design. Considering these three levels, the function of pathway mining may be useful in synthetic biology and systems metabolic engineering.

Mutation library function in IMGMD

The pathway prediction tool enables new pathway design for metabolic engineering; additionally, the mutation library function can be used for optimisation of the host strain. It can help to identify targets that couple cell growth with product formation, e.g., targets for gene upregulation, downregulation, and gene deletion^{34, 35}. In IMGMD, a library that combines in vivo and in silico results to guide metabolic engineering was created.

Organisms, models, and genes can be used as keywords to search for mutation information. For example, in a search for mutation information with model iAF1260, 217 results can be found. The effect of a knockout of b4025, which encodes glucosephosphate isomerase (pgi, EC: 5.3.1.9) in E. coli, the growth rate and production rate can be viewed on another webpage (Fig. 5). According to the information on this new page, when galactose serves as a carbon source, the in silico growth decreases by 36.1%, while the in vivo growth rate increases by 12.0%³⁶ (Table 2). Furthermore, the amino acid sequence and nucleic acid sequence of gene b4025 were also included (Fig. 5). The EC number of 5.3.1.9 is linked to BRENDA database for more detailed information.

Table 2 Results of the b4025 deletion in E. coli collected by IMGMD³⁶.

Full size table

In this module, 950 total mutation results were collected by literature mining. Additionally, 885 results (93.2%) were related to various knockout strategies, involving different algorithms, such as OptKnock¹⁹, GDLS³⁷, ReacKnock³⁸, DBFBA³⁹, BAFBA⁴⁰, and RobustKnock⁴¹. The remaining results are related to gene upregulation or downregulation⁴². Combined with the pathway mining and mutation library modules, IMGMD can be used to guide systems metabolic engineering, for both pathway screening and for target identification.

Methods

Data collection

GSMM data were collected from existing databases (http://systemsbiology.ucsd.edu/InSilicoOrganisms/OtherOrganisms, http://synbio.tju.edu.cn/GSMNDB/gsmndb.htm) and by scrutinizing the primary scientific literature (Web of Science, PubMed, and Google Scholar; Fig. 6). As of November 30th, 2016, 328 GSMMs covering 139 microorganisms have been collected. Of these models, bacteria, eukaryotes, and archaea account for 82.1%, 15.2%, and 2.7%, respectively. Additionally, based on literature mining results from 3,296 articles concerning GSMMs, a mutation library containing 950 mutations in 683 genes from 31 microorganisms was generated.

Data processing

After collecting information on 328 models, 58 models could not be found, and 270 downloadable models were classified by format according to their written language, i.e., Systems Biology Markup Language (SBML), Microsoft Excel, Microsoft Word, or PDF. To read these models using the COBRA Toolbox, models in all formats were rewritten in the Excel format. Word and PDF files were manually transformed into Excel files. The SBML models were transformed into Excel, using the COBRA Toolbox on the Matlab platform. Nonetheless, during this process, some models written in SBML could not be read using the COBRA Toolbox. Eventually, 265 of total 328 (80.8%) published models were standardised in the Excel and SBML formats and can be downloaded from our database.

GSMMs consist of metabolite lists and reaction lists. Since the GSMMs were constructed by different researchers, metabolites can be represented in various forms. For example, in E. coli model iAF1260⁴³, pyruvate was represented as pyr. In Saccharomyces cerevisiae model Yeast 1.0⁴⁴, and Yarrowia lipolytica model iNL895⁴⁵, it was indicated by PYR and s_1277, respectively. In the IMGMD database, according to their unique IDs in various biochemical databases (KEGG, SEED, ChEBI, and PubChem), the metabolites from different models were unified using IMGMD metabolite IDs. Then, 8367 metabolites from these different models were integrated. Additionally, 77.65% of metabolites can be linked to at least one of these databases (Table 3).

Table 3 Distribution of metabolites from different metabolite databases.

Full size table

A list of reactions, including the Gene–Protein–Reaction relations for the models, should contain 15 columns of information, e.g., a reaction description, formula, and genes²⁶. For the formula column, metabolites are first replaced and rearranged according to their unified IMGMD database metabolite IDs. Additionally, because some information was lacking, data (e.g., gene data) were collected by referring to information such as EC numbers, reaction descriptions, and formulas, in the other columns for the models. Finally, 17 of 19 (89.5%) GSMMs were filled out, except for models of Streptomyces lividans (GMD-TK24) and Pseudomonas putida (PpuMBEL1071). For the Formula column, metabolites were first replaced with the unified IMGMD metabolite IDs. For example, the reaction catalysed by alcohol dehydrogenase (EC: 1.1.1.1) was unified as ‘M00442[c] + M00003[c] 〈=〉 M00083[c] + M00079[c] + M00004[c]’. In all models, to arrange metabolites on the left and right of ‘〈=〉 ’ in order, a MATLAB script developed in our lab was used. Lastly, the reaction was unified as ‘M00003[c] + M00442[c] 〈=〉 M00004[c] + M00079[c] + M00083[c]’. If we ignore the cell compartments and do not count transport and exchange reactions, the database contains 21436 reactions.

During the process of literature mining, mutation information is stored in an Excel file. Information such as organisms, models, genes, operations, in vivo or in silico production, and in vivo or in silico growth rate is collected. Additionally, amino acid and nucleic acid sequences of related genes collected from the KEGG database are also stored in this Excel file.

Database design and implementation

All processed data are stored in a MySQL database and are available through a Web server built in the standard LAMP (Linux + Apache + MySQL + PHP) system to provide fast and secure data access. XAMPP for Linux 5.6.15 (https://www.apachefriends.org/index.html) was installed on CentOS Linux 5.8 (https://www.centos.org/download/). BLAST 2.2.28 (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/release) and Python 2.7.11 (https://www.python.org/) were used for model auto-reconstruction, and a C++ script based on a depth optimisation algorithm was written to explore the metabolic pathways in a particular GSMM.

Conclusion

The IMGMD database (http://imgmd.jiangnan.edu.cn/database) provides a platform that integrates the names of metabolites and metabolic reactions from common biochemical databases and existing model repositories. This database includes 328 models for 139 microorganisms and provides 265 standardised models for downloading. Based on a homologous sequence alignment method, models can be reconstructed automatically in the IMGMD database, which can accelerate the process of model construction. Furthermore, IMGMD provides a pathway mining tool for pathway design and a mutation library for strain optimisation.

Compared with other GSMM databases, the IMGMD database is specific for microorganisms. It is user-friendly and feature-rich; accordingly, the scientific community can easily use and extend the knowledge base. Thus, IMGMD will be a useful database for the design phase of systems metabolic engineering. Future developments include integration of the COBRA Toolbox, which will allow users to directly simulate gene deletion or over-expression, on the IMGMD platform. Besides, the IMGMD database is maintained by our lab and will be updated annually, to keep pace with the advances of GSMMs.

References

Liu, L. M., Agren, R., Bordel, S. & Nielsen, J. Use of genome-scale metabolic models for understanding microbial physiology. FEBS Lett. 584, 2556–2564 (2010).
Article CAS PubMed Google Scholar
Bordbar, A., Monk, J. M., King, Z. A. & Palsson, B. O. Constraint-based models predict metabolic and associated cellular functions. Nat. Rev. Genet. 15, 107–120 (2014).
Article CAS PubMed Google Scholar
O’Brien, E. J., Monk, J. M. & Palsson, B. O. Using Genome-scale Models to Predict Biological Capabilities. Cell 161, 971–987 (2015).
Article PubMed PubMed Central Google Scholar
Edwards, I. J. Systems Properties of the Haemophilus influenzae Rd Metabolic Genotype. J. Biol. Chem. 274, 17410–17416 (1999).
Article CAS PubMed Google Scholar
Monk, J., Nogales, J. & Palsson, B. O. Optimizing genome-scale network reconstructions. Nat. Biotechnol. 32, 447–452 (2014).
Article CAS PubMed Google Scholar
Kim, T. Y., Sohn, S. B., Bin Kim, Y., Kim, W. J. & Lee, S. Y. Recent advances in reconstruction and applications of genome-scale metabolic models. Curr. Opin. Biotechnol. 23, 617–623 (2012).
Article CAS PubMed Google Scholar
Thiele, I. & Palsson, B. Ø. A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat. Protoc. 5, 93–121 (2010).
Article CAS PubMed PubMed Central Google Scholar
Fondi, M. & Liò, P. In Bacterial Pangenomics: Methods and Protocols (eds Alessio Mengoni, Marco Galardini & Marco Fondi) 233–256 (Springer New York, 2015).
Nogales, J. In A Practical Protocol for Genome-Scale Metabolic Reconstructions 1–25 (Humana Press, 2014).
Ganter, M., Bernard, T., Moretti, S., Stelling, J. & Pagni, M. MetaNetX.org: a website and repository for accessing, analysing and manipulating metabolic networks. Bioinformatics 29, 815–816 (2013).
Article CAS PubMed PubMed Central Google Scholar
King, Z. A. et al. BiGG Models: A platform for integrating, standardizing and sharing genome-scale models. Nucleic Acids Res. 44, D515–D522 (2016).
Article PubMed Google Scholar
Ravikrishnan, A. & Raman, K. Critical assessment of genome-scale metabolic networks: the need for a unified standard. Brief. Bioinform. 16, 1057–1068 (2015).
Article PubMed Google Scholar
Moretti, S. et al. MetaNetX/MNXref - reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks. Nucleic Acids Res. 44, D523–D526 (2016).
Article PubMed Google Scholar
Agarwala, R. et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 44, D7–D19 (2016).
Article Google Scholar
Apweiler, R. et al. Activities at the Universal Protein Resource (UniProt). Nucleic Acids Res. 42, D191–D198 (2014).
Article Google Scholar
Richards, M. A. et al. MediaDB: A Database of Microbial Growth Conditions in Defined Media. Plos One 9 (2014).
Orth, J. D., Thiele, I. & Palsson, B. O. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010).
Article CAS PubMed PubMed Central Google Scholar
Oh, Y. K., Palsson, B. O., Park, S. M., Schilling, C. H. & Mahadevan, R. Genome-scale reconstruction of metabolic network in Bacillus subtilis based on high-throughput phenotyping and gene essentiality data. J. Biol. Chem. 282, 28791–28799 (2007).
Article CAS PubMed Google Scholar
Burgard, A. P., Pharkya, P. & Maranas, C. D. OptKnock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol. Bioeng. 84, 647–657 (2003).
Article CAS PubMed Google Scholar
Ranganathan, S., Suthers, P. F. & Maranas, C. D. OptForce: An Optimization Procedure for Identifying All Genetic Manipulations Leading to Targeted Overproductions. PLoS Comput. Biol. 6 (2010).
Saier, M. H. et al. The Transporter Classification Database (TCDB): recent advances. Nucleic Acids Res. 44, D372–D379 (2015).
Article PubMed PubMed Central Google Scholar
Horton, P. et al. WoLF PSORT: protein localization predictor. Nucleic Acids Res. 35, W585–W587 (2007).
Article PubMed PubMed Central Google Scholar
Nancy, Y. Y. et al. PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics 26, 1608–1615 (2010).
Article Google Scholar
Overbeek, R. et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 42, D206–D214 (2014).
Article CAS PubMed Google Scholar
Agren, R. et al. The RAVEN Toolbox and Its Use for Generating a Genome-scale Metabolic Model for Penicillium chrysogenum. PLoS Comput. Biol. 9, e1002980 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Schellenberger, J. et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2. 0. Nat. Protoc. 6, 1290–1307 (2011).
Article CAS PubMed PubMed Central Google Scholar
Swainston, N., Smallbone, K., Mendes, P., Kell, D. & Paton, N. The SuBliMinaL Toolbox: automating steps in the reconstruction of metabolic networks. J. Integr. Bioinform. 8, 186–202 (2011).
PubMed Google Scholar
Ye, C. et al. Reconstruction and analysis of a genome-scale metabolic model of the oleaginous fungus Mortierella alpina. BMC Syst. Biol. 9, 1–11 (2015).
Article PubMed PubMed Central Google Scholar
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).
Article PubMed Google Scholar
Hastings, J. et al. ChEBI in 2016: Improved services and an expanding collection of metabolites. Nucleic Acids Res. 44, D1214–D1219 (2016).
Article PubMed Google Scholar
Kim, S. et al. PubChem Substance and Compound databases. Nucleic Acids Res. 44, D1202–D1213 (2016).
Article PubMed Google Scholar
Goyal, N., Widiastuti, H., Karimi, I. A. & Zhou, Z. A genome-scale metabolic model of Methanococcus maripaludis S2 for CO2 capture and conversion to methane. Mol. Biosyst. 10, 1043–1054 (2014).
Article CAS PubMed Google Scholar
Gonnerman, M. C., Benedict, M. N., Feist, A. M., Metcalf, W. W. & Price, N. D. Genomically and biochemically accurate metabolic reconstruction of Methanosarcina barkeri Fusaro, iMG746. Biotech. J. 8, 1070–1079 (2013).
Article CAS Google Scholar
Dai, Z. & Nielsen, J. Advancing metabolic engineering through systems biology of industrial microorganisms. Curr. Opin. Biotechnol. 36, 8–15 (2015).
Article PubMed Google Scholar
Kim, B., Kim, W. J., Kim, D. I. & Lee, S. Y. Applications of genome-scale metabolic network model in metabolic engineering. J. Ind. Microbiol. Biotechnol. 42, 339–348 (2015).
Article CAS PubMed Google Scholar
Seppala, J. J. et al. Prospecting hydrogen production of Escherichia coli by metabolic network modeling. Int. J. Hydrogen. Energ. 38, 11780–11789 (2013).
Article CAS Google Scholar
Kim, J. & Reed, J. L. Refining metabolic models and accounting for regulatory effects. Curr. Opin. Biotechnol. 29, 34–38 (2014).
Article CAS PubMed Google Scholar
Xu, Z., Zheng, P., Sun, J. & Ma, Y. ReacKnock: Identifying Reaction Deletion Strategies for Microbial Strain Optimization Based on Genome-Scale Metabolic Network. Plos One 8 (2013).
Choon, Y. W. et al. Differential Bees Flux Balance Analysis with OptKnock for In Silico Microbial Strains Optimization. Plos One 9 (2014).
Choon, Y. W., Mohamad, M. S., Deris, S. & Illias, R. M. A hybrid of bees algorithm and flux balance analysis (BAFBA) for the optimisation of microbial strains. Int. J. Data Min. Bioin. 10, 225–238 (2014).
Article Google Scholar
Tepper, N. & Shlomi, T. Predicting metabolic engineering knockout strategies for chemical production: accounting for competing pathways. Bioinformatics 26, 536–543 (2010).
Article CAS PubMed Google Scholar
Bhan, N., Xu, P., Khalidi, O. & Koffas, M. A. Redirecting carbon flux into malonyl-CoA to improve resveratrol titers: proof of concept for genetic interventions predicted by OptForce computational framework. Chem. Eng. Sci. 103, 109–114 (2013).
Article CAS Google Scholar
Feist, A. M. et al. A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol. Syst. Biol. 3, 121–128 (2007).
Article PubMed PubMed Central Google Scholar
Dobson, P. D. et al. Further developments towards a genome-scale metabolic model of yeast. BMC Syst. Biol. 4 (2010).
Loira, N., Dulermo, T., Nicaud, J. M. & Sherman, D. J. A genome-scale metabolic model of the lipid-accumulating yeast Yarrowia lipolytica. BMC Syst. Biol. 6, 35 (2012).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the National High Technology Research and Development Program of China (No. 2014AA021501, 2014AA021701), the National Natural Science Foundation of China (No. 21422602, 31660320), the Research Program for State Key Laboratory of Food Science and Technology of Jiangnan University (SKLF-ZZA-201602).

Author information

Authors and Affiliations

State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
Chao Ye, Nan Xu, Xuan Zou, Xiulai Chen & Liming Liu
Key Laboratory for Neuroinformation of Ministry of Education, University of Electronic Science and Technology of China, No. 4, 2nd Section, North Jianshe Road, Chengdu, Sichuan, 610054, China
Chuan Dong & Fengbiao Guo
The Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
Chao Ye, Nan Xu, Xuan Zou, Xiulai Chen & Liming Liu
School of Biology and Engineering, Guizhou Medical University, Dongqing Road, Huaxi District, Guiyang, Guizhou, 550025, China
Yuannong Ye
School of Big Health, Guizhou Medical University, Dongqing Road, Huaxi District, Guiyang, Guizhou, 550025, China
Yuannong Ye

Authors

Chao Ye
View author publications
You can also search for this author in PubMed Google Scholar
Nan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chuan Dong
View author publications
You can also search for this author in PubMed Google Scholar
Yuannong Ye
View author publications
You can also search for this author in PubMed Google Scholar
Xuan Zou
View author publications
You can also search for this author in PubMed Google Scholar
Xiulai Chen
View author publications
You can also search for this author in PubMed Google Scholar
Fengbiao Guo
View author publications
You can also search for this author in PubMed Google Scholar
Liming Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.Y., N.X., C.D. and Y.Y. designed and developed database, X.Z. and X.C. performed data collecting and analysis. C.Y., F.G. and L.L. wrote the manuscript with contributions of all authors. All authors reviewed the manuscript.

Corresponding author

Correspondence to Liming Liu.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ye, C., Xu, N., Dong, C. et al. IMGMD: A platform for the integration and standardisation of In silico Microbial Genome-scale Metabolic Models. Sci Rep 7, 727 (2017). https://doi.org/10.1038/s41598-017-00820-6

Download citation

Received: 22 September 2016
Accepted: 16 March 2017
Published: 07 April 2017
DOI: https://doi.org/10.1038/s41598-017-00820-6

This article is cited by

In silico cell factory design driven by comprehensive genome-scale metabolic models: development and challenges
- Jiangong Lu
- Xinyu Bi
- Long Liu
Systems Microbiology and Biomanufacturing (2023)
Genome-scale metabolic network models: from first-generation to next-generation
- Chao Ye
- Xinyu Wei
- Wei Zou
Applied Microbiology and Biotechnology (2022)
Network reduction methods for genome-scale metabolic models
- Dipali Singh
- Martin J. Lercher
Cellular and Molecular Life Sciences (2020)
Genomic sequencing, genome-scale metabolic network reconstruction, and in silico flux analysis of the grape endophytic fungus Alternaria sp. MG1
- Yao Lu
- Chao Ye
- Junling Shi
Microbial Cell Factories (2019)
Genome-scale biological models for industrial microbial systems
- Nan Xu
- Chao Ye
- Liming Liu
Applied Microbiology and Biotechnology (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.