High-throughput generation, optimization and analysis of genome-scale metabolic models

Henry, Christopher S; DeJongh, Matthew; Best, Aaron A; Frybarger, Paul M; Linsay, Ben; Stevens, Rick L

doi:10.1038/nbt.1672

Resource
Published: 29 August 2010

High-throughput generation, optimization and analysis of genome-scale metabolic models

Christopher S Henry¹,
Matthew DeJongh²,
Aaron A Best³,
Paul M Frybarger^2,3,
Ben Linsay⁴ &
…
Rick L Stevens^4,5

Nature Biotechnology volume 28, pages 977–982 (2010)Cite this article

15k Accesses
692 Citations
19 Altmetric
Metrics details

Subjects

Abstract

Genome-scale metabolic models have proven to be valuable for predicting organism phenotypes from genotypes. Yet efforts to develop new models are failing to keep pace with genome sequencing. To address this problem, we introduce the Model SEED, a web-based resource for high-throughput generation, optimization and analysis of genome-scale metabolic models. The Model SEED integrates existing methods and introduces techniques to automate nearly every step of this process, taking ∼48 h to reconstruct a metabolic model from an assembled genome sequence. We apply this resource to generate 130 genome-scale metabolic models representing a taxonomically diverse set of bacteria. Twenty-two of the models were validated against available gene essentiality and Biolog data, with the average model accuracy determined to be 66% before optimization and 87% after optimization.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Model SEED genome-scale metabolic reconstruction pipeline.**

**Figure 2: Properties of SEED models organized by taxonomy.**

**Figure 3: Properties of SEED models predicted using FBA.**

**Figure 4: Accuracy of models generated by the Model SEED pipeline.**

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

John Jumper, Richard Evans, … Demis Hassabis

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Srinivas Niranj Chandrasekaran, Beth A. Cimini, … Anne E. Carpenter

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Qiuyue Yuan & Zhana Duren

References

Yus, E. et al. Impact of genome reduction on bacterial metabolism and its regulation. Science 326, 1263–1268 (2009).
Article CAS Google Scholar
Kumar, V.S. & Maranas, C.D. GrowMatch: an automated method for reconciling in silico/in vivo growth predictions. PLoS Comput. Biol. 5, e1000308 (2009).
Article Google Scholar
Feist, A.M. & Palsson, B.O. The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coli. Nat. Biotechnol. 26, 659–667 (2008).
Article CAS Google Scholar
Thiele, I. & Palsson, B. A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat. Protoc. 5, 93–121 (2010).
Article CAS Google Scholar
Overbeek, R., Disz, T. & Stevens, R. The SEED: A peer-to-peer environment for genome annotation. Commun. ACM 47, 46–51 (2004).
Article Google Scholar
Aziz, R.K. et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9, 75 (2008).
Article Google Scholar
DeJongh, M. et al. Toward the automated generation of genome-scale metabolic networks in the SEED. BMC Bioinformatics 8, 139 (2007).
Article Google Scholar
Jankowski, M.D., Henry, C.S., Broadbelt, L.J. & Hatzimanikatis, V. Group contribution method for thermodynamic analysis of complex metabolic networks. Biophys. J. 95, 1487–1499 (2008).
Article CAS Google Scholar
Henry, C.S., Zinner, J., Cohoon, M. & Stevens, R. iBsu1103: a new genome scale metabolic model of B. subtilis based on SEED annotations. Genome Biol. 10, R69 (2009).
Article Google Scholar
Suthers, P.F. et al. A genome-scale metabolic reconstruction of Mycoplasma genitalium, iPS189. PLOS Comput. Biol. 5, e1000285 (2009).
Article Google Scholar
Notebaart, R.A., van Enckevort, F.H., Francke, C., Siezen, R.J. & Teusink, B. Accelerating the reconstruction of genome-scale metabolic networks. BMC Bioinformatics 7, 296 (2006).
Article Google Scholar
Tsoka, S., Simon, D. & Ouzounis, C.A. Automated metabolic reconstruction for Methanococcus jannaschii. Archaea 1, 223–229 (2004).
Article CAS Google Scholar
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A.C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, W182–W185 (2007).
Article Google Scholar
Pramanik, J. & Keasling, J.D. Effect of Escherichia coli biomass composition on central metabolic fluxes predicted by a stoichiometric model. Biotechnol. Bioeng. 60, 230–238 (1998).
Article CAS Google Scholar
Satish Kumar, V., Dasika, M.S. & Maranas, C.D. Optimization based automated curation of metabolic reconstructions. BMC Bioinformatics 8, 212 (2007).
Article Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Article CAS Google Scholar
Kanehisa, M., Goto, S., Kawashima, S. & Nakaya, A. The KEGG databases at GenomeNet. Nucleic Acids Res. 30, 42–46 (2002).
Article CAS Google Scholar
Feist, A.M. et al. A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol. Syst. Biol. 3, 121 (2007).
Article Google Scholar
Reed, J.L., Vo, T.D., Schilling, C.H. & Palsson, B.O. An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR). Genome Biol. 4, R54 (2003).
Article Google Scholar
Durot, M. et al. Iterative reconstruction of a global metabolic model of Acinetobacter baylyi ADP1 using high-throughput growth phenotype and gene essentiality data. BMC Syst. Biol. 2, 85 (2008).
Article Google Scholar
Oh, Y.K., Palsson, B.O., Park, S.M., Schilling, C.H. & Mahadevan, R. Genome-scale reconstruction of metabolic network in Bacillus subtilis based on high-throughput phenotyping and gene essentiality data. J. Biol. Chem. 282, 28791–28799 (2007).
Article CAS Google Scholar
Goelzer, A. et al. Reconstruction and analysis of the genetic and metabolic regulatory networks of the central metabolism of Bacillus subtilis. BMC Syst. Biol. 2, 20 (2008).
Article Google Scholar
Schilling, C.H. et al. Genome-scale metabolic model of Helicobacter pylori 26695. J. Bacteriol. 184, 4582–4593 (2002).
Article CAS Google Scholar
Oliveira, A.P., Nielsen, J. & Forster, J. Modeling Lactococcus lactis using a genome-scale flux model. BMC Microbiol. 5, 39 (2005).
Article Google Scholar
Feist, A.M., Scholten, J.C., Palsson, B.O., Brockman, F.J. & Ideker, T. Modeling methanogenesis with a genome-scale metabolic reconstruction of. Methanosarcina barkeri. Mol. Syst. Biol. 2, 2006 0004 (2006).
PubMed Google Scholar
Jamshidi, N. & Palsson, B.O. Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targets. BMC Syst. Biol. 1, 26 (2007).
Article Google Scholar
Nogales, J., Palsson, B.O. & Thiele, I. A genome-scale metabolic reconstruction of Pseudomonas putida KT2440: iJN746 as a cell factory. BMC Syst. Biol. 2, 79 (2008).
Article Google Scholar
Duarte, N.C., Herrgard, M.J. & Palsson, B.O. Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res. 14, 1298–1309 (2004).
Article CAS Google Scholar
Becker, S.A. & Palsson, B.O. Genome-scale reconstruction of the metabolic network in Staphylococcus aureus N315: an initial draft to the two-dimensional annotation. BMC Microbiol. 5, 8 (2005).
Article Google Scholar
Douglas, A.E. Nutritional interactions in insect-microbial symbioses: aphids and their symbiotic bacteria Buchnera. Annu. Rev. Entomol. 43, 17–37 (1998).
Article CAS Google Scholar
Mahadevan, R. & Schilling, C.H. The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab. Eng. 5, 264–276 (2003).
Article CAS Google Scholar
Konstantinidis, K.T. & Tiedje, J.M. Trends between gene content and genome size in prokaryotic species with larger genomes. Proc. Natl. Acad. Sci. USA 101, 3160–3165 (2004).
Article CAS Google Scholar
von Eiff, C. et al. Phenotype microarray profiling of Staphylococcus aureus menD and hemB mutants with the small-colony-variant phenotype. J. Bacteriol. 188, 687–693 (2006).
Article CAS Google Scholar
Bochner, B.R. Global phenotypic characterization of bacteria. FEMS Microbiol. Rev. 33, 191–205 (2009).
Article CAS Google Scholar
Keymer, D.P., Miller, M.C., Schoolnik, G.K. & Boehm, A.B. Genomic and phenotypic diversity of coastal Vibrio cholerae strains is linked to environmental factors. Appl. Environ. Microbiol. 73, 3705–3714 (2007).
Article CAS Google Scholar
Gerdes, S. et al. Essential genes on metabolic maps. Curr. Opin. Biotechnol. 17, 448–456 (2006).
Article CAS Google Scholar
Zhang, R., Ou, H.Y. & Zhang, C.T. DEG: a database of essential genes. Nucleic Acids Res. 32, D271–D272 (2004).
Article CAS Google Scholar
Nakahigashi, K. et al. Systematic phenome analysis of Escherichia coli multiple-knockout mutants reveals hidden reactions in central carbon metabolism. Mol. Syst. Biol. 5, 306 (2009).
Article Google Scholar
Karp, P.D., Riley, M., Paley, S.M. & Pellegrini-Toole, A. The MetaCyc Database. Nucleic Acids Res. 30, 59–61 (2002).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the US Department of Energy under contract DE-ACO2-06CH11357, by the National Institute of Allergy and Infectious Diseases under contract HHSN266200400042C and by the National Science Foundation under grants MCB-0745100, CCF-0829929 and DBI-0850546. M.D. was also supported by the Argonne National Laboratory Guest Faculty Program and the United States Fulbright Scholarship Program. We acknowledge the SEED annotators and development team for producing the annotations and computational infrastructure that make this work possible. We thank R. Overbeek, V. Vonstein, R. Olson, T. Disz, S. Devoid, F. Xia and T. Paczian for assistance with the use of the SEED genome annotation and analysis tools.

Author information

Authors and Affiliations

Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois, USA
Christopher S Henry
Department of Computer Science, Hope College, Holland, Michigan, USA
Matthew DeJongh & Paul M Frybarger
Department of Biology, Hope College, Holland, Michigan, USA
Aaron A Best & Paul M Frybarger
Computer Science Department and Computation Institute, University of Chicago, Chicago, Illinois, USA
Ben Linsay & Rick L Stevens
Computing, Environment, and Life Sciences Directorate, Argonne National Laboratory, Argonne, Illinois, USA
Rick L Stevens

Authors

Christopher S Henry
View author publications
You can also search for this author in PubMed Google Scholar
Matthew DeJongh
View author publications
You can also search for this author in PubMed Google Scholar
Aaron A Best
View author publications
You can also search for this author in PubMed Google Scholar
Paul M Frybarger
View author publications
You can also search for this author in PubMed Google Scholar
Ben Linsay
View author publications
You can also search for this author in PubMed Google Scholar
Rick L Stevens
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.S.H. developed and operated the Model SEED resource and the related algorithms with assistance from all other authors. All authors participated in curation and testing of the SEED models. M.D. contributed significantly to reaction network annotation, model testing and model curation. A.A.B. contributed significantly to template biomass reaction design, model curation and selection of organisms for reconstruction. P.M.F. contributed to reaction network annotation. R.L.S. contributed to model curation and the design of pipeline and algorithms. B.L. contributed to Model SEED web interface development. All authors contributed to the writing and revision of the manuscript.

Corresponding author

Correspondence to Christopher S Henry.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Methods and Supplementary Fig. 1 (PDF 676 kb)

Supplementary Tables

Supplementary Tables S1–S9 (XLS 4084 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Henry, C., DeJongh, M., Best, A. et al. High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol 28, 977–982 (2010). https://doi.org/10.1038/nbt.1672

Download citation

Received: 22 February 2010
Accepted: 30 July 2010
Published: 29 August 2010
Issue Date: September 2010
DOI: https://doi.org/10.1038/nbt.1672

This article is cited by

The sustainable approach of microbial bioremediation of arsenic: an updated overview
- A. Khan
- I. Asif
- M. A. E. Naiel
International Journal of Environmental Science and Technology (2024)
Genome insights into the plant growth-promoting bacterium Saccharibacillus brassicae ATSA2T
- Lingmin Jiang
- Jiyoon Seo
- Jiyoung Lee
AMB Express (2023)
Amino acid auxotrophies in human gut bacteria are linked to higher microbiome diversity and long-term stability
- Svenja Starke
- Danielle M M Harris
- Silvio Waschina
The ISME Journal (2023)
Genome-scale metabolic reconstruction of 7,302 human microorganisms for personalized medicine
- Almut Heinken
- Johannes Hertel
- Ines Thiele
Nature Biotechnology (2023)
A genome-scale metabolic model of the effect of dissolved oxygen on 1,3-propanediol fermentation by Klebsiella pneumoniae
- Yang Zhang
- Menglei Yang
- Heng Gong
Bioprocess and Biosystems Engineering (2023)