An updated genome-scale metabolic network reconstruction of Pseudomonas aeruginosa PA14 to characterize mucin-driven shifts in bacterial metabolism

Payne, Dawson D.; Renz, Alina; Dunphy, Laura J.; Lewis, Taylor; Dräger, Andreas; Papin, Jason A.

doi:10.1038/s41540-021-00198-2

Download PDF

Article
Open access
Published: 08 October 2021

An updated genome-scale metabolic network reconstruction of Pseudomonas aeruginosa PA14 to characterize mucin-driven shifts in bacterial metabolism

npj Systems Biology and Applications volume 7, Article number: 37 (2021) Cite this article

3093 Accesses
9 Citations
13 Altmetric
Metrics details

Subjects

Abstract

Mucins are present in mucosal membranes throughout the body and play a key role in the microbe clearance and infection prevention. Understanding the metabolic responses of pathogens to mucins will further enable the development of protective approaches against infections. We update the genome-scale metabolic network reconstruction (GENRE) of one such pathogen, Pseudomonas aeruginosa PA14, through metabolic coverage expansion, format update, extensive annotation addition, and literature-based curation to produce iPau21. We then validate iPau21 through MEMOTE, growth rate, carbon source utilization, and gene essentiality testing to demonstrate its improved quality and predictive capabilities. We then integrate the GENRE with transcriptomic data in order to generate context-specific models of P. aeruginosa metabolism. The contextualized models recapitulated known phenotypes of unaltered growth and a differential utilization of fumarate metabolism, while also revealing an increased utilization of propionate metabolism upon MUC5B exposure. This work serves to validate iPau21 and demonstrate its utility for providing biological insights.

Genome-scale model of Pseudomonas aeruginosa metabolism unveils virulence and drug potentiation

Article Open access 10 February 2023

Pseudomonas aeruginosa reverse diauxie is a multidimensional, optimized, resource utilization strategy

Article Open access 14 January 2021

Transcriptomic determinants of the response of ST-111 Pseudomonas aeruginosa AG1 to ciprofloxacin identified by a top-down systems biology approach

Article Open access 13 August 2020

Introduction

The mucosal barrier is a hydrated mucus gel that lines wet epithelial cells throughout the body, including eyes, mouth, lungs, and the gastrointestinal and urogenital tracts^1,2. It serves as a key mechanism of protection against pathogens. The component responsible for the gel-like properties of the mucosal layer is the glycoprotein mucin³. The dysregulation of mucins underlies diseases like cystic fibrosis⁴ and chronic obstructive pulmonary disorders². As mucins are involved in the clearance of microbes⁵, a dysregulation of mucins can result in pathogen overgrowth and severe infections⁶. While some bacterial species, including pathogenic strains of Pseudomonas³, are capable of residing within the mucosal layer, mucins typically impair the formation of biofilms and surface attachment⁷. Furthermore, mucins are reported to downregulate virulence genes involved in siderophore biosynthesis, quorum sensing, and toxin secretion¹. By disturbing these key mechanisms of infection, mucins attenuate the virulence and infective potential of P. aeruginosa.

Elucidating the metabolic responses of P. aeruginosa to mucins can enable the development of protective approaches against infection⁸. Genome-scale metabolic network reconstructions (GENREs) and associated genome-scale metabolic models (GEMs) are well suited for this purpose as they can enable the prediction of cellular behavior under different biological conditions such as the absence or presence of different mucins in an environment⁹. A GENRE can also be used to contextualize high-throughput data, such as transcriptomics or proteomics data¹⁰. Gene expression data can, for example, be used to constrain specific predicted metabolic fluxes¹¹ and thereby increase the predictive value of the model. Metabolically active pathways under different conditions can be identified by integrating high-throughput data with a metabolic network¹².

P. aeruginosa is a critical bacterial species in the ‘Priority Pathogens List’ for research and development of new antibiotics published by the World Health Organization (WHO)¹³. However, the lack of novel antibiotics^14,15 emphasizes the need for the development of innovative and protective therapeutic approaches. This pressing need for protective strategies coupled with new insights from recent research present an opportunity to further refine the GENRE of the highly virulent strain UCBPP-PA14 by Bartell et al.¹⁶. An updated GENRE can be used to better understand the metabolic underpinnings of P. aeruginosa infections and ultimately develop new therapeutic strategies from those insights.

Here, we present iPau21, an updated GENRE of P. aeruginosa strain UCBPP-PA14 metabolism. We improve predictions of carbon source utilization and growth yields in order to better recapitulate the behavior of the pathogen. Metabolic network coverage is expanded through the addition of genes, reactions, and metabolites supported by literature evidence. The quality of the reconstruction was improved through an update of standardized formatting, improved annotation, and the addition of binning metabolites representing macromolecular categories to assist with analysis. The metabolic network model was validated by comparing phenotypic predictions to experimental datasets^{16,17,18,19,20,21} and the quality of the reconstruction was assessed with the MEMOTE benchmarking software²². This updated reconstruction was further contextualized with recently published transcriptomic data¹ in order to demonstrate its utility in elucidating the metabolic shifts of P. aeruginosa after exposure to mucins. The validated reconstructions will serve as a key resource for the Pseudomonas and microbial metabolic modeling communities and the insights into mucin-driven metabolic shifts in P. aeruginosa may serve to inform the future development of therapeutic strategies.

Results

An updated network reconstruction of Pseudomonas aeruginosa metabolism

A metabolic network reconstruction of P. aeruginosa PA14 (iPau1129) was previously published¹⁶ and served as a starting point for an updated reconstruction (iPau21). The metabolic coverage of the reconstruction was expanded, the format and annotations were updated, and an ATP-generating loop was resolved in order to produce a refined model with improved accuracy and extensive annotation.

We expanded iPau1129 by 40 genes, 24 metabolites, and 76 reactions (Fig. 1a) through manual curation based on literature evidence (Supplementary Data 1). Many of these additions served to increase the utility of the reconstruction for simulation (such as the addition of 33 exchange reactions), while others expanded metabolic pathways for amino acid metabolism and glycerophospholipid metabolism. A periplasmic compartment containing hydrogen was added to the reconstruction to better represent the electron transport chain and ATP synthase, which eliminated all ATP-generating loops in the metabolic network. The format was updated from SBML Level 2²³ to Level 3²⁴, which enables additional functionality such as the utilization of several extension packages and the transfer of information content to dedicated new data structures. Annotations from various databases were added to metabolites, reactions, and genes where possible.

**Fig. 1: Characteristics and MEMOTE benchmarking of iPau21.**

The overall quality of the updated reconstruction was assessed using MEMOTE²², a recently developed GENRE test suite. The MEMOTE score of iPau21 improved in all subcategories when compared to iPau1129 resulting in an increase of the overall score from 30% to 89% (Fig. 1b and Supplementary Materials). The scores in annotation subcategories were increased by adding annotations and SBO terms to metabolites, reactions, and genes in the updated GENRE. The consistency of the metabolic network was improved through the correction of imbalanced reactions and the resolution of energy generating cycles that were present in iPau1129.

The biomass objective function (BOF) was updated to better reflect the macromolecular components found experimentally in P. aeruginosa including the inclusion of lipopolysaccharide^25,26,27. BOF substrates were organized into corresponding macromolecular categories (i.e., DNA, RNA, protein, lipid) to better represent the categories of components that are required for growth.

Model validation

Validation of iPau21 was performed by comparing in silico predictions of biomass flux, carbon source utilization, and gene essentiality to experimental data. Biomass flux and subsequent doubling time predictions in simulated lysogeny broth (LB), synthetic cystic fibrosis media (SCFM), and glucose minimal media were compared to experimental values found in literature (Fig. 2a, Supplementary Data 2)^17,18,19. Doubling time predictions of iPau21 were 25%, 19%, and 22% more accurate than those of iPau1129 in simulated LB, SCFM, and glucose minimal media, respectively. Compared to the original model, iPau21 doubling times are higher, which reflects the resolution of the ATP-generating loop that previously allowed the model to costlessly convert ADP to ATP. The iPau21 doubling time prediction on glucose minimal media of 40.2 min showed agreement with experimental data, falling within the range of experimentally determined values¹⁹. Model doubling time predictions on LB and SCFM were faster than observed experimentally, which is consistent with metabolic network models that are structured to predict the optimal growth of an organism.

**Fig. 2: Updated reconstruction of *P. aeruginosa* enables accurate growth rate, gene essentiality, and carbon source utilization predictions.**

Carbon source utilization predictions were compared to previously collected experimental results across 91 carbon sources¹⁶. Utilization was predicted by iPau21 with an accuracy of 89% and Matthews correlation coefficient (MCC) of 0.78, while iPau1129 demonstrated an accuracy of 80% and MCC of 0.62 (Fig. 2b and Supplementary Data 3). This increase in accuracy was achieved through the completion of pathways that allow for the utilization of more carbon sources and the removal of an unsupported reaction that previously allowed for the utilization of D-malate. Carbon source predictions of iPau21 remain incorrect for 10 carbon sources. Five of the incorrect predictions are due to the absence of metabolic pathways required for growth on certain carbon sources. When addressing these predictions, our literature survey was unable to provide sufficient evidence for these pathways so the predictions remain incorrect and we opted to not gapfill without that additional evidence. To correct these predictions, summary reactions could be added to the reconstruction, but these reactions would lack the mechanistic granularity of associated genes and could have negative impacts on other aspects of the reconstruction. The other five incorrect predictions were caused by the presence of metabolic pathways that allow for the erroneous growth on the associated carbon sources. In each of these cases, the pathway was investigated and the corresponding genes were verified through the KEGG²⁸ and ModelSEED²⁹ databases, but there was not strong enough evidence to warrant changes in the reconstruction^28,30. Some of these discrepancies may be due to considerations that are outside of the scope of the network, such as transcriptional processes. For example, in the case of D-serine, PA14 has the ability to metabolize this carbon source but expression of this gene is not triggered by the presence of D-serine so it is unable to grow on this single carbon source in vitro³¹. These inaccurate predictions could be improved by modifying constraints in the metabolic network model. However, since the gene-protein-reaction (GPR) rules were found to be valid and the prediction error could be due to unaccounted for regulatory control, we opted to leave the pathways intact. Overall, we were able to increase carbon source utilization prediction accuracy by 9% in comparison to the previously published model.

Gene essentiality predictions were compared to a published dataset comprised of the overlap of essential genes identified through the growth of strains PAO1 and PA14 transposon insertion mutants in LB media^20,21. The number of genes accounted for by iPau21 was expanded to 1169 and the gene essentiality prediction accuracy was maintained at 91%, which is equivalent to iPau1129 (Supplementary Data 4). Gene essentiality was predicted by iPau21 with a MCC of 0.50, compared to a value of 0.44 by iPau1129. Three genes labeled as “SPONTANEOUS,” “unassigned,” and “Unassigned” were removed from the reconstruction given that these labels did not correspond to genes belonging to P. aeruginosa. Gene essentiality data was not used for curation of the metabolic network given the variability in gene essentiality screens and the resultant challenges with data interpretation³². Instead, model predictions were compared to gene essentiality data as one facet of validation. As a reference, iPau21 has a gene essentiality prediction accuracy of 91%, which is near the 93% accuracy of iML1515, a well-curated reconstruction of Escherichia coli³³.

Transcriptome-guided modeling of P. aeruginosa metabolism in the presence of human mucins

Mucins are the primary macromolecules in mucosal layers known to modulate microbial phenotypes². In order to investigate how the metabolism of P. aeruginosa shifts when it comes into contact with mucins, in vitro transcriptomic data was integrated with iPau21 to generate contextualized models that offer more biologically accurate representations of associated metabolic phenotypes. Analysis of the structure and pathway utilization in these transcriptome-guided models offers insights into the metabolic shifts that arise when P. aeruginosa is exposed to mucins.

Transcriptomic profiles of P. aeruginosa PAO1 grown in agrobacterium minimal medium with thiamine, glucose, and casamino acids (ABTGC) medium supplemented with either MUC5AC, MUC5B, or mucin-glycans were collected from literature^1,34. MUC5AC and MUC5B are mucin types found both individually and together at different sites of the human body that P. aeruginosa is known to infect⁸. The mucin-glycans used in the published experiments were isolated from the backbone of MUC5AC. The experiments were performed with strain PAO1, which has a highly similar genome to strain PA14³⁵. The main difference between the strains is the presence of additional gene clusters in PA14 (most linked to virulence) that we would not expect to have a large effect on overall metabolism. PAO1 genes in the transcriptomic dataset were mapped to PA14 orthologs and then the data was integrated with the iPau21 using the RIPTiDe algorithm³⁶. RIPTiDe uses transcriptomic evidence to create context-specific metabolic models representative of a parsimonious metabolism consistent with the transcriptional investments of an organism. This analysis resulted in four contextualized models that more accurately represent the metabolism of P. aeruginosa when grown without mucin exposure (ABTGC) and when exposed to MUC5AC, MUC5B, and glycans.

Flux samples were generated for each model and BOF flux did not vary significantly among the contextualized models (less than five percent change), recapitulating the phenotype that was observed experimentally¹. The flux distributions underlying the BOF values were compared across models using non-metric multidimensional scaling (NMDS) in order to compare the metabolic mechanisms of growth utilized by the condition-specific metabolisms (Fig. 3a). The fluxes from the 378 consensus reactions (shared across all models) were used for this analysis. NMDS analysis revealed that among the tested conditions, the sampled flux distributions from the MUC5B model clustered the furthest from the ABTGC condition. This result indicates that although there was not a significant difference in the BOF value, exposure to MUC5B caused the largest shift in the metabolic pathways utilized for growth. MUC5AC clustered the second furthest away, while Glycans clustered most closely to the ABTGC model, showing that there was a variable metabolic response to different mucins and glycans by P. aeruginosa. Mucin-glycans do not contain the same level of structural and biochemical complexity as MUC5AC and MUC5B, which may account for the slight metabolic shift observed in the Glycans model relative to the MUC5AC and MUC5B models. MUC5AC and MUC5B are known to differ from each other in terms of charge, shape, and glycosylation³⁷. These differences could explain the variable metabolic response they elicit in P. aeruginosa. Additionally, of the two only MUC5B has been shown to be critical for murine mucociliary transport and antibacterial defense³⁸. One mechanism of MUC5B antibacterial effects could be through modulation of pathogen metabolism, which would explain the larger shift in metabolism observed when P. aeruginosa was exposed to MUC5B. The conserved BOF flux values and separation observed between clusters of flux samples suggest that while P. aeruginosa metabolism is modulated by the presence of mucins, its versatility allows for the utilization of alternative metabolic pathways in order to avoid a growth defect.

**Fig. 3: Contextualization of updated reconstruction shows shifts in *P. aeruginosa* metabolism in response to mucins and mucin components.**

The differences between networks were further investigated through the metabolites that were produced and consumed by models in silico. This analysis offers a snapshot of the substrates used and byproducts of particular metabolic states, which can be informative of the metabolism underlying that state. All models were found to consume the same metabolites with some small differences in specific flux values however, there were key differences in the metabolites that models produced (Supplementary Data 5). The graded differences between models seen in NMDS were highlighted by the production of formate by the models. The ABTGC and Glycans models produced substantially higher amounts of formate than the MUC5AC model, while the MUC5B model did not contain the formate exchange reaction. Therefore, with our model, we are able to predict subtle shifts in P. aeruginosa metabolism in response to different environmental mucins.

Human mucins shift P. aeruginosa metabolism

Further analysis was conducted on the contextualized models to better understand the shifts in metabolism that resulted in the observed dissimilarities in the NMDS analysis. Reactions not shared across all models (non-consensus reactions) were identified and compared to investigate how network structure varies across models (Fig. 3b). This analysis revealed a set of 13 reactions shared by the ABTGC, MUC5B, and Glycans models but absent from the MUC5AC network. This result suggests that while MU5B displayed the largest functional differences in metabolism, MUC5AC is the most structurally unique of our models. Additionally, we found that there was no correlation between network structure and utilization among our contextualized models (Fig. 4, p-value = 0.92). Since the NMDS analysis revealed that the ABTGC and MUC5B models had the largest difference in functional metabolism, these two models were further investigated to find key attributes that underlie these large differences. Random forest analysis was conducted on the flux samples from consensus reactions of the ABTGC and MUC5B models to find which reactions were most differentially utilized between the two cases (Fig. 5). Two reactions corresponding to fumarate transport were in the top seven most discriminating reactions between models, suggesting that there was a differential utilization of reactions involved in fumarate metabolism. The MUC5B model utilized the fumarate reactions more highly than the ABTGC model and contained a fumarase reaction that was not present in the ABTGC model, which further suggests that fumarate metabolism is a key point of difference between the models. This observation recapitulates what was noted in the original paper that produced the transcript data used for contextualization¹. Of the top six most discriminating reactions, five corresponded to propionate metabolism and were more highly utilized by the MUC5B model than the ABTGC model. While there was no propionate in the simulated (or in vitro) media, it is a known byproduct of mucin fermenters and has anti-lipogenic and anti-inflammatory properties in humans^39,40. This analysis revealed that the exposure to MUC5B elicited the largest shift in metabolism compared to MUC5AC and Glycans. Further, an increased utilization of fumarate and propionate metabolism during simulated growth was responsible for this shift.

**Fig. 4: Network utilization does not correlate with network structure.**

**Fig. 5: Random forest analysis between ABTGC and MUC5B shows the networks differ most in terms of propionate and fumarate metabolism utilization.**

Discussion

We generated an updated network reconstruction of P. aeruginosa PA14 metabolism with considerable improvements in model annotation and accuracy of growth rate and carbon source utilization predictions. The metabolic reaction coverage of the reconstruction was expanded, the format and annotations were updated to be consistent with current best practices, and an ATP-generating loop was resolved. Model improvements were quantified through various metrics such as accuracy of growth yield and carbon source utilization predictions as well as MEMOTE benchmarking²².

The updated network reconstruction was contextualized using transcriptomic data in order to investigate the shifts in metabolism that occur when P. aeruginosa is exposed to mucins present in the human body. This analysis recapitulated an unaltered growth rate and differential fumarate metabolism that has been reported in literature and also revealed an increased utilization of propionate metabolism in the presence of mucins. Propionate is a short chain fatty acid with beneficial effects to human health such as anti-lipogenic, anti-inflammatory, and anti-carcinogenic action^39,41. While propionate is not present in the ABTGC medium, it is known to be produced by bacteria such as Akkermansia muciniphila when they come into contact with and catabolize mucins⁴⁰. This shift of P. aeruginosa metabolism towards propionate metabolism may indicate a cross-feeding mechanism where MUC5B mucins signal to Pseudomonas to prepare to metabolize the propionate produced by other microbes as they break down the mucins. Once validated, this insight could be used to develop therapeutic strategies for P. aeruginosa infections of body sites containing MUC5B such as the lung, oral cavity, and middle ear¹. Antibiotics could be designed to target proteins for propionate metabolism in order to combat drug-resistant strains that cannot be treated with traditional antibiotics.

While the updates made to the model broadly improved the model accuracy, there were incorrect predictions about carbon source utilization, gene essentiality, and growth rate that were not able to be addressed. Some incorrect predictions are due to a lack of literature evidence, such as incorrect carbon source utilization predictions that are due to the absence of metabolic pathways in the model. Other incorrect predictions are due to factors that are outside of the scope of the model, such as the incorrect prediction of growth on D-serine that is caused by the transcriptional regulation of dsdA³¹. There are other opportunities for further curation that would result in additional improvements to the MEMOTE score, which can be further interrogated by uploading the iPau21 reconstruction to the MEMOTE website (memote.io).

The transcriptomic data that was used to contextualize the model was collected through experiments with P. aeruginosa strain PAO1. Therefore, the genes in the transcriptomic dataset were mapped to their PA14 orthologs before being integrated with the network reconstruction. While the genomes of PAO1 and PA14 are highly similar, the PA14 (6.5 Mb) genome is slightly larger than PAO1 (6.3 Mb) and contains gene clusters that are not present in PAO1³⁵. The genes absent in PAO1 therefore would not be accounted for in the transcriptomic dataset. However, since most of these genes are linked to virulence, they should not have large effects on whole metabolism as simulated here. Therefore, we expect that this application of the model would allow the identification of broad shifts in metabolism due to exposure to mucins irrespective of the specific strain simulated.

The improvements in the P. aeruginosa metabolic network reconstruction were made to reconcile key disagreements between in silico predictions and in vitro results, ultimately producing a higher quality metabolic network reconstruction. Through the update process, we identified key predictions that remain incorrect and offer targets for further curation. The application of the model to investigate metabolic shifts that occur upon exposure to mucins recapitulated phenotypes observed in literature and offered mechanistic insights that would be difficult to delineate experimentally. This application of the reconstruction serves as an example of how the reconstruction and associated models can provide insights into context-specific metabolism. Ultimately, this reconstruction can serve as a resource for investigating the metabolism of P. aeruginosa in a variety of settings and conditions.

Methods

Genome-scale metabolic reconstructions and models (GENREs and GEMs)

GENREs are network reconstructions that represent the metabolic capabilities of an organism and can be analyzed for various applications. An organism’s genes are connected to the proteins they code for and the reactions that those proteins catalyze. These associations are stored as gene-protein-reaction (GPR) relationships with the reactants and products of each reaction cataloged in a stoichiometric matrix. Metabolites in the reconstruction are assigned to compartments that mirror biologically discrete spaces such as the cytosol and the extracellular space. Exchange and transport reactions allow metabolites to flow between the compartments in the reconstruction. A GENRE is turned into a GEM (Genome-Scale Metabolic Model) by adding reaction bounds that capture the flux constraints and the reversibility of reactions. The flux bounds dictate the amount and direction of flux that a reaction can carry. Objective functions (OFs) that represent metabolic goals are added to the model to simulate biological processes. GEMs can be analyzed using flux balance analysis (FBA)-based methods to investigate and gain insights into the metabolic state of a network⁴². The updated GENRE was named iPau21 according to the community standard naming convention⁴³.

Adding annotations

Initially, the PA14 reconstruction did not contain extensive annotations for metabolites, reactions, or genes. ModelPolisher⁴⁴ can be used to annotate metabolites and reactions of a metabolic model. To do so, identifiers of the BiGG database⁴⁵ (BiGG-IDs) are required as metabolite or reaction identifiers, respectively. Since the identifiers of the model were obtained from the ModelSEED database³⁰, BiGG-IDs needed to be determined. For each metabolite, the BiGG-IDs were assessed manually. Since this is a very time-consuming procedure, the BiGG-IDs for the reactions were resolved in a semi-automated way: The cross-references of the ModelSEED database to other databases, such as BiGG or KEGG²⁸, were used to automatically obtain the BiGG-IDs for the respective ModelSEED reaction identifier. If more than one BiGG-ID was returned, the correct identifier was determined by manual inspection of the respective reaction. The BiGG-IDs of the metabolites and reactions were added as biological qualifier (‘BQB_IS’) annotations to the model using libSBML Version 5.17.0⁴⁶. The annotations were added in accordance with the MIRIAM guidelines⁴⁷. After adding the BiGG-IDs to the model, ModelPolisher was used for further annotations of the model’s reactions and metabolites for references to other databases, such as KEGG, MetaNetX⁴⁸, or MetaCyc⁴⁴.

For the reactions, the obtained KEGG annotations were used to further add all pathways that are associated with the respective reaction to the model. The pathways were obtained using the KEGG-ID and KEGG API to request all associated pathways. The pathways were then added to the respective reactions using the biological qualifier “BQB_OCCURS_IN” in libSBML.

The identifiers of the model genes are from the KEGG database. With the help of libSBML, the KEGG gene annotation was added to the model. For further gene annotations, the KEGG API was used to request NCBI⁴⁹ Protein IDs and Uniprot⁵⁰ IDs, which were subsequently added as respective annotations to the model. Additionally, the ID mapper from PATRIC⁵¹ was used to request RefSeq and NCBI⁴⁹ gene identifiers, as well as identifiers of the ASAP database.

Systems Biology Ontology (SBO)⁵² terms can give semantic information or be used for annotation purposes. In our network reconstruction, all genes were labeled as genes with the SBO-term “SBO:0000243”. All metabolites without a valid SBO-term were labeled as simple chemicals with the SBO-term “SBO:0000247”. Transport reactions were divided into (1) active transport if ATP is required for the respective transport reaction (SBO:0000657), (2) passive transport if no external energy is required (SBO:0000658), (3) symporter-mediated transport if two or more molecules are transported into the same relative direction across a membrane (SBO:0000659), or (4) antiporter-mediated transport if two or more molecules are transported in relative opposite directions across a membrane (SBO:0000660). All metabolic reactions were labeled as biochemical reactions with the SBO-term “SBO:0000176”.

Upgrading SBML version

The initial PA14 reconstruction was represented in SBML Level 2 Version 1⁵³. The current reconstruction was updated to the latest SBML edition (Level 3)⁵⁴. With the help of libSBML, both the fbc-plugin⁵⁵ and the groups-plugin⁵⁶ were enabled. Initially, the chemical formulas and charges of the metabolites were stored in the notes field. With the fbc-plugin, the charges were added as features of the metabolites to the reconstruction. The fbc-plugin also enables the addition of gene products to the reconstruction. In the initial reconstruction, the subsystems of the reactions were saved in the notes field. With libSBML and the groups-plugin, the subsystems were extracted from the notes field and added as groups to the reconstruction. For each subsystem, a list of reactions associated with that pathway according to the notes was created and added to the subsystem as members.

Correcting charge and mass imbalances

A list of all mass- and charge-imbalanced reactions was extracted from the reconstruction. From this list, all exchange, sink, demand and biomass reactions were excluded. Each remaining reaction was manually checked by looking up the reaction-ID in ModelSEED²⁹: (1) If the reaction status in ModelSEED was balanced (“OK”), but differed from the reaction equation in the reconstruction, the reaction was adapted according to ModelSEED and again checked for imbalances. (2) If the reaction in ModelSEED also had an imbalanced reaction status, other databases like MetaCyc⁵⁷, BiGG⁴⁵, or KEGG²⁸ were explored and the reactions were adapted according to the respective reactions in the external databases. Where required, chemical formulas, charges, and coefficients were corrected, or chemical compounds were added or subtracted from the reactions according to the respective database reaction. All changed reactions are listed in Supplementary Data 1.

Assessing the quality of the reconstruction

MEMOTE is an open-source software that provides a measure for model quality²². Every change and improvement of the model was continuously documented and quality-assessed using MEMOTE Version 0.9.11. Full MEMOTE reports are provided for iPau1129, iPau21, and iML1515 (Supplementary Materials). Gene essentiality predictions were compared to a published dataset that was originally used to validate iPau1129¹⁶. This dataset comprises the overlap of essential genes identified through the growth of PAO1 and PA14 transposon insertion mutants in LB media^20,21. Carbon source utilization predictions were compared to previously collected experimental results¹⁶. Prediction accuracy was calculated as the number of correction predictions divided by the number of total predictions. Matthews correlation coefficient (MCC) was calculated in order to assess the quality of predictions⁵⁸. Biomass flux and subsequent doubling time predictions in lysogeny broth (LB), synthetic cystic fibrosis media (SCFM), and glucose minimal media were compared to experimental values found in literature (Fig. 1c)^17,18,19.

Literature-based updates

Previous work identified multiple areas where the original reconstruction (iPau1129) was unable to accurately recapitulate experimental data. This assessment included 18 incorrect carbon source predictions¹⁶ and several incorrect gene essentiality predictions⁵⁹. Pathways and gene-protein-reaction rules related to each incorrect prediction were manually curated to reflect the most recent evidence from literature, KEGG, and MetaCyc. In the absence of sufficient evidence, no changes were made, even if this absence of a change meant a prediction would remain uncorrected.

Evaluating and updating the BOF

Macromolecular categories represented in the dry weight of P. aeruginosa were identified through a literature survey. Metabolites in the biomass objective function (BOF) were organized into these macromolecular categories in order to better represent the components required for growth. During organization, no additional metabolites were added and the ratios of metabolites in the BOF were kept the same.

The BOF was also updated to include lipopolysaccharide (cpd17065) to reflect its presence in Gram-negative bacteria⁶⁰. A metabolite representing biomass was also added to the products of the BOF to represent the accumulation of biomass.

Addition of exchange reactions

A list of all extracellular metabolites in the reconstruction was compiled and compared to a list of all exchange reactions in the reconstruction. Exchange reactions were added for 33 extracellular metabolites that previously did not have one.

Removal of energy generating cycles

Exchange reactions were closed and the objective function was set to energy dissipation reactions for electron carriers (ATP, NADH, NADPH, FADH₂, and H⁺). The model was able to generate flux for only the ATP energy dissipation objective function, which indicated that an energy generating cycle existed. The cycle was resolved through the addition of a periplasm compartment to contain hydrogen involved in the electron transport chain and correcting the reversibility of four participating reactions.

RIPTiDe contextualization & analysis

Published transcriptomic data was integrated with the model using RIPTiDe³⁶. The transcriptomic data was normalized then translated from PAO1 genes to the orthologous PA14 genes prior to integration⁶¹. ABTGC medium was simulated in silico and applied to the model (Supplementary Data 2). Then, RIPTiDe was used to produce the contextualized models for in vitro media conditions.

NMDS analysis was conducted on flux samples from each contextualized model (n = 500 samples per model) using the Vegan package in R⁶². Only consensus reactions across all four contextualized models were included in the flux sample dataset and a constant was added to each flux value in the dataset to make all data points positive to facilitate comparison. Median fluxes for every reaction in each model are provided in Supplementary Data 6.

Random forest analysis was conducted on flux sampling data (n = 500 samples per model) from the consensus reactions of the ABTGC and MUC5B models using the randomForest package in R⁶³. Reactions that were differentially present in contextualized models were identified and connected to their corresponding metabolic pathways manually.

The Jaccard distance of network structures was calculated by comparing the reactions contained in pairs of networks⁶⁴. The NMDS distance was calculated as the distance between the median NMDS coordinates of network pairs. Spearman’s correlation was used to calculate a p-value for the relationship between network structure and network utilization across all pairs of networks.

Data availability

All data for this project is available on GitHub (https://github.com/dawsonpayne/iPau21). The genome-scale metabolic model iPau21 is available in the BioModels Database⁶⁵ as an SBML Level 3 Version 1 file within a COMBINE Archive OMEX file⁶⁶ including the contextualized models and metadata⁶⁷ at https://identifiers.org/biomodels.db/MODEL2106110001.

Code availability

Code used for the GENRE update and analyses have been archived in a GitHub repository (https://github.com/dawsonpayne/iPau21).

References

Wheeler, K. M. et al. Mucin glycans attenuate the virulence of Pseudomonas aeruginosa in infection. Nat. Microbiol. 4, 2146–2154 (2019).
Article PubMed PubMed Central CAS Google Scholar
Co, J. Y. et al. Mucins trigger dispersal of Pseudomonas aeruginosa biofilms. NPJ Biofilms Microbiomes 4, 1–8 (2018).
Article Google Scholar
Bansil, R. & Turner, B. S. Mucin structure, aggregation, physiological functions and biomedical applications. Curr. Opin. Colloid Interface Sci. 11, 164–170 (2006).
Article CAS Google Scholar
Henke, M. O., John, G., Germann, M., Lindemann, H. & Rubin, B. K. MUC5AC and MUC5B mucins increase in cystic fibrosis airway secretions during pulmonary exacerbation. Am. J. Respiratory Crit. Care Med. 175, 816–821 (2007).
Article CAS Google Scholar
Tilley, A. E., Walters, M. S., Shaykhiev, R. & Crystal, R. G. Cilia dysfunction in lung disease. Annu. Rev. Physiol. 77, 379–406 (2015).
Article CAS PubMed Google Scholar
Rose, M. C. & Voynow, J. A. Respiratory tract mucin genes and mucin glycoproteins in health and disease. Physiological Rev. 86, 245–278 (2006).
Article CAS Google Scholar
Caldara, M. et al. Mucin biopolymers prevent bacterial aggregation by retaining cells in the free-swimming state. Curr. Biol. 22, 2325–2330 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hoffman, C. L., Lalsiamthara, J., & Aballay, A. Host mucin is exploited by Pseudomonas aeruginosa to provide monosaccharides required for a successful infection. MBio, 11 (2020).
Fan, J., Qiu, J. & Wei, Q. Extremely rare case of intravascular solitary fibrous tumour in the inferior vena cava with review of the literature. Diagnostic Pathol. 14, 1–6 (2019).
Article Google Scholar
Oberhardt, M. A., Palsson, B. Ø. & Papin, J. A. Applications of genome‐scale metabolic reconstructions. Mol. Syst. Biol. 5, 320 (2009).
Article PubMed PubMed Central Google Scholar
Shlomi, T., Cabili, M. N., Herrgård, M. J., Palsson, B. Ø. & Ruppin, E. Network-based prediction of human tissue-specific metabolism. Nat. Biotechnol. 26, 1003–1010 (2008).
Article CAS PubMed Google Scholar
Usaite, R., Patil, K. R., Grotkjær, T., Nielsen, J. & Regenberg, B. Global transcriptional and physiological responses of Saccharomyces cerevisiae to ammonium, L-alanine, or L-glutamine limitation. Appl. Environ. Microbiol. 72, 6194–6203 (2006).
Article CAS PubMed PubMed Central Google Scholar
World Health Organization WHO Publishes List of Bacteria for Which New Antibiotics Are Urgently Needed (WHO, 2017).
Kmietowicz, Z. Few novel antibiotics in the pipeline, WHO warns. Br. Med. J.(Online) 358 (2017).
Carlet, J. et al. Society’s failure to protect a precious resource: antibiotics. Lancet 378, 369–371 (2011).
Article PubMed Google Scholar
Bartell et al. Reconstruction of the metabolic network of Pseudomonas aeruginosa to interrogate virulence factor synthesis. Nat. Commun. 8, 1–13 (2017).
Article Google Scholar
Kwon, T., Huse, H. K., Vogel, C., Whiteley, M. & Marcotte, E. M. Protein-to-mRNA ratios are conserved between Pseudomonas aeruginosa strains. J. Proteome Res. 13, 2370–2380 (2014).
Article CAS PubMed PubMed Central Google Scholar
LaBauve, A. E. & Wargo, M. J. Growth and laboratory maintenance of Pseudomonas aeruginosa. Curr. Protoc. Microbiol. 25, 6E–1 (2012).
Article Google Scholar
Oberhardt, M. A., Puchałka, J., Fryer, K. E., Dos Santos, V. A. M. & Papin, J. A. Genome-scale metabolic network analysis of the opportunistic pathogen Pseudomonas aeruginosa PAO1. J. Bacteriol. 190, 2790–2803 (2008).
Article CAS PubMed PubMed Central Google Scholar
Liberati, N. T. et al. An ordered, nonredundant library of Pseudomonas aeruginosa strain PA14 transposon insertion mutants. Proc. Natl Acad. Sci. USA 103, 2833–2838 (2006).
Article CAS PubMed PubMed Central Google Scholar
Jacobs, M. A. et al. Comprehensive transposon mutant library of Pseudomonas aeruginosa. Proc. Natl Acad. Sci. USA 100, 14339–14344 (2003).
Article CAS PubMed PubMed Central Google Scholar
Lieven, C. et al. MEMOTE for standardized genome-scale metabolic model testing. Nat. Biotechnol. 38, 272–276 (2020).
Article CAS PubMed PubMed Central Google Scholar
Finney, A. & Hucka, M. Systems biology markup language: Level 2 and beyond. Biochem. Soc. Trans. 31, 1472–1473 (2003).
Article CAS PubMed Google Scholar
Keating, S. M. et al. SBML Level 3: an extensible format for the exchange and reuse of biological models. Mol. Syst. Biol. 16, e9110 (2020).
Article PubMed PubMed Central Google Scholar
Falla, J. A., Bauda, P. & Block, J. C. Isolation of cell enveloped layers of Pseudomonas fluorescens. J. Microbiological Methods 7, 285–294 (1988).
Article CAS Google Scholar
Yeager, C. M., Bottomley, P. J. & Arp, D. J. Requirement of DNA repair mechanisms for survival of Burkholderia cepacia G4 upon degradation of trichloroethylene. Appl. Environ. Microbiol. 67, 5384–5391 (2001).
Article CAS PubMed PubMed Central Google Scholar
Huszczynski, S. M., Lam, J. S. & Khursigara, C. M. The role of Pseudomonas aeruginosa lipopolysaccharide in bacterial pathogenesis and physiology. Pathogens 9, 6 (2020).
Article CAS Google Scholar
Kanehisa, M., Furumichi, M., Sato, Y., Ishiguro-Watanabe, M. & Tanabe, M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021).
Article CAS PubMed Google Scholar
Seaver, S. M. D. et al. The ModelSEED Biochemistry Database for the integration of metabolic annotations and the reconstruction, comparison and analysis of metabolic models for plants, fungi and microbes. Nucleic acids Res. 49, D575–D588 (2021).
Article CAS PubMed Google Scholar
Henry, C. S. et al. High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat. Biotechnol. 28, 977–982 (2010).
Article CAS PubMed Google Scholar
Li, G. & Lu, C. D. The cryptic dsdA gene encodes a functional D-Serine dehydratase in Pseudomonas aeruginosa PAO1. Curr. Microbiol. 72, 788–794 (2016).
Article CAS PubMed Google Scholar
Blazier, A. S. & Papin, J. A. Reconciling high-throughput gene essentiality data with metabolic network reconstructions. PLoS Computational Biol. 15, e1006507 (2019).
Article CAS Google Scholar
Monk, J. M. et al. i ML1515, a knowledgebase that computes Escherichia coli traits. Nat. Biotechnol. 35, 904–908 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tan, S. Y. et al. Identification of five structurally unrelated quorum-sensing inhibitors of Pseudomonas aeruginosa from a natural-derivative database. Antimicrobial Agents Chemother. 57, 5629–5641 (2013).
Article CAS Google Scholar
Lee et al. Genomic analysis reveals that Pseudomonas aeruginosa virulence is combinatorial. Genome Biol. 7, 1–14. (2006).
Article CAS Google Scholar
Jenior, M. L., Moutinho, T. J. Jr, Dougherty, B. V. & Papin, J. A. Transcriptome-guided parsimonious flux analysis improves predictions with metabolic networks in complex environments. PLoS Computational Biol. 16, e1007099 (2020).
Article CAS Google Scholar
Bonser, L. R. & Erle, D. J. Airway mucus and asthma: the role of MUC5AC and MUC5B. J. Clin. Med. 6, 112 (2017).
Article PubMed Central CAS Google Scholar
Roy et al. Muc5b is required for airway defence. Nature 505, 412–416 (2014).
Article CAS PubMed Google Scholar
Hosseini, E., Grootaert, C., Verstraete, W. & Van de Wiele, T. Propionate as a health-promoting microbial metabolite in the human gut. Nutr. Rev. 69, 245–258 (2011).
Article PubMed Google Scholar
El Hage, R., Hernandez-Sanabria, E., Calatayud Arroyo, M., Props, R. & Van de Wiele, T. Propionate-producing consortium restores antibiotic-induced dysbiosis in a dynamic in vitro model of the human intestinal microbial ecosystem. Front. Microbiol. 10, 1206 (2019).
Article PubMed PubMed Central Google Scholar
Vinolo, M. A., Rodrigues, H. G., Nachbar, R. T. & Curi, R. Regulation of inflammation by short chain fatty acids. Nutrients 3, 858–876 (2011).
Article CAS PubMed PubMed Central Google Scholar
Raman, K. & Chandra, N. Flux balance analysis of biological systems: applications and challenges. Brief. Bioinforma. 10, 435–449 (2009).
Article CAS Google Scholar
Carey, M. A., Dräger, A., Beber, M. E., Papin, J. A. & Yurkovich, J. T. Community standards to facilitate development and address challenges in metabolic modeling. Mol. Syst. Biol. 16, e9235 (2020).
Article PubMed PubMed Central Google Scholar
Römer et al. ZBIT bioinformatics toolbox: a web-platform for systems biology and expression data analysis. PLoS ONE 11, e0149263 (2016).
Article PubMed PubMed Central CAS Google Scholar
Norsigian et al. BiGG Models 2020: multi-strain genome-scale models and expansion across the phylogenetic tree. Nucleic Acids Res. 48, D402–D406 (2020).
CAS PubMed Google Scholar
Bornstein, B. J., Keating, S. M., Jouraku, A. & Hucka, M. LibSBML: an API library for SBML. Bioinformatics 24, 880–881 (2008).
Article CAS PubMed Google Scholar
Juty, N., Le Novere, N. & Laibe, C. Identifiers. org and MIRIAM Registry: community resources to provide persistent identification. Nucleic Acids Res. 40, D580–D586 (2012).
Article CAS PubMed Google Scholar
Moretti, S., Tran, V. D. T., Mehl, F., Ibberson, M. & Pagni, M. MetaNetX/MNXref: unified namespace for metabolites and biochemical reactions in the context of metabolic models. Nucleic Acids Res. 49, D570–D574 (2021).
Article CAS PubMed Google Scholar
Sayers et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 47, D23 (2019). Database issue.
Article CAS PubMed Google Scholar
The UniProt Consortium. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489 (2021).
Article CAS Google Scholar
Wattam et al. Improvements to PATRIC, the all-bacterial bioinformatics database and analysis resource center. Nucleic Acids Res. 45, D535–D542 (2017).
Article CAS PubMed Google Scholar
Courtot et al. Controlled vocabularies and semantics in systems biology. Mol. Syst. Biol. 7, 543 (2011).
Article PubMed PubMed Central Google Scholar
Finney, A., Hucka, M., & Le Novère, N. Systems Biology Markup Language (SBML) Level 2: Structures and Facilities for Model Definitions. Retrieved from COMBINE (2006), http://identifiers.org/combine.specifications/sbml.level-2.version-2.
Hucka et al. (2018). The systems biology markup language (SBML): Language specification for level 3 version 1 core. J. Integr. Bioinform. 15 (2018).
Olivier, B. G. & Bergmann, F. T. SBML level 3 package: flux balance constraints version 2. J. Integr. Bioinform. 15 (2018).
Hucka, M. & Smith, L. P. SBML level 3 package: groups, version 1 release 1. J. Integr. Bioinforma. 13, 290 (2016).
Article Google Scholar
Caspi et al. The MetaCyc database of metabolic pathways and enzymes-a 2019 update. Nucleic Acids Res. 48, D445–D453 (2020).
Article CAS PubMed Google Scholar
Matthews, B. W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. Biophys. Acta (BBA)-Protein Struct. 405, 442–451 (1975).
Article CAS Google Scholar
Dunphy, L. J., Yen, P. & Papin, J. A. Integrated experimental and computational analyses reveal differential metabolic functionality in antibiotic-resistant Pseudomonas aeruginosa. Cell Syst. 8, 3–14 (2019).
Article CAS PubMed PubMed Central Google Scholar
Darveau, R. P. & Hancock, R. E. Procedure for isolation of bacterial lipopolysaccharides from both smooth and rough Pseudomonas aeruginosa and Salmonella typhimurium strains. J. Bacteriol. 155, 831 (1983).
Article CAS PubMed PubMed Central Google Scholar
Winsor et al. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database. Nucleic Acids Res. 44, D646–D653 (2016).
Article CAS PubMed Google Scholar
Oksanen et al. Community ecology package. R package version, 2 (2013).
Liaw, A. & Wiener, M. Classification and regression by randomForest. R. N. 2, 18–22 (2002).
Google Scholar
Hardle, W. K. & Simar, L. In Applied Multivariate Statistical Analysis (eds. Hardle, W. K. & Simar, L.) 3rd edn. (Springer, 2012).
Malik-Sheriff, R. S. et al. “BioModels—15 years of sharing computational models in life science”. Nucleic Acids Res. 48, D407–D415 (2020).
CAS PubMed Google Scholar
Bergmann, F. T. et al. “COMBINE archive and OMEX format: One file to share all information to reproduce a modeling project”. BMC Bioinform. 15, 369 (2014).
Article CAS Google Scholar
Neal, M. L. et al. “Harmonizing semantic annotations for computational models in biology”. Brief. Bioinform. ume 20, 540–550 (2019).
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank our colleagues Jennifer Bartell and Anna Blazier for their previous work on iPau1129 and their helpful suggestions for our curation of iPau21. This work is supported with U.S. federal funds from the National Institutes of Health (R01 AI154242) and funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy–EXC 2124–390838134 within the Cluster of Excellence CMFI (Controlling Microbes to Fight Infections). A.D. is funded by the Germany Center for Infection Research (DZIF) within the Deutsche Zentren der Gesundheitsforschung (BMBF-DZG, Germany Centers for Health Research of the Federal Ministry of Education and Research), grant no. 8020708703.

Author information

These authors contributed equally: Alina Renz, Laura J. Dunphy.

Authors and Affiliations

Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
Dawson D. Payne, Laura J. Dunphy & Jason A. Papin
Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, Tübingen, Germany
Alina Renz & Andreas Dräger
Department of Computer Science, University of Tübingen, Tübingen, Germany
Alina Renz & Andreas Dräger
Cluster of Excellence ‘Controlling Microbes to Fight Infections’, University of Tübingen, Tübingen, Germany
Alina Renz & Andreas Dräger
Department of Chemical Engineering, University of Massachusetts Amherst, Amherst, MA, USA
Taylor Lewis
German Center for Infection Research (DZIF) partner site, Tübingen, Germany
Andreas Dräger

Authors

Dawson D. Payne
View author publications
You can also search for this author in PubMed Google Scholar
Alina Renz
View author publications
You can also search for this author in PubMed Google Scholar
Laura J. Dunphy
View author publications
You can also search for this author in PubMed Google Scholar
Taylor Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Dräger
View author publications
You can also search for this author in PubMed Google Scholar
Jason A. Papin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.D. and J.P. conceived the project. D.P., A.R., L.D. and T.L. researched for GENRE updates. D.P., A.R. and L.D. prepared the GENRE. D.P. ran simulations with the GENRE and produced/analyzed contextualized models. D.P., L.D. and J.P. interpreted the results. D.P., A.R. and L.D. wrote the manuscript. All authors critically revised the manuscript and approved the final version.

Corresponding author

Correspondence to Jason A. Papin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Payne, D.D., Renz, A., Dunphy, L.J. et al. An updated genome-scale metabolic network reconstruction of Pseudomonas aeruginosa PA14 to characterize mucin-driven shifts in bacterial metabolism. npj Syst Biol Appl 7, 37 (2021). https://doi.org/10.1038/s41540-021-00198-2

Download citation

Received: 10 March 2021
Accepted: 17 September 2021
Published: 08 October 2021
DOI: https://doi.org/10.1038/s41540-021-00198-2

This article is cited by

Genome-scale metabolic network model and phenome of solvent-tolerant Pseudomonas putida S12
- Sol Han
- Dohyeon Kim
- Sung Ho Yoon
BMC Genomics (2024)