Genome-scale reconstructions of the mammalian secretory pathway predict metabolic costs and limitations of protein secretion

Gutierrez, Jahir M.; Feizi, Amir; Li, Shangzhong; Kallehauge, Thomas B.; Hefzi, Hooman; Grav, Lise M.; Ley, Daniel; Baycin Hizal, Deniz; Betenbaugh, Michael J.; Voldborg, Bjorn; Faustrup Kildegaard, Helene; Min Lee, Gyun; Palsson, Bernhard O.; Nielsen, Jens; Lewis, Nathan E.

doi:10.1038/s41467-019-13867-y

Download PDF

Article
Open access
Published: 02 January 2020

Genome-scale reconstructions of the mammalian secretory pathway predict metabolic costs and limitations of protein secretion

Jahir M. Gutierrez^1,2^na1,
Amir Feizi³^na1,
Shangzhong Li^1,2,
Thomas B. Kallehauge⁴,
Hooman Hefzi^1,2,
Lise M. Grav⁴,
Daniel Ley^4,5,
Deniz Baycin Hizal⁶,
Michael J. Betenbaugh⁷,
Bjorn Voldborg⁴,
Helene Faustrup Kildegaard⁴,
Gyun Min Lee ORCID: orcid.org/0000-0002-9876-7378⁴,
Bernhard O. Palsson ORCID: orcid.org/0000-0003-2357-6785^1,2,4,8,
Jens Nielsen ORCID: orcid.org/0000-0002-9955-6003^3,4 &
…
Nathan E. Lewis ORCID: orcid.org/0000-0001-7700-3654^1,2,8

Nature Communications volume 11, Article number: 68 (2020) Cite this article

16k Accesses
66 Citations
68 Altmetric
Metrics details

Subjects

Abstract

In mammalian cells, >25% of synthesized proteins are exported through the secretory pathway. The pathway complexity, however, obfuscates its impact on the secretion of different proteins. Unraveling its impact on diverse proteins is particularly important for biopharmaceutical production. Here we delineate the core secretory pathway functions and integrate them with genome-scale metabolic reconstructions of human, mouse, and Chinese hamster ovary cells. The resulting reconstructions enable the computation of energetic costs and machinery demands of each secreted protein. By integrating additional omics data, we find that highly secretory cells have adapted to reduce expression and secretion of other expensive host cell proteins. Furthermore, we predict metabolic costs and maximum productivities of biotherapeutic proteins and identify protein features that most significantly impact protein secretion. Finally, the model successfully predicts the increase in secretion of a monoclonal antibody after silencing a highly expressed selection marker. This work represents a knowledgebase of the mammalian secretory pathway that serves as a novel tool for systems biotechnology.

Pooled multicolour tagging for visualizing subcellular protein dynamics

Article Open access 19 April 2024

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

Article 26 February 2024

Luciferase- and HaloTag-based reporter assays to measure small-molecule-induced degradation pathway in living cells

Article 18 April 2024

Introduction

To interact with their environment, cells produce numerous signaling proteins, hormones, receptors, and structural proteins. In mammals, these include at least 2641 secreted proteins (e.g., enzymes, hormones, antibodies, extracellular matrix proteins) and >5500 membrane proteins¹, most of which are synthesized and processed in the secretory pathway.

The secretory pathway consists of a complex series of processes that predominantly take place in the endoplasmic reticulum (ER), Golgi apparatus, and the endomembrane system. This pathway is particularly important in biotechnology and the biopharmaceutical industry, since most therapeutic proteins are produced in mammalian cell lines such as HEK-293, PerC6, NS0, and Chinese hamster ovary (CHO) cells, which are capable of folding and adding the necessary post-translational modifications (PTMs) to the target product². For any given biotherapeutic, different machinery in the secretory pathway may be needed, and each step can exert a non-negligible metabolic demand on the cells. The complexity of this pathway, however, makes it unclear how the biosynthetic cost and cellular needs vary for different secreted proteins, each of which exerts different demands for cellular resources. Therefore, a detailed understanding of the biosynthetic costs of the secretory pathway could guide efforts to engineer host cells and bioprocesses for any desired product. The energetic and material demands of the mammalian secretory pathway can be accounted for by substantially extending the scope of metabolic models. Indeed, recent studies have incorporated portions of the secretory pathway in metabolic models of yeast^3,4,5. Furthermore, Lund et al.⁶ reconstructed a genetic interaction network of the mouse secretory pathway and the unfolded protein response and analyzed it in the context of CHO cells. However, such a network does not encompass a stoichiometric reconstruction of the biochemical reactions involved in the secretory pathway nor it is coupled to existing metabolic networks of mammalian cells.

Here, we present the first genome-scale stoichiometric reconstructions and computational models of mammalian metabolism coupled to protein secretion. Specifically, we constructed these for human, mouse, and CHO cells, called RECON2.2s, iMM1685s, and iCHO2048s, respectively. We first derive an expression for computing the energetic cost of synthesizing and secreting a product in terms of molecules of ATP equivalents per protein molecule. We use this expression and analyze how the energetic burden of protein secretion has led to an overall suppression of more expensive secreted host cell proteins in mammalian cells. Given its dominant role in biotherapeutic production, we further focus on the biosynthetic capabilities of CHO cells. We then demonstrate that product-specific secretory pathway models can be built to estimate CHO cell growth rates given the specific productivity of the recombinant product as a constraint. We identify the features of secreted proteins that have the highest impact on protein cost and productivity rates. Finally, we use our model to identify proteins that compete for cell resources, thereby presenting targets for cell engineering. Through this study we demonstrate that a systems-view of the secretory pathway now enables the analysis of many biomolecular mechanisms controlling the efficacy and cost of protein expression in mammalian cells. We envision our models as valuable tools for the study of normal physiological processes and engineering cell bioprocesses in biotechnology. All models and data used in this study are freely available at https://github.com/LewisLabUCSD/MammalianSecretoryRecon.

Results

A stoichiometric expression of protein secretion energetics

In any cell, the secretory machinery is concurrently processing thousands of secreted and membrane proteins, which all compete for secretory pathway resources and pose a metabolic burden. To quantify this burden, we estimated the energetic cost of synthesizing and/or secreting 5641 and 3538 endogenous proteins in the CHO and human secretome and membrane proteome in terms of total number of ATP equivalent molecules consumed (see Methods). These protein costs were compared to the cost of five recombinant proteins commonly produced in CHO cells (Fig. 1a). To refine estimates, we predicted signal peptides⁷, GPI anchor attachment signals⁸, and experimentally measured the number of N-linked glycans in the CHO proteome and integrated published numbers of O-linked glycans in CHO proteomic data⁹. Across the CHO secretome, protein synthesis cost varies substantially, and recombinant products are on average more expensive (Fig. 1a). For example, Factor 8 (F8) is a difficult-to-express protein in CHO cells due to its propensity to aggregate in the ER, which promotes its premature degradation^10,11. Our analysis further highlights that each molecule of F8 requires a large amount of ATP for its production (9488 ATP molecules). This imposes a significant burden to the secretory machinery of CHO cells, which typically express much less expensive endogenous proteins.

**Fig. 1: Mammalian secretory cells preferentially suppress more expensive proteins.**

Recombinant cells suppress expression of expensive proteins

With the broad range of biosynthetic costs for different proteins, we wondered if gene expression in mammalian cells that are tasked with high levels of protein secretion have been influenced by the ATP cost of secreted proteins. That is, have these secretory cells suppressed their protein expression to more efficiently allocate nutrients? To test this, we first looked at CHO cells, which have undergone extensive selection to obtain cells that secrete recombinant proteins at high titer, and then compared different human tissues with a range of secretory capacity.

Unless specific proteins are essential, CHO cells may preferentially suppress energetically expensive proteins. Thus, we analyzed ribosomal profiling (Ribo-Seq) data from a recombinant CHO cell line¹² and compared translation of each transcript against the ATP cost of the associated secreted protein (see Methods). Indeed, there was a significant negative correlation of −0.43 (Spearman R_s, p-value < 1 × 10⁻²⁰) between ribosomal occupancy and ATP cost during early exponential growth phase of culture (Fig. 1b). Wondering if the reduced translation was regulated transcriptionally, we further analyzed RNA-Seq data from the same recombinant cell line and from another, non-recombinant CHO-K1 cell line¹³. The RNA expression also negatively correlated with ATP cost (see Supplementary Fig. 2).

To evaluate if this is a general trend in mammalian secretory cells, we analyzed RNA-Seq data from human tissues and immortalized cell lines in the Human Protein Atlas (HPA)¹. For all RNA-Seq datasets in the HPA, there was a negative correlation between mRNA expression levels and ATP cost (Fig. 1d). Interestingly, we found that highly secretory tissues such as liver, pancreas and salivary gland had the strongest correlations, although none as strong as that of the recombinant CHO cells, which have undergone selection of high secretion. Feizi et al.¹⁴ recently found that these tissues fine-tune the expression of protein disulfide isomerase genes, suggesting that a similar regulatory process may take place in the ER of CHO cells as the secreted monoclonal antibody (mAb) contains a relatively high number (17) of disulfide bonds. In conclusion, there is a clear preference in CHO and native secretory tissues to suppress the expression and translation of proteins that are costly to synthesize, fold, and secrete.

In silico reconstruction of the mammalian secretory pathway

We mapped out the core processes involved in the synthesis of secreted and membrane proteins in mammalian cells (i.e., human, mouse, and Chinese hamster). This included 261 components (gene products) in CHO cells and 271 components in both human and mouse. The components are involved in secretory reactions across 12 subsystems (i.e., functional modules of the secretory pathway; Fig. 2a). These components represent the core secretory machinery needed in the transition of a target protein from its immature state in the cytosol (i.e., right after translation) to its final form (i.e., when it contains all post-translational modifications and is secreted to the extracellular space). Each component in the reconstruction either catalyzes a chemical modification on the target protein (e.g., N-linked glycosylation inside ER lumen/Golgi) or participates in a multi-protein complex that promotes protein folding and/or transport. This distinction between catalytic enzymes and complex-forming components is important for modeling purposes as a catalytic component consumes or produces metabolites that are directly connected to the metabolic network (e.g., ATP, sugar nucleotides). As all components of the core secretory pathway were conserved across human, mouse and hamster (Fig. 2b), we generated species-specific secretory pathway reconstructions and used them to expand the respective genome-scale metabolic networks (Recon 2.2¹⁵, iMM1415¹⁶, iCHO1766¹⁷). Following the naming convention of their metabolic counterparts, we named these new metabolic-secretory reconstructions as follows: iMM1685s, iCHO2048s, and Recon 2.2s, which account for 1685, 2048, and 1946 genes, respectively. A detailed list of the components, reactions and the associated genes can be found in the Supplementary Data 1.

**Fig. 2: Components in the reconstruction of the secretory pathway in mammalian cells.**

Validation of iCHO2048s growth and productivity predictions

We first validated the accuracy of iCHO2048s predictions using growth and specific productivity rates of IgG-producing CHO cell lines from two independent studies^12,18. For this, we built an IgG-secreting iCHO2048s model using the information in the PSIM matrix for the therapeutic mAb Rituximab. We then constrained the model’s Rituximab-specific secretory pathway with the reported productivity value in each study and used FBA to predict growth (Fig. 3a). Later, to assess the ability of iCHO2048s to predict growth rates in cases when CHO cells are producing non-antibody proteins, we collected data from two batch culture experiments using Enbrel- and C1-inhibitor-producing isogenic CHO cell lines. We constructed two iCHO2048s models for each case and predicted growth rates during the early exponential growth phase of culture while constraining the protein secretion rate to the measured specific productivity value (Fig. 3b). The model predictions agreed well with the reported and measured values. There were cases where iCHO2048s predicted a much higher growth rate than what was measured in the first days of batch culture (Fig. 3b). Since FBA computes theoretical maximum growth rates given a set of constraints, these over-prediction cases point at situations where CHO cells do not direct resources towards biomass production (during very early stages of culture), a discrepancy that is attenuated in later stages of culture. In conclusion, these results confirm the ability of protein-specific reconstructions to capture the specific energetic requirements that each recombinant product imposes on CHO cell metabolism.

**Fig. 3: Recombinant protein-producing models of iCHO2048s predict measured growth rates.**

Protein composition impacts predicted productivity

To produce a specific product, CHO cells may utilize different modules of the secretory pathway based on the protein attributes and post-translational modifications (PTMs). For example, the synthesis of a mAb requires the use of multiple processes and consumes several different metabolites, such as amino acids for protein translation, redox equivalents for forming disulfide bonds, ATP equivalents for vesicular transport, and sugar nucleotides for protein glycosylation (Fig. 2c). Therefore, we generated eight product-specific secretory pathway models for biotherapeutics commonly produced in CHO cells (Fig. 4a): bone morphogenetic proteins 2 and 7 (BMP2, BMP7), erythropoietin (EPO), Enbrel, factor VIII (F8), interferon beta 1a (IFNB1), Rituximab, and tissue plasminogen activator (tPA). The resulting iCHO2048s models were used to compute Pareto optimality frontiers between maximum cell growth (μ) and specific productivity (q_P), given the same measured glucose and amino acid uptake rates for each model¹⁷ (see Supplementary Data 3).

**Fig. 4: Construction of product-specific iCHO2048s models.**

We computed the trade-off between growth rate (h^–1) and specific productivity (picogram of protein produced per cell per day, or PCD) as a Pareto optimal curve for each protein (Fig. 4b). This curve defines the frontier of maximum specific productivity and maximum growth rates under the assumption that CHO cells can utilize all available resources towards production of biomass and recombinant protein only. The hinges in some of the curves are indicative of a transition between regions that are limited by distinct protein requirements (e.g., amino acids).

An analysis of the Pareto optimal curves for the eight biotherapeutics demonstrates that under the measured growth conditions, maximum productivities vary from 20–100 PCD at common growth rates (Fig. 4b, shaded region) to 70–150 PCD for senescent CHO cells. Neither the molecular weight (MW) nor product length can explain the twofold range differences in maximum productivity for different proteins. For example, the curves show tPA (MW = 61,917 Da) can express at higher PCD than BMP2 (MW = 44,702 Da) despite being larger, because the N-glycans in BMP2 reduce productivity due to the higher cost of synthesizing core N-glycans (see Table 1), consistent with previous observations in yeast⁵. Furthermore, the degree and directionality of these effects will depend on the nutrient uptake rates (Fig. 4c and Supplementary Fig. 1), highlighting the need in CHO bioprocessing to tailor culture media in a host cell and product-specific manner. Thus, while intuitively larger proteins would be expected to exert more bioenergetic cost on protein secretion, we find that specific compositional attributes of both the recombinant protein and the culture media significantly impact biosynthetic capacity. An in-depth analysis of the effects of PTMs on predicted productivities is provided in Jupyter Notebook C.

Table 1 Protein-specific information matrix of biotherapeutics secreted in eight iCHO2048s models.

Full size table

To further evaluate what functions of the secretory pathway had the greatest impact on the cost of protein synthesis and secretion, we computed secretion rates for 5461 proteins in the CHO secretome (see Methods) using iCHO2048s and its parent metabolic reconstruction iCHO1766¹⁷. While iCHO2048s captures all the required steps for protein synthesis, modification and secretion, the secretion reactions in iCHO1766 only account for the basic synthesis of the target protein in cytoplasm, and the synthesis of necessary precursors (N-linked glycans, O-linked glycans, and GPI anchors). We found that the secretory pathway had non-negligible costs on most proteins (Supplementary Fig. 3b). Furthermore, protein features associated with secreted proteins that differ in cost by >15% beyond the amino acid and glycan costs show a statistical enrichment (under the Hypergeometric test) for O-linked glycans (p = 0.0065), GPI anchors (p = 0.0216), transmembrane domains (p = 0.0326), and proteins destined to the ER lumen (p = 0.0142), the Golgi membrane (p = 0.0065), or the plasma membrane (p = 0.0186, see Supplementary Fig. 3d and Jupyter Notebook E). Thus, these PTMs and transmembrane domains exert additional costs to their demands.

iCHO2048s recapitulates results following gene knock-down

In a recent study, Kallehauge et al.¹² demonstrated that a CHO-DG44 cell line producing an antiviral mAb¹⁹ also expressed high levels of the neoR selection-marker gene (Fig. 5a, b). Upon neoR knock-down, the titer and maximum viable cell densities of the CHO-DG44 cell line were increased. To test if iCHO2048s could replicate these results, we constructed a model for the Kallehauge et al. DG44 cell line and measured exometabolomics, and dry cell weight to parameterize the model. Since expression of neoR uses resources that could be used for antibody production, we predicted how much additional antibody could be synthesized with the elimination of the neoR gene. We simulated antibody production following a complete knockout of neoR (see Table 2 and Fig. 5b) and predicted that the deletion of neoR could increase specific productivity by up to 4% and 29% on days 3 (early exponential phase) and 6 (late phase) of culture, respectively (Fig. 5c). This was qualitatively consistent with the experimentally observed values of 2% and 14% when neoR mRNA was knocked down by 80–85%. We then computed the Pareto optimality curves for both the control and the neoR in silico knockout conditions on day 6. We found that the length of the curve (denoted by Δ) increased by 18% when neoR production is eliminated (Fig. 5d). Thus, iCHO2048s can quantify how much non-essential gene knockouts can boost growth and productivity in CHO cells by freeing energetic and secretory resources. In fact, the ribosome-profiling data from Kallehauge et al. revealed that only 30 secretory proteins in CHO cells account for more than 50% of the ribosomal load directed towards translation of protein bearing a signal peptide (Fig. 4e). Indeed, we recently found that substantial resources can be liberated and recombinant protein titers can be increased when 14 high-abundance host cell proteins were knocked out²⁰. An analysis of other potential host cell gene knockouts using the method proposed here can be found in Supplementary Data 4.

**Fig. 5: iCHO2048s recapitulates experimental results of *neoR* knock-down in silico.**

Table 2 Experimental data used for validation of iCHO2048s predictive capabilities.

Full size table

Discussion

Mammalian cells synthesize and process thousands of proteins through their secretory pathway. Many of these proteins, including hormones, enzymes, and receptors, are essential for mediating mammalian cell interactions with their environment. Therefore, many have therapeutic importance either as drugs or as targets. The expression and secretion of recombinant proteins represents a significant anabolic demand that drains several substrates from cellular metabolism (e.g., amino acids, sugar nucleotides, ATP)^21,22. Furthermore, the recombinant proteins demand adequate expression of supporting proteins involved in their transcription, translation, folding, modification, and secretion. Thus, there has been an increasing interest in engineering the mammalian secretory pathway to boost protein production^23,24,25,26. Despite important advances in the field²⁷, current strategies to engineer the secretory pathway have remained predominantly empirical^28,29. Recent modeling approaches, however, have enabled the analysis of the metabolic capabilities of important eukaryotic cells under different genetic and environmental conditions^17,30,31,32. With the development of genome-scale models of protein-producing cells, such as CHO¹⁷, HEK-293³³, and hybridomas^34,35, it is now possible to gain a systems-level understanding of the mammalian protein production phenotype³⁶.

Efforts have been underway to enumerate the machinery needed for protein production. For example, Lund et al.⁶ recently reconstructed a comprehensive genetic network of the mouse secretory pathway. By comparing the mouse and CHO-K1 genomes and mapping CHO gene expression data onto this network, the authors identified potential targets for CHO cell engineering, demonstrating the potential of systems biology to interrogate and understand protein secretion in animal cells. This genetic network reconstruction, although useful for contextualizing omics data (e.g., RNA-Seq), is not set up for simulations of protein production, nor integrated with additional cellular processes such as metabolism. Therefore, our work is complementary in that it allows one to also to quantify the cost and cellular capacity for protein production by delineating the mechanisms of all biosynthetic steps and bioenergetic processes in the cell.

Here, we presented the first genome-scale reconstruction of the secretory pathway in mammalian cells coupled to metabolism. We connected this to current metabolic networks, yielding models of protein secretion and metabolism for human, mouse and CHO cells. These models compile decades of research in biochemistry and cell biology of higher eukaryotes and present it in a mathematical model. Using our model, we quantitatively estimated the energetic cost of producing several therapeutic proteins and all proteins in the CHO cell and human secretomes. We also identified factors limiting the secretion of individual products and observed that these depend on both the complexity of the product and the composition of the culture media. Furthermore, by integrating ribosomal profiling data with our model we found that CHO cells have selectively suppressed the expression of energetically expensive secreted proteins. Expanding upon this observation, we demonstrated that specific productivities can be predictably increased following the knock-down of an energetically expensive, non-essential protein. Furthermore, consistent with this, we have recently shown more than 50% reductions in total host cell protein production, along with increases in mAb titer when deleting 14 highly abundant proteins in CHO cells. Future studies will likely further explore how much of the CHO cell proteome can be deleted to further enhance recombinant protein secretion²⁰.

It is important to note that while our models capture major features of secreted proteins, there are additional PTMs (e.g., phosphorylation, gamma carboxylation), pathway machinery (e.g., chaperones), and cell processes that could possibly be captured in further expansions of the modeling framework⁶ (e.g., the unfolded protein response). These could be included as energetic costs associated with building and maintaining the secretory machinery (chaperones³, disulfide oxidoreductases³⁷, glycosyltransferases³⁸); protein stability and turnover rates³⁹; solubility constraints⁴⁰ and molecular crowding effects⁴¹. As these are captured by the models in a protein product-specific manner, predictions of protein production capacity will improve, and the models could provide further insights for cell engineering for biotechnology or to obtain a deeper understanding of mechanisms underlying amyloid diseases. Finally, a simplification of our secretory model is that it only computes the bioenergetic cost of synthesizing and attaching single representative N- and O-linked glycans to secreted proteins (i.e., it does not include the microheterogeneity and diversity of glycan structures of different proteins). Thus, an immediate potential expansion of our secretory model would involve coupling it to existing computational models of protein glycosylation^42,43. For example, given an N-glycan reaction network that captures the glycoform complexity of a target protein⁴⁴, one could build secretory reactions for the specific glycoforms of interest and compute the metabolic demands associated with each of them to identify potential targets and nutrient supplementations for glycoengineering.

In conclusion, the results of our study have important implications regarding the ability to predict protein expression based on protein-specific attributes and energetic requirements. The secretory pathway models here stand as novel tools to study mammalian cells and the energetic trade-off between growth and protein secretion in a product- and cell-specific manner. We presented algorithms that provide novel insights with our models, and expect that many other methods can be developed to answer a wide array of questions surrounding the secretory pathway, as seen for metabolism⁴⁵. To facilitate further use of these models, we provide our code and detailed instructions on how to construct protein-specific models in the Jupyter Notebooks available at https://github.com/LewisLabUCSD/MammalianSecretoryRecon.

Methods

Reconstruction of the mammalian secretory pathway

A list of proteins and enzymes in the mammalian secretory pathway was compiled from literature curation, UniProt, NCBI Gene, NCBI Protein and CHOgenome.org (see Supplementary Data 1). To facilitate the reconstruction process, the secretory pathway was divided into twelve subsystems or functional modules (Fig. 1) to sort the components according to their function. These subsystems correspond to the major steps required to process and secrete a protein. The components from a prior yeast secretory pathway reconstruction³ were used as a starting reference. To build species-specific models, orthologs for human, mouse and the Chinese hamster were identified and used, while yeast components and subsystems that are not present in the mammalian secretory pathway were removed. Additional subsystems were added when unique to higher eukaryotes, such as the calnexin-calreticulin cycle in the ER⁴⁶. These were constructed de novo and added to the reconstruction. The databases and literature were then consulted to identify the remaining components involved in each subsystem of the mammalian secretory pathway. Since most components in the mammalian secretory pathway have been identified in mouse and human, BLAST was utilized to identify the corresponding Chinese hamster orthologs by setting human as the reference organism and a cutoff of 60% of sequence identity. See Supplementary Discussion for an overview of the mammalian secretory pathway and its comparison with the yeast secretory pathway.

Protein-specific information matrix (PSIM)

The PSIM (Supplementary Data 2) contains the necessary information to construct a protein-specific secretory model from the template reactions in our reconstruction. The columns in the PSIM are presence of a signal peptide (SP), number of disulfide bonds (DSB), presence of Glycosylphosphatidylinositol (GPI) anchors, number of N-linked (NG) and O-linked (OG) glycans, number of transmembrane domains (TMD), subcellular location, protein length, and molecular weight. For most proteins, the information in the PSIM was obtained from the Uniprot database. When necessary, computational tools were used to predict signal peptides (PrediSi⁷) and GPI anchors (GPI-SOM⁸). Finally, additional information on the number of O-linked glycosylation sites of certain proteins were obtained from experimental data in previous studies^9,47. The PSIMs of the CHO and human secretomes are a subset of the full PSIM and contains only the proteins with a signal peptide (predicted or confirmed in Uniprot). The distribution of all PTMs across the human, mouse and CHO proteomes can be found in Jupyter Notebook D. For analyzing secretomes, a total of 3378 human proteins were picked based on the presence of a signal peptide in their sequence according to their annotation in the UniProt database. Similarly, 5641 CHO proteins were picked based on the presence of a signal peptide in their sequence and/or for being localized in the cell membrane according to the UniProt database.

Detection of N-linked glycosylation sites in CHO proteome

The number of N-linked glycosylation sites in the PSIM was determined experimentally as follows. CHO-K1 cells (ATCC) were lysed, denatured, reduced, alkylated and digested by trypsin. Desalted peptides were incubated with 10 mM sodium periodate in dark for 1 h before coupling to 50 μL of (50% slurry) hydrazide resins. After incubation overnight, non-glycosylated peptides were washed with 1.5 M NaCl and water. The N-glycosylated peptides were released with PNGaseF at 37 °C and desalted by using a C18 SepPak column. Strong cation exchange (SCX) chromatography was used to separate the sample into 8 fractions. Each fraction was analyzed on an LTQ-Orbitrap Velos (Thermo Electron, Bremen, Germany) mass spectrometer. During the mass spectrometry data analysis, carbamidomethylation was set as a fixed modification while oxidation, pyroglutamine, and deamidation were variable modifications.

Construction of models and constraint-based analysis

We wrote a Jupyter Notebook in Python (see Jupyter Notebook A) that takes a row from the PSIM as input to produce an expanded iCHO2048s, Recon 2.2s, or iMM1685s metabolic model with the product-specific secretory pathway of the corresponding protein. Flux balance analysis (FBA⁴⁸) and all other constraint-based analyses were done using the COBRA toolbox v2.0⁴⁹ in MATLAB R2015b and the Gurobi solver version 6.0.0. The analyses in Figs. 2, 3, 4 were done using the constraints in the Supplementary Data 3. For the iCHO2048s models secreting human proteins, we set the same constraints in all models and computed the theoretical maximum productivity (max_qp) while maintaining a growth rate (in units of inverse hours) of 0.01. Finally, since the exact glycoprofiles of most proteins in CHO are unknown and some even change over time in culture⁵⁰, we simplified our models by only adding the core N-linked and O-linked glycans to the secreted proteins.

Batch cultivation

Two isogenic CHO-S cell lines (Thermo Fisher Scientific, USA) adapted to grow in suspension, one producing Enbrel (Etanercept) and the other producing human plasma protease C1-inhibitor (C1INH), were seeded at 3 × 10⁵ cells per mL in 60 mL CD-CHO medium (Thermo Fisher Scientific, USA) supplemented with 8 mM l-Glutamine (Lonza) and 1 μL per mL anti-clumping agent (Life Technologies), in 250 mL Erlenmeyer shake flasks. Cells were incubated in a humidified incubator at 37 °C, 5% CO₂ at 120 rpm. Viable cell density and viability were monitored every 24 h for 7 days using the NucleoCounter NC-200 Cell Counter (ChemoMetec). Daily samples of spent media were taken for extracellular metabolite concentration and titer measurements by drawing 0.8 mL from each culture, centrifuging it at 1000 × g for 10 min and collecting the supernatant and discarding the cell pellet.

Titer determination

To quantify Enbrel and C1INH titers, biolayer interferometry was performed using an Octet RED96 (Pall Corporation, Menlo Park, CA). ProA biosensors (Fortebio 18–5013) were hydrated in phosphate-buffered saline (PBS) and preconditioned in 10 mM glycine pH 1.7. A calibration curve was prepared using Enbrel (Pfizer) or C1INH at 200, 100, 50, 25, 12.5, 6.25, 3.13, 1.56, 0.78 μg per mL. Culture spent media samples were collected after centrifugation and association was performed for 120 s with a shaking speed of 200 rpm at 30 °C. Octet System Data Analysis 7.1 software was used to calculate binding rates and absolute protein concentrations.

Extracellular metabolite concentration measurements

The concentrations of glucose, lactate, ammonium (NH₄⁺), and glutamine in spent media were measured using the BioProfile 400 (Nova Biomedical). Amino acid concentrations were determined via High-Performance Liquid Chromatography using the Dionex Ultimate 3000 autosampler at a flow rate of 1 mL per minute. Briefly, samples were diluted 10 times using 20 μL of sample, 80 μL MiliQ water, and 100 μL of an internal amino acid standard. Derivatized amino acids were monitored using a fluorescence detector. OPA-derivatized amino acids were detected at 340ex and 450em nm and FMOC-derivatized amino acids at 266ex and 305em nm. Quantifications were based on standard curves derived from dilutions of a mixed amino acid standard (250 ug per mL). The upper and lower limits of quantification were 100 and 0.5 μg per mL, respectively.

Estimation of protein secretion cost

We estimated the energetic cost of synthesizing and secreting all 5641 endogenous CHO cell proteins and 3538 endogenous human proteins. These proteins were chosen for containing a signal peptide in their sequence and/or for being localized in the cell membrane (according to the UniProt database). The energetic cost (in units of number of ATP equivalents) of secreting each protein (length L) was computed using the following formulas and assumptions.

Energy cost of translation: For each protein molecule produced, 2 L ATP molecules are cleaved to AMP during charging of the transfer RNA with a specific amino acid; 1 GTP molecule is consumed during initiation and 1 GTP molecule for termination; L-1 GTP molecules are required for the formation of L-1 peptide bonds; L-1 GTP molecules are necessary for L-1 ribosomal translocation steps. Thus, the total cost of translation (assuming no proofreading) is 4 L.

Average cost of signal peptide degradation: On average, signal peptides have a length of 22 amino acids. Thus, the average cost of degrading all peptide bonds in the signal peptide is 22. This average cost was assigned to all proteins analyzed.

Energetic cost of translocation across the ER membrane: During activation of the translocon, two cytosolic GTP molecules are hydrolyzed. From there, a GTP molecule bound to the folding-assisting chaperone BiP is hydrolyzed to GDP for every 40 amino acids that pass through the translocon pore⁴⁶. Thus, the cost of translocation is (L ÷ 40) + 2.

Energetic cost of vesicular transport and secretion: We used published data^51,52,53 (see Supplementary Data 1) to compute stoichiometric coefficients for reactions involving vesicular transport. That is, the number of GTP molecules bound to RAB and coat proteins in each type of vesicle (COPII and secretory vesicles). We found that a total of 192 and 44 GTPs must be hydrolyzed to transport one COPII or secretory (i.e., clathrin coated) vesicle from the origin membrane to the target membrane, respectively. Since vesicles do not transport a single protein molecule at a time, we estimated the number of secreted protein molecules that would fit inside a spherical vesicle (see estimated and assumed diameters in the Supplementary Data 1). For that, we assumed that the secreted protein is globular and has a volume V_P (nm³) that is directly proportional to its molecular weight MW⁵⁴:

$$\begin{array}{*{20}{c}} {V_{\mathrm{P}} = {\mathrm{MW}} \times 0.00121\ } \end{array}$$

(1)

Finally, we assumed that only 70 percent of the vesicular volume can be occupied by the target protein. Thus, the cost of vesicular transport via COPII vesicles with Volume V_COPII is:

$$\begin{array}{*{20}{c}} {192\;{\mathrm{GTPs}} \div \left( {V_{{\mathrm{COPII}}} \times 0.7 \div V} \right)\ } \end{array}$$

(2)

Similarly, the cost of vesicular secretion is:

$$\begin{array}{*{20}{c}} {44\;{\mathrm{GTPs}} \div \left( {V_{{\mathrm{Secretory}}} \times 0.7 \div V} \right)\ } \end{array}$$

(3)

Constraints used in models and Pareto optimality frontiers

All models were constrained using different sets of experimental uptake rates, which can be found in Supplementary Data 3. To construct Pareto optimality frontiers, we used the robustAnalysis function from the COBRA Toolbox v2.0 in Matlab 2015b using biomass as the control and secretion of the recombinant protein as the objective reactions, respectively.

Analysis of gene expression versus protein cost

Ribosome-profiling data¹² were used to quantify the ribosomal occupancy of each transcript in CHO cells. A cutoff of 1 RPKM was used to remove genes with low expression (10,045 genes removed from day 3 analysis and 10,411 from day 6 analysis). We used Spearman correlation to assess the variation of expression levels with respect to protein ATP cost.

CHO-DG44 model and prediction of neoR knockout effect

Ribosome-profiling data, specific productivity, product sequence, and growth rates of an IgG-producing CHO-DG44 cell line were obtained from a previous publication¹². From the same cultures, we obtained further cell dry-weight and metabolomic data from spent culture medium for this study. The mCADRE algorithm^55,56 was used to construct a DG44 cell line-specific iCHO2048s model. The specific productivity and the RPKM values of the secreted IgG were used to estimate the translation rate for the neoR selection-marker gene. We assumed that the flux (in units of mmol per gram dry-weight per hour) through the neoR translation reaction (v_neoR) should be proportional to that of the IgG translation rate (v_IgG, calculated from the measured specific productivity) and related to their expression ratios (i.e., the RPKM values of their genes in the ribosome-profiling data).

$$\begin{array}{*{20}{c}} {v_{neoR} = \frac{{{\mathrm{RPKM}}_{{\mathrm{neoR}}}}}{{2\left( {{\mathrm{RPKM}}_{{\mathrm{light}}} \, + \; {\mathrm{RPKM}}_{{\mathrm{heavy}}}} \right)}}v_{{\mathrm{IgG}}}\ } \end{array}$$

(4)

Finally, a reaction of neoR peptide translation (which is expressed in the cytosol and is not processed in the secretory pathway) was added to construct a neoR-specific iCHO2048s model. Uptake and secretion rates of relevant metabolites on days 3 and 6 of cell culture were used to constrain our model. As recombinant proteins represent 20% of total cell protein⁵⁷, we scaled the coefficients of all 20 amino acids in the model’s biomass reaction accordingly (i.e., each coefficient was multiplied by 0.8). We then used FBA to predict the specific productivity of IgG with or without neoR.

Cell dry-weight measurements

For cell dry-weight measurements, six tubes containing 2 mL of culture samples of known viable cell density and viability were freeze dried, weighed, washed in PBS, and weighed again. The difference in weight was used to calculate the mass per cell. The procedure resulted in an average cell dry-weight of 456 pg per cell. As a simplification, we assumed that cell dry-weight does not significantly differ from this average measured value during culture and thus was used when computing flux distributions in all simulations.

Calculation of growth and productivity rates

Supplementary Data 3 contains the experimental uptake and secretion rates used to constrain the iCHO2048s models^12,22,23. When rates were not explicitly stated in the studies we consulted, we used a method we developed previously²⁷. Briefly, appropriate viable cell density, titer, and metabolite concentration plots were digitized using WebPlot Digitizer software and we computed the corresponding rates as follows:

Growth rate (in units of inverse hours):

$$\mu = \frac{1}{{{\mathrm{VCD}}}}\frac{d}{{dt}}{\mathrm{VCD}}$$

(5)

Where VCD is the viable cell density (in units of cells per milliliter) Specific productivity (in units of picograms per cell per hour):

$$q_{\mathrm{p}} = \frac{1}{{{\mathrm{VCD}}}}\frac{d}{{dt}}{\mathrm{Titer}}$$

(6)

Consumption or production rate v_x of metabolite x (in units of millimoles per gram dry-weight per hour):

$$v_x = \frac{1}{{{\mathrm{VCD}}}}\frac{{d\left[ x \right]}}{{dt}}$$

(7)

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data that support the findings of this study, including the models, tables, and Jupyter Notebooks, are available at https://github.com/LewisLabUCSD/MammalianSecretoryRecon, as well as in the Supplementary Data and Source Data files. The Ribo-Seq and RNA-Seq data from the study by Kallehauge et al.¹² is available on the Gene Expression Omnibus with GEO accession Number GSE79512. The RNA sequencing data for human tissue is freely available at the Human Protein Atlas website (https://www.proteinatlas.org/about/download).

Code availability

All code used to generate the results of this study, including Jupyter Notebooks, MATLAB, and Python scripts, are freely accessible at https://github.com/LewisLabUCSD/MammalianSecretoryRecon

References

Uhlén, M. et al. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article CAS PubMed Google Scholar
Walsh, G. Biopharmaceutical benchmarks 2018. Nat. Biotechnol. 36, 1136–1145 (2018).
Article CAS PubMed Google Scholar
Feizi, A., Österlund, T., Petranovic, D., Bordel, S. & Nielsen, J. Genome-scale modeling of the protein secretory machinery in yeast. PLoS ONE 8, e63284 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, L., Feizi, A., Österlund, T., Hjort, C. & Nielsen, J. Genome-scale analysis of the high-efficient protein secretion system of Aspergillus oryzae. BMC Syst. Biol. 8, 73 (2014).
Article PubMed PubMed Central CAS Google Scholar
Irani, Z. A., Kerkhoven, E. J., Shojaosadati, S. A. & Nielsen, J. Genome-scale metabolic model of Pichia pastoris with native and humanized glycosylation of recombinant proteins. Biotechnol. Bioeng. 113, 961–969 (2016).
Article CAS PubMed Google Scholar
Lund, A. M. et al. Network reconstruction of the mouse secretory pathway applied on CHO cell transcriptome data. BMC Syst. Biol. 11, 37 (2017).
Article PubMed PubMed Central CAS Google Scholar
Hiller, K., Grote, A., Scheer, M., Munch, R. & Jahn, D. PrediSi: prediction of signal peptides and their cleavage positions. Nucl. Acids Res. 32, W375–W379 (2004).
Article CAS PubMed PubMed Central Google Scholar
Fankhauser, N. & Maser, P. Identification of GPI anchor attachment signals by a Kohonen self-organizing map. Bioinformatics 21, 1846–1852 (2005).
Article CAS PubMed Google Scholar
Yang, Z. et al. The GalNAc-type O-glycoproteome of CHO cells characterized by the SimpleCell strategy. Mol. Cell. Proteom. 13, 3224–3235 (2014).
Article CAS Google Scholar
Kaufman, R. J. et al. Effect of von Willebrand factor coexpression on the synthesis and secretion of factor VIII in Chinese hamster ovary cells. Mol. Cell. Biol. 9, 1233–1242 (1989).
Article CAS PubMed PubMed Central Google Scholar
Pipe, S. W., Morris, J. A., Shah, J. & Kaufman, R. J. Differential interaction of coagulation factor VIII and factor V with protein chaperones calnexin and calreticulin. J. Biol. Chem. 273, 8537–8544 (1998).
Article CAS PubMed Google Scholar
Kallehauge, T. B. et al. Ribosome profiling-guided depletion of an mRNA increases cell growth rate and protein secretion. Sci. Rep. 7, 40388 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
van Wijk, X. M. et al. Whole-genome sequencing of invasion-resistant cells identifies laminin α2 as a host factor for bacterial invasion. MBio. 8, e02128–16 (2017).
PubMed PubMed Central Google Scholar
Feizi, A., Gatto, F., Uhlen, M. & Nielsen, J. Human protein secretory pathway genes are expressed in a tissue-specific pattern to match processing demands of the secretome. NPJ Syst. Biol. Appl. 3, 22 (2017).
Article PubMed PubMed Central Google Scholar
Swainston, N. et al. Recon 2.2: from reconstruction to model of human metabolism. Metabolomics 12, 109 (2016).
Article PubMed PubMed Central CAS Google Scholar
Sigurdsson, M. I., Jamshidi, N., Steingrimsson, E., Thiele, I. & Palsson, B. O. A detailed genome-wide reconstruction of mouse metabolism based on human Recon 1. BMC Syst. Biol. 4, 140 (2010).
Article PubMed PubMed Central CAS Google Scholar
Hefzi, H. et al. A consensus genome-scale reconstruction of chinese hamster ovary cell metabolism. Cell Syst. 3, 434–443.e8 (2016).
Article CAS PubMed PubMed Central Google Scholar
Templeton, N., Dean, J., Reddy, P. & Young, J. D. Peak antibody production is associated with increased oxidative metabolism in an industrially relevant fed-batch CHO cell culture. Biotechnol. Bioeng. 110, 2013–2024 (2013).
Article CAS PubMed Google Scholar
Kim, S. J., Kim, N. S., Ryu, C. J., Hong, H. J. & Lee, G. M. Characterization of chimeric antibody producing CHO cells in the course of dihydrofolate reductase-mediated gene amplification and their stability in the absence of selective pressure. Biotechnol. Bioeng. 58, 73–84 (1998).
Article CAS PubMed Google Scholar
Kol, S. et al. Multiplex secretome engineering enhances recombinant protein production and purity. Preprint at https://www.biorxiv.org/content/10.1101/647214v1 (2019).
Gu, M. B., Todd, P. & Kompala, D. S. Metabolic burden in recombinant CHO cells: effect of dhfr gene amplification and lacZ expression. Cytotechnology 18, 159–166 (1996).
Article Google Scholar
Gu, M. B., Todd, P. & Kompala, D. S. Analysis of foreign protein overproduction in recombinant CHO cells. Effect of growth kinetics and cell cycle traverse. Ann. N. Y. Acad. Sci. 721, 194–207 (1994).
Article ADS CAS PubMed Google Scholar
Hansen, H. G., Pristovšek, N., Kildegaard, H. F. & Lee, G. M. Improving the secretory capacity of Chinese hamster ovary cells by ectopic expression of effector genes: Lessons learned and future directions. Biotechnol. Adv. 35, 64–76 (2017).
Article CAS PubMed Google Scholar
Delic, M., Göngrich, R., Mattanovich, D. & Gasser, B. Engineering of protein folding and secretion—strategies to overcome bottlenecks for efficient production of recombinant proteins. Antioxid. Redox Signal. 21, 414–437 (2014).
Article CAS PubMed Google Scholar
Le Fourn, V., Girod, P. -A., Buceta, M., Regamey, A. & Mermod, N. CHO cell engineering to prevent polypeptide aggregation and improve therapeutic protein secretion. Metab. Eng. 21, 91–102 (2014).
Article PubMed CAS Google Scholar
Kuo, C. C. et al. The emerging role of systems biology for engineering protein production in CHO cells. Curr. Opin. Biotechnol. 51, 64–69 (2018).
Article CAS PubMed Google Scholar
Golabgir, A. et al. Quantitative feature extraction from the Chinese hamster ovary bioprocess bibliome using a novel meta-analysis workflow. Biotechnol. Adv. 34, 621–633 (2016).
Article CAS PubMed Google Scholar
Borth, N., Mattanovich, D., Kunert, R. & Katinger, H. Effect of increased expression of protein disulfide isomerase and heavy chain binding protein on antibody secretion in a recombinant CHO cell line. Biotechnol. Prog. 21, 106–111 (2008).
Article CAS Google Scholar
Ku, S. C. Y., Ng, D. T. W., Yap, M. G. S. & Chao, S. -H. Effects of overexpression of X-box binding protein 1 on recombinant protein production in Chinese hamster ovary and NS0 myeloma cells. Biotechnol. Bioeng. 99, 155–164 (2008).
Article CAS PubMed Google Scholar
Yusufi, F. N. K. et al. Mammalian systems biotechnology reveals global cellular adaptations in a recombinant CHO cell line. Cell Syst. 4, 530–542.e6 (2017).
Article CAS PubMed Google Scholar
Selvarasu, S. et al. Combined in silico modeling and metabolomics analysis to characterize fed-batch CHO cell culture. Biotechnol. Bioeng. 109, 1415–1429 (2012).
Article CAS PubMed Google Scholar
Gutierrez, J. M. & Lewis, N. E. Optimizing eukaryotic cell hosts for protein production through systems biotechnology and genome-scale modeling. Biotechnol. J. 10, 939–949 (2015).
Article CAS PubMed Google Scholar
Quek, L. -E. et al. Reducing Recon 2 for steady-state flux analysis of HEK cell culture. J. Biotechnol. 184, 172–178 (2014).
Article CAS PubMed Google Scholar
Selvarasu, S., Karimi, I. A., Ghim, G. -H. & Lee, D.- Y. Genome-scale modeling and in silico analysis of mouse cell metabolic network. Mol. BioSyst. 6, 152–161 (2009).
Article PubMed Google Scholar
Sheikh, K., Förster, J. & Nielsen, L. K. Modeling hybridoma cell metabolism using a generic genome-scale metabolic model of mus musculus. Biotechnol. Prog. 21, 112–121 (2008).
Article CAS Google Scholar
Galleguillos, S. N. et al. What can mathematical modelling say about CHO metabolism and protein glycosylation? Comput. Struct. Biotechnol. J. 15, 212–221 (2017).
CAS Google Scholar
Araki, K. & Inaba, K. Structure, mechanism, and evolution of Ero1 family enzymes. Antioxid. Redox Signal. 16, 790–799 (2012).
Article CAS PubMed Google Scholar
Jimenez Del Val, I., Polizzi, K. M. & Kontoravdi, C. A theoretical estimate for nucleotide sugar demand towards Chinese Hamster Ovary cellular glycosylation. Sci. Rep. 6, 28547 (2016).
O’Brien, E. J., Lerman, J. A., Chang, R. L., Hyduke, D. R. & Palsson, B. O. Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction. Mol. Syst. Biol. 9, 693–693 (2014).
Article CAS Google Scholar
Vazquez, A. & Oltvai, Z. N. Macromolecular crowding explains overflow metabolism in cells. Sci. Rep. 6, 31007 (2016).
Beg, Q. K. et al. Intracellular crowding defines the mode and sequence of substrate uptake by Escherichia coli and constrains its metabolic activity. Proc. Natl Acad. Sci. USA 104, 12663–12668 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Spahn, P. N. & Lewis, N. E. Systems glycobiology for glycoengineering. Curr. Opin. Biotechnol. 30C, 218–224 (2014).
Article CAS Google Scholar
Tejwani, V., Andersen, M. R., Nam, J. H. & Sharfstein, S. T. Glycoengineering in CHO cells: advances in systems biology. Biotechnol. J. 13, 1700234 (2018).
Article CAS Google Scholar
Spahn, P. N. et al. A Markov chain model for N-linked protein glycosylation - towards a low-parameter tool for model-driven glycoengineering. Metab. Eng. 33, 52–66 (2016).
Article CAS PubMed Google Scholar
Lewis, N. E., Nagarajan, H. & Palsson, B. O. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–305 (2012).
Article CAS PubMed PubMed Central Google Scholar
Araki, K. & Nagata, K. Protein folding and quality control in the ER. Cold Spring Harb. Perspect. Biol. 3, a007526–a007526 (2011).
Article PubMed PubMed Central CAS Google Scholar
Baycin-Hizal, D. et al. Proteomic analysis of Chinese hamster ovary cells. J. Proteome Res. 11, 5265–5276 (2012).
Article CAS PubMed PubMed Central Google Scholar
Orth, J. D., Thiele, I. & Palsson, B. Ø. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010).
Article CAS PubMed PubMed Central Google Scholar
Schellenberger, J. et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat. Protoc. 6, 1290–1307 (2011).
Article CAS PubMed PubMed Central Google Scholar
Grainger, R. K. & James, D. C. CHO cell line specific prediction and control of recombinant monoclonal antibody N-glycosylation. Biotechnol. Bioeng. 110, 2970–2983 (2013).
Article CAS PubMed Google Scholar
Borner, G. H. H. et al. Multivariate proteomic profiling identifies novel accessory proteins of coated vesicles. J. Cell Biol. 197, 141–160 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cheng, Y., Boll, W., Kirchhausen, T., Harrison, S. C. & Walz, T. Cryo-electron tomography of clathrin-coated vesicles: structural implications for coat assembly. J. Mol. Biol. 365, 892–899 (2007).
Article CAS PubMed Google Scholar
Takamori, S. et al. Molecular anatomy of a trafficking organelle. Cell 127, 831–846 (2006).
Article CAS PubMed Google Scholar
Liu, J. K. et al. Reconstruction and modeling protein translocation and compartmentalization in Escherichia coli at the genome-scale. BMC Syst. Biol. 8, 110 (2014).
Article PubMed PubMed Central CAS Google Scholar
Opdam, S. et al. A systematic evaluation of methods for tailoring genome-scale metabolic models. Cell Syst. 4, 318–329.e6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y., Eddy, J. A. & Price, N. D. Reconstruction of genome-scale metabolic models for 126 human tissues using mCADRE. BMC Syst. Biol. 6, 153 (2012).
Article PubMed PubMed Central Google Scholar
González-Leal, I. J. et al. Use of a Plackett-Burman statistical design to determine the effect of selected amino acids on monoclonal antibody production in CHO cells. Biotechnol. Prog. 27, 1709–1717 (2011).
Article PubMed CAS Google Scholar
Uhlen, M. et al. The human secretome. Sci. Signal. 12, 609 (2019).
Martínez, V. S., Buchsteiner, M., Gray, P., Nielsen, L. K. & Quek, L.-E. Dynamic metabolic flux analysis using B-splines to study the effects of temperature shift on CHO cell metabolism. Metab. Eng. Commun. 2, 46–57 (2015).
Article PubMed PubMed Central Google Scholar
Karpusas, M. et al. The crystal structure of human interferon beta at 2.2-A resolution. Proc. Natl Acad. Sci. USA 94, 18813–18818 (1997).
Article Google Scholar
Gong, L. et al. Crystal structure of the michaelis complex between tissue-type plasminogen activator and plasminogen activators inhibitor-1. J. Biol. Chem. 290, 25795–25804 (2015).
Article CAS PubMed PubMed Central Google Scholar
Saphire, E. et al. Crystal structure of a neutralizing human IGG against HIV-1: a template for vaccine design. Science 293, 1155–1159 (2001).
Article CAS PubMed Google Scholar
Mukai, Y. et al. Solution of the structure of the TNF-TNFR2 Complex. Sci. Signal 3, ra83–ra83 (2010).
Article CAS PubMed Google Scholar
Groppe, J. et al. Structural basis of BMP signalling inhibition by the cystine knot protein noggin. Nature 420, 636–642 (2002).
Article ADS CAS PubMed Google Scholar
Webber, D. et al. A silent H-bond can be mutationally activated for high-affinity interaction of BMP-2 and activin type IIB receptor. BMC Struct. Biol. 7, 6 (2007).
Article CAS Google Scholar
Svensson, L. et al. Evaluation of the metal binding sites in a recombinant coagulation factor Viii identifies two sites with unique metal binding properties. Biol. Chem. 394, 761 (2013).
Article CAS PubMed Google Scholar
Syed, R. et al. Efficiency of signalling through cytokine receptors depends critically on receptor orientation. Nature 395, 511–516 (1998).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Philipp Spahn, Austin Chiang, and Chih-Chung Kuo for their insightful comments on this manuscript. This work was supported by generous funding from the Novo Nordisk Foundation provided to the Center for Biosustainability at the Technical University of Denmark (NNF10CC1016517 and NNF16CC0021858), and from NIGMS (R35 GM119850), and a fellowship from the Government of Mexico (CONACYT) and the University of California Institute for Mexico and the United States (UC-MEXUS).

Author information

These authors contributed equally: Jahir M. Gutierrez, Amir Feizi.

Authors and Affiliations

Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
Jahir M. Gutierrez, Shangzhong Li, Hooman Hefzi, Bernhard O. Palsson & Nathan E. Lewis
Novo Nordisk Foundation Center for Biosustainability at the University of California, San Diego, School of Medicine, La Jolla, CA, 92093, USA
Jahir M. Gutierrez, Shangzhong Li, Hooman Hefzi, Bernhard O. Palsson & Nathan E. Lewis
Department of Biology and Biological Engineering, Kemivägen 10, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
Amir Feizi & Jens Nielsen
Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Kongens Lyngby, Denmark
Thomas B. Kallehauge, Lise M. Grav, Daniel Ley, Bjorn Voldborg, Helene Faustrup Kildegaard, Gyun Min Lee, Bernhard O. Palsson & Jens Nielsen
Department of Systems Biology, Technical University of Denmark, Kongens Lyngby, Denmark
Daniel Ley
Pharmaceutical R&D Department, Turgut Illaclari A.S, Istanbul, Turkey
Deniz Baycin Hizal
Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218-2686, USA
Michael J. Betenbaugh
Department of Pediatrics, University of California, San Diego, School of Medicine, La Jolla, CA, 92093, USA
Bernhard O. Palsson & Nathan E. Lewis

Authors

Jahir M. Gutierrez
View author publications
You can also search for this author in PubMed Google Scholar
Amir Feizi
View author publications
You can also search for this author in PubMed Google Scholar
Shangzhong Li
View author publications
You can also search for this author in PubMed Google Scholar
Thomas B. Kallehauge
View author publications
You can also search for this author in PubMed Google Scholar
Hooman Hefzi
View author publications
You can also search for this author in PubMed Google Scholar
Lise M. Grav
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Ley
View author publications
You can also search for this author in PubMed Google Scholar
Deniz Baycin Hizal
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Betenbaugh
View author publications
You can also search for this author in PubMed Google Scholar
Bjorn Voldborg
View author publications
You can also search for this author in PubMed Google Scholar
Helene Faustrup Kildegaard
View author publications
You can also search for this author in PubMed Google Scholar
Gyun Min Lee
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard O. Palsson
View author publications
You can also search for this author in PubMed Google Scholar
Jens Nielsen
View author publications
You can also search for this author in PubMed Google Scholar
Nathan E. Lewis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.M.G., A.F. and N.E.L. conceived the project and designed the experiments. J.M.G., A.F., S.L., T.B.K, H.H., L.G., D.L collected data. J.M.G. and D.B.H. performed experiments. J.M.G. performed research, analyzed the data, and interpreted the results. J.M.G., A.F. and N.E.L. drafted the article. M.J.B., B.V., H.F.K., G.M.L., B.O.P, J.N. and N.E.L. critically revised and approved the article.

Corresponding author

Correspondence to Nathan E. Lewis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gutierrez, J.M., Feizi, A., Li, S. et al. Genome-scale reconstructions of the mammalian secretory pathway predict metabolic costs and limitations of protein secretion. Nat Commun 11, 68 (2020). https://doi.org/10.1038/s41467-019-13867-y

Download citation

Received: 19 July 2017
Accepted: 22 November 2019
Published: 02 January 2020
DOI: https://doi.org/10.1038/s41467-019-13867-y

This article is cited by

OxPhos defects cause hypermetabolism and reduce lifespan in cells and in patients with mitochondrial diseases
- Gabriel Sturm
- Kalpita R. Karan
- Martin Picard
Communications Biology (2023)
Improving recombinant protein production by yeast through genome-scale modeling using proteome constraints
- Feiran Li
- Yu Chen
- Jens Nielsen
Nature Communications (2022)
Energy metabolism in Chinese hamster ovary (CHO) cells: Productivity and beyond
- Jong Uk Park
- Hye-Jin Han
- Jong Youn Baik
Korean Journal of Chemical Engineering (2022)
Translational landscape and protein biogenesis demands of the early secretory pathway in Komagataella phaffii
- Troy R. Alva
- Melanie Riera
- Justin W. Chartron
Microbial Cell Factories (2021)
Correcting for sparsity and interdependence in glycomics by accounting for glycan biosynthesis
- Bokan Bao
- Benjamin P. Kellman
- Nathan E. Lewis
Nature Communications (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.