- Laboratory of Systems and Synthetic Biology, Department of Agrotechnology and Food Sciences, Wageningen University & Research, Wageningen, Netherlands
Genome-scale metabolic models of Mycobacterium tuberculosis (Mtb), the causative agent of tuberculosis, have been envisioned as a platform for drug discovery. By systematically probing the networks that underpin such models, the reactions that are essential for Mtb are identified. A majority of these reactions are catalyzed by enzymes and thus represent candidate drug targets to fight an Mtb infection. Nevertheless, this is complicated by the limited knowledge on the environment that Mtb encounters during infection. Modeling the behavior of the bacteria during infection requires knowledge of the so-called biomass reaction that represents bacterial biomass composition. This composition varies in different environments or bacterial growth phases. Accurate modeling of the metabolic state requires a precise biomass reaction for the described condition. In recent years, additional insights in the in-host environment occupied by Mtb have been gained as transcript abundance data of interacting host and pathogen have become available. Therefore, we used transcript abundance data and developed a straightforward and systematic method to obtain a condition-specific biomass reaction for Mtb during in vitro growth and during infection of its host. The method described herein is virtually free of any pre-set assumptions on uptake rates of nutrients, making it suitable for exploring environments with limited accessibility. The condition-specific biomass reaction represents the “metabolic objective” of Mtb in a given environment (in-host growth and growth on defined medium) at a specific time point, and as such allows modeling the bacterial metabolic state in these environments. Five different biomass reactions were used to predict nutrient uptake rates and gene essentiality. Predictions were subsequently compared to available experimental data. Our results show that nutrient uptake can accurately be predicted. Gene essentiality can also be predicted but accurate predictions remain difficult to obtain. In conclusion, a viable strategy to model Mtb metabolism in hard-to-access environments that is virtually free of pre-set assumptions is provided.
Introduction
Constraint-based genome-scale metabolic models (GEMs) enable prediction of metabolic states. A metabolic state is defined as a vector of all fluxes or conversion rates (in mmol h−1) throughout metabolism per weight unit of biomass (usually 1 gram dry weight, gDw). GEMs comprise linear equations describing conversions among metabolites, uptake or secretion processes, and transport processes over different compartments. These equations are referred to as flux balance constraints and are founded on an underlying metabolic network wherein all metabolites are interconnected by conversion and transport reactions. The flux balance constraints are captured in a stoichiometric matrix (Kauffman et al., 2003). GEMs may comprise additional constraints as well, such as reversibility and capacity constraints. The whole of all possible fluxes that satisfy all constraints of a GEM is referred to as the solution space (Bordbar et al., 2014). Additional constraints present an opportunity to further limit the size of the solution space, which results in a more accurate calculation of the metabolic state. A suitable way to increase the amount of constraints is to measure uptake and/or secretion rates of metabolites/nutrients. Knowledge of a few of these rates can considerably shrink the solution space (Reed, 2012).
Given the stoichiometric matrix, the most straightforward approach for calculating a metabolic state is to simulate conditions wherein the organism is in a steady state physiological condition, meaning that there is no net intracellular accumulation of metabolites. Under this assumption, it is possible to construct a Flux Balance Analysis (FBA) problem. FBA finds the optimal (maximum or minimum) value of a selected function, the so-called objective function, while satisfying all constraints. Solution of the FBA problem leads to a vector of reaction fluxes that represents a calculated metabolic state of the organism. This calculated metabolic state is more likely to represent the actual metabolic state as the solution space is shrunk by additional constraints (Raman and Chandra, 2009; Orth et al., 2010).
The metabolic state is, among others, dependent on the objective function. Metabolic states have been accurately predicted for several bacteria in recent years, using objective functions such as maximizing the flux through the biomass reaction to represent growth rate, maximizing ATP production or minimizing enzyme usage among others (Schuetz et al., 2007).
However, in some conditions measuring uptake and/or secretion rates can be notoriously difficult, if not impossible. Such is the case for intracellular Mycobacterium tuberculosis (Mtb), a pathogenic bacterium able to withstand the harsh environment of the phagosome. Mtb is even capable of halting the maturation of the phagosome inside immune cells and providing a niche for the bacterium to thrive (Gengenbacher and Kaufmann, 2012; Zondervan et al., 2018). Genome-scale metabolic models of Mtb, have been envisioned as a platform for drug discovery (Jamshidi and Palsson, 2007; Rienksma et al., 2014).
In addition to uptake rates, other measurements can serve to estimate or approach (a part of) the metabolic state of a cell, such as transcript profiles (Hoppe, 2012). For Mtb, a major difficulty with these measurements is the large size difference between the eukaryotic host cell and the prokaryotic pathogen, which results in metabolites and transcripts from the host vastly outnumbering those of the pathogen (Fels et al., 2017). With regard to transcript abundance experimental methods have been developed to increase the ratio of pathogen mRNA to host mRNA (Mangan et al., 2002). This enrichment in pathogen transcripts renders differences between intracellular and extracellular pathogen transcript abundance apparent. We recently published a dataset of Mycobacterium bovis BCG and THP-1 cells using a dual RNA-sequencing strategy (Rienksma et al., 2015). However, such an enrichment method is not available for metabolites, which are more closely related to fluxes as compared to transcripts. Moreover, metabolites, unlike transcripts, cannot be assigned to host or pathogen unless they only occur in one of said host or pathogen (Zimmermann et al., 2017).
Transcript abundance data can be used to constrain models in environments where knowledge regarding nutrient availability and objective(s) is limited. Methods such as iMAT (Shlomi et al., 2008), MADE (Jensen and Papin, 2011), GIMME (Becker and Palsson, 2008), E-flux (Colijn et al., 2009), TRFBA (Motamedian et al., 2017), and others (Lewis et al., 2012) limit the solution space by using expression values as a proxy for flux. These methods allow for the explanation of phenomena that cannot be derived solely from the models, such as the prediction of the Crabtree effect in yeast (Rossell et al., 2013). These model and data integration methods limit the solution space within the ranges of expression data, thereby effectively generating condition specific models. Shrinking the solution space by limiting fluxes based on gene expression seems an obvious choice, but it is not at all obvious how this should be done. Methods for model and data integration have been thoroughly evaluated (Machado and Herrgard, 2014). The evaluation showed that no method outperforms the others for all tested models and datasets. Finally, this condition-specific model building can hamper exploration of metabolic states that arise from perturbations of the environment, from which the gene expression data was originally derived. These adapted models would only allow changes to the metabolic state that fit within the boundaries of what was originally measured. Such a rigid model appears a poor choice for predictive modeling.
A modeling approach focused on an accurate description of the objective of Mtb during infection appears to be a better strategy to make new predictions because it does not limit the solution space or metabolic flexibility beforehand. Previous approaches have relied on adapting the biomass reaction to represent the composition on mycobacterial cells during infection. Bordbar and colleagues adjusted the biomass reaction based on differential gene expression (Bordbar et al., 2010). This approach is biased by the biomass reaction that is present in the model prior to the tailoring process and the potential synthesis of other metabolites specifically during infection, is overlooked. Shi et al. (2010) proposed a biomass reaction comprising trehalose dimycolate, triacylglycerol (TAG), and polyglutamate/glutamine to reflect a minimal cell wall composition. The logical assumption applied was that during a “non-growth state”, Mtb utilizes metabolites produced in pathways of which gene expression is elevated and does not, or to a lesser extent, utilize metabolites produced in pathways of which gene expression is suppressed. Shi and colleagues used qPCR to monitor gene expression (Shi et al., 2010). This requires a pre-selection of target genes based on experience and experimental output and does not accommodate unbiased exploration of the transcriptional landscape.
Here, we integrate a constraint-based model of Mtb metabolism and RNA sequencing data to provide condition-specific biomass reactions during host infection and during growth on Middlebrook 7H9 medium. The genome-scale nature of this approach ensures all known pathways and biomass precursors are taken into account, whereas the nature of the used data (RNAseq) ensures unbiased assumptions on the types and quantities of metabolic precursors. During on-going infection mycobacterial cells might enter a non-growth state on which maximal growth rate is not the metabolic objective. However, still minimal macromolecular components need to be synthesized and energetic requirements need to be fulfilled to ensure survival. The condition-specific biomass reaction representing infection combines both aspects as it reflects the composition of mycobacterial cells during infection, and it also represents the metabolic requirements for its survival and interaction with the host which are incorporated in the RNAseq data as well. To simulate the metabolic state of the bacteria upon infection, flux through the condition-specific biomass reaction is maximized, while the total usage of enzymes is minimized. As Mtb faces several types of stress and adverse conditions imposed by the host's immune system during infection of the host (Fontán et al., 2008), it is assumed that Mtb does not squander its resources, and makes optimal use of available enzymes. From a modeling perspective, this can be seen as a bi-objective optimization problem wherein two competing objectives, i.e., maximization of biomass production on the one hand, and minimization of enzyme usage on the other hand, are simultaneously considered.
The goal of multi-objective optimizations is to find Pareto optimal solutions (also called non-dominated solutions) (Papalambros and Wilde, 2000). A solution is Pareto optimal if no other solution exists that better satisfies all objectives. In other words, a solution is Pareto optimal if an improvement in one objective requires a degradation of another. Multiple methods have been developed to obtain Pareto optimal solutions in multi-objective optimization problems such as the normal constraint method (Messac et al., 2003) that has been used to explore tradeoffs between hepatic metabolic functions (Nagrath et al., 2007). Here, we tackle the problem by using a weighted sum method in which weight factors are attributed to each objective: fb and fe, i, for biomass and enzyme usage for each of the i = {1, …, m} reactions, respectively. This approach ensures that the obtained solutions are Pareto optimal (Suárez et al., 2008). Parsimonious enzyme usage FBA (pFBA) has been proposed to explore the tradeoffs between maximizing growth and minimizing enzyme utilization (Bordbar et al., 2010). In pFBA there is an initial maximization of the biomass reaction followed by a minimization of enzyme usage. In our approach, this Pareto optimal solution would correspond to the extreme case on which the weight of the biomass objective is much higher than that of the enzyme minimization objectives (fb >> fe, i,). By changing these weight factors, a ratio between these two factors, fr, is established that enables the prediction of metabolic genes essential to Mtb within the macrophage as well as metabolites that are sequestered by Mtb from the phagosome. Comparison of these predictions with experimentally obtained data (Beste et al., 2013; Mendum et al., 2015) reveals that by using a condition-specific objective function inferred from transcript abundance data the metabolic state of Mtb upon infection can be accurately predicted.
Methods
Mtb Metabolic Model
We used our genome-scale metabolic model of M. tuberculosis called sMtb, in silico M. tuberculosis (Rienksma et al., 2014). We made some minor corrections to this model regarding among others the respiration chain, and added six reactions to improve the functioning of the model. This improved sMtb model can be found in Supplementary Data Sheet 1 (as systems biology markup language file, SBML) and in Supplementary Data Sheet 2 (as excel file) together with a small summary of the aforementioned changes in Supplementary Data Sheet 3. A list of metabolites that could be present in the phagosome was collected from literature (Beste et al., 2013), similarly a list of metabolites in Middlebrook 7H9 medium was collected (Supplementary Table 1).
Constraining sMtb With Gene Expression Data
Raw sequence read data supporting the results of this article are available in the EMBL-EBI European Nucleotide Archive under the Accession No. PRJEB6552, http://www.ebi.ac.uk/ena/data/view/PRJEB6552 for both M. bovis BCG grown on Middlebrook 7H9 medium and M. bovis BCG cells infecting THP-1 cells. RNA sequencing reads were aligned to the M. bovis BCG genome as described before (Rienksma et al., 2015). For each gene present in sMtb, the number of reads aligning to it was summed. A cutoff value of 100 counts per million (cpm) was used to identify lowly expressed genes, that were assigned a count value of zero. The resulting gene count values were subsequently transferred to their corresponding reactions, summing the counts for reactions catalyzed by isozymes. For reactions catalyzed by a protein complex, the smallest number of counts of every gene that encodes a part of such a complex was assigned to the reaction. For reactions that can be catalyzed by several different protein complexes, the smallest number of counts assigned to one of the genes encoding a part of each complex was identified and subsequently the total of all these smallest numbers of counts was assigned to the reaction. Reactions that received no counts using this method were not allowed to carry any flux. Afterwards, the total number of counts assigned to each reaction was normalized by dividing this total number of counts by the largest number of counts assigned to any reaction in sMtb, resulting in a value ranging between 0 and 1 for each enzyme-catalyzed reaction. This procedure is called the E-flux algorithm and is explained in greater detail by Colijn et al. (2009).
Obtaining Condition-Specific Biomass Reactions
The workflow applied to model sMtb is generally depicted in Figure 1. This workflow was applied twice: for gene expression data of M. bovis BCG grown on Middlebrook 7H9 medium (medium condition) and for gene expression data of M. bovis BCG cells infecting THP-1 cells (infection condition). Firstly, upper bounds on unidirectional (forward) reactions, and upper and lower bounds on bidirectional reactions were replaced by the normalized counts assigned to that reaction (Figure 1A). The resulting sMtb model constrained by gene expression data was further constrained by setting uptake rates of unavailable metabolites to zero in the given condition and allowing unconstrained uptake of available nutrients, based on nutrient availability data (Figure 1B). The nutrient availability in the phagosome and in Middlebrook 7H9 medium is given in Supplementary Table 1. Afterwards, a general list of biomass precursors was obtained from sMtb (Supplementary Data Sheets 1, 2). For each biomass precursor, a sink reaction was added and the flux of each of these sink reactions was individually maximized, effectively maximizing the flux toward the respective precursor (Figure 1C). Subsequently, the obtained maximum value for each biomass precursor was normalized such that the total molecular weight of these precursors equaled one gram, resulting in a condition-specific biomass reaction of Mtb during infection, growing in phagosomal conditions, CSI, and a condition-specific objective function of Mtb growing in Middlebrook 7H9 medium, CSM (Figure 1D). These condition-specific objective functions were subsequently added to sMtb and constraints derived from the gene expression were removed (Figure 1E).
Figure 1. Workflow to create a model with a condition-specific biomass reaction. (A) An exemplary constraint-based genome-scale metabolic model (GEM) comprising a metabolic network with metabolites (yellow diamonds) and reactions (arrows), including uptake reactions (green arrows) and secretion reactions (red arrows), is depicted in a microorganism (rounded square). Gene expression data (blue wave-shaped lines) is used to constrain maximal flux values according to the E-flux algorithm (Colijn et al., 2009), to obtain a condition-specific GEM having a shrunken solution space. (B) The condition-specific (blue background) GEM is subsequently combined with nutrient availability data (graph) and uptake of unavailable nutrients is constrained to zero. (C) Biomass precursor data (bar plot) is used to pinpoint biomass precursors in the condition-specific GEM with blocked transport reactions (red crosses), and the flux through the flux limiting reaction for each precursor is selected by maximizing flux toward each biomass precursor (blue diamonds) individually. (D) The sum of all precursor fluxes is normalized to one gram biomass dry weight (1 gDw) and a condition-specific biomass reaction (green diamond) is obtained. (E) All constraints placed on the GEM in the previous steps (A–D), are removed and a GEM with a condition-specific biomass reaction is obtained.
Calculating Nutrient Uptake for Various Objective Functions
We compared five different objective functions for their ability to correctly predict nutrient uptake rates by Mtb in the phagosome. The following biomass reactions were used: CSI, CSM, the regular biomass reaction from model sMtb representing growth (REB) (Rienksma et al., 2014), the biomass reaction from model sMtb representing in vivo growth (IVB) (Rienksma et al., 2014) and a reaction representing Mtb in a non-replicative state (NRC) (Shi et al., 2010).
The bounds on uptake rates of all nutrients representing phagosomal conditions (Supplementary Table 1) were unconstrained, with the sole exception of constraining the oxygen uptake rate to 0.01 mmol gDw−1 h−1. Subsequently, each of the five objective functions was maximized while the sum of all other enzymatically catalyzed reactions was minimized (Equations 1.1 - 1.3). The weight factor for the biomass reaction, fb, was varied while keeping the weight factor for enzymatically catalyzed reactions, fe, i, constant at 0.001 for all reactions i, hence effectively varying fr, the ratio between fb and the sum of all fe, i-values (Equation 2). Subsequently, a flux variability analysis was performed, wherein the maximum objective function value, w (Figure 2, right panel), was set as a constraint and the nutrient uptake rates were minimized and maximized individually, resulting in maximal and minimal uptake rate boundaries for various nutrients for each objective function (Figures 3–5) (Gudmundsson and Thiele, 2010; Orth et al., 2010).
Figure 2. Tradeoff between biomass production (growth rate) and enzyme utilization in the metabolic model. Predicted values of the flux through the biomass synthesis reaction (left), average flux through all enzymatically catalyzed reactions (middle) and the objective function value (right), i.e., combination of total enzymatic reaction minimization and biomass reaction maximization for various fr-values. Five different biomass reactions are shown: CSI (green), CSM (blue), IVB (yellow), REB (red), and NRC (black). The dashed line indicates fr = 0.8.
Figure 3. Predicted amino acid uptake rates. Maximum and minimum predicted uptake rates for alanine, aspartate, asparagine, and glutamate using five different biomass reactions: CSI (green), CSM (blue), IVB (yellow), REB (red), and NRC (black) for varying values of the biomass weight factor fr. The dashed line indicates fr = 0.8. Two lines of the same color indicate upper and lower limits of the prediction. Note that negative values of uptake rates denote excretion of that metabolite.
Figure 4. Predicted lipid uptake rates. Maximum and minimum predicted uptake rates for diacylglycerol, glycerol-3-phosphate, and triacylglycerol using five different biomass reactions: CSI (green), CSM (blue), IVB (yellow), REB (red), and NRC (black) for varying values of the biomass weight factor fr. The dashed line indicates fr = 0.8. Two lines of the same color indicate upper and lower limits of the prediction. Note that negative values of uptake rates denote excretion of that metabolite.
Figure 5. Predicted oxygen, carbon dioxide, and nitric oxide uptake rates. Maximum and minimum predicted uptake rates for oxygen, carbon dioxide, and nitric oxide using five different biomass reactions: CSI (green), CSM (blue), IVB (yellow), REB (red), and NRC (black) for varying values of the biomass weight factor fr. The dashed line indicates fr = 0.8. Two lines of the same color indicate upper and lower limits of the prediction. Note that negative values of uptake rates denote excretion of that metabolite.
Comparison of Gene Essentialities for Various Objective Functions
At log(fr) > −0.5 flux is channeled through all (condition-specific) biomass reactions (Figure 2, right panel). At log(fr) < 0.3 minimizing flux through the sum of all enzymatically catalyzed reactions is still relevant and the restrictions of the oxygen uptake threshold are not yet overcome. Beyond this point, the (condition-specific) biomass reaction weight factor, fb, is so large as compared to the sum of all fe, i-values, that an optimal objective function value, w, is obtained by solely minimizing enzyme usage, and ignoring maximizing flux through the (condition-specific) biomass reaction. An fr-value of 0.8 was chosen from Figures 2–5 as asparagine, alanine and glutamate in addition to glycerol-3-phosphate and CO2 are taken up from the host, which is likely to occur during infection (Beste et al., 2013) and this value log(0.8) ≈ − 0.1 is centered between the boundaries of log(fr) = 0.5 and log(fr) = 0.3. The growth rates (i.e., the in silico calculated flux of CSI, CSM, IVB, REB, and NRC) were maximized indirectly by maximizing the aforementioned bi-objective optimization problem. Each biomass reaction will always obtain its maximal value using this approach. Thereafter, using the COBRA toolbox (Schellenberger et al., 2011), genes and their corresponding reactions where deleted one by one and the resulting specific growth rates were computed by maximizing the aforementioned bi-objective optimization problem. These growth rates were divided by the wild-type growth rate, resulting in a number between 0 and 1 for each knocked-out gene, representing the relative specific growth rate. We applied a 95% reduction in the relative growth rate as a threshold to indicate essential genes as described before (Rienksma et al., 2014).
Mendum and colleagues infected human dendritic cells with an Mtb transposon library to identify genes that are required for in vivo survival after 3 days and after 7 days (Mendum et al., 2015). These experimentally identified essential genes were compared to the predicted essential genes using the aforementioned five different objective functions. Subsequently, the accuracy, sensitivity and specificity of the predictions, were calculated for all five objective functions and for both experimental time points.
Results
We created two condition-specific biomass reactions (CSI and CSM) based on transcript abundance data in two conditions. The term “biomass reaction” is perhaps not the most suitable term as these reactions not only cover synthesis of metabolites used for biomass production, but also synthesis of excreted enzymes and small molecules, repair of damaged lipid membranes and other metabolites involved in host-pathogen interaction. Even though these processes themselves are largely unknown, transcript abundance data indirectly reflects these processes and combined with a GEM can give a picture of required metabolic precursors for these processes.
Condition-Specific Biomass Reactions
The creation of a condition-specific biomass reaction requires a GEM, a list of available nutrients in the given condition, a list of metabolic precursors for synthesis of macromolecules, and transcript abundance data. We used model sMtb, a comprehensive model of Mtb metabolism (Rienksma et al., 2014), with minor corrections and additions (Supplementary Data Sheets 1–3). Transcript abundance data was obtained from a dual RNA-sequencing experiment wherein transcript abundances of M. bovis BCG, a close relative of Mtb having a highly similar genome (Garnier et al., 2003), were measured under two conditions (Rienksma et al., 2015). In the first condition M. bovis BCG infects THP-1 cells, and in the second condition M. bovis BCG grows on Middlebrook 7H9 medium. The sMtb model was used as a platform to integrate the expression data and to calculate two condition-specific biomass reactions of Mtb, CSI (condition-specific infection) and CSM (condition-specific medium), for both aforementioned conditions, respectively. A list of all metabolites known or expected to be present in the phagosome was assembled (Supplementary Table 1). Availability of these metabolites was simulated by enabling their free uptake in the model. In addition, a list of all known biomass precursors was generated based on the sMtb model (Supplementary Table 2).
The flux toward each biomass precursor was maximized one by one, while limiting the maximum flux through enzymatically-catalyzed reactions based on the transcript abundance for the present condition (Figure 1). The ratio of biomass precursors obtained for both conditions represents the two condition-specific biomass reactions (CSM and CSI). The contributions of each class of precursors to these two biomass reactions are shown in Table 1 (see Supplementary Table 2 for a more detailed breakdown). The largest differences in the biomass reactions of both conditions entails the fraction of amino acids, which is approximately doubled in the host as compared to in vitro growth on Middlebrook 7H9 medium, which is in accordance with previous predictions (Zimmermann et al., 2017). The fraction of carbohydrates on the other hand, is substantially reduced from 20.1 to 9.9%.
Simulating Mtb Metabolism: Balance Between Growth and Enzyme Utilization
To predict the in vivo metabolic state, reflecting Mtb's intracellular behavior, we compared the performance of five different biomass reactions: the in vitro biomass growth reaction (IVB) and a regular biomass growth reaction (REB), both present in sMtb (Rienksma et al., 2014), a biomass reaction representing non-replicating cells (NRC) (Shi et al., 2010), the condition-specific biomass reaction representing growth on Middlebrook 7H9 medium (CSM) and the condition-specific biomass reaction representing growth within the host's phagosome (CSI).
Simulation of the metabolic state of Mtb in the phagosome is complicated by a lack of knowledge on the rate at which nutrients are acquired from the host. However, various studies have shown that the phagosomal environment is likely to be hypoxic (Schnappinger et al., 2003). Therefore, we chose to limit the oxygen uptake rate at a relatively low value of 0.01 mmol gDW−1 h−1 while keeping unrestricted the uptakes of all other nutrients that were assumed to be present in the host. Even with such a restriction, nutrients were predicted to be taken up in unrealistically large quantities. This behavior can be traced back to anaerobic reactions in the model that result in ATP generation, followed by the artificial generation of oxygen at the cost of high amounts of energy in the form of ATP to ADP conversion. In addition, limiting the oxygen uptake rate all the way to 0 mmol gDW−1 h−1 resulted in zero flux through the (condition-specific) biomass reaction, and was therefore an unsuitable strategy as well.
To overcome such difficulty, the assumption was made that Mtb utilizes its resources parsimoniously when in a hostile environment. This can be modeled by minimization of enzyme usage while maximizing the flux through the biomass reaction. This bi-objective optimization was performed using a weighted sum method in which the following FBA problem with a weighted objective was solved:
subject to:
Wherein w is the objective function value, ve, i represents the flux or rate of reaction i catalyzed by at least one enzyme; fe, i represents the weight factor for reaction i; vb represents the specific growth rate (or biomass reaction flux value), i.e., the flux through one of the five aforementioned (condition-specific) biomass reactions; fb represents the weight factor for the biomass reaction; n is the total number of reactions catalyzed by at least one enzyme; S represents the stoichiometric matrix; v represents a vector with all fluxes (comprising ve, i and vb); b represents a vector with zeros; l represents a vector with lower bounds for all fluxes and u represents a vector with upper bounds for all fluxes. The weight factor ratio, fr, between growth and total enzyme utilization is given by:
Each reaction in the model catalyzed by one or multiple enzymes was given the same weight factor (fe) and the weight factor (fb) of the (condition-specific) biomass reaction was varied such that log(fr) varied around a value of 0. A log(fr) value of 0 entails that the numerator and denominator of Equation (2) are of equal size and reflects a balanced weight distribution between minimization of enzyme usage (i.e., maximization of the negative values) and maximization of growth. By changing the weight factor ratio, the relative importance of enzyme usage minimization and biomass reaction maximization changes (Figure 2). If too much weight is put on the minimization of enzyme usage, i.e., fr becomes too low, the biomass reaction flux value, vb, becomes irrelevant and its value drops to zero, this can be seen at the left hand panel of Figure 2, where the graphs equal zero. The reason that the average flux through enzymatically catalyzed reactions, ve, does not drop to zero when too much emphasis is put on enzyme usage minimization, as can be seen in the middle panel of Figure 2, is because there is a small (0.1 mmol gDW−1 h−1) growth related maintenance coefficient enforcing a small minimum flux of ATP to ADP conversion.
Prediction of Uptake Rates
Figures 3–5 show predicted uptake rates for the five different biomass reactions. As fr increases, unrealistically high uptake rates are predicted to overcome the restrictions of the oxygen uptake threshold (0.01 mmol gDW−1 h−1). As can be seen in Figure 2 (black line), the graph representing NRC biomass reaction (non-replicating cells) is slightly shifted as compared to the other objectives. The reason for this is that the total molecular weight of biomass precursors for this objective as obtained from Shi and colleagues is not normalized to one gram. Its value is actually higher, resulting in a larger objective function value at a smaller fr-value (Figure 2, right panel). For the four other biomass reactions a balance exists between maximization of growth and minimization of enzyme usage between approximately log(fr) = −0.5 and log(fr) = 0.3 (0.3 ≤ fr ≤ 2.0). Beyond log(fr) = 0.3 the restrictions of the oxygen uptake threshold are overcome, and vb and vi-values jump to infinite (for the NRC biomass reaction, this point is reached earlier). An appropriate value for fr was selected from Figures 2–5 based on the consideration that uptake of asparagine, alanine and glutamate in addition to glycerol-3-phosphate and CO2 from the host is likely to occur during infection (Beste et al., 2013). In addition, nitric oxide is not produced in high amounts by THP-1 cells, and thus not a likely source of nutrition (Fontán et al., 2008), further justifying an fr-value >0.3 [log(fr) > −0.5], when hardly any nitric oxide is predicted to be taken up (Figure 5, right panel). At fr = 0.8 [log(fr) = −0.1, dashed vertical lines], uptake of glutamate and glycerol-3-phosphate is predicted for all biomass reactions except for NRC, the biomass reaction describing non-replicating cells. For this biomass reaction uptake of glutamate is not predicted. In addition, at this point (fr = 0.8) uptake of asparagine is predicted for the condition specific biomass reaction of infection (CSI) and predicted to be likely (the average of minimum and maximum uptake rates is above zero) for the other four objectives. The uptake of alanine at this point is predicted to be likely for all five objective functions.
As can be seen in Figures 3–5, the predicted uptake rates are very similar for all five biomass reactions. Therefore, the biomass reaction itself seems of minor importance for the prediction of uptake rates. The uptake of glutamate appears as especially high for a relatively small fr-value, regardless of the chosen biomass reaction.
Beste and colleagues determined that the amino acids asparagine, alanine and glutamate are likely taken up during infection. Acetate- or acetyl-CoA-derived from β-oxidation of host lipids and CO2 is utilized intracellularly and glycerol-3-phosphate could be a potential carbon source as well (Beste et al., 2013). Regardless of the objective used, sMtb is able to reproduce these observations (Figures 3–5). In general, glutamate is taken up at low fr-values, while asparagine becomes more important at higher fr-values. The routes of glutamate toward most metabolic precursors are shorter than those of asparagine, which is predicted to be taken up at a higher fr-value. In this way the change of the uptake rates with the fr-value reflects the metabolic versatility of each component.
Lipid uptake rates show that glycerol-3-phosphate is likely to be taken up, while diacylglycerol and triacylglycerol are possibly taken up. Cholesterol is not predicted to be used as a carbon source at any fr-value, in contrast to mounting evidence that cholesterol plays an important role as a nutrient for Mtb in the host (Wipperman et al., 2014; Rienksma et al., 2015). Currently, the cholesterol degradation pathway of Mtb is partly unknown, therefore only a partial degradation pathway exists in sMtb and the double ringed product (ring C and D of the cholesterol molecule) can only be excreted in sMtb. Partial degradation results in suboptimal yield of energy carrying metabolites derived from the cholesterol molecule compared to other molecules and therefore it is not predicted to be taken up. As knowledge on the cholesterol degradation pathway advances, the complete pathway will eventually be known. Integrating this complete pathway into sMtb will likely yield different results regarding cholesterol uptake.
The prediction of CO2 uptake is complicated, as it is a nutrient that is excreted and possibly taken up, unlike the other nutrients in Figures 3–5. With FBA only a prediction of the difference between CO2 excretion and uptake can be obtained. On average, CO2 is predicted to be excreted throughout the entire fr range.
Gene Essentiality Within the Host
Gene essentiality predictions are often used to assess the predictive power of GEMs. Gene essentiality predictions can be simulated with in silico gene knock out (KO) mutants and comparing the maximal predicted growth rate of the wild type strain with the KO mutant. A reduction in the predicted specific growth rate of 95% or more is generally accepted as a threshold value for gene essentiality (Beste et al., 2007; Jamshidi and Palsson, 2007; Rienksma et al., 2014).
Here this approach will not provide satisfactory results, as there are too few constraints on the uptake rates of individual nutrients, only on the whole of enzymatically catalyzed reactions, resulting in an excess of unrealistic metabolic routes that could circumvent the deficiency caused by the deletion of the gene. We therefore optimized the aforementioned weighted bi-objective using fe, i = 0.001 for all i with and without deleting the corresponding gene. Afterwards, both results were compared and a reduction of the specific growth rate, vb, by 95% was marked as an essential gene.
These gene essentiality predictions were performed for each of the biomass reactions. We subsequently compared these predictions with experimental data obtained by Mendum et al. (2015) and evaluated the accuracy, sensitivity and specificity of the predictions obtained with each of the five biomass reactions was calculated (Table 2).
Table 2. Gene essentiality predictions made using sMtb with five objective functions compared with experimental data obtained 3 and 7 days after infection (Mendum et al., 2015).
Discussion
We have created condition-specific biomass reactions based on transcript abundance data, thereby ensuring that the obtained biomass compositions represent the organism's needs in the corresponding conditions. By limiting the availability of nutrients to those known or estimated to be present in the phagosome and restricting the uptake of all other nutrients, we were able to capture the metabolic state of Mtb during infection.
Methods such as iMAT (Shlomi et al., 2008), MADE (Jensen and Papin, 2011), or GIMME (Becker and Palsson, 2008), aim at developing condition specific models maximizing the agreement between flux predictions and expression measurements methods. The flexibility of these models is reduced, and this can limit their predictive power. If, for example, certain reactions are perturbed by the effect of drugs, perhaps the system shifts to another metabolic state to accommodate the effect of such perturbation. However, due to the fitting of the gene expression data, it might happen that this effect cannot be accounted for, as the predicted metabolic state is biased to represent the gene expression data. In our approach, we initially constrain the reaction bounds in the model with the gene expression data. The constrained model is used to derive a condition specific biomass reaction. The obtained coefficients of the biomass precursors contain information on the network wide impact of the gene expression data. The constraints in the model are then removed while the newly defined condition specific biomass reaction is used to provide an indirect representation of the metabolic state corresponding to the expression data. Our goal was to retain flexibility in the model, while incorporating the experimental data.
We reasoned that the enzymes encoded by transcripts and involved in metabolism, which were present at a given moment in Mtb, should roughly reflect the flux through these enzymes at that specific condition and time point. Even though transcript abundance is not linearly correlated to enzyme abundance or flux (i.e., the reaction rate of an enzyme) (Bordel et al., 2010), for larger systems, such as pathways or the entire metabolism, a correlation is likely to exist. On average, metabolic transcript abundance data should reflect the optimal quantity of a given enzyme that is sufficient to perform its metabolic task. Production of an excess of metabolic enzymes would be a waste of energy, and thus unfavorable for an organism residing in a hostile environment.
The synthesis routes toward amino acids are predicted to carry more flux during host infection as compared to in vitro growth, which is in agreement with other predictions (Zimmermann et al., 2017). This is represented in Table 1 by the higher (doubled) fraction of amino acids required. This suggests that protein synthesis is increased upon infection. Mtb is known to excrete proteins during infection, which could explain this predicted increase (Gengenbacher and Kaufmann, 2012). At the same time, the predicted lipid synthesis requirement is lower during infection than during growth on Middlebrook medium, confirming the lipid-rich diet that Mtb encounters in the host environment (Schnappinger et al., 2003; Gengenbacher and Kaufmann, 2012). Another major difference is the lower carbohydrate synthesis. Following the same reasoning, carbohydrates should be more abundant in the host environment, but it is generally assumed that Mtb has poor access to carbohydrates in this environment (Kalscheuer et al., 2010; Fullam et al., 2016). A possible explanation could be that Mtb does not synthesize carbohydrates as the synthesis of other metabolites are preferred within the host as compared to growth on Middlebrook medium.
We have used a bi-objective optimization approach to simultaneously take into account growth requirements and parsimonious enzyme utilization. The tradeoff between both objectives is apparent in Figure 2. Still the comparison between the uptake profiles in Figures 3–5 led us to conclude that a ratio between both objectives, fr, of 0.8 [corresponding to log(fr) = −0.1] is likely to represent the metabolic state in the host. This suggests that, under these conditions, growth represents a major sink to cellular resources. Here we have selected an equal fe, i for all enzymatic reactions i, however this could be modified to account for differences in enzymes, such as size (molecular weight), activity or degradation rates.
Finally, it should be borne in mind that the transcriptomics data do not represent later infectious states, but a single time point 24 h post infection, before the onset of growth arrest. As can be seen from Figures 3–5, the profiles of uptake rates of different nutrients are quite similar for all five (condition-specific) biomass reactions, even though these reactions are very different. Production of a variety of precursors is apparently possible using a more or less fixed set of nutrients. The predicted combination of nutrients that Mtb acquires during infection is surprising from a modeling point of view. As uptake of one nutrient and subsequent production of energy carrying metabolites (ATP, NADH), biomass precursor(s), and excretion of byproducts, will always be more favorable than that of another metabolite in terms of its potential to sustain growth. The result is that the one nutrient is always favored above another and uptake of multiple nutrients normally does not occur without setting quantitative arbitrary boundaries on uptake rates. This preferential substrate utilization is often regulated at multiple levels, and it should be considered that this type of models does not explicitly account for regulation. Still, the energy and metabolite precursor gain from each nutrient is very balanced using sMtb and the bi-objective optimization, which indicates that enough regulatory information is retained in the transcript data.
A major advantage of the simulations performed within this study is that virtually no assumptions on quantitative uptake rates are required. The only limitation on uptake rates, apart from not allowing uptake of metabolites that are not known or likely to be available in the phagosome, is set on the uptake of oxygen. The phagosome is likely a hypoxic environment (Schnappinger et al., 2003; Gengenbacher and Kaufmann, 2012) and the oxygen uptake rate was therefore (arbitrarily) set to 1% (0.01 mmol gDW−1 h−1) of the rate used in previous predictions on Mtb metabolism (Jamshidi and Palsson, 2007).
The predictions of essential genes using sMtb and the five different (condition-specific) biomass reactions are not overwhelmingly accurate. In general, the specificity (the correct prediction of non-essential genes) is quite good, but the sensitivity (the correct prediction of essential genes) is very poor. This is rather remarkable, as such a long list of biomass precursors (Supplementary Table 2) is likely to result in a high number of genes predicted to be essential, as there is ample opportunity to disrupt synthesis routes toward many precursors by an in silico knockout. Possibly, there are even more metabolic precursors that should be taken into account when creating biomass reactions for Mtb.
Although the biomass reaction representing non-replicating cells, NRC, has the highest accuracy, its sensitivity is the poorest of all biomass reactions, due to its low number of biomass precursors. If one is interested in developing novel therapeutic intervention strategies, the essential genes are arguably the most interesting. In general, the amount of genes that are predicted to be essential is lower than the measured number. This could imply that the list of 108 biomass precursors is still too short. Given that there are 2,500 different lipids identified in Mtb up till now (Sartain et al., 2011), the total number of different metabolites is probably a lot higher. Even if metabolic intermediates are omitted, it is still likely that the total number of biomass precursors is well above 108.
The Mtb genome roughly contains 4,000 genes, of which a quarter has an unknown function (Qin et al., 2013). Model sMtb currently contains 930 genes, which is approximately one-third of the genome having a known function. Extrapolating these figures would mean that there are still an estimated 300 unknown genes in the Mtb genome that are involved in metabolism. So, an estimated quarter of model sMtb is missing. This will undoubtedly affect predictions made with sMtb.
Another, more fundamental problem lies in the possibility that Mtb and the host continuously interact and a steady state is not easily obtained (Garton and Hare, 2013). As the foundation of constraint-based metabolic models is the stoichiometric matrix, wherein a steady state (i.e., synthesis and degradation rates for each metabolite are equal) is assumed for all metabolites, a non-steady state situation might negatively impact the predictions made using sMtb.
The poor prediction of genes essential to survival of Mtb within the host is in stark contrast to in vitro predictions previously made using sMtb where accuracies of 80% were reached (Rienksma et al., 2014). Remarkably, the biomass reactions seem to have limited influence on gene essentiality predictions within the host. As the general list of biomass precursors of model sMtb is primarily derived from in vitro data of Mtb, or close relatives of Mtb, the list of biomass precursors could be overfitted to in vitro growth conditions.
In addition, the condition-specific biomass reactions could be incorrectly inferred. As the biomass precursors are maximized individually one at a time, information regarding their interdependency is not taken into account. One could for example envision maximizing the sum of the flux toward all biomass precursors at the same time, while minimizing the difference between the overall flux profile and the gene expression profile, instead of the approach taken here. Nevertheless, such a strategy is at risk of ignoring precursors and corresponding synthesis pathways that are relatively lowly expressed, and ending up only a few precursors in the biomass reaction.
Another explanation is that important constraints are missing. For example, the influence of metal cofactors such as iron and zinc on the metabolic state is ignored, while these cofactors are crucial for intracellular survival, and many metabolic enzymes do not function without these cofactors (Zondervan et al., 2018).
Taken together, the lack of predictive power of sMtb regarding in-host essential gene predictions could be caused by several problems, one of the most fundamental problems being the absence of a steady state situation. The gene essentiality measurements from Mendum and colleagues show a similar picture, as only 78–80% of the metabolic genes essential for survival are shared between 3 days and 7 days after infection (Mendum et al., 2015). This figure is not strikingly low, but it does point in the direction of a lack of a steady state situation. The effect that a non-steady state situation would have on the predictions of essential genes and the metabolic state is difficult to quantify.
Although Mtb is very similar to M. bovis BCG, there are obvious differences. First of all, Mtb is highly pathogenic to humans, while M. bovis BCG is a relatively safe organism. From a metabolic point of view, both organisms are highly similar, although there are some notable differences (Lofthouse et al., 2013). Moreover, it is not unimaginable that metabolic differences during infection are highlighted as M. bovis BCG is eventually eradicated within human immune cells, while Mtb is able to withstand and thrive within such cells. Another aspect is that the gene essentiality measurements are made 3 days and 7 days after infection while the dual RNA-seq data is derived from an experiment 1 day after infection.
We developed a method of modeling the metabolism of M. tuberculosis during infection of the host's immune cells. The method has the advantage that, unlike previously applied host-pathogen modeling approaches (Bordbar et al., 2010), it is virtually free from any artificially placed constraints on metabolite uptake and secretion rates. In addition, our method does not require a pre-composed biomass reaction. The only requirements are: knowledge of nutrient availability, a genome-scale dataset of transcript abundances (such as an RNA-sequencing dataset), a detailed list of biomass precursors, and a genome-scale constraint-based model of metabolism. A relatively small amount of data is required for this method, and it is therefore suited to explore metabolic states of microorganisms in difficult to access environments where an efficient usage of resources is likely to occur.
Our method allows accurate prediction of nutrients from the host, apart from cholesterol uptake, which was not predicted to take place, likely due to lack of knowledge on the complete degradation pathway. A doubled amino acid synthesis requirement was predicted using our method, suggesting an increased synthesis rate of proteins relative to other metabolic precursors during host infection. Lipid synthesis was predicted to decrease during infection, confirming the predominant lipid diet encountered by Mtb within the host.
Flux predictions obtained with the condition specific biomass reactions, without any further constraints show poor correlation with the transcriptomics data (lower than 0.1). This value is similar to the values obtained using the other four biomass reactions. Poor correlation between transcriptomics data and proteomics measurements has been shown in a wide number of publications (Maier et al., 2009; Payne, 2015; Edfors et al., 2016). In addition, accurate predictions would also require inclusion of enzyme turnover data (Sánchez Benjamín et al., 2017). This further confirms that fitting the model to the gene expression data might lead to an over-constrained model.
It is important to notice that during the onset of infection not only the bacterium undergoes metabolic changes, but also the host environment it thrives in most likely undergoes changes as the host responds to infection. This interplay between the host and the pathogen has not been taken into account as here only the bacterium is modeled. Another reason for the inaccurate gene essentiality predictions could be that many enzymes play additional roles in the synthesis of precursors that are not required during in vitro growth or that the list of precursors is not comprehensive. The latter explanation would be plausible, as the predictions on nutrient uptake are quite accurate, suggesting that nutrient uptake is driven by energy efficiency constraints.
Author Contributions
RR drafted the manuscript, conceived the study and performed the data analysis. MS-D participated in drafting the manuscript and helped with the data analysis. PS and VM participated in drafting the manuscript. All authors contributed to the study design and critically read, revised and approved the manuscript.
Funding
This work has been supported by Framework Program 7 of the European Research Council, SysteMTb Collaborative Project (HEALTH-2009-2.1.1-1-241587) and by the European Union's Horizon 2020 research and innovation programme under grant agreement No. 634942 (MycoSynVac).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2018.00264/full#supplementary-material
References
Becker, S. A., and Palsson, B. O. (2008). Context-specific metabolic networks are consistent with experiments. PLoS Comput. Biol. 4:e1000082. doi: 10.1371/journal.pcbi.1000082
Beste, D. J., Hooper, T., Stewart, G., Bonde, B., Avignone-Rossa, C., Bushell, M. E., et al. (2007). GSMN-TB: a web-based genome-scale network model of Mycobacterium tuberculosis metabolism. Genome Biol. 8:R89. doi: 10.1186/gb-2007-8-5-r89
Beste, D. J., Nöh, K., Niedenführ, S., Mendum, T. A., Hawkins, N. D., Ward, J. L., et al. (2013). 13C-Flux spectral analysis of host-pathogen metabolism reveals a mixed diet for intracellular Mycobacterium tuberculosis. Chem. Biol. 20, 1012–1021. doi: 10.1016/j.chembiol.2013.06.012
Bordbar, A., Lewis, N. E., Schellenberger, J., Palsson, B. O., and Jamshidi, N. (2010). Insight into human alveolar macrophage and M. tuberculosis interactions via metabolic reconstructions. Mol. Syst. Biol. 6:422. doi: 10.1038/msb.2010.68
Bordbar, A., Monk, J. M., King, Z., and Palsson, B. O. (2014). Constraint-based models predict metabolic and associated cellular functions. Nat. Rev. Genet. 15, 107–120. doi: 10.1038/nrg3643
Bordel, S., Agren, R., and Nielsen, J. (2010). Sampling the solution space in genome-scale metabolic networks reveals transcriptional regulation in key enzymes. PLoS Comput. Biol. 6:e1000859. doi: 10.1371/journal.pcbi.1000859
Colijn, C., Brandes, A., Zucker, J., Lun, D. S., and Weiner, B. (2009). Interpreting expression data with metabolic flux models: predicting Mycobacterium tuberculosis mycolic acid production. PLoS Comput. Biol. 5:e1000489. doi: 10.1371/journal.pcbi.1000489
Edfors, F., Danielsson, F., Hallström, B. M., Käll, L., Lundberg, E., Pontén, F., et al. (2016). Gene–specific correlation of RNA and protein levels in human cells and tissues. Mol. Syst. Biol. 12:883. doi: 10.15252/msb.20167144
Fels, U., Gevaert, K., and Van Damme, P. (2017). Proteogenomics in aid of host–pathogen interaction studies: a bacterial perspective. Proteomes 5:26. doi: 10.3390/proteomes5040026
Fontán, P., Aris, V., Ghanny, S., Soteropoulos, P., and Smith, I. (2008). Global transcriptional profile of Mycobacterium tuberculosis during THP-1 human macrophage infection. Infect. Immun. 76, 717–725. doi: 10.1128/IAI.00974-07
Fullam, E., Prokes, I., Fütterer, K., and Besra, G. S. (2016). Structural and functional analysis of the solute-binding protein UspC from Mycobacterium tuberculosis that is specific for amino sugars. Open Biol. 6:160105. doi: 10.1098/rsob.160105
Garnier, T., Eiglmeier, K., Camus, J. C., Medina, N., Mansoor, H., Pryor, M., et al. (2003). The complete genome sequence of Mycobacterium bovis. Proc. Natl. Acad. Sci. U.S.A. 100, 7877–7882. doi: 10.1073/pnas.1130426100
Hare, H. M. (2013). Tuberculosis: feeding the enemy. Chem. Biol. 20(8): 971–972. doi: 10.1016/j.chembiol.2013.08.001
Gengenbacher, M., and Kaufmann, S. H. (2012). Mycobacterium tuberculosis: success through dormancy. FEMS Microbiol. Rev. 36, 514–532. doi: 10.1111/j.1574-6976.2012.00331.x
Gudmundsson, S., and Thiele, I. (2010). Computationally efficient flux variability analysis. BMC Bioinformatics 11:489. doi: 10.1186/1471-2105-11-489
Hoppe, A. (2012). What mRNA abundances can tell us about metabolism. Metabolites 2, 614–631. doi: 10.3390/metabo2030614
Jamshidi, N., and Palsson, B. (2007). Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targets. BMC Syst. Biol. 1:26. doi: 10.1186/1752-0509-1-26
Jensen, P. A., and Papin, J. A. (2011). Functional integration of a metabolic network model and expression data without arbitrary thresholding. Bioinformatics 27, 541–547. doi: 10.1093/bioinformatics/btq702
Kalscheuer, R., Weinrick, B., Veeraraghavan, U., Besra, G. S., and Jacobs, W. R. (2010). Trehalose-recycling ABC transporter LpqY-SugA-SugB-SugC is essential for virulence of Mycobacterium tuberculosis. Proc. Natl. Acad. Sci. U.S.A. 107, 21761–21766. doi: 10.1073/pnas.1014642108
Kauffman, K. J., Prakash, P., and Edwards, J. S. (2003). Advances in flux balance analysis. Curr. Opin. Biotechnol. 14, 491–496. doi: 10.1016/j.copbio.2003.08.001
Lewis, N. E., Nagarajan, H., and Palsson, B. O. (2012). Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Micro. 10, 291–305. doi: 10.1038/nrmicro2737
Lofthouse, E. K., Wheeler, P. R., Beste, D. J., Khatri, B., Wu, H., Mendum, T. A., et al. (2013). Systems-based approaches to probing metabolic variation within the Mycobacterium tuberculosis complex. PLoS ONE 8:e75913. doi: 10.1371/journal.pone.0075913
Machado, D., and Herrgård, M. (2014). Systematic evaluation of methods for integration of transcriptomic data into constraint-based models of metabolism. PLoS Comput. Biol. 10:e1003580. doi: 10.1371/journal.pcbi.1003580
Maier, T., Güell, M., and Serrano, L. (2009). Correlation of mRNA and protein in complex biological samples. FEBS Lett. 583, 3966–3973. doi: 10.1016/j.febslet.2009.10.036
Mangan, J. A., Monahan, I. M., and Butcher, P. D. (2002). “Gene expression during host—pathogen interactions: Approaches to bacterial mRNA extraction and labelling for microarray analysis„” in Methods in Microbiology, Vol. 33, ed N. D. Brendan Wren (London: Academic Press), 137–151.
Mendum, T. A., Wu, H., Kierzek, A. M., and Stewart, G. R. (2015). Lipid metabolism and type vii secretion systems dominate the genome scale virulence profile of Mycobacterium tuberculosis in human dendritic cells. BMC Genomics 16:372. doi: 10.1186/s12864-015-1569-2
Messac, A., Ismail-Yahaya, A., and Mattson, C. A. (2003). The normalized normal constraint method for generating the Pareto frontier. Struct. Multidiscip. Optim. 25, 86–98. doi: 10.1007/s00158-002-0276-1
Motamedian, E., Mohammadi, M., and Shojaosadati, S. A. Heydari, M. (2017). TRFBA: an algorithm to integrate genome-scale metabolic and transcriptional regulatory networks with incorporation of expression data. Bioinformatics 33, 1057–1063. doi: 10.1093/bioinformatics/btw772
Nagrath, D., Avila-Elchiver, M., Berthiaume, F., Tilles, A. W., Messac, A., and Yarmush, M. L. (2007). Integrated energy and flux balance based multiobjective framework for large-scale metabolic networks. Ann. Biomed. Eng. 35, 863–885. doi: 10.1007/s10439-007-9283-0
Orth, J. D., and Thiele, I. Palsson, B. Ø. (2010). What is flux balance analysis? Nat. Biotechnol. 28, 245–248. doi: 10.1038/nbt.1614
Papalambros, P. Y., and Wilde, D. J. (2000). Principles of Optimal Design: Modeling and Computation. The Press Syndicate of the University of Cambridge.
Payne, S. H. (2015). The utility of protein and mRNA correlation. Trends Biochem. Sci. 40, 1–3. doi: 10.1016/j.tibs.2014.10.010
Qin, L., Gao, S., Wang, J., Zheng, R., Lu, J., and Hu, Z. (2013). The conservation and application of three hypothetical protein coding gene for direct detection of Mycobacterium tuberculosis in sputum specimens. PLoS ONE 8:e73955. doi: 10.1371/journal.pone.0073955
Raman, K., and Chandra, N. (2009). Flux balance analysis of biological systems: applications and challenges. Brief. Bioinform. 10, 435–449. doi: 10.1093/bib/bbp011
Reed, J. L. (2012). Shrinking the metabolic solution space using experimental datasets. PLoS Comput. Biol. 8:e1002662. doi: 10.1371/journal.pcbi.1002662
Rienksma, R. A., Suarez-Diez, M., Mollenkopf, H. J., Dolganov, G. M., Dorhoi, A., Schoolnik, G. K., et al. (2015). Comprehensive insights into transcriptional adaptation of intracellular mycobacteria by microbe-enriched dual RNA sequencing. BMC Genomics 16:34. doi: 10.1186/s12864-014-1197-2
Rienksma, R. A., Suarez-Diez, M., Spina, L., and Schaap, P. J. (2014). Systems-level modeling of mycobacterial metabolism for the identification of new (multi-)drug targets. Semin. Immun. 26, 610–622. doi: 10.1016/j.smim.2014.09.013
Rossell, S., Huynen, M. A., and Notebaart, R. A. (2013). Inferring metabolic states in uncharacterized environments using gene-expression measurements. PLoS Comput. Biol. 9:e1002988. doi: 10.1371/journal.pcbi.1002988
Sánchez Benjamín, J., Zhang, C., Nilsson, A., Lahtvee, P. J., Kerkhoven Eduard, J., and Nielsen, J. (2017). Improving the phenotype predictions of a yeast genome–scale metabolic model by incorporating enzymatic constraints. Mol. Syst. Biol. 3:935. doi: 10.15252/msb.20167411
Sartain, M. J., Dick, D. L., Rithner, C. D., Crick, D. C., and Belisle, J. T. (2011). Lipidomic analyses of Mycobacterium tuberculosis based on accurate mass measurements and the novel Mtb LipidDB. J. Lipid Res. 52, 861–872. doi: 10.1194/jlr.M010363
Schellenberger, J., Que, R., Fleming, R. M., Thiele, I., Orth, J. D., Feist, A. M., et al. (2011). Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat. Protoc. 6, 1290–1307. doi: 10.1038/nprot.2011.308
Schnappinger, D., Ehrt, S., Voskuil, M. I., Liu, Y., Mangan, J. A., et al. (2003). Transcriptional adaptation of Mycobacterium tuberculosis within Macrophages: insights into the phagosomal environment. J. Exp. Med. 198, 693–704. doi: 10.1084/jem.20030846
Schuetz, R., Kuepfer, L., and Sauer, U. (2007). Systematic evaluation of objective functions for predicting intracellular fluxes in Escherichia coli. Mol. Syst. Biol. 3:119. doi: 10.1038/msb4100162
Shi, L., Sohaskey, C. D., Pfeiffer, C., Datta, P., Parks, M., McFadden, J., et al. (2010). Carbon flux rerouting during Mycobacterium tuberculosis growth arrest. Mol. Microbiol. 78, 1199–1215. doi: 10.1111/j.1365-2958.2010.07399.x
Shlomi, T., Cabili, M. N. Herrgård, M. J., and Ruppin, E. (2008). Network-based prediction of human tissue-specific metabolism. Nat. Biotech. 26, 1003–1010. doi: 10.1038/nbt.1487
Suárez, M., Tortosa, P., Carrera, J., and Jaramillo, A. (2008). Pareto optimization in computational protein design with multiple objectives. J. Comput. Chem. 29, 2704–2711. doi: 10.1002/jcc.20981
Wipperman, M. F., Sampson, N. S., and Thomas, S. T. (2014). Pathogen roid rage: cholesterol utilization by Mycobacterium tuberculosis. Crit. Rev. Biochem. Mol. Biol. 49, 269–293. doi: 10.3109/10409238.2014.895700
Zimmermann, M., Kogadeeva, M., Gengenbacher, M., McEwen, G., Mollenkopf, J. H., Zamboni, N., et al. (2017). Integration of metabolomics and transcriptomics reveals a complex diet of sMycobacterium tuberculosis during early macrophage infection. mSystems 2:e00057-17. doi: 10.1128/mSystems.00057-17
Keywords: metabolic model, Mycobacterium tuberculosis, systems biology, host-pathogen interaction, condition specific, flux balance analysis
Citation: Rienksma RA, Schaap PJ, Martins dos Santos VAP and Suarez-Diez M (2018) Modeling the Metabolic State of Mycobacterium tuberculosis Upon Infection. Front. Cell. Infect. Microbiol. 8:264. doi: 10.3389/fcimb.2018.00264
Received: 09 March 2018; Accepted: 13 July 2018;
Published: 03 August 2018.
Edited by:
Tunahan Cakir, Gebze Technical University, TurkeyReviewed by:
Christopher Scott Henry, Argonne National Laboratory (DOE), United StatesSayed-Amir Marashi, University of Tehran, Iran
Leonid Chindelevitch, Simon Fraser University, Canada
Copyright © 2018 Rienksma, Schaap, Martins dos Santos and Suarez-Diez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Maria Suarez-Diez, bWFyaWEuc3VhcmV6ZGllekB3dXIubmw=