Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
Download
29547 views
1
2
3
4
5
Introduction
6
The knowledge of a complete genome sequence holds the
7
potential to reveal the 'blueprints' for cellular life. The
8
genome sequence contains the information to propagate the
9
living system, and this information exists as open reading
10
frames (ORFs) and regulatory information. Computational
11
approaches have been developed (and are continuously being
12
improved) to decipher the information encoded in the DNA [
13
1, 2, 3, 4, 5, 6, 7]. However, it is becoming evident that
14
cellular functions are intricate and the integrated
15
function of biological systems involves many complex
16
interactions among the molecular components within the
17
cell. To understand the complexity inherent in cellular
18
networks, approaches that focus on the systemic properties
19
of the network are also required.
20
The complexity of integrated cellular systems leads to
21
an important point, namely that the properties of complex
22
biological processes cannot be analyzed or predicted based
23
solely on a description of the individual components, and
24
integrated systems based approaches must be applied [ 8].
25
The focus of such research represents a departure from the
26
classical
27
reductionist approach in the
28
biological sciences, and moves toward the
29
integrated approach to understanding
30
the interrelatedness of gene function and the role of each
31
gene in the context of multi genetic cellular functions or
32
genetic circuits [ 8, 9, 10].
33
The engineering approach to analysis and design of
34
complex systems is to have a mathematical or computer
35
model; e.g. a dynamic simulator of a cellular process that
36
is based on fundamental physicochemical laws and
37
principles. Herein, we will analyze the integrated function
38
of the metabolic pathways, and there has been a long
39
history of mathematical modeling of metabolic networks in
40
cellular systems, which dates back to the 1960s [ 11, 12].
41
While the ultimate goal is the development of dynamic
42
models for the complete simulation of cellular metabolism,
43
the success of such approaches has been severely hampered
44
by the lack of kinetic information on the dynamics and
45
regulation of metabolism. However, in the absence of
46
kinetic information it is still possible to assess the
47
theoretical capabilities and operative modes of metabolism
48
using flux balance analysis (FBA) [ 10, 13, 14, 15, 16,
49
17].
50
We have developed an
51
in silico representation of
52
Escherichia coli (E. coli in
53
silico) to describe the bacterium's metabolic
54
capabilities [ 18].
55
E. coli in silico was derived based
56
on the annotated genetic sequence [ 19], biochemical
57
literature [ 20], and the online bioinformatic databases [
58
21, 22, 23]. The properties of
59
E. coli in silico were analyzed and
60
compared to the
61
in vivo properties of
62
E. coli, and it was shown that
63
E. coli in silico can be used to
64
interpret the metabolic phenotype of many
65
E. coli mutants [ 18]. However, the
66
utilization of the metabolic genes is dependent on the
67
carbon source and the substrate availability [ 24, 25].
68
Thus, the mutant phenotype is also dependent on specific
69
environmental parameters. Therefore, herein we have
70
utilized
71
E. coli in silico to computationally
72
examine the condition dependent optimal metabolic pathway
73
utilization, and we will show that the FBA can be used to
74
analyze and interpret the metabolic behavior of wildtype
75
and mutant
76
E. coli strains.
77
78
79
Materials and Methods
80
81
Flux balance analysis
82
All biological processes are subjected to physico
83
chemical constraints (such as mass balance, osmotic
84
pressure, electro neutrality, thermodynamic, and other
85
constraints). As a result of decades of metabolic
86
research and the recent genome sequencing projects, the
87
mass balance constraints on cellular metabolism can be
88
assigned on a genome scale for a number of organisms.
89
Methods have been developed to analyze the metabolic
90
capabilities of a cellular system based on the mass
91
balance constraints and this approach is known as flux
92
balance analysis (FBA) [ 13, 14, 16] (see the
93
supplementary information for an FBA primer). The mass
94
balance constraints in a metabolic network can be
95
represented mathematically by a matrix equation:
96
S • v = 0 Equation 1
97
The matrix
98
S is the
99
mxn stoichiometric matrix, where
100
m is the number of metabolites and
101
n is the number of reactions in the
102
network (The
103
E. coli stoichiometric matrix is
104
available in matrix format in the supplementary
105
information and in a reaction list in Appendices 1-3).
106
The vector
107
v represents all fluxes in the
108
metabolic network, including the internal fluxes,
109
transport fluxes and the growth flux.
110
For the
111
E. coli metabolic network
112
represented by Eqn. 1, the number of fluxes was greater
113
than the number of mass balance constraints; thus, there
114
were multiple feasible flux distributions that satisfied
115
the mass balance constraints, and the solutions (or
116
feasible metabolic flux distributions) were confined to
117
the nullspace of the matrix
118
S .
119
In addition to the mass balance constraints, we
120
imposed constraints on the magnitude of individual
121
metabolic fluxes.
122
α
123
124
i
125
≤ v
126
127
i
128
≤ β
129
130
i
131
Equation 2
132
The linear inequality constraints were used to enforce
133
the reversibility of each metabolic reaction and the
134
maximal flux in the transport reactions. The
135
reversibility constraints for each reaction are indicated
136
online. The transport flux for inorganic phosphate,
137
ammonia, carbon dioxide, sulfate, potassium, and sodium
138
was unrestrained (α
139
140
i
141
= -∞ and β
142
143
i
144
= ∞). The transport flux for the other metabolites,
145
when available in the
146
in silico medium, was constrained
147
between zero and the maximal level (0 ≤ v
148
149
i
150
≤ v
151
152
i
153
max). The v
154
155
i
156
157
max values used in the simulations
158
are noted for each simulation (Fig. 1). When a metabolite
159
was not available in the medium, the transport flux was
160
constrained to zero. The transport flux for metabolites
161
capable of leaving the metabolic network (i.e. acetate,
162
ethanol, lactate, succinate, formate, and pyruvate) was
163
always unconstrained in the net outward direction.
164
The intersection of the nullspace and the region
165
defined by the linear inequalities defined a region in
166
flux space that we will refer to as the feasible set, and
167
the feasible set defined the capabilities of the
168
metabolic network subject to the imposed cellular
169
constraints. It should be noted that every vector
170
v within the feasible set is not
171
reachable by the cell under a given condition due to
172
other constraints not considered in the analysis (i.e.
173
maximal internal fluxes and gene regulation). The
174
feasible set can be further reduced by imposing
175
additional constraints (i.e. kinetic or gene regulatory
176
constraints), and in the limiting condition where all
177
constraints are known, the feasible set may reduce to a
178
single point.
179
A particular metabolic flux distribution within the
180
feasible set (vector
181
v which satisfies the constraints in
182
Eqns. 1 and 2) was found using linear programming (LP). A
183
commercially available LP package was used (LINDO, Lindo
184
Systems Inc., Chicago, II). LP identified a solution that
185
minimized a metabolic objective function (subject to the
186
imposed constraints- Eqns. 1 and 2) [ 16, 48, 49], and
187
was formulated as shown below:
188
Minimize -Z
189
where
190
Z = Σ
191
c
192
193
i
194
v
195
196
i
197
= <c • v> Equation 3
198
The vector
199
c was used to select a linear
200
combination of metabolic fluxes to include in the
201
objective function [ 50]. Herein,
202
c was defined as the unit vector in
203
the direction of the growth flux, and the growth flux was
204
defined in terms of the biosynthetic requirements:
205
(Equation 4)
206
where
207
d
208
209
m
210
is the biomass composition of metabolite X
211
212
m
213
(we used a constant biomass composition defined from
214
the literature [ 51] (see Appendix 4)), and the growth
215
flux was modeled as a single reaction that converts all
216
the biosynthetic precursors into biomass.
217
218
219
Phenotype Phase Plane Analysis
220
All feasible
221
E. coli in silico metabolic flux
222
distributions are mathematically confined to the feasible
223
set, which is a region in flux space ( n), where each
224
solution in this space corresponds to a feasible
225
metabolic flux distribution.
226
Phenotype Phase Plane (PhPP): A PhPP is a
227
two-dimensional projection of the feasible set, and below
228
we will briefly discuss the formalism for constructing
229
the PhPP. Two parameters that describe the growth
230
conditions (such as substrate and oxygen uptake rates)
231
were defined as the two axes of the two dimensional
232
space. The optimal flux distribution was calculated
233
(using LP) for all points in this plane by solving the LP
234
problem while adjusting the exchange flux constraints
235
(defining the two-dimensional space). A finite number of
236
qualitatively different patterns of metabolic pathway
237
utilization were identified in such a plane, and lines
238
were drawn to demarcate these regions. Each region is
239
denoted by Pn
240
x, y , where 'P' indicates that the
241
region was defined by a phenotype phase plane analysis,
242
'n' denotes the number of the demarcated phase (as shown
243
in a particular PhPP figure), and 'x, y' denotes the two
244
uptake rates on the axis of the PhPP. PhPPs were also
245
generated for mutant genotypes; represented as P genen
246
x, y .
247
One demarcation line in the PhPP was defined as the
248
line of optimality (LO). The LO represents the optimal
249
relation between exchange fluxes defined on the axes of
250
the PhPP.
251
252
253
Alterations of the genotype
254
FBA and
255
E. coli in silico were used to
256
examine the systemic effects of in
257
silico gene deletions. The genes
258
involved in the central metabolic pathways (glycolysis,
259
pentose phosphate pathway, TCA cycle, electron transport)
260
were subjected to removal from
261
E. coli in silico. To simulate a
262
gene deletion, all metabolic reactions catalyzed by a
263
given gene product were simultaneously constrained to
264
zero. Some metabolic reactions were catalyzed by more
265
than one enzyme, and all genes that code for enzymes that
266
catalyze a given reaction were simultaneously removed
267
(i.e.
268
rpiAB ). Furthermore, all genes
269
that make up an enzyme complex were also simultaneously
270
removed (i.e.
271
sdhABCD ).
272
The optimal metabolic flux distribution for the
273
generation of biomass was calculated for each
274
in silico deletion strain. The
275
in silico gene deletion analysis
276
was performed with the transport flux constraints defined
277
by the wild-type PhPP. The constraints imposed for each
278
simulation are noted in Fig. 1.
279
For each
280
in silico deletion strain, the
281
optimal production of the twelve biosynthetic precursors
282
and the metabolic cofactors was also calculated to
283
identify auxotrophic requirements and impaired functions
284
in the metabolic network (Table 1). The optimal
285
production of the biosynthetic precursors was calculated
286
by setting the objective function to the drain of a
287
single metabolite (i.e., ATP → ADP + P
288
i , or PEP →). The numerical value of
289
the objective function for each
290
in silico deletion strain was
291
reported as a fraction of the wild-type optimal value
292
(Table 1).
293
294
295
296
Results
297
We have previously described the construction of
298
phenotype phase planes (PhPPs) (see materials and methods)
299
and the analysis of the glucose-oxygen PhPP. We have
300
previously described the effect of
301
in silico 'gene deletions' on the
302
ability of
303
E. coli in silico to 'grow' under a
304
single condition [ 18]. Since the utilization of the
305
metabolic pathways is condition dependent, herein, we have
306
investigated the link between the environmental conditions
307
and the optimal metabolic pathway utilization
308
in silico by: 1. studying the effects
309
of gene deletions in all phases of the glucose-oxygen PhPP,
310
and 2. broadening the analysis of
311
in silico deletion strains by
312
comparing PhPPs from isogenic
313
in silico strains.
314
315
Gene Deletions : A point within each
316
phase of the glucose-oxygen PhPP was chosen to define the
317
transport flux constraints (indicated in Fig. 1) for the
318
FBA simulations. At each point, the growth characteristics
319
of all
320
in silico gene deletion strains (of
321
central metabolic pathway genes) were examined. Based on
322
the results, the genes were categorized as;
323
essential (growth under the defined
324
condition requires the activity of the corresponding gene
325
product),
326
critical (growth at a reduced yield
327
(< 95% of wild-type)), or
328
non-essential (growth at near
329
wild-type yield (> 95%)). The effects of the
330
in silico gene deletions were
331
phase-dependent, allowing us to identify optimal growth
332
phenotypes for each growth condition. Additionally, the
333
optimal production of the 12 biosynthetic precursors,
334
high-energy phosphate bonds, and redox potential was
335
calculated for each
336
in silico deletion strain (Table 1)
337
to determine the specific effect of the gene deletion on
338
the metabolic capabilities. For instance, the
339
in silico acnAB' strain was unable to
340
synthesize α-ketoglutarate under all simulated growth
341
conditions, and thus,
342
acnAB was defined as essential for
343
growth in a glucose minimal media (Table 1).
344
The optimal utilization of the metabolic pathways was
345
dependent on the specific transport flux constraints, and
346
the qualitative shifts in optimal metabolic behavior as a
347
function of two transport fluxes are shown in the PhPP. The
348
optimal biomass yield and biosynthetic precursor production
349
capabilities were calculated for each
350
E. coli in silico deletion strain for
351
a point within each region of the PhPP, and the optimal
352
values were normalized to the wild-type (Fig. 1). The
353
condition dependent metabolic phenotypes were
354
computationally analyzed, and the results are organized by
355
the overall metabolic phenotype;
356
essential, conditionally
357
essential, or
358
non-essential genes.
359
360
Essential genes : The gene products
361
that were essential for growth with conditions defined by
362
the line of optimality (LO) (see materials and methods)
363
were also identified as essential within all other phases (
364
365
acnAB, gapAC, gltA, icdA, pgk, rpiAB,
366
tktAB ). Specifically, the
367
gltA -,
368
icdA -, and
369
acnAB -
370
in silico deletion strains were
371
unable to produce one biosynthetic precursor
372
(α-ketoglutarate, Table 1), and retained the capability to
373
synthesize the remaining biosynthetic precursors and
374
cofactors nearly equivalent to the wild-type. This
375
prediction is consistent with the defined media required
376
for the cultivation of
377
aglt -
378
E. coli mutant strain (glucose
379
minimal media supplemented with glutamine or proline) [
380
26]. Furthermore, the essential glycolytic gene products (
381
pgk, gapAC ) were required for the
382
synthesis of oxaloacetate, succinyl-CoA, α-ketoglutarate,
383
pyruvate, phosphoenolpyruvate (PEP), and 3-phosphoglycerate
384
within all conditions, and were unable to synthesize all
385
biosynthetic precursors under anaerobic growth conditions.
386
The remaining two essential gene products were in the
387
pentosephosphate pathway (
388
tktAB, rpiAB ). The
389
tktAB and
390
rpiAB gene products were required for
391
the synthesis of erythrose 4-phosphate in all phases
392
(aromatic amino acid supplement required for the
393
cultivation of
394
tkt -
395
E. coli mutant strains [ 27]).
396
Additionally,
397
rpiAB -strains were identified as
398
ribose auxotrophs by the
399
in silico analysis, which was
400
consistent with experimental data [ 28].
401
402
Conditionally essential genes :
403
During the growth simulations with external parameters
404
defined by the LO, there were genes defined as critical for
405
growth; however, many of these genes were essential for
406
cellular growth upon oxygen limitations (
407
fba, pfkAB, tpiA, eno, gpmAB ). These
408
genes were termed conditionally essential. The
409
fba -,
410
pfkAB -, and
411
tpiA -
412
in silico deletion strains had a
413
limited capability to synthesize glyceraldehyde
414
3-phosphate, 3-phosphoglycerate, phosphoenolpyruvate,
415
pyruvate, acetyl-CoA, α-ketoglutarate, succinyl-CoA,
416
oxaloacetate, and high-energy phosphate bonds in all
417
phases, and were completely unable to synthesize many of
418
the biosynthetic precursors in phases 46 (Table 1) (
419
tpi -
420
in silico strain discussed below).
421
The growth potential of the
422
eno -and
423
gpmAB -
424
in silico deletion strains was
425
theoretically maintained under aerobic conditions by the
426
synthesis and degradation of serine, and without the serine
427
degradation pathway, the
428
eno -and
429
gpmAB -gene products were defined as
430
essential. However, the
431
eno -and
432
gpmAB -
433
in silico deletion strains were
434
limited in their production capability of high-energy
435
phosphate bonds under all conditions, and were unable to
436
produce any of the biosynthetic precursors in phase 6 even
437
with the serine degradation pathway.
438
Additionally, several LO non-essential gene products
439
were essential (
440
sdhABCD, ppc, frdABCD ) for growth
441
within other phases. The
442
in silico analysis suggested that the
443
444
sdhABCD and
445
frdABCD gene products were required
446
for anaerobic pyrimidine biosynthesis. Additionally, the
447
frdABCD gene products were essential
448
for the anaerobic synthesis of the NAD cofactor. However,
449
these
450
in silico results could be due to
451
inaccurate stoichiometric information with respect to
452
cofactor utilization and should be critically examined.
453
Finally, the
454
ppc gene product was required for the
455
anaerobic synthesis of oxaloacetate and α-ketoglutarate,
456
but the
457
in silico analysis suggests that this
458
gene product is not essential for growth in aerobic
459
conditions where the glyoxylate by-pass has the potential
460
to replenish the biosynthetic precursors [ 29].
461
462
Non-essential genes : Several genes
463
that are critical for growth in conditions defined by the
464
LO were non-essential for growth in other phases (
465
nuo ,
466
cyoABCD, fumABC ). The
467
in silico nuo -and
468
cyoABCD -deletion strains were
469
limited in their production capabilities of high-energy
470
phosphate bonds for aerobic growth; however, under
471
anaerobic conditions high-energy phosphate bonds were
472
produced by substrate level phosphorylation. The production
473
capabilities of the
474
fumABC -
475
in silico deletion strain was not
476
limited with respect to the biosynthetic precursors shown
477
in the table (other than a slight limitation of ATP
478
production in P1
479
glucose, oxygen ). However, the
480
fumABC -
481
in silico deletion strain was limited
482
in its production capabilities of several amino acids (arg,
483
gly, his- not shown in table), but under anaerobic
484
conditions, these capabilities were not limited with
485
respect to the wild-type.
486
Several LO non-essential gene products were critical (
487
pgi, pta, ackAB ) for growth within
488
other phases. The
489
in silico pgi deletion strain had a
490
reduced capacity to produce all the biosynthetic precursors
491
under oxygen limitation, and this resulted in a decreased
492
normalized growth yield of this
493
in silico deletion strain. The
494
pta and
495
ackAB gene products participate in
496
the metabolic pathway leading to the formation of acetate.
497
Acetate was predicted as a metabolic by-product upon oxygen
498
limitations (all phases below the LO). Under conditions
499
defined by P5-6
500
glucose, oxygen , the production
501
capabilities of several of the biosynthetic precursors
502
(glucose 6-phosphate, fructose 6-phosphate, ribose
503
5-phosphate, erythrose 5-phosphate, glyceraldehyde
504
3-phosphate) were limited in the
505
pta and
506
ackAB in silico deletion strains (
507
pta -
508
In silico deletion strain discussed
509
below).
510
This sub-section illustrated the condition-dependent
511
effect of gene deletions on the metabolic
512
genotype-phenotype relation. The results covered the range
513
of substrate uptake rates and defined the optimal metabolic
514
pathway utilization of isogenic strains
515
in silico under different
516
combinations of environmental parameters. The optimal
517
utilization of the metabolic pathways was dependent on the
518
metabolic genotype; thus, different metabolic genotypes are
519
characterized by different PhPPs. The results presented
520
above provide insight into the genotype phenotype relation.
521
Next, we will compare the PhPPs from
522
in silico deletion strains to the
523
wild-type to provide a more complete definition of optimal
524
phenotypes.
525
526
in silico Deletion Strain Phenotype Phase
527
Plane Analysis : Comparative analysis of the phase
528
planes for several mutant strains (
529
tpi -
530
, pta -
531
, and
532
zwf ) were performed. These case
533
studies were chosen to further investigate the metabolic
534
genotype-phenotype relation
535
in silico and to demonstrate the use
536
of FBA to interpret and analyze cellular metabolism.
537
538
tpi : The
539
tpi -PhPP showed 3 distinct optimal
540
metabolic phenotypes- one glucose limited phase (P
541
tpi 2
542
glucose, oxygen ), and two futile phases
543
(Fig. 2A). Futile phases are characterized by a negative
544
effect of one of the substrates on the objective function.
545
One of the futile phases was due to excess oxygen (P
546
tpi 1
547
glucose, oxygen ) and the other was due
548
to excess glucose (P
549
tpi 3
550
glucose, oxygen ). Although the
551
tpi -
552
in silico metabolic genotype
553
theoretically supported biomass production, the feasible
554
steady states were restricted to a limited phase of the
555
phase plane and the flexibility of the metabolic network
556
was reduced to one dimension.
557
The optimal utilization of
558
the tpi -metabolic network under
559
environmental conditions defined by the LO
560
tpi was characterized by increased
561
PPP fluxes to bypass the TPI block. The PPP operated
562
cyclically; thus, leading to a high production of NADPH.
563
Due to the high NADPH production in the PPP, the TCA cycle
564
flux was optimally reduced and functioned only to produce
565
the biosynthetic precursors.
566
The
567
in silico analysis suggests that the
568
tpi -metabolic network was restricted
569
by the ability to regenerate phosphoenolpyruvate (PE) for
570
the PTS, and the
571
in silico analysis identified 3
572
metabolic 'routes' for the regeneration of PEP. Two of the
573
'routes' were equivalent (alternate optimal solutions), (1)
574
The PEP was regenerated by the phosphoenolpyruvate synthase
575
(PPS), or (2) the glactose transporter was used for the
576
transport of glucose which was subsequently phosphorylated
577
by the glucokinase reaction. These two routes were
578
equivalent with respect to the objective function (although
579
they were structurally different). The third PEP
580
regeneration route involved the glyoxylate bypass and the
581
phosphoenolpyruvate carboxykinase, and this route was
582
characterized by a 38% reduction in the optimal biomass
583
yield. Furthermore, experimentally it was shown that
584
constitutive expression of the glyoxylate bypass suppressed
585
the PEP deficient phenotype [ 30, 31]. The PEP regeneration
586
routes (discussed above) theoretically allow the
587
tpi -to grow, and one of these
588
solutions was required for the growth of the
589
tpi -
590
in silico strain.
591
592
zwf :
593
zwf codes for glucose-6-phosphate
594
dehydrogenase (G6PDH), the first enzyme in the oxidative
595
branch of the PPP.
596
zwf has been shown to be a
597
non-essential gene for the growth of
598
E. coli in glucose minimal media, and
599
600
zwf strains grow at near wild-type
601
growth rates [ 32].
602
zwf was predicted by FBA to be a
603
non-essential gene for growth in glucose minimal media
604
(Fig. 1). We conducted a phenotype phase plane analysis of
605
the
606
zwf strain and examined the systemic
607
metabolic function of
608
zwf and its relation to the
609
environmental conditions
610
in silico (Fig. 2B). The slope of the
611
LO
612
zwf slightly increased (relative to
613
the wild-type), indicating a higher oxygen:glucose ratio
614
for optimal growth. Removing the G6PDH from the metabolic
615
network eliminated all metabolic pathways that utilized the
616
oxidative branch of the PPP. Therefore, the
617
zwf PhPP was significantly changed in
618
the phases that utilized the oxidative branch of the PPP (P
619
620
zwf 2
621
glucose, oxygen and P
622
zwf 3
623
glucose, oxygen ) but was unchanged in
624
phases that did not optimally utilize the
625
zwf gene product (P
626
zwf 4
627
glucose, oxygen ).
628
629
pta : Acetate excretion is a common
630
characteristic of
631
E. coli metabolism and several
632
approaches have been applied to reduce acetate production
633
to improve the productivity of engineering strains [ 33,
634
34, 35]. Acetate production can be interpreted using FBA [
635
36, 37], and we have used a phase plane analysis to
636
quantitatively analyze the conditions for which acetate
637
excretion optimally occurs. Acetate was optimally excreted
638
from the cell within all phases of the glucose-oxygen PhPP
639
below the LO. We have generated the
640
pta -PhPP and analyzed the metabolic
641
characteristics of the in
642
silico pta -strain (Fig. 2C). The
643
pta -PhPP indicated that this mutant
644
strain maintained the potential to support growth (both
645
aerobically and anaerobically). Experimentally,
646
the pta -
647
E. coli strain has been shown to grow
648
aerobically and anaerobically on glucose minimal media [
649
38]. The
650
in silico analysis predicted that
651
the pta -strain optimally shifted the
652
carbon flux from acetate to ethanol in P
653
pta 3. However, in P
654
pta 4, the optimal metabolic
655
by-products included lactate, ethanol, and pyruvate, and
656
under completely anaerobic conditions, succinate was also
657
optimally produced as a metabolic byproduct. These
658
metabolic byproducts were qualitatively consistent with
659
experimental observations in the
660
pta -strain [ 38].
661
662
663
Discussion
664
The rapid development of bioinformatic databases is
665
resulting in extensive information about the molecular
666
composition and function of several single cellular
667
organisms. These genetic and biochemical databases [ 21,
668
23, 39] have now been developed to the point where the
669
methods of systems science need to be used to analyze,
670
interpret, and predict the integrated behavior of complex
671
multigeneic biological processes. Herein, we have utilized
672
an
673
in silico representation of
674
E. coli to study the condition
675
dependent phenotype of
676
E. coli and central metabolism gene
677
deletion strains. We have shown that a computational
678
analysis of the metabolic behavior can provide valuable
679
insight into cellular metabolism. The results presented
680
herein address a pressing question in the post-genome era;
681
how can genome sequence information be
682
used to analyze integrated cellular functions? Given
683
the central importance of this question, we will discuss
684
the general applicability, limitations, and future
685
prospects for FBA and functional genomics.
686
The FBA metabolic modeling framework is different than
687
other well-known metabolic modeling approaches. FBA can
688
more accurately be defined as a metabolic constraining
689
approach, this is because FBA defines the 'best' the cell
690
can do, rather than predicting the metabolic behavior. To
691
accomplish this, we have constrained metabolic function
692
based on the most reliable information, the metabolic
693
stoichiometry (the stoichiometry is well known for the vast
694
majority of the metabolic processes). However, FBA does
695
have predictive capabilities when a physiologically
696
meaningful objective function can be defined, and the
697
E. coli FBA results, with maximal
698
growth rate as the objective function, have been shown to
699
be consistent with experimental data under nutritionally
700
rich conditions [ 40]. It should be mentioned that FBA does
701
not directly consider regulation, or the regulatory
702
constraints on the metabolic network, but rather FBA
703
assumes that the regulation is such that metabolic behavior
704
is optimal. This assumption produces results that are
705
generally consistent with experimental data, however, this
706
assumption is only valid for a system that has evolved
707
toward optimality. In mutant strains, the regulation of the
708
metabolic network has not evolved to operate in an optimal
709
fashion. Therefore, the optimal utilization of the mutant
710
metabolic network does not necessarily correspond to the
711
in vivo utilization of the metabolic
712
network. Computational analysis of metabolic processes,
713
coupled to an experimental program may provide valuable
714
information regarding the regulatory structure of metabolic
715
networks, and will provide a challenge for future
716
computational studies coupled to highly parallel
717
experimental programs, such as large-scale mutation studies
718
[ 41].
719
Currently, about one-third of the
720
E. coli open reading frames do not
721
have a functional assignment. Thus, the metabolic network
722
studied here is incomplete and does not account for all the
723
metabolic processes carried out by
724
E. coli. However, we have used the
725
biochemical literature to refine the
726
in silico metabolic genotype and
727
given the long history of
728
E. coli metabolic research [ 20], a
729
large percentage of the
730
E. coli metabolic capabilities have
731
likely been identified. However, when additional metabolic
732
capabilities are discovered [ 42], the
733
E. coli stoichiometric matrix can be
734
updated, leading to an iterative model building process.
735
Furthermore, inconsistencies between the model and
736
experimental data may help point to unidentified metabolic
737
functions. Additionally, the
738
in silico analysis can help identify
739
missing or incorrect functional assignments; for example,
740
by identifying sets of metabolic reactions that are not
741
connected to the metabolic network by the mass balance
742
constraints.
743
The study presented herein is an example of the rapidly
744
growing field of
745
in silico biology. It is clear that
746
computer modeling and simulations must be used iteratively
747
with an experimental program to continually improve
748
in silico models and to develop
749
systemic understanding of cellular functions. Thus, an
750
in silico analysis can be used to
751
define an experimental program. For example, the ability to
752
construct well-defined knockout strains of
753
E. coli [ 43] opens the possibility
754
to critically evaluate the relation between the
755
in silico representation of mutant
756
behavior and the
757
in vivo metabolic network under
758
well-defined genetic and environmental conditions for
759
strategically chosen genes. This possibility is
760
particularly timely, given the increasing number of genome
761
scale measurements that are now possible, through 2D gels [
762
44, 45] and DNA array technology [ 46, 47].
763
764
765
Conclusions
766
Herein, we have utilized an
767
in silico representation of
768
E. coli to study the condition
769
dependent phenotype of
770
E. coli and central metabolism gene
771
deletion strains. We have shown that a computational
772
analysis of the metabolic behavior can provide valuable
773
insight into cellular metabolism. The present
774
in silico study builds on the ability
775
to define metabolic genotypes in bacteria and mathematical
776
methods to analyze the possible and optimal phenotypes that
777
they can express. These capabilities open the possibility
778
to perform
779
in silico deletion studies to help
780
sort out the complexities of
781
E. coli mutant phenotypes.
782
783
784
785
786