Endy:Chassis engineering/Computational load modeling: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
No edit summary
 
(17 intermediate revisions by the same user not shown)
Line 5: Line 5:
#What goals should I set for the modeling work to get some benefit from it without devoting a long period of time to it?
#What goals should I set for the modeling work to get some benefit from it without devoting a long period of time to it?
#What modeling approach should I adopt?
#What modeling approach should I adopt?
#What species should I be considering in the model; what is the scope of the model?


These questions are discussed below:
These questions are discussed below:
Line 11: Line 12:
*Build a simple model of gene expression that considers the finite resources of the cellular chassis and the fraction of those resources consumed by the gene expression process.
*Build a simple model of gene expression that considers the finite resources of the cellular chassis and the fraction of those resources consumed by the gene expression process.
*Construct the simple model of gene expresssion in a modular fashion such that it can be used to model a genetic network.
*Construct the simple model of gene expresssion in a modular fashion such that it can be used to model a genetic network.
*Use the model to examine the benefits of using dedicated systems.
*Use the model to test the network dynamics of a number of possible feedback control configurations for [[VM|VM2.0]].
*Use the model to test the network dynamics of a number of possible feedback control configurations for [[VM|VM2.0]].


Line 16: Line 18:


===Modeling Approach===
===Modeling Approach===
Two main modeling approaches exist - deterministic, continuous models based on differential equations and discrete, stochastic simulation of individual biochemical reactionsBoth of these approaches offer some advantages and disadvantages for me.
Based on the goals laid out above it makes sense to use a continuous, deterministic modeling approachHere are some reasons why -


====Continuous deterministic modeling====
*I have more experience with continuous models than discrete models.
<font color="green">
*Analytical solutions, which I can obtain only from a continuous model, give greater insight.
*I have more experience with this.
*I can do this in Matlab quickly.
*Analytical solutions give greater insight.
*Can do this in Matlab, quite quickly.
*Computationally efficient - easier to get to steady state of the system which is probably the most interesting state.
*Computationally efficient - easier to get to steady state of the system which is probably the most interesting state.
</font>
*I'm not necessarily interested in stochastic effects of small molecule numbers.
<font color="red">
*Hard to model some of the things I'd like to model.  For example if I want to explicitly model the numbers of ribosomes on the transcript I need to create species and reactions like the following '''mRNA.(Ribosome)<sub>n</sub>+Ribosome -> mRNA.(Ribosome)<sub>n+1</sub>'''
*The drawbacks of continuous modeling when the molecule numbers are potentially small.</font>
**<font color="green">A less expicit, but probably adequate approach is just to include the following species - '''RBS''', '''RBS.Ribosome''', '''Ribosome<sub>elong</sub>'''.  This doesn't explicitly model an mRNA with varying numbers of ribosomes on it but the number of '''Ribosome<sub>elong</sub>''' would implicitly tell me how many ribosomes are on each transcript.  Thanks [[Sri Kosuri|Sri]].
</font>


====Discrete stochastic simulation====
===Model Scope===
<font color="green">
It seems to make sense with the smallest, simplest model I can imagine.  I can then build that out, adding species and reactions as appropriate until I get to a sensible stopping point.
*[[TABASCO]]
*Explicitly models a gene expression system's usage of the chassis' resources.
*Accurate simulation of dynamics of small molecule numbers.
</font>
<font color="red">
*I think this would require me to simulate the time evolution of the system even if I only wanted to look at the steady state.  I would also have to run many simulations.
*java.language.ignorance.Barry
*I might gain less intuition and insight from this approach since there would be no analytical solutions.  I could try and do both; simulate discretely while at the same time analytically solving some parts of the system.
</font>


If the dynamics of [[VM|VM2.0]] are the most important thing to get out of the modeling, then a continuous model is probably the best way to go, if I want to do a thorough job of the modeling that lets me look closely at the demand of whatever system I'm modeling, then a discrete simulation might be better.
Initially, I'm going to build a model of the constitutive expression of a gene, incorporating a minimal list of species. I may begin to fill that out somewhat or I may instead begin to add regulation so that I can start to build a model of a gene network, as I need to do for the design of [[Endy:Chassis engineering/VM2.0|VM2.0]].


==Transcription==
==Model Development==
*[[Endy:Computational modeling of demand/species|Species]]
In keeping with the scope of the model, proposed above, I'm starting to fill out the list of species and reactions that I'm going to look at.
*[[Endy:Computational modeling of demand/Species and reactions|Species and reactions]]
*[[Endy:Computational modeling of demand/Parameterization|Model parameterization]]
 
==Results==
===3/22/06===
[[Image:BCantonSimpleModel1.jpg|thumb|right|300px|Simple model of gene expression for E7103 in an ''E. coli'' cell.  Chassis file - BL21VM1.m.  System file - E7104.mat.]]
Using the current version of the model I can model simple gene expression in a chassis with one synthesis channel and one reporter gene.  I can produce time courses for each species and rate of interest.  An example is shown on the right.  The model was parameterized to model the operation of [http://parts.mit.edu/r/parts/partsdb/view.cgi?part_id=5761 E7104] on a 100 copy plasmid in an ''E.coli'' cell containing 1800 T7 polymerases, 5000 ribosomes, with a doubling time of 30mins.
<br style="clear:both" />
 
==Literature==
[[Endy:Computational modeling of demand/Literature]]

Latest revision as of 14:55, 7 March 2007

Introduction

Based on my thesis committee meeting and subsequent conversations with Drew, we decided that aspects of my project would benefit from some modeling work. Firstly, we believe that modeling could better inform the design of feedback control for the dedicated systems of VM2.0. Secondly, we currently lack a clear model of how the demands of an engineered system vary with the system parameters, such as DNA copy number, promoter PoPS, RBS RiPS etc.

Initial questions

  1. What goals should I set for the modeling work to get some benefit from it without devoting a long period of time to it?
  2. What modeling approach should I adopt?
  3. What species should I be considering in the model; what is the scope of the model?

These questions are discussed below:

Modeling Objectives

  • Build a simple model of gene expression that considers the finite resources of the cellular chassis and the fraction of those resources consumed by the gene expression process.
  • Construct the simple model of gene expresssion in a modular fashion such that it can be used to model a genetic network.
  • Use the model to examine the benefits of using dedicated systems.
  • Use the model to test the network dynamics of a number of possible feedback control configurations for VM2.0.

While the third of these goals might be the most immediately important it might be worth proceeding in the order listed here to have a more powerful tool in the longer term. The objectives I choose will partly determine the best modeling approach to use.

Modeling Approach

Based on the goals laid out above it makes sense to use a continuous, deterministic modeling approach. Here are some reasons why -

  • I have more experience with continuous models than discrete models.
  • Analytical solutions, which I can obtain only from a continuous model, give greater insight.
  • I can do this in Matlab quickly.
  • Computationally efficient - easier to get to steady state of the system which is probably the most interesting state.
  • I'm not necessarily interested in stochastic effects of small molecule numbers.

Model Scope

It seems to make sense with the smallest, simplest model I can imagine. I can then build that out, adding species and reactions as appropriate until I get to a sensible stopping point.

Initially, I'm going to build a model of the constitutive expression of a gene, incorporating a minimal list of species. I may begin to fill that out somewhat or I may instead begin to add regulation so that I can start to build a model of a gene network, as I need to do for the design of VM2.0.

Model Development

In keeping with the scope of the model, proposed above, I'm starting to fill out the list of species and reactions that I'm going to look at.

Results

3/22/06

Simple model of gene expression for E7103 in an E. coli cell. Chassis file - BL21VM1.m. System file - E7104.mat.

Using the current version of the model I can model simple gene expression in a chassis with one synthesis channel and one reporter gene. I can produce time courses for each species and rate of interest. An example is shown on the right. The model was parameterized to model the operation of E7104 on a 100 copy plasmid in an E.coli cell containing 1800 T7 polymerases, 5000 ribosomes, with a doubling time of 30mins.

Literature

Endy:Computational modeling of demand/Literature