Team:Minnesota/Modeling

From 2009.igem.org

(Difference between revisions)
 
(5 intermediate revisions not shown)
Line 27: Line 27:
|}
|}
 +
{| style="color:gold;background-color:#800000;" cellpadding="3" cellspacing="1" border="1" bordercolor="#fff" width="90%" align="center"
{| style="color:gold;background-color:#800000;" cellpadding="3" cellspacing="1" border="1" bordercolor="#fff" width="90%" align="center"
{| style="color:gold;background-color:#800000;" cellpadding="3" cellspacing="1" border="1" bordercolor="#fff" width="90%" align="center"
!align="center"|[[Team:Minnesota|<font color="gold">Home</font>]]
!align="center"|[[Team:Minnesota|<font color="gold">Home</font>]]
!align="center"|[[Team:Minnesota/Team|<font color="gold">The Team</font>]]
!align="center"|[[Team:Minnesota/Team|<font color="gold">The Team</font>]]
!align="center"|[[Team:Minnesota/Project|<font color="gold">The Project</font>]]
!align="center"|[[Team:Minnesota/Project|<font color="gold">The Project</font>]]
-
!align="center"|[[Team:Minnesota/Parts|<font color="gold">Submitted Parts</font>]]
+
!align="center"|[[Team:Minnesota/Designer|<font color="gold">SynBioSS Designer</font>]]
!align="center"|[[Team:Minnesota/Modeling|<font color="gold">Modeling</font>]]
!align="center"|[[Team:Minnesota/Modeling|<font color="gold">Modeling</font>]]
-
!align="center"|[[Team:Minnesota/Designer|<font color="gold">SynBioSS Designer</font>]]
+
!align="center"|[[Team:Minnesota/Notebook|<font color="gold">Experimental</font>]]
-
!align="center"|[[Team:Minnesota/Parts Characterization|<font color="gold">Parts Characterization</font>]]
+
!align="center"|[[Team:Minnesota/Parts Characterization|<font color="gold">Competition Requirements</font>]]
-
!align="center"|[[Team:Minnesota/Notebook|<font color="gold">Experiments and Calendar</font>]]
+
-
 
+
|}
|}
Line 43: Line 42:
[[Image:ModelingLab2.jpg|300px|right|The modelers in their natural environment.]]
[[Image:ModelingLab2.jpg|300px|right|The modelers in their natural environment.]]
-
Create mathematical models of our experimental results. Our computational output will guide future research and is the key to determining what is happening at the molecular level of our logical AND gate.
+
The primary goal of the modeling group is to create simulations of our experimental results. Our computational output will guide future research and helps us to understand what is happening at the molecular level of our logical AND gate.
-
It is important to be able to observe the fidelity of the AND gate <i>in silico</i> as well as <i>in vivo</i>.  Using our in-house software suite, SynBioSS, we used networks of sixty or more reactions to model the AND gate with different constructs and mutations. Similar to the experimental procedures, we quantified the fidelity of our AND gate with GFP. We also used a MATLAB Graphical User Interface (GUI) to analyze our experimental data. We used our empirical data from the wet lab to perfect our model.
+
It is important to be able to observe the fidelity of the AND gate <i>in silico</i> as well as <i>in vivo</i>.  Using our in-house software suite, SynBioSS, we will create reaction networks describing the AND gate behavior with different constructs (e.g. TTN, TTL, TNN) and mutations. These reaction networks can then used to create the files necessary for simulation on the supercomputer. Certain kinetic constants in the reaction network can be adjusted to fit the simulation to experimental data. By observing which kinetic constants change with AND-gate construction or mutation we will then be able to understand some of the ways in which the AND-gate functions.
<h2>Mathematical Modeling</h2>
<h2>Mathematical Modeling</h2>
<h3>Overview</h3>
<h3>Overview</h3>
-
[[Image:designgui.jpeg|300px|left|thumb|The MATLAB GUI displays all the reactions and chemical species in the network and allows the user to manipulate all aspects of the system such as reaction rates in order to better mimic experimental results.]]<p>In order to create a model, we must first use a program that will allow us to enter in the contents of the reactions network. The reaction network is formatted as .nc file, which is a special format called NetCDF that works particularly well for entering in the required data. The most basic way to enter in a network is in Linux, as a text file and then converts it to a .nc file. However, this is very tedious and error-prone process. In order to more quickly and accurately create the models, we use one of two programs that have been created for this purpose: SynBioSS or a MATLAB GUI. Each of these programs allows us to graphically see what the model looks like. (See screenshots). <br>   
+
[[Image:designgui.jpeg|300px|left|thumb|The MATLAB GUI displays all the reactions and chemical species in the network and allows the user to manipulate all aspects of the system such as reaction rates in order to better mimic experimental results.]]<p>In order to create a model, we must first use a program that will allow us to enter in the contents of the reactions network. The reaction network is formatted as .nc file, which is a special format called NetCDF that can interact with the group's supercomputer algorithms. The reaction network can be inputted directly into Linux as a text file and then converted into an .nc file. However, this is very tedious and error-prone process. In order to more quickly and accurately create the models, we use one of two programs: SynBioSS or a MATLAB GUI. Each of these programs allows us to graphically see what the model looks like and automatically converts the data describing the system into an .nc file. (See screenshots). <br>   
-
Once the model is created, it is sent to a supercomputer where it is run using hybrid stochastic simulation algorithms created for this type of models. This algorithm takes the initial conditions of the model along with the reactions and kinetic constants and determines the state of the system over time. It does this by using the rate constants along with a random number generator to determine the order that reactions happen in the system. After one reaction happens, it changes the state of the system accordingly, and then repeats the algorithm to find the next reaction. This allows the state of the system (the number of molecules of all species) to change over time, just like real cells do. <br>
+
Once the model is created, it is sent to a supercomputer where it is run using hybrid stochastic simulation algorithms created by the Kaznessis group. This algorithm takes the initial conditions of the model along with the reactions and kinetic constants and determines the state of the system over time. A stochastic model is used because the system that is described is very small. Cells are typically on the order of 1E-15 L. Because of this many molecular species exist in small quantities. When there are low numbers of molecules in a system, discrete changes in the number of molecules drastically change the reaction rates. Stochastic models use random number generators to describe the probabilistic nature of stochastic systems. <br>
-
The results of the models are then removed from the supercomputer and analyzed using MATLAB. It is then possible to see if the model ran correctly and if it gave reasonable results. Depending on how the model compares to the experimental data, kinetic constants are changed and then the process of running a model begins again.<br>
+
The results of the models are then retrieved from the supercomputer and analyzed using MATLAB. It is then possible to see if the model ran correctly and if it gave reasonable results. Depending on how the model compares to the experimental data, kinetic constants are changed and then the process of running a model begins again.<br>
-
After many iterations of this process, the model is as accurate as is possible. The accuracy of the model is determined by the difference in reporter gene expression between the model and the experiments. Calculating this value requires several steps to ensure accuracy. First, the models must be rescaled, so that the model data matches with the experimental numerically. In order to do this, the model is compared to another model, called a baseline expression model, which produces the maximum amount of reporter gene the system can produce. This is done by getting rid of any repressors that could slow down this production. Experimentally, a similar procedure is done which results in the baseline reporter gene expression for the actual cells. Using this data, it is possible to rescale the model and the experimental data to the same standard. After this, the error is found by taking the difference between the experiment and model at all points and squaring it. The smaller this number is, the better the model is. <br>
+
The accuracy of the model is determined by the difference in reporter gene expression between the model and the experiments. In order to make reasonable comparison the models must be rescaled, so that the model data matches with the experimental numerically. Basal expression models, which produce the maximum amount of reporter gene, are created for both the experimental and simulation data. This is done by removing the repressor molecules tetR2 and lacR4 from the system. Using this data, it is possible to rescale the model and the experimental data as a percentage of maximum expression. A least squares regression can then be done to determine the difference between the experimental results and the simulation. <br>
-
Once the best model is found, it can be compared to other models and analysis can be done to determine if there is anything to learn about the AND gate. </p>
+
Once the simulation is as close to the experimental results as possible a more accurate description of the AND-gate function can be constructed. This can be done by observing how changes in kinetic constants affect GFP expression. Or, using experimental data for mutant constructs, the simulations can be re-fit and the mutation's affect on the reaction network can be quantified. </p>
Line 60: Line 59:
<h2>Some Basic Modeling</h2>
<h2>Some Basic Modeling</h2>
-
[[Image:reactionlist.jpg|400px|right|The reaction list in the MATLAB GUI with reactions 3 and 54 highlighted.]]
+
[[Image:reactionlist.jpg|200px|right|The reaction list in the MATLAB GUI with reactions 3 and 54 highlighted.]]
We are working with four different combinations of <i>lac</i> and<i>tet</i> (L and T, respectively) operators in our promoter design project. These constructs are: T--, TT-, TTL and -T-. Here, we will demonstrate a little basic modeling we have done of the TTL construct using SynBioSS MATLAB and a design GUI.  
We are working with four different combinations of <i>lac</i> and<i>tet</i> (L and T, respectively) operators in our promoter design project. These constructs are: T--, TT-, TTL and -T-. Here, we will demonstrate a little basic modeling we have done of the TTL construct using SynBioSS MATLAB and a design GUI.  
Line 66: Line 65:
On the SynBioSS wiki, the TTL model is available for download in a .nc format. Upon opening this with the MATLAB GUI, the list of 65 reactions that constitute the repression and induction of transcription. In this model, there are two <i>tet</i> operators TO1 and TO2 and one <i>lac</i> operator LO1. The reactions included in this list account for the repression and induction of transcription in the absence and presence of aTc and IPTG, respectively. In this simple model, we examined reactions 3 and 54, which involve the <i>tet</i> operators being bound by the repressor. (These reactions are shown in the GUI on the left.) We varied the reactions constants from the original 10<sup>8</sup> as low as 0 and as high as 10<sup>9</sup> to see if the fidelity of the AND gate could depend on the reaction rates for repression. We quantified our results by counting molecules of GFP. This not only gave us a clearer idea of what was happening at the molecular level at this promoter site, but also served as a preparation for matching the experimental results computationally.
On the SynBioSS wiki, the TTL model is available for download in a .nc format. Upon opening this with the MATLAB GUI, the list of 65 reactions that constitute the repression and induction of transcription. In this model, there are two <i>tet</i> operators TO1 and TO2 and one <i>lac</i> operator LO1. The reactions included in this list account for the repression and induction of transcription in the absence and presence of aTc and IPTG, respectively. In this simple model, we examined reactions 3 and 54, which involve the <i>tet</i> operators being bound by the repressor. (These reactions are shown in the GUI on the left.) We varied the reactions constants from the original 10<sup>8</sup> as low as 0 and as high as 10<sup>9</sup> to see if the fidelity of the AND gate could depend on the reaction rates for repression. We quantified our results by counting molecules of GFP. This not only gave us a clearer idea of what was happening at the molecular level at this promoter site, but also served as a preparation for matching the experimental results computationally.
 +
<h2>Calendar</h2>
{|  align="center"
{|  align="center"
{| style="color:gold;background-color:#800000;" cellpadding="3" cellspacing="1" border="10" bordercolor="#00CCFF" width="62%" align="center"
{| style="color:gold;background-color:#800000;" cellpadding="3" cellspacing="1" border="10" bordercolor="#00CCFF" width="62%" align="center"
Line 72: Line 72:
!align="center"|{{#calendar: title=Minnesota |year=2009 | month=08}}
!align="center"|{{#calendar: title=Minnesota |year=2009 | month=08}}
!align="center"|{{#calendar: title=Minnesota |year=2009 | month=09}}
!align="center"|{{#calendar: title=Minnesota |year=2009 | month=09}}
-
!align="center"|{{#calendar: title=Minnesota |year=2009 | month=10}}
 
|}
|}
 +
 +
<h2>Results</h2>
 +
Place results of models here.

Latest revision as of 19:22, 7 August 2009


Mnlogo.jpg
Home The Team The Project SynBioSS Designer Modeling Experimental Competition Requirements


Contents

Our Modeling Goals

The modelers in their natural environment.

The primary goal of the modeling group is to create simulations of our experimental results. Our computational output will guide future research and helps us to understand what is happening at the molecular level of our logical AND gate. It is important to be able to observe the fidelity of the AND gate in silico as well as in vivo. Using our in-house software suite, SynBioSS, we will create reaction networks describing the AND gate behavior with different constructs (e.g. TTN, TTL, TNN) and mutations. These reaction networks can then used to create the files necessary for simulation on the supercomputer. Certain kinetic constants in the reaction network can be adjusted to fit the simulation to experimental data. By observing which kinetic constants change with AND-gate construction or mutation we will then be able to understand some of the ways in which the AND-gate functions.

Mathematical Modeling

Overview

The MATLAB GUI displays all the reactions and chemical species in the network and allows the user to manipulate all aspects of the system such as reaction rates in order to better mimic experimental results.

In order to create a model, we must first use a program that will allow us to enter in the contents of the reactions network. The reaction network is formatted as .nc file, which is a special format called NetCDF that can interact with the group's supercomputer algorithms. The reaction network can be inputted directly into Linux as a text file and then converted into an .nc file. However, this is very tedious and error-prone process. In order to more quickly and accurately create the models, we use one of two programs: SynBioSS or a MATLAB GUI. Each of these programs allows us to graphically see what the model looks like and automatically converts the data describing the system into an .nc file. (See screenshots).

Once the model is created, it is sent to a supercomputer where it is run using hybrid stochastic simulation algorithms created by the Kaznessis group. This algorithm takes the initial conditions of the model along with the reactions and kinetic constants and determines the state of the system over time. A stochastic model is used because the system that is described is very small. Cells are typically on the order of 1E-15 L. Because of this many molecular species exist in small quantities. When there are low numbers of molecules in a system, discrete changes in the number of molecules drastically change the reaction rates. Stochastic models use random number generators to describe the probabilistic nature of stochastic systems.
The results of the models are then retrieved from the supercomputer and analyzed using MATLAB. It is then possible to see if the model ran correctly and if it gave reasonable results. Depending on how the model compares to the experimental data, kinetic constants are changed and then the process of running a model begins again.
The accuracy of the model is determined by the difference in reporter gene expression between the model and the experiments. In order to make reasonable comparison the models must be rescaled, so that the model data matches with the experimental numerically. Basal expression models, which produce the maximum amount of reporter gene, are created for both the experimental and simulation data. This is done by removing the repressor molecules tetR2 and lacR4 from the system. Using this data, it is possible to rescale the model and the experimental data as a percentage of maximum expression. A least squares regression can then be done to determine the difference between the experimental results and the simulation.

Once the simulation is as close to the experimental results as possible a more accurate description of the AND-gate function can be constructed. This can be done by observing how changes in kinetic constants affect GFP expression. Or, using experimental data for mutant constructs, the simulations can be re-fit and the mutation's affect on the reaction network can be quantified.




Some Basic Modeling

The reaction list in the MATLAB GUI with reactions 3 and 54 highlighted.

We are working with four different combinations of lac andtet (L and T, respectively) operators in our promoter design project. These constructs are: T--, TT-, TTL and -T-. Here, we will demonstrate a little basic modeling we have done of the TTL construct using SynBioSS MATLAB and a design GUI.

On the SynBioSS wiki, the TTL model is available for download in a .nc format. Upon opening this with the MATLAB GUI, the list of 65 reactions that constitute the repression and induction of transcription. In this model, there are two tet operators TO1 and TO2 and one lac operator LO1. The reactions included in this list account for the repression and induction of transcription in the absence and presence of aTc and IPTG, respectively. In this simple model, we examined reactions 3 and 54, which involve the tet operators being bound by the repressor. (These reactions are shown in the GUI on the left.) We varied the reactions constants from the original 108 as low as 0 and as high as 109 to see if the fidelity of the AND gate could depend on the reaction rates for repression. We quantified our results by counting molecules of GFP. This not only gave us a clearer idea of what was happening at the molecular level at this promoter site, but also served as a preparation for matching the experimental results computationally.

Calendar

June
MTWTFSS
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30
July
MTWTFSS
    1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31
August
MTWTFSS
          1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31
September
MTWTFSS
  1 2 3 4 5 6
7 8 9 10 11 12 13
14 15 16 17 18 19 20
21 22 23 24 25 26 27
28 29 30

Results

Place results of models here.