Team:Alberta/Project/Gene Selection
From 2009.igem.org
Rpgguardian (Talk | contribs) |
|||
(27 intermediate revisions not shown) | |||
Line 1: | Line 1: | ||
- | {{:Team:Alberta/ | + | {{:Team:Alberta/TemplateSc}} |
+ | |||
<html> | <html> | ||
<head> | <head> | ||
<style type="text/css"> | <style type="text/css"> | ||
.b1f, .b2f, .b3f, .b4f{font-size:1px; overflow:hidden; display:block;} | .b1f, .b2f, .b3f, .b4f{font-size:1px; overflow:hidden; display:block;} | ||
- | .b1f {height:1px; background:# | + | .b1f {height:1px; background:#e1e1e1; margin:0 5px;} |
- | .b2f {height:1px; background:# | + | .b2f {height:1px; background:#e1e1e1; margin:0 3px;} |
- | .b3f {height:1px; background:# | + | .b3f {height:1px; background:#e1e1e1; margin:0 2px;} |
- | .b4f {height:2px; background:# | + | .b4f {height:2px; background:#e1e1e1; margin:0 1px;} |
- | .content {background: # | + | .content {background: #e1e1e1;} |
.content div {margin-left: 5px;} | .content div {margin-left: 5px;} | ||
</style> | </style> | ||
Line 26: | Line 27: | ||
<div class="Outreach"> | <div class="Outreach"> | ||
<div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | <div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | ||
- | <h1>BioBytes Essential Gene List</h1> | + | <h1>Literature Review</h1> |
+ | |||
+ | <font size="2"> | ||
+ | <P> | ||
+ | There were four primary literature sources which were used for the determination of the essential genome. These genes were analyzed to construct a preliminary essential gene list </P> | ||
+ | <b>Literature Gene List Data</b> | ||
+ | <TABLE BORDER> | ||
+ | <TR> | ||
+ | <TH>Essential Gene List from the literature</TH> | ||
+ | <TH>Method</TH> | ||
+ | <TH># of Genes considered essential</TH> | ||
+ | <TH>% of E.coli genes considered essential</TH> | ||
+ | <TH># of genes unique to that list</TH> | ||
+ | </TR> | ||
+ | <TR> | ||
+ | <TD>Baba et al. 2006</TD> | ||
+ | <TD>Single gene knockout </TD> | ||
+ | <TD>303</TD> | ||
+ | <TD> 6.4%</TD> | ||
+ | <TD> 36</TD> | ||
+ | </TR> | ||
+ | <TR> | ||
+ | <TD>Gerdes et al 2003</TD> | ||
+ | <TD>Transposon insertions to inactivate single genes</TD> | ||
+ | <TD>617</TD> | ||
+ | <TD>13.0%</TD> | ||
+ | <TD> 379 </TD> | ||
+ | </TR> | ||
+ | <TR> | ||
+ | <TD>Gil et al 2004</TD> | ||
+ | <TD>Gene conservation and literature review</TD> | ||
+ | <TD>203</TD> | ||
+ | <TD>4.3%</TD> | ||
+ | <TD>53</TD> | ||
+ | </TR> | ||
+ | <TR> | ||
+ | <TD>Profiling of E.coli Chromosome (PEC) database</TD> | ||
+ | <TD>Literature review</TD> | ||
+ | <TD>302</TD> | ||
+ | <TD>6.3%</TD> | ||
+ | <TD>126</TD> | ||
+ | </TR> | ||
+ | </TABLE BORDER> | ||
+ | |||
+ | |||
+ | <P>Each gene list was determined in a variety of ways and there results show very little consistency. The number of genes from each source varies greatly. When the lists are compared to one another, there is very little overlap noted. Please see below: </P> | ||
+ | |||
+ | <b>Venn Diagram of the Number of Essential Genes Shared Between Lists in the Literature</b> | ||
+ | |||
+ | <center><img src="https://static.igem.org/mediawiki/2009/a/a1/Uofa_Venn_of_literature.png" width="450" height="450"></center> | ||
+ | |||
+ | <P>The maximum number of genes in common between any two literature lists is 205, which is between Baba et al 2006 and Gerdes et al 2003. Only 48 genes were present in all four lists. The lack of consensus between these literature lists makes it very unreliable to use these genes in an essential genome. Still, these lists provide an important foundation for basic components that are required in an essential genome list.</P> | ||
+ | |||
+ | |||
+ | </font></div> | ||
+ | |||
+ | </div></div> | ||
+ | <b class="b4f"></b><b class="b3f"></b><b class="b2f"></b><b class="b1f"></b> | ||
+ | </td> | ||
+ | </tr> | ||
+ | |||
+ | <tr> | ||
+ | <td style="height: 400; padding-left: 10px; padding-right: 10px; padding-top: 11px;"> | ||
+ | <b class="b1f"></b><b class="b2f"></b><b class="b3f"></b><b class="b4f"></b> | ||
+ | <div class="Presentations"> | ||
+ | <div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | ||
+ | <h1>Constructing the BioBytes Preliminary Essential Gene List</h1> | ||
<!-- <div align="justify" style="padding-left:20px; padding-right:20px"> --> | <!-- <div align="justify" style="padding-left:20px; padding-right:20px"> --> | ||
Line 32: | Line 99: | ||
<font size="2"> | <font size="2"> | ||
- | |||
+ | <p> The preliminary essential gene list is based on literature sources. As described in the modeling section of this wiki, the metabolic genes from this preliminary list were used as a starting point for the computer model and were greatly altered based on the model's suggestions. The following criteria were used for selecting genes from a literature source: | ||
+ | |||
+ | <ul> | ||
+ | <li>Genes must be present in more than one literature list unless there is particular reason to suspect they are essential.</li> | ||
+ | <li>The BioBytes metabolism is modeled after the minimal metabolism proposed by Gil et al 04, with the addition of cell wall, fatty acid, heme and ubiquitin synthesis, as Gil assumed these would not be necessary in a mycoplasma like minimal cell.</li> | ||
+ | <li>Additional genes required for metabolism were selected based on pathway information in the Ecocyc database. Redundancy of pathways is likely why these genes don’t appear essential in Baba, Gerdes and PEC.</li> | ||
+ | <li>Antitoxin genes are not essential as toxin genes would not be present.</li> | ||
+ | </ul> | ||
+ | <p> The basic functional groups of genes which were selected can be seen in the next chart and a detailed list of processes which were included follow.<p/> | ||
+ | <b>Essential Gene Functions</b> | ||
+ | <center><img src="https://static.igem.org/mediawiki/2009/c/c4/Uofa_Function_chart.png"></center> | ||
+ | |||
+ | <h3>Genes for the following processes were included:</h3> | ||
+ | <ul> | ||
+ | <li>DNA replication and cell division, but no DNA repair</li> | ||
+ | <li>Chaperones, but no heat shock or membrane stress response system</li> | ||
+ | <li>Transcription</li> | ||
+ | <li>Translation</li> | ||
+ | <li>Glycolysis</li> | ||
+ | <li>PMF generation via an ATP synthase consuming ATP to export protons.</li> | ||
+ | <li>Synthesis of acetyl-CoA from pyruvate</li> | ||
+ | <li>Fatty acid synthesis</li> | ||
+ | <li>Methylerithritol pathway (for undecaprenyl phosphate and a ubiquinone side chain)</li> | ||
+ | <li>Synthesis of phosphatidylethanolamine, but no other phospholipids</li> | ||
+ | <li>Pentose phosphate pathway (converts 6 or 3 carbon sugars to 5C sugars, such as ones needed in nucleotide biosynthesis)</li> | ||
+ | <li>Lipoprotein synthesis (Int and lolB are lipoproteins and essential)</li> | ||
+ | <li>Synthesis of nucleotides (deoxy and oxy) from nucleosides</li> | ||
+ | <li>Attaching lipid and biotin groups to protein</li> | ||
+ | <li>Transport:</li> | ||
+ | <ul> | ||
+ | <li>PTC transport system (imports and phosphorylates glucose)</li> | ||
+ | <li>Inorganic phosphate transport</li> | ||
+ | <li>Nucleoside transport</li> | ||
+ | <li>Sec system (exports proteins to periplasm), including SRP for cotranslational membrane insertion. secB chaperone does not appear essential. There is NO tat system, which would be used to export cofactor containing folded proteins.</li> | ||
+ | <li>Lipoprotein transport to outermembrane</li> | ||
+ | <li>Glutathione transport </li> | ||
+ | </ul> | ||
+ | <li>Cofactor synthesis: </li> | ||
+ | <ul> | ||
+ | <li>Riboflavin from GTP and ribulose-5-phosphate </li> | ||
+ | <li>FAD from riboflavin</li> | ||
+ | <li>NAD from nicotinamide</li> | ||
+ | <li>NADPH from NAD</li> | ||
+ | <li>CoA from pantothenic acid</li> | ||
+ | <li>Methylene tetrahydroxyfolate (mTHF) from folic acid</li> | ||
+ | <li>S-adenosylmethionine (SAM) from methionine</li> | ||
+ | <li>Thiamine diphosphate (TPP) from thiamine</li> | ||
+ | <li>Pyridoxal-5-phosphate (PP) from pyridoxal </li> | ||
+ | <li>Heme from glutamate </li> | ||
+ | <li>Ubiquinone </li> | ||
+ | </ul> | ||
+ | </ul> | ||
+ | |||
+ | <h3>RNAs:</h3> | ||
+ | <P>The rrnC operon supplies all the rRNA’s and three of the tRNAs. This operon was selected because it includes the great number of tRNA’s. To select the other tRNA’s, all tRNA’s listed as essential in PEC were first included. One tRNA was then selected for each anticodon that differed on one of the last two bases. Differences in the first base can be accommodated by anticodon 'wobble'. At least one tRNA was included for each amino acid. </P> | ||
+ | |||
+ | |||
+ | <P>The complete list of essential RNA’s can be found <a href="https://2009.igem.org/Image:Uofa_RNAs_essential.xls"> here </a>. </P> | ||
+ | |||
+ | |||
+ | </font></div> | ||
+ | |||
+ | </div></div> | ||
+ | <b class="b4f"></b><b class="b3f"></b><b class="b2f"></b><b class="b1f"></b> | ||
+ | </td> | ||
+ | </tr> | ||
+ | |||
+ | |||
+ | <tr> | ||
+ | <td style="height: 400; padding-left: 10px; padding-right: 10px; padding-top: 11px;"> | ||
+ | <b class="b1f"></b><b class="b2f"></b><b class="b3f"></b><b class="b4f"></b> | ||
+ | <div class="Survey"> | ||
+ | <div style="height: 400; background:#FFFFFF; line-height:100% padding: 3px 0px;"> | ||
+ | <h1>Statistics on BioBytes Preliminary Essential Gene List</h1> | ||
+ | |||
+ | <!-- <div align="justify" style="padding-left:20px; padding-right:20px"> --> | ||
+ | <div align="justify"> | ||
+ | |||
+ | <font size="2"> | ||
+ | |||
+ | <P> Total genes in Ecoli: 4762 </P> | ||
+ | <P> Total protein coding genes in BioBytes preliminary essentials list: 332 </P> | ||
+ | <P> Total number of RNA genes in BioBytes preliminary essentials list: 29 </P> | ||
+ | |||
+ | |||
+ | </font></div> | ||
+ | |||
+ | </div></div> | ||
+ | <b class="b4f"></b><b class="b3f"></b><b class="b2f"></b><b class="b1f"></b> | ||
+ | </td> | ||
+ | </tr> | ||
+ | |||
+ | <tr> | ||
+ | <td style="height: 400; padding-left: 10px; padding-right: 10px; padding-top: 11px;"> | ||
+ | <b class="b1f"></b><b class="b2f"></b><b class="b3f"></b><b class="b4f"></b> | ||
+ | <div class="Outreach"> | ||
+ | <div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | ||
+ | <h1>Modeling Genes</h1> | ||
+ | |||
+ | <font size="2"> | ||
+ | <P> Selection of the individual modeling genes can be seen under the Modeling tab of the Bioinformatics section. From the lists determined by the model there are 116 genes where were determined to be essential. The model only contains metabolic genes of the MG1655 ''E. coli'' genome therefore all other types of genes were solely determined using literature sources. Many of the genes that were determined to be essential are due to the complex nature of metabolic pathways. It is not sufficient to simply delete a single gene and determine if the organism is viable. Often genes act in complexes, or become essential if other genes become deleted (for example in redundant processes where 2 genes fulfill the same essential function) allowing the modeling work to fill the gaps of numerous genes which are required for life. The function of many genes which were added include transport of small metabolic compounds. Although there are some new pathways that are added, the majority of the genes collected add to many of the pathways determined to be essential via the literature search. This shows that our list contained many of the correct pathways, just further research was required to determine all of the essential genes.<p/> | ||
+ | |||
+ | <tr> | ||
+ | <td style="height: 400; padding-left: 10px; padding-right: 10px; padding-top: 11px;"> | ||
+ | <b class="b1f"></b><b class="b2f"></b><b class="b3f"></b><b class="b4f"></b> | ||
+ | <div class="Outreach"> | ||
+ | <div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | ||
+ | <h1>BioBytes Final Essential Gene List</h1> | ||
+ | |||
+ | <!-- <div align="justify" style="padding-left:20px; padding-right:20px"> --> | ||
+ | <div align="justify"> | ||
+ | |||
+ | <font size="2"> | ||
+ | <b>A final list of essential genes was produced from the literature review and the computer model. </b> | ||
+ | |||
+ | </font></div> | ||
<P>Number of genes in list created from literature: 332 </P> | <P>Number of genes in list created from literature: 332 </P> | ||
- | <P>Number of additional genes suggested by model: | + | <P>Number of additional genes suggested by model: 116 </P> |
<P>Number of genes in final essential genes list: 448 </P> | <P>Number of genes in final essential genes list: 448 </P> | ||
<P>Number of genes in our essential gene list not classified as essential in the literature: 117 | <P>Number of genes in our essential gene list not classified as essential in the literature: 117 | ||
</P> | </P> | ||
+ | <h3> To view the complete list of Literature Genes click <a href="https://2009.igem.org/Image:UofA_LiteratureSearchGenes.xls"> here </a>. To view the complete list of Metabolic Model genes, click <a href="https://2009.igem.org/Image:UofA_ModelingGenes.xls"> here </a>.</h3> | ||
+ | <p>Additionally, the University of Lethbridge team has constructed a series of visualized diagrams for some of the essential metabolic genes in our list. Please click <a href="https://2009.igem.org/Team:Lethbridge/Collaboration"> here </a> to see these figures. When these lists are compared to the literature lists of essential genes, they are found to have a very limited amount of overlap. In fact,our BioBytes Essential Gene List differs by 40%. </p> | ||
+ | <b>Correlation of BioBytes Essential Gene List to Literature Lists</b> | ||
+ | <center><img src="https://static.igem.org/mediawiki/2009/a/a6/Uofa_Essential_genes_uniqueness.png"></center> | ||
+ | <b>Number of Genes Found in Common in Literature and BioBytes Essential Gene Lists</b> | ||
+ | <center><img src="https://static.igem.org/mediawiki/2009/8/83/Uofa_Lists_pie_chart.png"></center> | ||
+ | <p>This list gives a much greater chance of success in producing a minimal genome than many of the sources that are presently available. With this list completed, our BioBytes approach can be used to assemble these genes into constructs and eventually produce the genome. Together, our modeling work along with BioBytes serve as a genome construction toolkit which anyone can use.</p> | ||
+ | </div></div> | ||
+ | <b class="b4f"></b><b class="b3f"></b><b class="b2f"></b><b class="b1f"></b> | ||
+ | </td> | ||
+ | </tr> | ||
+ | <tr> | ||
+ | <td style="height: 400; padding-left: 10px; padding-right: 10px; padding-top: 11px;"> | ||
+ | <b class="b1f"></b><b class="b2f"></b><b class="b3f"></b><b class="b4f"></b> | ||
+ | <div class="Outreach"> | ||
+ | <div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | ||
+ | <h1>Standardization of Gene Regulation Components</h1> | ||
- | </font></div> | + | <!-- <div align="justify" style="padding-left:20px; padding-right:20px"> --> |
+ | <div align="justify"> | ||
+ | |||
+ | <font size="2"> | ||
+ | <p>In order to produce a well characterized and standardized minimal genome, numerous components have been standardized including promoters, terminators, and RBS sites. These have been incorporated into the BioBytes system either as individual parts (as in the case of promoters and terminators) or as components of our unique plasmids pAB and pBA (which occurred with the RBS site). Microarray data was also used to identify the relative amount of transcript which was produced by each essential gene and therefore which promoter to incorporate with each gene.</p> | ||
+ | <p align=right><p align=right><a href="https://2009.igem.org/Team:Alberta/Project/Promoters_&_Terminators"> Click here for more...</a> </P> | ||
+ | </div></div> | ||
+ | <b class="b4f"></b><b class="b3f"></b><b class="b2f"></b><b class="b1f"></b> | ||
+ | </td> | ||
+ | </tr> | ||
+ | <tr> | ||
+ | <td style="height: 400; padding-left: 10px; padding-right: 10px; padding-top: 11px;"> | ||
+ | <b class="b1f"></b><b class="b2f"></b><b class="b3f"></b><b class="b4f"></b> | ||
+ | <div class="Outreach"> | ||
+ | <div style="height: 400; background:#FFFFFF; colorou line-height:100% padding: 3px 0px;"> | ||
+ | <h1>Incorporation Into pAB/pBA</h1> | ||
+ | |||
+ | <!-- <div align="justify" style="padding-left:20px; padding-right:20px"> --> | ||
+ | <div align="justify"> | ||
+ | |||
+ | <font size="2"> | ||
+ | <p>In order to produce the minimal genome, each individual gene is required to be amplified. In order to accomplish this, PCR was used to produce genes with distinct ends allowing for insertion into the pAB or pBA plasmids used in genome construction. 188 of these primers have been tested and added to the parts registry (please see the Achievements section for the parts list). </p> | ||
+ | <p align=right><p align=right><a href="https://2009.igem.org/Team:Alberta/Project/Primer_Design"> Click here for more...</a> </P> | ||
+ | </div></div> | ||
+ | <b class="b4f"></b><b class="b3f"></b><b class="b2f"></b><b class="b1f"></b> | ||
+ | </td> | ||
+ | </tr> | ||
+ | </table> | ||
+ | </div> | ||
+ | </HTML> |
Latest revision as of 22:11, 20 October 2009
|
Literature ReviewThere were four primary literature sources which were used for the determination of the essential genome. These genes were analyzed to construct a preliminary essential gene list Literature Gene List Data
Each gene list was determined in a variety of ways and there results show very little consistency. The number of genes from each source varies greatly. When the lists are compared to one another, there is very little overlap noted. Please see below: Venn Diagram of the Number of Essential Genes Shared Between Lists in the LiteratureThe maximum number of genes in common between any two literature lists is 205, which is between Baba et al 2006 and Gerdes et al 2003. Only 48 genes were present in all four lists. The lack of consensus between these literature lists makes it very unreliable to use these genes in an essential genome. Still, these lists provide an important foundation for basic components that are required in an essential genome list. |
|||||||||||||||||||||||||
Constructing the BioBytes Preliminary Essential Gene ListThe preliminary essential gene list is based on literature sources. As described in the modeling section of this wiki, the metabolic genes from this preliminary list were used as a starting point for the computer model and were greatly altered based on the model's suggestions. The following criteria were used for selecting genes from a literature source:
The basic functional groups of genes which were selected can be seen in the next chart and a detailed list of processes which were included follow. Essential Gene FunctionsGenes for the following processes were included:
RNAs:The rrnC operon supplies all the rRNA’s and three of the tRNAs. This operon was selected because it includes the great number of tRNA’s. To select the other tRNA’s, all tRNA’s listed as essential in PEC were first included. One tRNA was then selected for each anticodon that differed on one of the last two bases. Differences in the first base can be accommodated by anticodon 'wobble'. At least one tRNA was included for each amino acid. The complete list of essential RNA’s can be found here . |
|||||||||||||||||||||||||
Statistics on BioBytes Preliminary Essential Gene ListTotal genes in Ecoli: 4762 Total protein coding genes in BioBytes preliminary essentials list: 332 Total number of RNA genes in BioBytes preliminary essentials list: 29 |
|||||||||||||||||||||||||
Modeling GenesSelection of the individual modeling genes can be seen under the Modeling tab of the Bioinformatics section. From the lists determined by the model there are 116 genes where were determined to be essential. The model only contains metabolic genes of the MG1655 ''E. coli'' genome therefore all other types of genes were solely determined using literature sources. Many of the genes that were determined to be essential are due to the complex nature of metabolic pathways. It is not sufficient to simply delete a single gene and determine if the organism is viable. Often genes act in complexes, or become essential if other genes become deleted (for example in redundant processes where 2 genes fulfill the same essential function) allowing the modeling work to fill the gaps of numerous genes which are required for life. The function of many genes which were added include transport of small metabolic compounds. Although there are some new pathways that are added, the majority of the genes collected add to many of the pathways determined to be essential via the literature search. This shows that our list contained many of the correct pathways, just further research was required to determine all of the essential genes.
|
BioBytes Final Essential Gene List
A final list of essential genes was produced from the literature review and the computer model.
Number of genes in list created from literature: 332 Number of additional genes suggested by model: 116 Number of genes in final essential genes list: 448 Number of genes in our essential gene list not classified as essential in the literature: 117 To view the complete list of Literature Genes click here . To view the complete list of Metabolic Model genes, click here .Additionally, the University of Lethbridge team has constructed a series of visualized diagrams for some of the essential metabolic genes in our list. Please click here to see these figures. When these lists are compared to the literature lists of essential genes, they are found to have a very limited amount of overlap. In fact,our BioBytes Essential Gene List differs by 40%. Correlation of BioBytes Essential Gene List to Literature ListsThis list gives a much greater chance of success in producing a minimal genome than many of the sources that are presently available. With this list completed, our BioBytes approach can be used to assemble these genes into constructs and eventually produce the genome. Together, our modeling work along with BioBytes serve as a genome construction toolkit which anyone can use.
|
Standardization of Gene Regulation ComponentsIn order to produce a well characterized and standardized minimal genome, numerous components have been standardized including promoters, terminators, and RBS sites. These have been incorporated into the BioBytes system either as individual parts (as in the case of promoters and terminators) or as components of our unique plasmids pAB and pBA (which occurred with the RBS site). Microarray data was also used to identify the relative amount of transcript which was produced by each essential gene and therefore which promoter to incorporate with each gene.
|
Incorporation Into pAB/pBAIn order to produce the minimal genome, each individual gene is required to be amplified. In order to accomplish this, PCR was used to produce genes with distinct ends allowing for insertion into the pAB or pBA plasmids used in genome construction. 188 of these primers have been tested and added to the parts registry (please see the Achievements section for the parts list). |