From 2009.igem.org

(Difference between revisions)

Latest revision as of 01:43, 22 October 2009

Notebook HEARTBEAT

Welcome to the notebook of the HEARTBEAT (Heidelberg Artificial Transcription Factor Binding Sites Assembly and Engineering Tool) project. This notebook comprises the work on three sublanes: HEARTBEAT database (DB), HEARTBEAT graphical user interface (GUI) and HEARTBEAT fuzzy modeling (FN) as well as some additional work on logo as well as wiki design. Have fun!

July

7-27-2009

Meeting with Oliver Pelz
- Discuss general ideas of our Database Structure and Content
- An introduction into PromoterSweep (LINK). PromoterSweep screens a given sequence for conserved regions giving us consensus sequences and moreover screens them for TFBS by using database search (TRANSFAC, Jasper) (LINK)
- Our new database should contain following informations: promoter sequence, TFs, TFBS, position of TFBS, number of binding TFBS, "host organism"
- We decide to choose MySQL as a appropiate language solving this challenge which allows us also a graphical representation of the database on the web later.
- GUI on wiki: which language? php? javascript?
- Problems: access to PromoterSweep (Husar Bioinformatics Group, DKFZ), choice of Promoter Database (DoOP, UCSC, EnsEMBL) (LINK)

aim: create database until end of August

@@ Line 6: / Line 6: @@
 |-valign="top" border="0" style="margin-left: 2px;"
 |width="650px" style="padding: 0 15px 15px 20px; background-color:#ede8e2"|
-='''Notebook HEARTBEAT'''=
+= Notebook HEARTBEAT =
 Welcome to the notebook of the HEARTBEAT (Heidelberg Artificial Transcription Factor Binding Sites Assembly and Engineering Tool) project. This notebook comprises the work on three sublanes: HEARTBEAT database (DB), HEARTBEAT graphical user interface (GUI) and HEARTBEAT fuzzy modeling (FN) as well as some additional work on logo as well as wiki design. Have fun!
 =='''Contents'''==
-* [[Team:Heidelberg/Notebook_modeling#July| July]]
+* <span style="font-size:5mm;">[[Team:Heidelberg/Notebook_modeling#July| July]]</span>
-* [[Team:Heidelberg/Notebook_modeling#August| August]]
+* <span style="font-size:5mm;">[[Team:Heidelberg/Notebook_modeling#August| August]]</span>
-* [[Team:Heidelberg/Notebook_modeling#September| September]]
+* <span style="font-size:5mm;">[[Team:Heidelberg/Notebook_modeling#September| September]]</span>
-* [[Team:Heidelberg/Notebook_modeling#October| October]]
+* <span style="font-size:5mm;">[[Team:Heidelberg/Notebook_modeling#October| October]]</span>
 == July ==
@@ Line 140: / Line 139: @@
 * Start working with MySQL
 * request UNIX/HUSAR/HPC access at DKFZ (Nao)
-* first contact with several databases: EmsEMBL, Compara, cisRED, DoOP, TiProD, contra (LINKS)
+* first contact with several databases: EmsEMBL, Compara, cisRED, DoOP, TiProD, contra
 === 8-6-2009 ===
@@ Line 165: / Line 164: @@
 * installation of R, R editor and perl editor
 * further configuration of our internal server / mediawiki
+* writing first perl program - "Hi there"
 [[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
@@ Line 185: / Line 185: @@
 *** '''idea 1''': ''error checking'' of affinity: compare expectation to experimental results and figure out where the error is hiding
 *** '''idea 2''': ''create&visualize fancy and fuzzy data from ''in silico'' simulation
-** combine: promoter, output and graphic representation ('''GRAFIK'''!)
+** combine: promoter, output and graphic representation
 ** next meeting with Marti: end of next week.
@@ Line 200: / Line 200: @@
 * trying out some DoOP queries
-* download fasta sequences from UCSC gene browser (LINK)
+* download fasta sequences from UCSC gene browser
 * mapping of NCBI Entrez Gene IDs with RefSeq IDs
 * configure perl working environment on Windows XP
-* contact Endre Sebestyen concerning the perl module Bio-DoOP-DoOP (LINK)
+* contact Endre Sebestyen concerning the perl module Bio-DoOP-DoOP
 === 8-14-2009 ===
+* parse UCSC fasta sequences according to our selection
+* write parsed sequences into multifasta format
 * start PromoterSweep Analysis over Weekend
@@ Line 223: / Line 226: @@
 ** MAY contain: position, heartbeat, prediction, assembly, eukaryotes
 ** and still more keywords to come
+*establishing local@host access to mysql
 === 8-19-2009 ===
-* parse Promotersweep xml file into tab-separated text file ('''PERL CODE'''?)
+* parse Promotersweep xml file into tab-separated text file
 ** the text file should contain: RefSeq ID, TF name, TFBS position, TF motif sequence, TFBS Quality, TSS, Entrez ID, EnsEMBL ID, further gene description.
 ** this provided us with several programming problems concerning working with multiple arrays, hashes and their combinations (arrays of hashes, hashes of hashes, etc.) thus
 * studying structure and basic concepts of hash & key
+*including parsed data into mysql database
 === 8-20-2009 ===
@@ Line 293: / Line 298: @@
 * Meeting with Daniela (Nao): ''Cell Profiler'' for capturing biological images & data analysis based on MATLAB
-* working with R module '''RMySQL''' (LINK) for using the pipeline between R and MySQL
+* working with R module '''RMySQL''' for using the pipeline between R and MySQL
 * create a list of useful RMySQL commands
@@ Line 313: / Line 318: @@
 ** difference in consensus sequences (also ask Anna-Lena)
 ** different PWM types (vertebrates, plant, insect, fungi, bacteria, nematodes...)
-** positive control: when histograms are generated and plotted, check distribution of Sp1 (LINK)
+** positive control: when histograms are generated and plotted, check distribution of Sp1
 * so far we have 3640 promoter sequences "sweeped"!
+*access from R to mysql at the local@host server established
 === 8-28-2009 ===
+* dealing with perl - introduce transition of variables between perl and R
 [[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
@@ Line 389: / Line 397: @@
 * derive transcription factor data using R and MySQL
 * plot HEARTBEAT TF hit distribution as histograms & density functions for different PWM subsets (all, vertebrates only, single matrices and joined TFs)
+*further completion of the database
 === 9-2-2009 ===
@@ Line 419: / Line 428: @@
 === 9-4-2009 ===
 * work with Transfac PWM: structure, description, and using consensus sequence
-* write script for generating consensus sequence based on Transfac PWM and replacing ambiguity code with A, C, G or T <pre>Getconsensus.pl, MakeConsensus.pl</pre> '''(CODE)?'''
+* write script to get the ID's and frequencies for all co-occuring TFBS of VDR and SREBP
+* write script for generating consensus sequence based on Transfac PWM and replacing ambiguity code with A, C, G or T <pre>Getconsensus.pl, MakeConsensus.pl</pre>
 * Wiki Meeting (Nao)
@@ Line 436: / Line 446: @@
 * aim: creation of an automatic designing tool for synthetic promoters which include sequence design, transfac search as well as filling the sequence up with spacer sequences.
 [[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
 === 9-7-2009 ===
-* check designed sequences for restriction sites <pre>CheckRestrictionsites.pl</pre> '''CODE'''?
+* check designed sequences for restriction sites <pre>CheckRestrictionsites.pl</pre>
 * finish creating sequences
 * consider CMV core promoter into the calculation of the relative position of TFBS to the TSS
@@ Line 446: / Line 457: @@
 ** pure TFBS free sequence
 ** sequences with TFBS at minima of the density function
+* checking for all sequences for further binding sites with the Transfac match tool
 === 9-8-2009 ===
@@ Line 462: / Line 474: @@
 * Phone conference with Kai Ludwig, Logo & Web Design (Nao)
-* official Team Meeting '''(LINK)'''
+* official Team Meeting
 * wiki closure on Oct 21st!
 === 9-9-2009 ===
+* modify synthetic promoter sequences to be ready for ordering
+* Sweep more promoter sequences using Promotersweep
+* start Modeling
+* revise and improve HEARTBEAT
+* discuss differences between PWMs
+=== 9-10-2009 ===
+* still modifying synthetic sequences to be ready for shipping
+* we have altogether 25 designed promoter sequences!
+=== 9-11-2009 ===
+* Software Meeting (Stephen, Tim, Nao)
+** compartibility with mediawiki: HTML, perl, php, R, java?
+** GUI design
+*** simple interface: single TF, auxiliary TFs, #TFBS, sequence length
+*** "interactive": multiple TF, choosing auxiliary TFs, additional information (see [[Team:Heidelberg/Eukaryopedia|Eukaryopedia]]), density function plot & histogram
+*** "hyper-interactive" step-by-step design & creation
+* Modeling Meeting with Marti and Anna-Lena (Tim, Nao)
+** aim: fancy visualization to show expectation & prediction providing pathway insights
+** TODO/QUESTIONS
+*** what is the stimulus? collect possible inputs!
+*** measurable outcome: experiments & pathways
+*** quality of synthetic sequence: error checking
+**** we need to define the '''quality''' of our sequences
+** LEVELS of modeling
+*** (1) DNA (2) expression/transcriptional activity (3) output
+*** each with corresponding measurement
+* general modeling scheme: input - "What we are affecting" - possible outcomes
+* how? We use fuzzy logic
+[[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
+=== 9-14-2009 ===
+* collect input for inducing the system (e.g. p53: CPT, Pifithrin-alpha; NFkB: TNF-alpha etc.)
+* phone conference with Kai Ludwig
+* learn how to include Perl code into html code
+** learn how to use embperl
+** configure apache2 server such that embperl can be interpreted
+** try to make offline use of embperl working
+*try to find nice html editor for ubuntu - (seamonkey, Amaya)
+=== 9-15-2009 ===
+* create network picture for meeting tomorrow
+* Logo discussion
+* Read paper: ''Fuzzy Logic Modeling of Signaling Networks'' (Aldridge 2009)
+* learn data management of virtual server
+* get an overview about the apache2 file and security system
+=== 9-16-2009 ===
+* Modeling Meeting with Marti (Douaa, Tim, Nao)
+** update on available drugs/sequences
+** decide what to model: (A) error checking, and (B) differential expression?
+** use natural promoters to build up model for prediction of activity of synthetic promoters
+** Discussion of TF score
+*** Transfac sequence alignment score
+*** promotersweep binding site quality
+*** relative position to TSS: How?
+**** (A) peak width & amplitude, (B) distance to maximal peak & position, (C) number of PEAK, (D) "sliding window" and calculate area under curve, (E) #TFBS (also for comparison of different synthetic promoters)
+*** biophysical affinity using TRAP
+** first model: build up either on CMV or on JeT
+** potential: integrate many stimuli -> find out crosstalks of pathways?
+* TODO (meeting)
+** collect data
+** define WHAT we want to model
+** summarize available sequences
+** try to formulate IF ... THEN "sentences"
+** check MATLAB & MATLAB Fuzzy Logic Toolbox availability
+=== 9-17-2009 ===
+* internal Team Meeting
+* find error.log files on the server and learn how to use it
+=== 9-18-2009 ===
+[[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
+=== 9-20-2009 ===
+* learn how to use tag language of embperl
+** learn how to write loops with embperl
+** access of input variables in embperl -- using the %fdat hash
+=== 9-21-2009 ===
+* struggling with how to use R from embperl
+=== 9-22-2009 ===
+* Wiki Meeting (Dani, Cori, Nao)
+** install image processing tool
+** design wiki, brainstorming for possible navigation bars
+* Wiki Phone Meeting with Kai Ludwig (Nao)
+** design header & presentation-master as well as team shirts
+* Seminar: '''Martijn Luijsterburg''' (Karolinska Institute) - ''Heterochromatin Protein 1 is involved in the DNA damage response''. Host: Thomas Höfer, Bioquant
+=== 9-23-2009 ===
+* Modeling Meeting with Marti, Anna-Lena (Tim, Nao)
+** contact database group (TP3)
+** statistics: characterizing peaks
+*** we go for area under the curve and affinity. optionally we can choose Transfac sequence score and peak height & width
+** strategy to convince the wetlab people from the importance of modeling during the meeting on upcoming friday.
+** MATLAB license?
+** logical gates: try to start creating model topology after Friday
+* Presentation: '''Marti Bernado Faura''' (Bioquant, University of Heidelberg): ''Data-driven Fuzzy Logic modeling of Programmed Cell Death''
+** intro into fuzzy logic
+** system development & work flow of fuzzy logic
+** fuzzy inference & model prediction
+** model types: MISO / MIMO
+* Wrap-up meeting: Team HEARTBEAT (Tim, Nao)
+** split up computational work into three tracks: HEARTBEAT DB, HEARTBEAT GUI and modeling
+*** database: documentation (until Oct 18), peak characterization, calculate absolute density function
+*** GUI: based on ''embperl'', design according to our new wiki
+*** modeling: MATLAB license, collect sequences & input data, develop network model, include pathways
+* literature work
+=== 9-24-2009 ===
+* prepare slides for meeting tomorrow
+* pathway search: TNF-alpha/NFkB, VDR, SREBP and crosstalks. NFkB has a lot of pathway crosstalks, while SREBP and VDR show a interesting connection. Upon induction, SREBP activates VDR.
+=== 9-25-2009 ===
+* Team Meeting (Wetlab, Nao)
+** short progress report of all of us
+** modeling: discussing scheme, modeling elements and strategies
+[[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
+=== 9-28-2009 ===
+* Wiki Phone Meeting with Kai Ludwig (Nao)
+=== 9-29-2009 ===
+* designed synthetic promoters (HB_0001 - HB_0025) will be joined to CMV core promoter since JeT core promoter contains a Sp1 site in it. All other sequences (random synthesized, e.g.) are coupled with JeT core promoter.
+* literature studies on combinatorial ''cis''-regulation as well as on modelig of the lambda-switch
+* prepare slides for the next modeling meeting
+=== 9-30-2009 ===
+* Wiki Meeting (Dani, Nao)
+* MATLAB license order (Jens)
+* postpone Yara meeting (Wetlab, Tim)
+* got sequences from Lars
+* got qRT-PCR setup from Chenchen
+* Modeling Meeting with Marti & Anna-Lena (Tim, Nao)
+** still need to collect FACS and microscopy results
+** discuss our network prediction model using TNF-alpha as aｎ example
+** maybe we can use the lambda switch paper as a good starting point for our modeling
 [[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
@@ Line 531: / Line 695: @@
 === 10-1-2009 ===
+* Wiki & Presentation Meeting with Dani (Nao)
 === 10-2-2009 ===
+* some wiki work
 [[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
@@ Line 539: / Line 705: @@
 === 10-6-2009 ===
+* Internal Team Meeting
+** check out number and measurement plans of randomly assembled synthetic promoters (5x NFkB, 5x p53, 2x pPARg, 2x SREBP)
+* Wiki Meeting (Corinna, Daniela, Nao)
+** discuss design of the top page and possible features
+** try out CSS design
 === 10-7-2009 ===
+* Wiki Design (Nao)
+* Wiki Phone Meeting with Kai Ludwig (Nao)
+* MATLAB has arrived!
+* literature work
+* Wetlab Meeting: progress report on measurement of random assembled synthetic promoters
+* make thoughts about the whole storyboard of our presentation at the jamboree
 === 10-8-2009 ===
+* Short Meeting with Roland
+* image processing work for wiki
 === 10-9-2009 ===
@@ Line 550: / Line 732: @@
 === 10-13-2009 ===
+* Measurement discussion with Lars: REU/RMPU, defining equations for mammalian systems
+* literature work on PoPS paper (Kelly JR et al.) and apply their equations
+* Marti Modeling Meeting (Anna-Lena, Tim, Nao)
+** Journal Club (Tim, Nao)
+** summary of meeting from Team Meeting from last thursday
+* Marti: start modeling using MATLAB and Fuzzy Logic Toolbox (FLT), playing around with FLT and tutorial
+=== 10-14-2009 ===
+Nao
+* develop first test fuzzy inference system (FIS) for testing
+* Marti Modeling Meeting, specify model topology
+* collect data: FACS (Cori), Microscopy (Hannah), Sequence & TECAN (Lars)
+* start calculating position score using R
+* translating project abstract
+=== 10-15-2009 ===
+Nao
+* calculate affinity score using TRAP (Anna-Lena)
+* collect ideas for integrating TFwise scores in order to calculate final position/affinity score for one sequence: median, mean, maximum, weighted mean?
+* all data analysis is stored in three sheets (SequenceAnalysis, ResultSummary and CalculateTRAP)
+* from now on we concentrate on FACS measurements because they are the most reliable ones (TECAN used only for scanning)
+* fill up TRAP data with missing transcription factors
+=== 10-16-2009 ===
+Nao
+* Anna-Lena Meeting: discuss how to integrate sequence scores
+* get & check p53, pPARg and random SREBP sequences
+* go through FACS results
+* add HEARTBEAT sequences for data analysis
+* modeling documentation
+* parsing experimental setups for modeling use
+* Chenchen qRT-PCR results
+* define possible modeling layers
+** first layer: input
+*** drug type, pathway, drug mode of action, drug concentration, targeted cells, incubation time
+*** sequence type, position score, affinity score
+*** we choose position & affinity score, sequence type and the presence of stimulation. Time as well as different concentration (unfortunately no data available) can be added in future
+** second layer: promoters
+*** 6 constitutives, 3 standards, 6 inducible available
+*** data analysis narrows this to 5 constitutives, 3 standards and 4 inducible
+*** HEARTBEAT sequences have to be measured a.s.a.p.
+* Marti Modeling Meeting
+** try to define some fuzzy rules
+** we assume better binding -> better expression
+** define membership functions
+** start modeling with NFkB results
+All
+* Internal Team Meeting
+** reminder: wiki task, wiki to do
+* Official Team Meeting
+=== 10-17-2009 ===
+* final decision: we go for maximum of position and affinity score
+* added HB sequences for data analysis table; as soon as results are there we can model designed synthetic promoters
+* define shape of membership functions
+* literature search for missing activity values?
+* still TODO: check out p53 results since the p53-NFkB crosstalk is really interesting!
+=== 10-18-2009 ===
+Nao
+* SREBP/VDR paper arrived
+* finish data analysis
+* study & playing around with MATLAB FLT, programming from both FLT GUI and MATLAB command line
+* define our work to be (i) error checking and (ii) exclusive pathway modeling
+* high potential of this model lies in its plug'n'play structrue, with a high capacity of integrating more inputs, outputs and also the middle layer (promoter diversity)
+[[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
+=== 10-19-2009 ===
+Nao
+* define final network structure
+* wiki work
+* reading RFC documentation and correction
+* we call this project HEARTBEAT fuzzy network (FN)
+* HB FN documentation and first results!
+** creating two fuzzy controllers: inducible NFkB and constitutive
+* how do we integrate the data? combine via Simulink!
+=== 10-20-2009 ===
+* Creating, developing, integrating and combining fuzzy network modeling (MATLAB, Simulink)
+* first analysis of HB sequences
+* HEARTBEAT FN documentation
+=== 10-21-2009 ===
+=== 10-22-2009 ===
+* FROZEN WIKI!!!!
 [[https://2009.igem.org/Team:Heidelberg/Notebook_modeling TOP]]
-|width="250px" style="background-color:#d8d5d0"|
+|width="250px" style="padding: 0 20px 15px 15px; background-color:#d8d5d0"|
 |}

Team:Heidelberg/Notebook modeling

From 2009.igem.org

Latest revision as of 01:43, 22 October 2009

Notebook HEARTBEAT

Contents

July

7-27-2009

August

8-3-2009

8-4-2009

8-5-2009

8-6-2009

8-7-2009

8-10-2009

8-11-2009

8-12-2009

8-13-2009

8-14-2009

8-18-2009

8-19-2009

8-20-2009

8-21-2009

8-24-2009

8-25-2009

8-26-2009

8-27-2009

8-28-2009

8-31-2009

September

9-1-2009

9-2-2009

9-3-2009

9-4-2009

9-5-2009

9-6-2009

9-7-2009

9-8-2009

9-9-2009

9-10-2009

9-11-2009

9-14-2009

9-15-2009

9-16-2009

9-17-2009

9-18-2009

9-20-2009

9-21-2009

9-22-2009

9-23-2009

9-24-2009

9-25-2009

9-28-2009

9-29-2009

9-30-2009

October

10-1-2009

10-2-2009

10-5-2009

10-6-2009

10-7-2009

10-8-2009

10-9-2009

10-12-2009

10-13-2009

10-14-2009

10-15-2009

10-16-2009

10-17-2009

10-18-2009

10-19-2009

10-20-2009

10-21-2009

10-22-2009