Team:Heidelberg/Project Measurement

From 2009.igem.org

(Difference between revisions)
Line 7: Line 7:
|width="650px" style="padding: 0 15px 15px 20px; background-color:#ede8e2"|
|width="650px" style="padding: 0 15px 15px 20px; background-color:#ede8e2"|
__NOTOC__
__NOTOC__
 +
 +
__NOTOC__
 +
 +
{{Template_HD}}
 +
 +
== Synthetic Promoters ==
 +
 +
'''The central question of the synthetic promoter project is: Are we able to make specific promoters by predicting their sequence ''in silico''?'''
 +
 +
Or, going even further: Are we able to develop a standard method for creating promoters of
 +
* '''Defined strength'''
 +
* '''Defined response'''
 +
* '''Defined pathway integration'''
 +
 +
How this is supposed to work... Read on!
 +
 +
=== Abstract ===
 +
 +
Promoters are the key regulators of gene expression. Possessing promoters which are active under a desired condition, at a desired strength and in a specified tissue is of great value for Plant Biotechnology, Gene Therapy and fundamental research in Bioscience. Therefore, it has become a desire to synthetically construct promoters responsive to a variety of pathways. We explore two ways to the synthesis of promoters: On one hand, we have developed a bioinformatical model and database (HEARTBEAT) describing the structure of promoters responsive to user-defined inputs. On the other hand, we have developed a biochemical method for the synthesis of randomized promoter libraries. Using this method, we have created a library of constitutive promoters of varying strength. Also, we have created libraries of promoters putatively responsive to a variety of pathways. We have screened these libraries for functional, pathway responsive promoters and present a detailed characterization of a NF-κB responsive promoter of our making. We finally discuss ways to combine randomized biochemical synthesis and bioinformatical modeling to propose a method towards the generation of promoters of complex regulation (i.e. by multiple pathways).
 +
 +
=== Introduction ===
 +
 +
Promoters are the key regulators of gene expression. Possessing promoters which are active under a desired condition, at a desired strength and in a specified tissue is of great value for Plant Biotechnology, Gene Therapy and fundamental research in Bioscience. Most efforts of obtaining such promoters focus on cloning them from Nature. This approach is, in eukaryotes, flawed for three reasons: First, promoters in eukaryotic cells are very complexly regulated by a wide variety of transcription factors, and thus, pathways [[Team:Heidelberg/Project_Synthetic_promoters#References|[1]]]. Therefore, natural promoters cannot be used reliably as transcriptional assays. Second, a promoter might be required to be active under a set of conditions for which no natural promoter exists. Third, for precise control of gene expression levels, promoters of human-defined transfer functions and expression strengths are required.<br>
 +
 +
Therefore, efforts have emerged to synthetically construct promoters. Two concepts of synthetic promoters in mammalian cells co-exist independently from each other. One is the concept of "genetic switches" (see [[Team:Heidelberg/Project_Synthetic_promoters#References|[10]]] for a recent review) - promoters which can specifically be induced by a stimulus mammalian cells are usually insensitive to, e.g. tetracycline [[Team:Heidelberg/Project_Synthetic_promoters#References|[11]]]). Much fewer efforts have been put into developing promoters sensitive to ''endogenous'' signals (referred to as "synthetic promoters" in the rest of this article). Such promoters are of very high value for a broad variety of applications. Three examples should demonstrate this. First, in virotherapy for cancer and other diseases, it has become a desire to express toxic genes only in affected cells (reviewd in [[Team:Heidelberg/Project_Synthetic_promoters#References|[12]]]). For example, breast cancer cells are characterized by high levels of estrogen receptor. Constructing a promoter which is active only at high estrogen receptor levels (plus, maybe, only in cells which are irradiated, as ER can be very active in other tissues of the female reproductive tract also) might therefore help developing novel breast cancer therapies. Second, biologists studying pathway interactions are in need for transcriptional assays, that is promoters which are specifically activated by a single transcription factor. Third, the concept can be transferred to plants, where synthetic promoters can be very valuable, as plant biotechnology is always in need for novel tissue- or development-specific promoters.<br>
 +
 +
Three approaches exist to construct synthetic promoters responsive to endogenous factors. First, the by structure of promoters is modeled by generating large data sets describing the relative spacing and coincidence of transcription factors (reviewed in [[Team:Heidelberg/Project_Synthetic_promoters#References|[4]]]). To our knowledge, such predictions have not been tested ''in vivo''. Second, promoters are generated by randomly or repeatedly cloning response elements upstream of a core promoter. To our knowledge, repeated cloning of response elements works well [[Team:Heidelberg/Project_Synthetic_promoters#References|[5]]] and is frequently applied, but no suggestions exist on how to apply this strategy to the generation of more complexly regulated promoters. The random creation of promoters works well to generate constitutive promoters [[Team:Heidelberg/Project_Synthetic_promoters#References|[6]]] and was even applied to broadly identify activating elements  [[Team:Heidelberg/Project_Synthetic_promoters#References|[2]]], but no promoters of specific regulation have been described for this approach. A third approach is the randomization of spacer elements between transcription factor binding sites, which is applied to generate libraries of promoters of varying strength [[Team:Heidelberg/Project_Synthetic_promoters#References|[3]]], [[Team:Heidelberg/Project_Synthetic_promoters#References|[8]]].
 +
 +
In order to be able to design synthetic promoters, an understanding of natural promoters is required. Mammalian promoters can be subdivided into several "domains". The ''core promoter'' is the binding site of the basal transcription machinery, i.e. RNA polymerase and associated factors. Core promoters differ in composition, but are more or less similar for most genes (reviewed in [[Team:Heidelberg/Project_Synthetic_promoters#References|[9]]]). The main regulatory domain is the proximal promoter, which is where regulatory elements bind. It can be very large (4kb), meaning that some transcription factors regulate transcription despite being very far away from the RNA polymerase. This is mainly possible because of the three-dimensional structure the DNA adopts. In addition to this, there are even more distal elements that are referred to as "enhancers" and "silencers". A further challenge is that some transcription factors are not able to initiate transcription on their own, but rather they require other transcription factors for their activity.
 +
 +
== Results ==
 +
 +
=== RA-PCR, a method for the generation of randomized promoter libraries ===
 +
 +
[[Image:HD09_rapcr.jpg|none|thumb|650px|'''Figure 1: The method of RA-PCR''']]
 +
 +
We have developed a standard method (termed "Random Assembly PCR / RA-PCR") for the construction of randomized promoter libraries. We modified Assembly PCR [[Team:Heidelberg/Project_Synthetic_promoters#References|[7]]] to create randomized promoters instead of ordered genes by using different oligos containing a transcription factor binding site (or random DNA) plus two annealing sequences (see Figure 1 for a comprehensive explanation of the method).  We use two sets of oligos, one for the top strand, one for the bottom strand. The oligos for each strand have the same annealing sequences (which are complementary to the annealing sequences of the other strand). If these oligos are pooled, they will randomly anneal to each other, thus generating randomized repeats of the transcription factor binding sites of interest at varying spacing. In order to be able to clone the construct, we also add two stop oligos (termed stop 5' and top 3') which contain only one annealing sequence, plus a cutsite (SpeI 5', HindIII 3'). Double-stranded DNA is created by running a seven-cycle PCR, and amplified by a  25-cycle PCR. Then, the resulting (proximal) promoter is cloned 5' of a core promoter (we used the core promoter of JeT [[Team:Heidelberg/Project_Synthetic_promoters#References|[8]]]) by inserting it into [[Team:Heidelberg/Project_Measurement#A_promoter_measurement_kit_for_use_in_mammalian_systems|pSMB_MEASURE]], the promoter measurement plasmid we developed (from there, it can be excised like any standard biological part in a submission plasmid). Thus, a mixture of different promoters in the same plasmid backbone is generated. These can then be transformed into bacteria. Each colony represents a single putative promoter, which can the be transfected into mammalian cell under the conditions of interest, plus control conditions. Promoters active under the desired conditions, but not under control conditions, are selected for further characterization. <br>
 +
 +
Please see a detailled protocol for RA-PCR [[Team:Heidelberg/Project_Synthetic_promoters#RA-PCR_protocol|below]].
 +
 +
=== Generation of a library of constitutive promoters ===
 +
 +
[[Image:HD09_constitutive.png|left|thumb|350px|'''Figure 2: A library of constitutive promoters created by RA-PCR''' Promoters were analyzed by the standards developed in the [[Team:Heidelberg/Project_Measurement#Measuring REU_by_flow_cytometry_and_image_analysis|Measurement]] part of our project in HeLa cells.]]
 +
 +
As a first application of RA-PCR, we have created a library of constitutive promoters. We performed RA-PCR on oligos containing binding sites for some well known generally activating transcription factors (Sp1, Ap1, CREB, NF-Y)  which we identified from literature search [[Team:Heidelberg/Project_Synthetic_promoters#References|[2]]],[[Team:Heidelberg/Project_Synthetic_promoters#References|[6]]],[[Team:Heidelberg/Project_Synthetic_promoters#References|[8]]]. We also added NF-&kappa;B responsive oligos as NF-&kappa;B has non-specific activity and is therefore used by a variety of viral constitutive promoters, e.g. the HIV promoter [[Team:Heidelberg/Project_Synthetic_promoters#References|[13]]]. We picked 24 colonies, two of which we dismissed after a test digest (not shown). Figure 3 shows the sequence analysis of some randomly selected clones and demonstrates that RA-PCR is able to generate randomized repeats of Oligos. We then measured the activity of the clones we picked by applying the [[Team:Heidelberg/Project_Measurement|Concept of Relative Expression Units (REU)]] we developed. Figure 2 shows that we have been able to create a library of promoters of varying strength, some of which have an expression strength higher than JeT (which was not accomplished by JeT's developers, although attempted [[Team:Heidelberg/Project_Synthetic_promoters#References|[8]]]). Such a library is of great value for fine-tuning gene expression levels.
 +
[[Image:HD09_0109_const.jpg|left|thumb|350px|'''Figure 3: RA-PCR generates randomized repeats of transcription factor binding sites.''' Sequence analysis of clones of constitutive promoters generated by RA-PCR. Transcription factor binding sites are marked in color, random sequences in light grey.]]
 +
 +
=== Generation and screening of a library of promoters putatively responsive to NF-&kappa;B ===
 +
 +
[[Image:HD09_nfkb.png|left|thumb|350px|'''Figure 4: A library of putative NF-&kappa;B responsive promoters created by RA-PCR''' Promoters were induced by TNF-&alpha; in U2OS cells and screened by TECAN (automated plate reader).]]
 +
<br>
 +
 +
RA-PCR was conducted with Oligos containing a NF-&kappa;B binding site, plus a small number of "general activators" (NF-Y, Sp1, Ap1, CREB) . Box 1 demonstrates how the oligos were designed from a frequency matrix.  33 clones were picked, miniprepped and transfected. NF-&kappa;B was then induced by the addition of TNF-&alpha (2.5µM) for 10 hours, and left uninduced as a control. The plate was then scanned by TECAN, an automated fluorescence plate reader. TECAN is very imprecise on eukaryotic cells, and the arbitrary fluorescence we meausred is not proportional to [[Team:Heidelberg/Project_Measurement|REU]] or another precise measure of mammalian promoter activity, but it can serve as a rough indicator of promoter induction. The result (Fig.4) shows that most clones appear not to be induced by NF-&kappa;B, whereas others are induced at varying levels of strength. Considering the sequence analysis of some randomly selected clones (Fig.5), this result is not intuitive, as most sequences contain a NF-&kappa;B binding site, but it demonstrates that simply cloning repeats of a Transcription Factor Binding Site in front of a core promoter will not necessarily work.<br>
 +
 +
We picked clone 31 for further characterization in REU.
 +
[[Image:HD09_nfkbseq.jpg|left|thumb|350px|'''Figure 5: Sequence analysis of putative NF-&kappa;B responsive promoters]]
 +
 +
[[Image:HD09_nfkbmatrix.jpg|left|thumb|350px|'''Box 1: RA-PCR allows for synthesis of promoters responsive of imprecisely described transcription  factors.''' Considering the graphical representation of NF-kappaB's frequency matrix shown above (source: [http://jaspar.cgb.ki.se/ JASPAR]), the oligo can be designed in order to represent this matrix, instead of a static NF-kappa B binding site. A sensible representation of this matrix would be GGGRHTTYCC (for the IUPAC nucleotide code, refer to [http://www.bioinformatics.org/sms/iupac.html bioinformatics.org]). Most oligonucleotide manufactures provide the option to synthesize such mixtures of individual oliogs without further cost. As our method is PCR-based (unlike other methods such as [[Team:Heidelberg/Project_Synthetic_promoters#References|[5]]] and [[Team:Heidelberg/Project_Synthetic_promoters#References|[6]]]), we are able to synthesize even promoters responsive to badly-described transcription factors ]]
 +
 +
=== Characterization of a NF-&kappa;B responsive promoter ===
 +
 +
'''Hannah and Corinna, I need you data. Possibly also ibidi cotransfection, video etc.'''
 +
 +
=== RA-PCR can generate promoters responsive to a variety of pathways ===
 +
 +
We performed RA-PCR to construct promoters putatively responsive to Transcription factors as diverse as p53 (DNA damage sensor), pPAR&gamma; (metabolism & diabtetis), SREBP (Sterol nutrition), HIF (hypoxia) and Estrogen receptor . While screening these promoters we found the following:
 +
[[Image:HD09_ppary_data.png|left|thumb|220px|'''Figure 6: pPAR&gamma; responsive promoters induced by Thiazolidinedione in U2OS cells.]]
 +
* For pPAR&gamma;, we, by screening, identified two clones which appear to be responsive to the anti-diabetis drug [http://en.wikipedia.org/wiki/Thiazolidinedione Thiazolidinedione]. We roughly characterized these promoters by a triple TECAN read reltaive to JeT (Fig. 6)
 +
* For p53, induction of the pathway by the Topoisomerase inhibitor [http://en.wikipedia.org/wiki/Camptothecin Camptothecin] (a anti-cancer drug) turned out to be difficult as is severly harms the cells and makes promoter indcution levels difficult. We therefore attempted to normalize screening conditions to number of living cells by Hoechst-Staining. We found that some promoters appeared to be strongly dowregulated by Camptothecin and therefore experimented with a variety of conditions inducing by p53 by different pathways, at different phosphorylation sites, but where unable to obtain a conclusive picture.
 +
* For HIF, we failed to induced the condtions sufficiently to achieve promoter activation. We below [[Team:Heidelberg/Project_Synthetic_promoters#Improving_RA-PCR|discuss]] how screening can be improved.
 +
* For SREBP and Estrogen, we encoutered technical problems during promoter synthesis (probably damaged HinDIII enzyme) and therefore were unable to produce enough clones for a sufficient screening. For SREBP, we therefore cloned two natural, SREBP-upregulated promoters we had at hand and submitted them to the registry (where a characterization can be found).
 +
 +
=== HEARTBEAT, a model describing promoter structure ===
 +
[[Project_heartbeat|Main article: HEARTBEAT]]<br>
 +
 +
[[Image:HD09_VDRspatial.png|left|thumb|220px|'''Figure 7: Probability density function for the distribution of VDR-Binding sites along an ideal promoter as modelled by HERATBEAT''']]Based on the assumption that transcription factors (TFs) have a spatial preference for binding to the natural promoters' sequence concerning the distance to the transcriptional start site (TSS) [[Team:Heidelberg/Project_Synthetic_promoters#References|[14]]], we developed HEARTBEAT (Heidelberg Artificial Transcription Factor Binding Site Engineering and Assembly Tool). In a first step 4395 human promoter sequences 1000 bp upstream from the TSS obtained from the UCSC genome browser were analysed by the program “Promotersweep” [[Team:Heidelberg/Project_Synthetic_promoters#References|[15]]]. Promotersweep is able to assign transcription factor binding sites (TFBS) to a given sequence by retrieving and combining information from three homology databases (EnsEMBL Compara, NCBI HomoloGene, DoOP database), five promoter databases (EPD, DBTSS), six sequence motive identification tools (e.g. Meme, Gibbs Motif Sampler) and two matrix profile databases (Jaspar Core Library, Transfac Professional Library). Each TFBS motive is further classified into weak, conserved and reliable according to the quality of the assignment. The final result of Promotersweep can be divided into general spatial information about the TFBS and the consensus sequence on the one hand and further detailed facts about the associated gene on the other.<br>
 +
 +
[[Image:HD09_VDRcoin.png|left|thumb|220px|'''Figure 8: Frequency of other transcription factors occuring together with VDR. 680 transcription factors were examined, of which the displayed 340 show coincidence at least once.''']] In figure 7 the spatial distribution of VDR (Vitamin D receptor) binding sites within 140 natural promoter sequences is shown as an example. The size of each bin equals the number of VDR-TFBS within a range of 20 bps. The solid line represents the probability density function (pdf). Here, the maximum of the pdf is located 54 bps upstream to the TSS indicated by the vertical line. Natural promoter sequences usually exhibit multiple TFBS which implies dependencies between different TFs according to their binding behaviour to the DNA. Figure 8 shows the frequency distribution of coincidental appearing TFBSs if VDR is present. The highest peak represents VDR itself. The next three highest peaks are Kid3 (inhibitory), WT1 and AP-2 (stimulatory). In total, together with VDR, there are over 300 different TFBS coincidentally present. Both plots represent data deduced from the Heartbeat-database which enable a well-defined synthesis of promoter sequences.
 +
 +
=== An ''in vivo'' test of predicted promoter sequences ===
 +
 +
''ongoing work''
 +
 +
== Discussion ==
 +
 +
The results shown above demonstrate the potential of RA-PCR towards the synthesis of any promoter. Even by analyzing modest amounts of clones for each individual pathway, we were able to obtain promoters of a wide variety of strength and inducibility. Also, we were able to obtain constitutive promoters of greater strength than JeT, which has not been possible before[8].<br>
 +
 +
Many insights about promoter regulation are possible by analyzing different promoters created by RA-PCR. For example, clone 3 and clone 11 (see figure 4 and 5) differ only in the positioning of the single response element (RE), but still, induction strength differs threefold. This gives hints about Nf-&kappa;B's binding preference. A systematic study of promoters generated by RA-PCR and their strength could therefore be used to develop a comprehensive model of transcriptional regulation.  '''Nao, kann man hier dein model verlinken und mit einem oder zwei stzen elegant beschrieben?'''
 +
 +
=== Improving RA-PCR ===
 +
 +
''Screening conditions and induction strength:'' <br>
 +
 +
As noted [[Team:Heidelberg/Project_Synthetic_promoters#RA-PCR_can_generate_promoters_responsive_to_a_variety_of_pathways|above]], we experienced difficulties inducing some of the pathways (namely, HIF and p53). From our cell culture work, we learned that finding the ideal timepoint of induction for a certain pathway and the ideal conditions is very difficult even with literature at hand. Also, one would expect a much higher induction than the one observed for the NF-&kappa;B responsive clone we describe. Our induction levels might be low because NF-&kappa;B has a high constant actvity '''link to nfkb review here''', especially if the cells encounter rough cell culture conditions. Therefore we suggest that for future screening, a library of siRNAs for the transcription factors of interest should be compiled. Also, a library of transcription factors mutated to be constantly active is is required. With these libraries at hand, individual transcription factors can be knocked down, and activated specifically at 100% efficiency. This will greaty facilitate screening and parts characterization.<br>
 +
 +
''Generation of down-regulated promoters''<br>
 +
 +
As shown, we were able to generate a set of promoters upregulated by certain factors. For several applications, promoters of a high constant strength, which become down-regulated by a signal, are required. We think it might be possible to construct such promoters by performing a RA-PCR with oligos containing weak binding sites for generally activating transcription factors (that is, binding sites which deviate from the consensus sequence), and to add some oligos containing very strong binding sites for the transcription factor of interest (say, NF-&kappaB). If this factor is not active, the general activators will be able to bind to the DNA and activate transcription. Upon factor activation, the general activator will be replaced. If the binding site is then in a position where it does not initiate transcription (as for some of the clones (32 etc.) shown in Figure 4 and 5), the promoter will be downregulated, instead of upregulated. This hypothesis remains to be tested.
 +
 +
=== M-RA-PCR, a model-guided biochemical method for synthesis of complex promoters ===
 +
 +
[[Image:HD09_dual.jpg|thumb|left|250px|'''Figure 9: RA-PCR can be modified to refelct probability densitiy curves in vitro''']] RA-PCR can be modified to reflect modeled probability density curves. If a promoter regulated by multiple pathways, for example VDR (Vitamin D receptor) and SREBP (Sterol regulated element binding protein), is to be constructed, considering the density curves as obtained from the model (Figure 9) can give clues about its construction. A working VDR/SREBP promoter requires VDR and/or SREBP Response Elements (REs) in the close vicinity of the TSS (at approx. 850). It might require SREBP REs between 300 and 700, and VDR REs between 0 and 300. This distribution can be refelected by setting up 3 RA-PCRs with varying concentations of VDR-responsive, SREBP-responsive and spacer-oligos (compare figure B2.1). If a 3'Stop oligo containing a NheI cutsite, and a 5'Stop oligo containing a SpeI cutsite (or any combination of cutsites yielding compatible ends) is used, an infinite number of RA-PCR products can be assembled and cloned in front of a core promoter (having a SpeI cutsite 5').<br>
 +
We believe that this technique, termed Model-guided Random Assembly PCR, or M-RA-PCR, is the way forward to constructing the promoters of complex regulation described in the [[Team:Heidelberg/Project_Synthetic_promoters#Introduction|Introduction]].
 +
 +
=== Final remarks ===
 +
 +
We have developed two independent methods for the generation of truly synthetic promoter for use in mammalian cells and discussed possibilites for their combination and improvement.  We are looking forward to continuing this work and generating promoters which can be used in medical or biotechnological applications, such as transcriptional targeting in virotherapy or [[Team:Heidelberg/Project_SaO|a reporter cell line]].
 +
 +
== Methods ==
 +
 +
=== RA-PCR protocol ===
 +
* '''All Oligos we used can be found in [[Notebook/material#Oligos_used_for_RA-PCR|Material and Methods]]
 +
* Obtain density curves about the distribution of your TF of interest from our model. If this densitiy curve shows a decisive peak at distance >250Bp from the Transcriptional Start Site (TSS), continue with Box 2 (M-RA-PCR). If a peak is present close to the TSS, or if data is insufficient, continue here.<br>How our model was developed is detailled on the model page.<br>
 +
* Check our model for transcription factors coinceding with your transcription factor of interest<br>
 +
 +
* Design two annealing sites, each 15-18 base pairs long. Annealing sites should be void of transcription factor binding sites. Calculate the reverse complement of both sequences. We used the following sequences:
 +
{| class="wikitable centered" border="2" rules="rows" style="border-color:white;"
 +
|-
 +
!
 +
! Forward (F)
 +
! Reverse Complement (RC)
 +
|-
 +
|Annealing Sequence 1 (AS1)
 +
|GGGTGACGGGTTCA
 +
|AGTGAACCCGTCACCC
 +
|-
 +
|Annealing Sequence 2 (AS2)
 +
|GCGATCGGCAGATCA
 +
|TGATCTGCCGATCGC
 +
|-
 +
|}
 +
* Design a 5' stop oligos containing a cutsite (SpeI) and AS1_F.
 +
* Design a 3' stop oligos containing a cutsite (HindIII) and AS1_RC.
 +
* Design forward and reverse Oligos for each transcription factor of your interest. Forward oligos contain AS2_F, the transcription factor binding site and AS1_F. Reverse oligos contain AS2_RC, the TFBS and AS1_RC. TFBS should be designed to represent the matrix describing the factor's binding preferences (Box 1).
 +
* Design forward and/or reverse oligos for coinceding transcription factors identified in step 2 in the same way as described in step 6.<br>
 +
* Design forward and/or reverse oligos for general activators.
 +
* Design forward and reverse spacer oligos, which contain 10-15*N (random nucleotide) instead of a TFBS.
 +
* Order oligos at 100µM. Pool the oligos. As a general rule, use 0,8µL oft Stop5' and Stop 3'; ~4µL of the transcription factor (forward), ~4µL of the transcription factor (reverse), 1-2µL each of the forward and reverse spacer oligo, ~1µL of coinceding transcription factors and a total of 0,5µL of general activators. For the examples shown below, we used the following mixtutres of oligos:<br>
 +
{| class="wikitable centered" width="800px" border="2" rules="rows" style="border-color:white;"
 +
|- 
 +
! p53
 +
! NFkB II
 +
! HIF
 +
! Activator Mix
 +
|-
 +
|width="200px"|6µL  p53 (O.91)<br>
 +
5µL p53 reverse (O.188)<br>
 +
1µL random (O.56)<br>
 +
1µL Activator Mix<br>
 +
0.8µL Stop 5 new (O.187)<br>
 +
0.8µL Stop 3 (O.58)<br>
 +
|width="200px"|3µL NFkB-1 (O.93)<br>
 +
3µL NFkB-2 (O.94)<br>
 +
4µL NFkB-rev (O.194)<br>
 +
3µL Random (O.56)<br>
 +
2µL Activator Mix<br>
 +
0.8µL Stop 5 new (O.187)<br>
 +
0.8µL Stop 3 (O.58)<br>
 +
|width="200px"|2,5µL  HIF-1 (O.53)<br>
 +
2,5µL HIF-2 (O.54)<br>
 +
1µL CREB(O.89)<br>
 +
3µL HIF-rev (O.189)<br>
 +
3µL Random (O.56)<br>
 +
1µL Stop 5 new (O.187)<br>
 +
1µL Stop 3 (O.58)
 +
.2µL each Ap1, Sp1 (O.55, O.57)
 +
|width="200px"|2µL each Ap1, Sp1, CREB (O.55, O.57, O.89)<br>
 +
1µL each NFY, Empty (O.90, O.95)
 +
Water to 30µL
 +
|- 
 +
! SREBP
 +
! AHR
 +
! pPAR&gamma;
 +
! Estrogen receptor
 +
|-
 +
|width="200px"|5µL  SREBP (O.208)<br>
 +
4µL SREBP reverse (O.209)<br>
 +
1µL Sp1 (O.57)
 +
2µL random (O.56)<br>
 +
1µL Activator Mix<br>
 +
0.8µL Stop 5 new (O.187)<br>
 +
0.8µL Stop 3 (O.58)<br>
 +
|width="200px"|5µL  AHR (O.212)<br>
 +
4µL AHR reverse (O.213)<br>
 +
2µL random (O.56)<br>
 +
1,5µL Activator Mix<br>
 +
0.8µL Stop 5 new (O.187)<br>
 +
0.8µL Stop 3 (O.58)<br>
 +
|width="200px"|5µL  pPAR&gamma; (O.210)<br>
 +
4µL pPAR&gamma; reverse (O.211)<br>
 +
2µL random (O.56)<br>
 +
1,5µL Activator Mix<br>
 +
0.8µL Stop 5 new (O.187)<br>
 +
0.8µL Stop 3 (O.58)<br>
 +
|width="200px"|5µL Estrogen receptor (O.210)<br>
 +
4µL Estrogen receptor reverse (O.211)<br>
 +
2µL random (O.56)<br>
 +
1,5µL Activator Mix<br>
 +
0.8µL Stop 5 new (O.187)<br>
 +
0.8µL Stop 3 (O.58)<br>
 +
|}
 +
* Introduce the oligos thus pooled into a PCR reactions at a final dilutionof 1:200-1:500. We used Phusion MasterMix 2x (Finnzymes) as PCR reagent. Do that PCR reaction twice in order to achieve greater heterogenety.<br>
 +
* Run the PCR, 7-10 cycles, with the following setup:<br>
 +
**1 cycle Initial dentaturing, 5 minute 95°C<br>
 +
**7-10  cycles assembly: 30 seconds 95°C, 45 seconds 58°C, 45 seconds 72°C<br>
 +
**Terminal hold, 4°C, forever<br>
 +
* Remove oligonucleotides by performing a PCR purification using PCR purification kit (QIAGEN) or a gel extraction using Gel extraction kit (QIAGEN)<br>
 +
* Add PCR reagent (Phusion MasterMix 2x) again. Add 5' stop oligo and 3' Stop oligo, 25pmol (1µL of 1:4 diluted stock). <br>
 +
* Run the PCR, 25 cycles, with the following setup:<br>
 +
**1 cycle Initial dentaturing, 5 minute 95°C<br>
 +
**25 cycles amplification 30 seconds 95°C, 45 seconds 68°C, 60 seconds 72°C<br>
 +
**Terminal hold, 4°C, forever<br>
 +
* Gel purify PCR products to exclude everything <200Bp. Use a 1% agarose gel, 50V for at least 2h to achieve a good resolution<br>
 +
* Digest with HindIII and SpeI (or whatever cutsites were included in step 4 and 5). Digest a reporter plasmid containing a core promoter and a reporter gene with the same enzymes. We used the plasmids (containing GFP as a reporter) for this task. Make sure to perform a thorough digest; in addition, digest the plasmid with shrimp alkaline phosphatase or calf intestine phosphatase afterwards. Gel purify the plasmid backbone, PCR purify the digested PCR products.<br>
 +
* Ligate. Perform a thorough ligation to increase transformation efficiency. We used Fermentas T4 DNA Ligase for 5h, 19°C or overnight, 16°C.<br>
 +
* Transform into comptetent E. Coli cells and plate out. Pick no more than 20 colonies per individual PCR reaction. If more putative promoters are desired, set up several PCR reactions<br>
 +
* Isolate plasmid DNA from the selected colonies. We used a QIAGEN Miniprep kit for this tasked.<br>
 +
* Recommended step: Test-digest miniprep DNA with the same enzymes used in step 17 to make sure you get plasmid with synthesized promoters of varying length. Length of the inserts (that is, synthetic promoters) should be between 100 and 600 basepairs. If this is not the case, vary stop oligo concentration in step 10, improve gel purification setup in step 16 or alter PCR conditions in step 12 and 15.<br>
 +
* Perfom a screening to select functioning clones. For example, transfect clones in triplicates into eukaryotic cells on a 96 well palte by using transfection agents such as EFFECTENE or Lipofectamine. Then, induce the conditions of interest in one replicate, shut them off in a second replicate, and leave control medium on the third replicate. When the pathway is fully active,  read flourescence (or luminescence, if a luciferase reporter is used) by a plate reader (TECAN) or other automated methods. We used the following conditions for promoter screening:<br>
 +
'' to be added ''
 +
 +
=== References ===
 +
[1] Alberts, B. et al. Molecular Biology of the, Cell (5th edition). New York: Garland Science, p. 432-453
 +
 +
[2] Edelmann, G.M. et al. Synthetic promoter elements obtained by nucleotide sequence variation and selection for activity. PNAS 97, 3038-43 (2000).
 +
 +
[3] Ellis, T. et al. Diversity-based, model-guided construction of synthetic gene networks with predicted functions. Nature Biotechnology 27, 465-471 (2009).
 +
 +
[4] Venter, M. Synthetic promoters: genetic control through cis engineering. Trends in Plant Science 12, 118-124 (2007). (and the references cited therein)
 +
 +
[5] Rushton, P.J. et al. Synthetic plant promoters containing defined regulatory elements provide novel insights into pathogen- and wound-induced signalling. in Plant Cell 14, 749–762 (2002).
 +
 +
[6] Ogawa, R. Construction of strong mammalian promoters by random cis-acting element elongation. Biotechniques 42, 628-632 (2007).
 +
 +
[7] Stemmer, W.P.C. et al. Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides. Gene 164, 49-53 (1995).
 +
 +
[8] Tornoe, J. Generation of a synthetic mammalian promoter library by modification of sequences spacing transcription factor binding sites. Gene 297, 21-32 (2002).
 +
 +
[9] Heintzman ND, Ren B. The gateway to transcription: identifying, characterizing and understanding promoters in the eukaryotic genome. Cellular and Molecular Life Science 64, 386-400 (2007).
 +
 +
[10] Fussenegger, M., Weber, W. Engineering of Synthetic Mammalian Gene Networks. Chemistry and Biology 16, 287-297 (2009).
 +
 +
[11] Gossen, M., Bujard, L. Tight control of gene expression in mammalian cells by tetracycline-responsive promoters. PNAS 89, 5547-5551 (1992).
 +
 +
[12] Dorer, D.E., Nettelbeck, D. Targeting cancer by transcriptional control in cancer gene therapy and viral oncolysis. Advanced Drug Delivery Reviews 61, 554-557 (2009).
 +
 +
[13] Rattner, A. NF-kappa B activates the HIV promoter in neurons. EMBO 12, 4261–4267 (1993).
 +
 +
[14] Yokoyama KD et al. Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships. Nuc Acids Res, 1-21 (2009).
 +
 +
[15] del Val C. et al.  PromoterSweep: a tool for identification of transcription factor binding site. Theor Chem Acc (in press)
 +
|width="250px" style="padding: 0 20px 15px 15px; background-color:#d8d5d0"|
|width="250px" style="padding: 0 20px 15px 15px; background-color:#d8d5d0"|
|}
|}

Revision as of 10:43, 17 October 2009



Synthetic Promoters

The central question of the synthetic promoter project is: Are we able to make specific promoters by predicting their sequence in silico?

Or, going even further: Are we able to develop a standard method for creating promoters of

  • Defined strength
  • Defined response
  • Defined pathway integration

How this is supposed to work... Read on!

Abstract

Promoters are the key regulators of gene expression. Possessing promoters which are active under a desired condition, at a desired strength and in a specified tissue is of great value for Plant Biotechnology, Gene Therapy and fundamental research in Bioscience. Therefore, it has become a desire to synthetically construct promoters responsive to a variety of pathways. We explore two ways to the synthesis of promoters: On one hand, we have developed a bioinformatical model and database (HEARTBEAT) describing the structure of promoters responsive to user-defined inputs. On the other hand, we have developed a biochemical method for the synthesis of randomized promoter libraries. Using this method, we have created a library of constitutive promoters of varying strength. Also, we have created libraries of promoters putatively responsive to a variety of pathways. We have screened these libraries for functional, pathway responsive promoters and present a detailed characterization of a NF-κB responsive promoter of our making. We finally discuss ways to combine randomized biochemical synthesis and bioinformatical modeling to propose a method towards the generation of promoters of complex regulation (i.e. by multiple pathways).

Introduction

Promoters are the key regulators of gene expression. Possessing promoters which are active under a desired condition, at a desired strength and in a specified tissue is of great value for Plant Biotechnology, Gene Therapy and fundamental research in Bioscience. Most efforts of obtaining such promoters focus on cloning them from Nature. This approach is, in eukaryotes, flawed for three reasons: First, promoters in eukaryotic cells are very complexly regulated by a wide variety of transcription factors, and thus, pathways [1]. Therefore, natural promoters cannot be used reliably as transcriptional assays. Second, a promoter might be required to be active under a set of conditions for which no natural promoter exists. Third, for precise control of gene expression levels, promoters of human-defined transfer functions and expression strengths are required.

Therefore, efforts have emerged to synthetically construct promoters. Two concepts of synthetic promoters in mammalian cells co-exist independently from each other. One is the concept of "genetic switches" (see [10] for a recent review) - promoters which can specifically be induced by a stimulus mammalian cells are usually insensitive to, e.g. tetracycline [11]). Much fewer efforts have been put into developing promoters sensitive to endogenous signals (referred to as "synthetic promoters" in the rest of this article). Such promoters are of very high value for a broad variety of applications. Three examples should demonstrate this. First, in virotherapy for cancer and other diseases, it has become a desire to express toxic genes only in affected cells (reviewd in [12]). For example, breast cancer cells are characterized by high levels of estrogen receptor. Constructing a promoter which is active only at high estrogen receptor levels (plus, maybe, only in cells which are irradiated, as ER can be very active in other tissues of the female reproductive tract also) might therefore help developing novel breast cancer therapies. Second, biologists studying pathway interactions are in need for transcriptional assays, that is promoters which are specifically activated by a single transcription factor. Third, the concept can be transferred to plants, where synthetic promoters can be very valuable, as plant biotechnology is always in need for novel tissue- or development-specific promoters.

Three approaches exist to construct synthetic promoters responsive to endogenous factors. First, the by structure of promoters is modeled by generating large data sets describing the relative spacing and coincidence of transcription factors (reviewed in [4]). To our knowledge, such predictions have not been tested in vivo. Second, promoters are generated by randomly or repeatedly cloning response elements upstream of a core promoter. To our knowledge, repeated cloning of response elements works well [5] and is frequently applied, but no suggestions exist on how to apply this strategy to the generation of more complexly regulated promoters. The random creation of promoters works well to generate constitutive promoters [6] and was even applied to broadly identify activating elements [2], but no promoters of specific regulation have been described for this approach. A third approach is the randomization of spacer elements between transcription factor binding sites, which is applied to generate libraries of promoters of varying strength [3], [8].

In order to be able to design synthetic promoters, an understanding of natural promoters is required. Mammalian promoters can be subdivided into several "domains". The core promoter is the binding site of the basal transcription machinery, i.e. RNA polymerase and associated factors. Core promoters differ in composition, but are more or less similar for most genes (reviewed in [9]). The main regulatory domain is the proximal promoter, which is where regulatory elements bind. It can be very large (4kb), meaning that some transcription factors regulate transcription despite being very far away from the RNA polymerase. This is mainly possible because of the three-dimensional structure the DNA adopts. In addition to this, there are even more distal elements that are referred to as "enhancers" and "silencers". A further challenge is that some transcription factors are not able to initiate transcription on their own, but rather they require other transcription factors for their activity.

Results

RA-PCR, a method for the generation of randomized promoter libraries

Figure 1: The method of RA-PCR

We have developed a standard method (termed "Random Assembly PCR / RA-PCR") for the construction of randomized promoter libraries. We modified Assembly PCR [7] to create randomized promoters instead of ordered genes by using different oligos containing a transcription factor binding site (or random DNA) plus two annealing sequences (see Figure 1 for a comprehensive explanation of the method). We use two sets of oligos, one for the top strand, one for the bottom strand. The oligos for each strand have the same annealing sequences (which are complementary to the annealing sequences of the other strand). If these oligos are pooled, they will randomly anneal to each other, thus generating randomized repeats of the transcription factor binding sites of interest at varying spacing. In order to be able to clone the construct, we also add two stop oligos (termed stop 5' and top 3') which contain only one annealing sequence, plus a cutsite (SpeI 5', HindIII 3'). Double-stranded DNA is created by running a seven-cycle PCR, and amplified by a 25-cycle PCR. Then, the resulting (proximal) promoter is cloned 5' of a core promoter (we used the core promoter of JeT [8]) by inserting it into pSMB_MEASURE, the promoter measurement plasmid we developed (from there, it can be excised like any standard biological part in a submission plasmid). Thus, a mixture of different promoters in the same plasmid backbone is generated. These can then be transformed into bacteria. Each colony represents a single putative promoter, which can the be transfected into mammalian cell under the conditions of interest, plus control conditions. Promoters active under the desired conditions, but not under control conditions, are selected for further characterization.

Please see a detailled protocol for RA-PCR below.

Generation of a library of constitutive promoters

Figure 2: A library of constitutive promoters created by RA-PCR Promoters were analyzed by the standards developed in the Measurement part of our project in HeLa cells.

As a first application of RA-PCR, we have created a library of constitutive promoters. We performed RA-PCR on oligos containing binding sites for some well known generally activating transcription factors (Sp1, Ap1, CREB, NF-Y) which we identified from literature search [2],[6],[8]. We also added NF-κB responsive oligos as NF-κB has non-specific activity and is therefore used by a variety of viral constitutive promoters, e.g. the HIV promoter [13]. We picked 24 colonies, two of which we dismissed after a test digest (not shown). Figure 3 shows the sequence analysis of some randomly selected clones and demonstrates that RA-PCR is able to generate randomized repeats of Oligos. We then measured the activity of the clones we picked by applying the Concept of Relative Expression Units (REU) we developed. Figure 2 shows that we have been able to create a library of promoters of varying strength, some of which have an expression strength higher than JeT (which was not accomplished by JeT's developers, although attempted [8]). Such a library is of great value for fine-tuning gene expression levels.

Figure 3: RA-PCR generates randomized repeats of transcription factor binding sites. Sequence analysis of clones of constitutive promoters generated by RA-PCR. Transcription factor binding sites are marked in color, random sequences in light grey.

Generation and screening of a library of promoters putatively responsive to NF-κB

Figure 4: A library of putative NF-κB responsive promoters created by RA-PCR Promoters were induced by TNF-α in U2OS cells and screened by TECAN (automated plate reader).


RA-PCR was conducted with Oligos containing a NF-κB binding site, plus a small number of "general activators" (NF-Y, Sp1, Ap1, CREB) . Box 1 demonstrates how the oligos were designed from a frequency matrix. 33 clones were picked, miniprepped and transfected. NF-κB was then induced by the addition of TNF-&alpha (2.5µM) for 10 hours, and left uninduced as a control. The plate was then scanned by TECAN, an automated fluorescence plate reader. TECAN is very imprecise on eukaryotic cells, and the arbitrary fluorescence we meausred is not proportional to REU or another precise measure of mammalian promoter activity, but it can serve as a rough indicator of promoter induction. The result (Fig.4) shows that most clones appear not to be induced by NF-κB, whereas others are induced at varying levels of strength. Considering the sequence analysis of some randomly selected clones (Fig.5), this result is not intuitive, as most sequences contain a NF-κB binding site, but it demonstrates that simply cloning repeats of a Transcription Factor Binding Site in front of a core promoter will not necessarily work.

We picked clone 31 for further characterization in REU.

Figure 5: Sequence analysis of putative NF-κB responsive promoters
Box 1: RA-PCR allows for synthesis of promoters responsive of imprecisely described transcription factors. Considering the graphical representation of NF-kappaB's frequency matrix shown above (source: [http://jaspar.cgb.ki.se/ JASPAR]), the oligo can be designed in order to represent this matrix, instead of a static NF-kappa B binding site. A sensible representation of this matrix would be GGGRHTTYCC (for the IUPAC nucleotide code, refer to [http://www.bioinformatics.org/sms/iupac.html bioinformatics.org]). Most oligonucleotide manufactures provide the option to synthesize such mixtures of individual oliogs without further cost. As our method is PCR-based (unlike other methods such as [5] and [6]), we are able to synthesize even promoters responsive to badly-described transcription factors

Characterization of a NF-κB responsive promoter

Hannah and Corinna, I need you data. Possibly also ibidi cotransfection, video etc.

RA-PCR can generate promoters responsive to a variety of pathways

We performed RA-PCR to construct promoters putatively responsive to Transcription factors as diverse as p53 (DNA damage sensor), pPARγ (metabolism & diabtetis), SREBP (Sterol nutrition), HIF (hypoxia) and Estrogen receptor . While screening these promoters we found the following:

Figure 6: pPARγ responsive promoters induced by Thiazolidinedione in U2OS cells.
  • For pPARγ, we, by screening, identified two clones which appear to be responsive to the anti-diabetis drug [http://en.wikipedia.org/wiki/Thiazolidinedione Thiazolidinedione]. We roughly characterized these promoters by a triple TECAN read reltaive to JeT (Fig. 6)
  • For p53, induction of the pathway by the Topoisomerase inhibitor [http://en.wikipedia.org/wiki/Camptothecin Camptothecin] (a anti-cancer drug) turned out to be difficult as is severly harms the cells and makes promoter indcution levels difficult. We therefore attempted to normalize screening conditions to number of living cells by Hoechst-Staining. We found that some promoters appeared to be strongly dowregulated by Camptothecin and therefore experimented with a variety of conditions inducing by p53 by different pathways, at different phosphorylation sites, but where unable to obtain a conclusive picture.
  • For HIF, we failed to induced the condtions sufficiently to achieve promoter activation. We below discuss how screening can be improved.
  • For SREBP and Estrogen, we encoutered technical problems during promoter synthesis (probably damaged HinDIII enzyme) and therefore were unable to produce enough clones for a sufficient screening. For SREBP, we therefore cloned two natural, SREBP-upregulated promoters we had at hand and submitted them to the registry (where a characterization can be found).

HEARTBEAT, a model describing promoter structure

Main article: HEARTBEAT

Figure 7: Probability density function for the distribution of VDR-Binding sites along an ideal promoter as modelled by HERATBEAT
Based on the assumption that transcription factors (TFs) have a spatial preference for binding to the natural promoters' sequence concerning the distance to the transcriptional start site (TSS) [14], we developed HEARTBEAT (Heidelberg Artificial Transcription Factor Binding Site Engineering and Assembly Tool). In a first step 4395 human promoter sequences 1000 bp upstream from the TSS obtained from the UCSC genome browser were analysed by the program “Promotersweep” [15]. Promotersweep is able to assign transcription factor binding sites (TFBS) to a given sequence by retrieving and combining information from three homology databases (EnsEMBL Compara, NCBI HomoloGene, DoOP database), five promoter databases (EPD, DBTSS), six sequence motive identification tools (e.g. Meme, Gibbs Motif Sampler) and two matrix profile databases (Jaspar Core Library, Transfac Professional Library). Each TFBS motive is further classified into weak, conserved and reliable according to the quality of the assignment. The final result of Promotersweep can be divided into general spatial information about the TFBS and the consensus sequence on the one hand and further detailed facts about the associated gene on the other.
Figure 8: Frequency of other transcription factors occuring together with VDR. 680 transcription factors were examined, of which the displayed 340 show coincidence at least once.
In figure 7 the spatial distribution of VDR (Vitamin D receptor) binding sites within 140 natural promoter sequences is shown as an example. The size of each bin equals the number of VDR-TFBS within a range of 20 bps. The solid line represents the probability density function (pdf). Here, the maximum of the pdf is located 54 bps upstream to the TSS indicated by the vertical line. Natural promoter sequences usually exhibit multiple TFBS which implies dependencies between different TFs according to their binding behaviour to the DNA. Figure 8 shows the frequency distribution of coincidental appearing TFBSs if VDR is present. The highest peak represents VDR itself. The next three highest peaks are Kid3 (inhibitory), WT1 and AP-2 (stimulatory). In total, together with VDR, there are over 300 different TFBS coincidentally present. Both plots represent data deduced from the Heartbeat-database which enable a well-defined synthesis of promoter sequences.

An in vivo test of predicted promoter sequences

ongoing work

Discussion

The results shown above demonstrate the potential of RA-PCR towards the synthesis of any promoter. Even by analyzing modest amounts of clones for each individual pathway, we were able to obtain promoters of a wide variety of strength and inducibility. Also, we were able to obtain constitutive promoters of greater strength than JeT, which has not been possible before[8].

Many insights about promoter regulation are possible by analyzing different promoters created by RA-PCR. For example, clone 3 and clone 11 (see figure 4 and 5) differ only in the positioning of the single response element (RE), but still, induction strength differs threefold. This gives hints about Nf-κB's binding preference. A systematic study of promoters generated by RA-PCR and their strength could therefore be used to develop a comprehensive model of transcriptional regulation. Nao, kann man hier dein model verlinken und mit einem oder zwei stzen elegant beschrieben?

Improving RA-PCR

Screening conditions and induction strength:

As noted above, we experienced difficulties inducing some of the pathways (namely, HIF and p53). From our cell culture work, we learned that finding the ideal timepoint of induction for a certain pathway and the ideal conditions is very difficult even with literature at hand. Also, one would expect a much higher induction than the one observed for the NF-κB responsive clone we describe. Our induction levels might be low because NF-κB has a high constant actvity link to nfkb review here, especially if the cells encounter rough cell culture conditions. Therefore we suggest that for future screening, a library of siRNAs for the transcription factors of interest should be compiled. Also, a library of transcription factors mutated to be constantly active is is required. With these libraries at hand, individual transcription factors can be knocked down, and activated specifically at 100% efficiency. This will greaty facilitate screening and parts characterization.

Generation of down-regulated promoters

As shown, we were able to generate a set of promoters upregulated by certain factors. For several applications, promoters of a high constant strength, which become down-regulated by a signal, are required. We think it might be possible to construct such promoters by performing a RA-PCR with oligos containing weak binding sites for generally activating transcription factors (that is, binding sites which deviate from the consensus sequence), and to add some oligos containing very strong binding sites for the transcription factor of interest (say, NF-&kappaB). If this factor is not active, the general activators will be able to bind to the DNA and activate transcription. Upon factor activation, the general activator will be replaced. If the binding site is then in a position where it does not initiate transcription (as for some of the clones (32 etc.) shown in Figure 4 and 5), the promoter will be downregulated, instead of upregulated. This hypothesis remains to be tested.

M-RA-PCR, a model-guided biochemical method for synthesis of complex promoters

Figure 9: RA-PCR can be modified to refelct probability densitiy curves in vitro
RA-PCR can be modified to reflect modeled probability density curves. If a promoter regulated by multiple pathways, for example VDR (Vitamin D receptor) and SREBP (Sterol regulated element binding protein), is to be constructed, considering the density curves as obtained from the model (Figure 9) can give clues about its construction. A working VDR/SREBP promoter requires VDR and/or SREBP Response Elements (REs) in the close vicinity of the TSS (at approx. 850). It might require SREBP REs between 300 and 700, and VDR REs between 0 and 300. This distribution can be refelected by setting up 3 RA-PCRs with varying concentations of VDR-responsive, SREBP-responsive and spacer-oligos (compare figure B2.1). If a 3'Stop oligo containing a NheI cutsite, and a 5'Stop oligo containing a SpeI cutsite (or any combination of cutsites yielding compatible ends) is used, an infinite number of RA-PCR products can be assembled and cloned in front of a core promoter (having a SpeI cutsite 5').

We believe that this technique, termed Model-guided Random Assembly PCR, or M-RA-PCR, is the way forward to constructing the promoters of complex regulation described in the Introduction.

Final remarks

We have developed two independent methods for the generation of truly synthetic promoter for use in mammalian cells and discussed possibilites for their combination and improvement. We are looking forward to continuing this work and generating promoters which can be used in medical or biotechnological applications, such as transcriptional targeting in virotherapy or a reporter cell line.

Methods

RA-PCR protocol

  • All Oligos we used can be found in Material and Methods
  • Obtain density curves about the distribution of your TF of interest from our model. If this densitiy curve shows a decisive peak at distance >250Bp from the Transcriptional Start Site (TSS), continue with Box 2 (M-RA-PCR). If a peak is present close to the TSS, or if data is insufficient, continue here.
    How our model was developed is detailled on the model page.
  • Check our model for transcription factors coinceding with your transcription factor of interest
  • Design two annealing sites, each 15-18 base pairs long. Annealing sites should be void of transcription factor binding sites. Calculate the reverse complement of both sequences. We used the following sequences:
Forward (F) Reverse Complement (RC)
Annealing Sequence 1 (AS1) GGGTGACGGGTTCA AGTGAACCCGTCACCC
Annealing Sequence 2 (AS2) GCGATCGGCAGATCA TGATCTGCCGATCGC
  • Design a 5' stop oligos containing a cutsite (SpeI) and AS1_F.
  • Design a 3' stop oligos containing a cutsite (HindIII) and AS1_RC.
  • Design forward and reverse Oligos for each transcription factor of your interest. Forward oligos contain AS2_F, the transcription factor binding site and AS1_F. Reverse oligos contain AS2_RC, the TFBS and AS1_RC. TFBS should be designed to represent the matrix describing the factor's binding preferences (Box 1).
  • Design forward and/or reverse oligos for coinceding transcription factors identified in step 2 in the same way as described in step 6.
  • Design forward and/or reverse oligos for general activators.
  • Design forward and reverse spacer oligos, which contain 10-15*N (random nucleotide) instead of a TFBS.
  • Order oligos at 100µM. Pool the oligos. As a general rule, use 0,8µL oft Stop5' and Stop 3'; ~4µL of the transcription factor (forward), ~4µL of the transcription factor (reverse), 1-2µL each of the forward and reverse spacer oligo, ~1µL of coinceding transcription factors and a total of 0,5µL of general activators. For the examples shown below, we used the following mixtutres of oligos:
p53 NFkB II HIF Activator Mix
6µL p53 (O.91)

5µL p53 reverse (O.188)
1µL random (O.56)
1µL Activator Mix
0.8µL Stop 5 new (O.187)
0.8µL Stop 3 (O.58)

3µL NFkB-1 (O.93)

3µL NFkB-2 (O.94)
4µL NFkB-rev (O.194)
3µL Random (O.56)
2µL Activator Mix
0.8µL Stop 5 new (O.187)
0.8µL Stop 3 (O.58)

2,5µL HIF-1 (O.53)

2,5µL HIF-2 (O.54)
1µL CREB(O.89)
3µL HIF-rev (O.189)
3µL Random (O.56)
1µL Stop 5 new (O.187)
1µL Stop 3 (O.58) .2µL each Ap1, Sp1 (O.55, O.57)

2µL each Ap1, Sp1, CREB (O.55, O.57, O.89)

1µL each NFY, Empty (O.90, O.95) Water to 30µL

SREBP AHR pPARγ Estrogen receptor
5µL SREBP (O.208)

4µL SREBP reverse (O.209)
1µL Sp1 (O.57) 2µL random (O.56)
1µL Activator Mix
0.8µL Stop 5 new (O.187)
0.8µL Stop 3 (O.58)

5µL AHR (O.212)

4µL AHR reverse (O.213)
2µL random (O.56)
1,5µL Activator Mix
0.8µL Stop 5 new (O.187)
0.8µL Stop 3 (O.58)

5µL pPARγ (O.210)

4µL pPARγ reverse (O.211)
2µL random (O.56)
1,5µL Activator Mix
0.8µL Stop 5 new (O.187)
0.8µL Stop 3 (O.58)

5µL Estrogen receptor (O.210)

4µL Estrogen receptor reverse (O.211)
2µL random (O.56)
1,5µL Activator Mix
0.8µL Stop 5 new (O.187)
0.8µL Stop 3 (O.58)

  • Introduce the oligos thus pooled into a PCR reactions at a final dilutionof 1:200-1:500. We used Phusion MasterMix 2x (Finnzymes) as PCR reagent. Do that PCR reaction twice in order to achieve greater heterogenety.
  • Run the PCR, 7-10 cycles, with the following setup:
    • 1 cycle Initial dentaturing, 5 minute 95°C
    • 7-10 cycles assembly: 30 seconds 95°C, 45 seconds 58°C, 45 seconds 72°C
    • Terminal hold, 4°C, forever
  • Remove oligonucleotides by performing a PCR purification using PCR purification kit (QIAGEN) or a gel extraction using Gel extraction kit (QIAGEN)
  • Add PCR reagent (Phusion MasterMix 2x) again. Add 5' stop oligo and 3' Stop oligo, 25pmol (1µL of 1:4 diluted stock).
  • Run the PCR, 25 cycles, with the following setup:
    • 1 cycle Initial dentaturing, 5 minute 95°C
    • 25 cycles amplification 30 seconds 95°C, 45 seconds 68°C, 60 seconds 72°C
    • Terminal hold, 4°C, forever
  • Gel purify PCR products to exclude everything <200Bp. Use a 1% agarose gel, 50V for at least 2h to achieve a good resolution
  • Digest with HindIII and SpeI (or whatever cutsites were included in step 4 and 5). Digest a reporter plasmid containing a core promoter and a reporter gene with the same enzymes. We used the plasmids (containing GFP as a reporter) for this task. Make sure to perform a thorough digest; in addition, digest the plasmid with shrimp alkaline phosphatase or calf intestine phosphatase afterwards. Gel purify the plasmid backbone, PCR purify the digested PCR products.
  • Ligate. Perform a thorough ligation to increase transformation efficiency. We used Fermentas T4 DNA Ligase for 5h, 19°C or overnight, 16°C.
  • Transform into comptetent E. Coli cells and plate out. Pick no more than 20 colonies per individual PCR reaction. If more putative promoters are desired, set up several PCR reactions
  • Isolate plasmid DNA from the selected colonies. We used a QIAGEN Miniprep kit for this tasked.
  • Recommended step: Test-digest miniprep DNA with the same enzymes used in step 17 to make sure you get plasmid with synthesized promoters of varying length. Length of the inserts (that is, synthetic promoters) should be between 100 and 600 basepairs. If this is not the case, vary stop oligo concentration in step 10, improve gel purification setup in step 16 or alter PCR conditions in step 12 and 15.
  • Perfom a screening to select functioning clones. For example, transfect clones in triplicates into eukaryotic cells on a 96 well palte by using transfection agents such as EFFECTENE or Lipofectamine. Then, induce the conditions of interest in one replicate, shut them off in a second replicate, and leave control medium on the third replicate. When the pathway is fully active, read flourescence (or luminescence, if a luciferase reporter is used) by a plate reader (TECAN) or other automated methods. We used the following conditions for promoter screening:

to be added

References

[1] Alberts, B. et al. Molecular Biology of the, Cell (5th edition). New York: Garland Science, p. 432-453

[2] Edelmann, G.M. et al. Synthetic promoter elements obtained by nucleotide sequence variation and selection for activity. PNAS 97, 3038-43 (2000).

[3] Ellis, T. et al. Diversity-based, model-guided construction of synthetic gene networks with predicted functions. Nature Biotechnology 27, 465-471 (2009).

[4] Venter, M. Synthetic promoters: genetic control through cis engineering. Trends in Plant Science 12, 118-124 (2007). (and the references cited therein)

[5] Rushton, P.J. et al. Synthetic plant promoters containing defined regulatory elements provide novel insights into pathogen- and wound-induced signalling. in Plant Cell 14, 749–762 (2002).

[6] Ogawa, R. Construction of strong mammalian promoters by random cis-acting element elongation. Biotechniques 42, 628-632 (2007).

[7] Stemmer, W.P.C. et al. Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides. Gene 164, 49-53 (1995).

[8] Tornoe, J. Generation of a synthetic mammalian promoter library by modification of sequences spacing transcription factor binding sites. Gene 297, 21-32 (2002).

[9] Heintzman ND, Ren B. The gateway to transcription: identifying, characterizing and understanding promoters in the eukaryotic genome. Cellular and Molecular Life Science 64, 386-400 (2007).

[10] Fussenegger, M., Weber, W. Engineering of Synthetic Mammalian Gene Networks. Chemistry and Biology 16, 287-297 (2009).

[11] Gossen, M., Bujard, L. Tight control of gene expression in mammalian cells by tetracycline-responsive promoters. PNAS 89, 5547-5551 (1992).

[12] Dorer, D.E., Nettelbeck, D. Targeting cancer by transcriptional control in cancer gene therapy and viral oncolysis. Advanced Drug Delivery Reviews 61, 554-557 (2009).

[13] Rattner, A. NF-kappa B activates the HIV promoter in neurons. EMBO 12, 4261–4267 (1993).

[14] Yokoyama KD et al. Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships. Nuc Acids Res, 1-21 (2009).

[15] del Val C. et al. PromoterSweep: a tool for identification of transcription factor binding site. Theor Chem Acc (in press)