Team:Heidelberg/Eucaryopedia

{|
 * -valign="top" border="0" style="margin-left: 2px;"
 * width="650px" style="padding: 0 15px 15px 20px; background-color:#ede8e2"|

= Eukaryopedia =

As most synthetic biologists and iGEM teams work with Escherichia Coli, the use of other model systems can create confusion. We hope to ease the legibility of our project descriptions by creating eukaryopedia, an overview about transcription factors and cell lines we used in our studies, as well as general molecular biology issues that affect our work. We hope it can help you find guidance in the jungle that mammalian molecular biology is at the moment.

Contents
Cell lines

HeLa - MCF-7 - U2-OS

Transcription factors

AP-1 - AP-2 - CREB - HIF-1 - NFAT - NF-&kappa;B - pPAR&gamma; - p53 - RAR - Sp1 - SREBP - Vitamin D receptor - WT1 - ZF5 - Kid3

Proteins

Apo A IV - CYP1A1 - EGF - HMG CoA synthase - Hsp70 - LDL receptor - PUMA - TNF-alpha

Molecular and Cellular Biology

Post-transcriptional modification / mRNA processing in eukaryotes - Regulation of transcription in eukaryotic organisms

Drugs

Camptothecin

Cellular components as tools

GPI - Sar-1 - Myrpalm - NLS - GFP

MCF-7
MCF-7 is a hormone-dependent, poorly invasive human breast cancer cell line [1]. Originally, the cell line was derived from a postmenopausal woman with metastatic breast cancer at the Michigan Cancer Foundation. It was observed, however, that cell lines used in different laboratories vary greatly in their biological characteristics, so that it is suggested that they were derived from different patients [2]. MCF-7 cells are estrogen-receptor positive and require estrogen for tumorigenesis in vivo. 17β-estradiol induces an TGFα-like activity [3], which promotes tumor growth and progression [4]. Furthermore the cells express receptors for and respond to several other hormones including androgen, progesterone, glucocorticoids, insulin, epidermal growth factor, insulin-like growth factor, prolactin and thyroid hormone [2].

HeLa
The cells were originally derived in 1952 from Henrietta Lacks, who suffered from an adenocarcinoma of the cervix. The HeLa cells were the first human epithelial cells established in long-term culture [5]. There are three main characteristics of the genome of HeLa by which they can be recognized: hypertriploid chromosome number (3n+), 20 clonally abnormal chromosomes and the integration of multiple copies of HPV18 (Human Papilloma Virus) at various sites [6]. It has been shown, that the HeLa genome has been remarkably stable after years of subcultivation [6], but it is also possible to select strains of HeLa cells with certain properties by putting them under selection pressure [7].

U2-OS
U2-OS, formerly known as 2T cell line [8], were derived from a 15-year-old girl with a moderately differentiated osteogenic sarcoma of the shinbone. Cell culture of U2-OS started at the time of amputation of the left leg on September 3, 1964 [9]. U2-OS cells express adhesion molecules such as integrins, Ig-CAMs and chemokine receptors as well as growth factors which are either constitutively expressed (such as IL-7) or inducible (such as TNF) by PMA (phorbol ester) or ionomycine. The adhesion molecules and growth factors support the growth of CD34 progenitor cells [10].

NF-&kappa;B
NF-κB (nuclear factor kappa-light-chain-enhancer of activated B cells) is a transcription factor (TF) which regulates many different target genes resulting in the expression of various proteins. In most cell types (with the exception of B cells and Dentritic cells) NF-κB is bound to the Inhibitor of κB (IκB) withholding NF-κB from entering the nucleus. When the cell becomes activated by an extra cellular stimluli, IκB is degraded and NF-κB can enter the nucleus. Within the nucleus NF-κB is able to enhance transcription of genes which are involved in immune response, cell proliferation or cell survival, depending on cell type and extra cellular stimuli [11]. In many cells NF-κB regulates anti-apoptotic proteins (e.g. TRAF1/2) and therby preventing cell death. Therefore mutations of NF-κB resulting in a constitutively active form are often associated with unregulated cell proliferation and cancer [12]. In macrophages the NF-κB signalling pathway could be activated by binding of bacterial lipopolysacchride (LPS). There NF-κB activation leads to secretion of cytokines which influence other lymphocytes.

p53
P53 is a transcription factor (TF) which is involved in several physiological processes. One major function of P53 is cell cycle regulation. P53 is often activated through DNA damage or other cellular stresses like cell cycle abnormalities, hypoxia and oxidative stress. In normal cells the P53 level is kept low by the protein HDM2, which attaches ubiqutin to P53 (acts as ubiquitin ligase). The ubiquitylation of P53 leads to its degradation by the proteasome. In response to cellular stress P53 is phosphorylated and changes its conformation in a way preventing HDM2(mdm2 is the analog protein in mouse) binding. Conformational changes also result in the exposion of the DNA binding domain. This activation of P53 leads to farreaching alteration of gene expression. The cell cycle is stopped between G1 and S phase and DNA repair systems are switched on. If the cell damage is intense P53 accumaltion can also lead to apoptosis of the cell. Because of its roles in DNA protection and cell cycle regulation P53 mutation is often correlated with cancer [13].

HIF-1
HIF-1 (hypoxia inducible factor-1) is a transcription factor (TF) which is exclusivly active during hypoxia (low oxygen level). HIF-1 is a heterodimer consisting of a α- and a β-subunit. During normal oxygen conditions the α-subunit is hydroxylated by the HIF prolyl-hydroxylase. The hydroxylated α-subunit is a target for an ubiquitin ligase. Ubiquitylation of the α-subunit leads to its degradation by the proteasome. During hypoxia the degradation of the α-subunit does not occur, since the HIF prolyl-hydroxylase uses oxygen as a cosubstrate. The active TF enhances expression of different genes as for instance genes associated with blood vessel formation.

pPAR&gamma;
Peroxisome proliferator-activated receptor γ (PPAR γ) is a transcription factor belonging to the family of nuclear receptors. PPAR γ plays an important role in glucose metabolism and fatty acid storage. PPAR γ is basically activated by ligands like the prostaglandin PGJ2 and through dimerization with retinoid X receptor (RXR)[14]. The activated heterodimer binds to the DNA consensus sequence AGGTCANAGGTCA resulting in an increased or decreased transcription of the appropriate gene. The genes activated by PPAR γ initiate the uptake of fatty acids and differentiation of cells to adipocytes. Besides its function in metabolism, PPAR γ was also shown to be correlated with several diseases such as cancer and diabetes. Activation of PPAR γ by synthetic PPAR γ ligands result in an increased glucose uptake. These syntethtic ligands are therefore promising agents in diabetes II treatment [15]. Another synthetic ligand of PPAR γ is able to inhibit the proliferation of different cancer types [16].

SREBP
Sterol regulatory element-binding protein (SREBP) is a transcription factor involved in the regulation of sterol metabolism. In cells with high concentration of cholestrol SREBP is present in an inactive form anchored to the endoplasmatic reticulum or the nuclear envelop. If the cholesterol concentration decreases SREBP is cleaved by the proteases site-1 protease and site-2 protease resulting in a release of the aminoterminal domain of SREBP. Two additional proteins (Scap and Insig) are needed to regulate this process in a way that the cleavage occurs exclusively during lack of sterol[17]. The aminoterminal domain of SREBP is translocated into the nucleus and binds to the DNA consensus sequence TCACNCCAC. The binding causes an up regulation of the genes needed for cholesterol synthesis.

Sp1
Specificity protein 1 (Sp1) is a transcription factor which belongs to the zinc-finger protein family. It binds to promoter elements containing a central CpG motive with the following consensus sequence; 5'-G/TGGGCGGG/AG/AC/T-3' [18]. Sp1 is involved in chromatin-remodelling processes [19] as well as in derecruiting repressor proteins from the promoter [20]. For these and for other reasons Sp1 is often considered as a universal transcription supporting protein. Sp1 was shown to regulate various genes responsible for cellular processes like apoptosis, cell growth and differentiation and immune response [21]. It interacts with several well known proteins, such as c-myc, c-Jun and Stat1 [21].

AP-1
Activating Protein1 (AP-1) is a transcription factor which is activated by several pathways and extracellular stimuli like UV radiation, growth factors and bacterial and viral infections. AP-1 is a heterodimer consisting of one member of the Fos and one member of the Jun family. AP-1 composition and activation is mainly controlled by MAP kinase cascades by up regulating the expression of both Fos and Jun proteins. Besides the heterodimerisation process, phosphorylation of the complex is needed to achieve an efficient transcription of the target genes [22]. Activated AP-1 binds to DNA sequences with the consensus sequence 5'-TGAG/CTCA-3' [23]. The genes regulated by AP-1 are involved in cellular processes like apoptosis, cell differentiation, cell proliferation and oncogenic transformation[22].

AP-2
Activating protein 2 (AP-2) is a family of transcription factors being closely related (AP-2alpha, -beta and –gamma). AP-2 proteins are activated through homo or heterodimerisation and bind to GC-rich motives in their target genes [24]. The genes both positively and negatively regulated by AP-2 are manifold. These genes are mainly involved in developmental processes. Mutation of AP-2-beta can lead to the Char syndrome [25]. AP-2 transcription factors regulate also genes playing an important role in cell proliferation and apoptosis [24]. In this context AP-2alpha is thought to act in a tumor suppressive manner in breast tissues [24].

CREB
CAMP responsive element binding protein (CREB) is a transcription factor which binds to cAMP response elements (consensus sequence 5'- TGACGTCA -3' [26]) occurring in many promoter sequences. CREB is activated by MAP kinase cascade but also through the cAMP signalling pathway. Homodimerisation leads finally to an activated complex and binding to DNA occurs via a leucine zipper domain (s. Picture). Many genes are regulated by CREB including the neurotrophin Brain-derived neurotrophic factor, c-fos and some neuropeptides. CREB is thought to be involved in processes like long-term memory [27] and drug addiction [28]. CREB plays also an important role in cell survival [29].

NF-Y
Nuclear factor Y (NF-Y), also called core binding factor (CBF) is a transcription factor which binds to the consensus sequence CCAAT [31] occurring in about 25% of eukaryotic genes [30]. NF-Y is involved in the transcription regulation of several genes including HSP70, albumin, FGF-4, α-collagen, β-actin and several others [31]. NF-Y is a heterotrimeric complex and is evolutionary extremely conserved. It was also shown that NF-Y plays a major role in cellular senescence [30].

Vitamin D receptor
Vitamin D receptor (VDR), also known as calcitriol receptor, is a transcription factor belonging to the family of steroid receptors and to the super family of nuclear receptors. The VDR has a very high affinity towards calcitriol (1α,25(OH)2-cholecalciferol) which is the prohormone of vitamin D3. Binding of calcitriol results in a heterodimerisation of VDR with retinoid-X receptor, the complex is transferred into the nucleus and binds to several promoters (vitamin D response elements) thereby increasing or decreasing the transcription of the appropriate genes. The genes modulated by the VDR-calcitriol complex are involved in activating the immune system, bone formation and protection of cancer [32]. Many studies for example indicate a relationship between vitamin D signalling and reduced breast cancer occurance [33]. Furthermore the transcription of the VDR gene itself is positively regulated by the presence of calcitriol.

ZF5
Zink finger protein 5 (ZF5) is transcription factor playing an important role as a transcriptional repressor. ZF5 is composed of 5 zink finger motives enabling the multimerisation process and the binding of ZF5 to GC- rich DNA elements [34]. Both the HSV thymidine kinase (TK) promoter [35] and the c-myc promoter [34] are targets of the ZF5 transcription factor. The appropriate genes are repressed as a result of the binding process. However, ZF5 is not only a transcription repressor, binding to human immunodeficiency virus (HIV) promoter leads to an enhanced transcription of the appropriate gene.

WT1
Wilms' tumor protein 1 (WT1) is a transcription factor containing three zink finger motives. The WT1 transcription factor regulates genes being involved in developmental processes (for example development of the urogenital system, kidney, blood vessel formation and heart [36]) and cell survival [37]. The many isoforms of WT1 in different tissues are the reason for the multitude of functions. Mutation of the appropriate gene can result in the formation of Wilms' tumor (nephroblastoma).

RAR
Retinoic acid receptor (RAR) is a transcription factor belonging to the family of nuclear receptors. RAR builds a hetereodimer with Retinoid X receptor (RXR), this complex is able to bind to specific response elements. Transcriptional activation of the appropriate gene occurs when the ligands all-trans retinoic acid or 9-cis retinoic acid bind to the complex. Genes regulated by RAR are thought to be involved in developmental processes [38].

NFAT
Nuclear factor of activated T-cells (NFAT) is a transcription factor family consisting of 5 members (NFATc1, NFATc2, NFATc3, NFATc4, and NFAT5). NFATc1 and NFATc4 are sensitive to calcium signalling. A high calcium level leads to the exposure of the nuclear localization signal and the transcription factor is transported into the nucleus. NFAT proteins play important roles in developmental processes and in the immune system [39].

Kid3
Kid3 is one out of the huge amount of C2H2 zink finger proteins, that are known in eukaryotic organisms. Because of their DNA binding zink finger domain they are involved in gene expression in the role of transcription factors, especially in the early embryonal development, cell growth, diﬀerentiation and tumorigenesis. Kid3 has a Krüppel-associated box domain (KRAB) at the N-terminus, that performs a transcription repressing function, and a C-terminal C2H2 zink finger domain. The consensus binding sequence of Kid3 is 5'-CCAC(C/G)-3'[66].

LDL receptor
Low-density lipoprotein receptor (LDL receptor) is a cell surface protein which is responsible for the cholesterol supply of the cell. The receptor recognizes the protein B100 which is part of the LDL particles. Binding of the B100 protein to the LDL receptor leads to endocytosis via clathrin coated pits. The vesicle fuses with an endosome, the resulting shift in the pH value leads to the detachment of the LDL and the receptor can be transported back to the plasma membrane (receptor recycling). Through this process the cell takes up the cholesterol which is associated with LDL. LDL accumulation in the blood is responsible for atherosclerosis and many cardiovascular diseases. The LDL gene is regulated by the intracellular level of cholesterol. A low level of cholesterol leads to the activation whereas a high level results in a decrease of transcription[40].

HMG CoA synthase
3-hydroxy-3-methylglutaryl-CoA synthase (HMG CoA synthase) is an enzyme catalyzing the condensation of Acetyl-CoA and acetoacetyl-CoA to form 3-hydroxy-3-methylglutaryl-CoA (HMG-CoA). There are two distinct HMG CoA synthases, the cytosolic and the mitochondrial form, encoded by two different genes. The reaction catalyzed by the cytosolic enzyme is a part of the biosynthesis of cholesterol. In mitochondria the same reaction is responsible for keton body formation. Sterol regulatory elements in the promoter region of the gene are responsible for transcriptional regulation [41].

PUMA
P53 upregulated modulator of apoptosis (PUMA) is protein belonging to the BH3-only family of pro-apoptotic proteins. P53 plays an important role both in p53 dependent and p53 independent apoptosis. The activation of PUMA leads to mitochondrial dysfunction and caspase activation [1]. PUMA is regulated by many transcription factors (TF), these TFs in turn are regulated by extra and intra cellular stimuli like genotoxic stress, toxins, oncogene expression, redox status and growth factors [42].

Hsp70
Heat shock protein 70 (Hsp70) is an enzyme helping other proteins to fold. Hsp70 proteins are found not only in the cytosol but also in mitochondria and in the endoplasmatic reticulum. The protein binds, together with the cochaperone Hsp40, to newly synthesised (hydrophobic) amino acid residues and prevents the aggregation of those. During cellular stress e.g. oxidative or thermal stress proteins may unfold, Hsp70 binds to the hydrophobic regions of the proteins and prevents further unfolding, aggregation and apoptosis [43].

Apo A-IV
Apolipoprotein A-IV (Apo A-IV) is a glycoprotein secreted by the small intestine in humans. The production of Apo A-IV is activated through lipid absorption (Chylomicrons). Several studies indicate that Apo A-IV protects against atherosclerosis. It is also thought to be involved in regulation of food intake [44].

CYP1A1
Cytochrome P450 1A1 (CYP1A1) is an enzyme which is regulated by the aryl hydrocarbon receptor (AhR) signalling pathway. The transcription is also influenced by metal ions and oxidative stress. CYP1A1 catalyzes two of three critical steps in transformation of benz[a]pyren to the carcinogen BP-7,8-dihydrodiol-9,10-epoxide. Furthermore CYP1A1 is involved in processes like xenobiotic metabolism and drug degradation [45].

EGF
Epidermal Growth Factor (EGF) is a 6045 Da protein discovered by Stanley Cohen in 1986, which won him a Nobel Prize in Physiology and Medicine. EGF regulates cell proliferation by binding to the epidermal growth factor receptors (EGFRs) which are located on the cell surface. Upon binding of EGF to its receptor intrinsic tyrosine kinase activity is stimulated inducing a signaling cascade inside the cell which leads to increased calcium levels, glycolyisis and protein synthesis in the cell. This process ultimately leads to the proliferation of the cell. It was recently shown that c-Jun is one of the targets of EGF action. [46][47]

TNF-alpha
Tumor Necrosis Factor-alpha (TNF-alpha) is a cytokine involved the cells inflammatory response. TNF-alpha is a homotrimer that binds to one of its to receptors (TNF-R1/ TNF-R2) which then form a trimer themselves. Trimerization of the receptor induces a conformational change and the dissociation of the inhibitory protein SODD (Silencer of Death Domain protein) from the intracellular death domain of the receptor. The adaptor protein TRADD (TNF Receptor-associated Death Domain protein) can bind now to the death domain and allow other protein factors to bind aswell. The three main signaling pathways initiated in this way are: the NF-kB pathway, the MAPK opathway and the death signaling cascade.[60]

Pifithrin-α
Pifithrin-α (PFTα) is proposed to be a specific inhibitor of p53 signaling. It is not yet clear how exactly PFTα inhibits p53, but it seems to act at a stage after p53 translocation to the nucleus. Temporary suppression in vitro of p53 inhibits apoptosis induced by the damage to DNA and thus increases the fraction of cells surviving the stress. [61][62]

Post-transcriptional modification / mRNA processing in eukaryotes
To express a gene and successfully synthesize the appropriate protein the gene must firstly transcript into mRNA. Unlike in bacteria, this mRNA molecule is not directly ready for translation; the primary transcript is therefore called precursor-mRNA (pre-mRNA). One of the first modifications is a process referred to as 5’-capping. By means of several biochemical steps a 7-methylguanosine molecule is bound to the 5’ end of the pre-mRNA, via a 5’ to 5’ triphoshpate linkage. This 5’ cap has various functions including prevention of 5’ degradation, export from the nucleus and initiation of translation. Not only the 5’ end but also the 3’ end is modified, this process is called polyadenylation. Therefore a Polyadenylation signal is needed (consensus sequence 5'- AAUAAA-3'), further in the 3’ direction occurs a 5’-CA-3’ element, these both sequences are recognized by the enzymes cleavage and polyadenylation specificity factor and cleavage stimulation factor. Together they are attracting many other proteins including Polyadenylate Polymerase (PAP). The protein complex cuts the pre-mRNA at the CA element and the PAP adds about 200 adenine residues to the 3’ end. The function of the poly-A tail is protection against degradation, marking of the end of the transcript and aid in translation initiation. The pre-mRNA contains not only these sequences coding for the protein, so called exons, but also many sequences which are non-coding. These introns have to be removed, that occurs in a process known as splicing. A protein complex called spliceosom connects all the exons thereby cutting out the introns. Responsible for the recognition of the exon-intron borders are small nuclear RNA within the spliceosom. Many genes can be spliced in several ways, an incident termed alternative splicing. [48] [49]

Regulation of transcription in eukaryotic organisms
Cells have to adapt to changes in their environment and must be able to receive and react to extra cellular signals; cells accomplish these requirements by the up and down regulation of certain proteins. The protein expression in eukaryotic cells can be regulated on many different levels, this article concentrate on the regulation of transcriptions. Only a small percentage of the human genomic DNA is transcribed into mRNA. On the opposite, a huge part of the human genome is involved in regulating the transcription of coding sequences. To initiate transcription of a gene eukaryotic RNA-polymerases have to bind to several general transcription factors to establish the so called initiation complex (IC), which is able to bind to the DNA. The binding occurs upstream of the transcriptional start site (TSS) in a region called core promoter, this part of the promoter often contains specific elements like the TATA-Box (consensus sequence, TATAA/TAA/T, about 30 bp upstream of TSS [50]) and the GC-Box (consensus sequence TGTGGCTNNNAGCCAA) app. 80 bp upstream of the TSS [51] to which the IC can bind. Further upstream is a part of the promoter which is referred to as proximal promoter. Containing specific sequence elements, this part of the promoter is highly important for the transcriptional regulation. Transcription factors can bind to these response elements thereby up regulating or down regulating the gene transcription. Proteins methylating or acetylating the DNA are also involved in gene transcription regulation by remodelling of the chromatin structure.

CPT
Camptothecins (CPT) is a cytotoxic quinoline alkaloid and a topoisomerase I inhibitor isolated from the Camptotheca acuminata (Camptotheca or the Happy tree). It was discovered during a screen for natural anti-cancer drugs in 1966 but it is not not used in cancer therapy due to its severe side effects, but there were various derivatives developed to increase the benefits of this drug while decreasing its negative effects.[52] The two CPT analogues have been approved for cancer chemotherapy today are topotecan and irinotecan. CPT acts by binding to the topoisomerase I-DNA complex using hydrogen bonds and thereby preventing DNA-religation, inducing DNA damage and ultimately causing the cell to die. [53]

GPI
Glycosylphosphatidylinositol (GPI) is a glycolipid. During the posttranslational modification in eukaryotic cells, it becomes attached to hydrophobic C-termini of proteins that have a special singnal peptide on them. This signalpeptide leads their translation into the ER, where the hydrophobic C-terminus will be replaced by a GPI anchor. Because of its hydrophobic nature it attaches the bound protein to the cell membrane [54].

Sar-1
Sar-1 GTP-binding proteins direct the transport of molecules inside of veiscles from the ER to the golgi and the other way round. Being an anchor for COPII molecules that cause the budding of vesicles off the membranes, it needs a domain to attach to the ER membrane [55]. The C terminus of the Sar-1 protein fullfills this task. Therefore one can use the C terminus as an ER targeting sequence for other proteins.

Myrpalm
This localization signal is located at the N-terminal end of the amino acid chain. The myrpalm signaling sequence causes a myristilation and palmitolyation of the targeted protein. Both modifications lead to a binding to the cell membrane [56].

NLS
Nuclear localisation signals are peptidesequences that are able to bind to nuclear import receptors. These cause an import of newly synthesized protein through nuclear pores. This feature is caused by several positively charged amino acids. Nuclear localization signals can be located almost anywhere in the peptidechain [57] We used a nuclear localization signal at the C-terminal end of the protein.

GFP
Green Fluorescent Protein (GFP) was first discovered by Shimomura et al. in the Aequorea jellyﬁsh. They described a slightly green colour of a GFP-containing solution that in the sunlight[63]. The same group of scientists investigated the protein in more detail, and have since discovered many characteristics, including the excitation and emission wavelengths. The most important accomplishment was the cloning of the GFP gene into other organisms to make them fluorescent [64] [65]. Many scientists have since worked on GFP and introduced mutations to enhance fluorscence levels or change the spectra. Nowadays flourescent proteins exist in different colours exist increasing their range of application even more.