Team:Warsaw/Modelling/Structural

From 2009.igem.org

(Difference between revisions)

Revision as of 08:42, 11 October 2009

Introduction

Because the native conformation of secretion peptide from hemolysin A is not determine we decided to use several computational structure prediction method to find the three-dimensional structure of this domain. Additionally we attempt to obtain the theoretical models of proaptoptotic fusion proteins and naturally occuring proteins which are used in our project.

Fundamental basis

Protein folding

Protein folding is the physical phenomena by which a polypeptide chain folds into highly specific and functional three-dimensional structure from random coil. Shortly after translation from mRNA each protein molecule exist as an unfolded chain with no characteristic conformation. However aminoacids interact with each other to create a well-defined three dimensional structure known as the native state. This resulting conformation is determined by the amino acid sequence.

Fusion proteins

Fusion proteins are proteins which are created by means the joining two or more genes which originally encoded separate polypeptide chain. Expression of that fusion gene results in a single polypeptide with functional properties derived from each of the proteins encoded by used genes. Recombinant fusion proteins are created artificially via DNA recombination for use in biological research or to produce altered proteins with new features.

In most cases the functionality of fusion proteins is not interrupted. It is possible due to intristic protein domains modularity. The fragment of polypeptide which corresponds to a given domain may be removed ar added to the rest of the molecule without destroying its native capabilities.

However it is highly recommended to predict the three-dimensional structure of fusion protein or the artificially attached domains. The knowledge of the spatial organisation of any given protein is an extremely useful prerequisite for the understanding of the function and for the rational modification of the proteins.

Methods

Computation

We choose following servers to compute the secondary structures and full models for proteins of interest.

[http://www.bioinfo.pl/ BioInfoBank Meta Server]

This server offers a set of structural models collected from the prediction servers are assessed using the powerful 3D-jury consensus approach.

[http://zhang.bioinformatics.ku.edu/I-TASSER/ TASSER]

I-TASSER server is an Internet service for protein structure and function predictions. Models are built based on multiple-threading alignments by LOMETS and iterative TASSER simulations.

[http://robetta.bakerlab.org/ Robetta]

Robetta is a full-chain protein structure prediction server. It parses protein chains into putative domains and models those domains either by homology modeling or by de novo modeling

[http://www.reading.ac.uk/bioinf/ModFOLD/ The ModFOLD Model Quality Assessment Server]

ModFOLD is a server which can provide a single score and a p-value relating to the predicted quality of a single 3D model of a protein structure and rankings for multiple 3D models for the same protein target according to predicted model quality. It also may do some predictions of the local quality within multiple models.

More detailed description of used methods is available here

Evaluation

We used the following measures of the models validity

Ramachandran plot
RMSD
TM-score
C-score

More detailed description of used methods is available here

Results

2D-predictions

By means of some programs available on bioinformatics metaservers the secondary structures for our proteins of interest have been found. All structures (except one which was found by i-TASSER) were predicted using the meta.bioinfo.pl server. If you want to know the detailed information about secondary structures click here

Full models

Secretion peptid

It was recommemded to elucidate the three dimensional structure of the hemolysin A domain responsible for its secretion. In the case of commonly used large proteins tag such as GST it is knowjn that the added domain usually folded autonomically and do not interrupt the native structure of the rest of molecule. Although there is no available data concerning the influence of aforementioned secretion domain on the correct folding.

Models score quality (TM  score calculated by the MODfold)  0.3015 – 0.1873

0.3015 – model1 (Muster)
0.2813 – model2 (Modeller) 
0.2754 – model3 (Modeller)
0.2585 – model4 (Tasser)

The accuracy of predicted structures is moderate. All generated models resemble each other however the RMSD values among them show that the similarity of these structures is not very significant. The global geometry of the modelled domains is not altered however in each case the spatial localisation of amino acid residues is different. Mediocre TM score indicates that the global topology of the models may not correspond to the real structure of the secretion peptid.

Bax with secretion domain secreted

Most of obtained models are at first sight incorrect and they do not form valid proteins. Only structured predicted by Tasser seems to be physically acceptable. It should be remarked that the resolution of these models is low especially for the secretion protein. Many residues has inproper values of dihedral angles. In spite of these results it appear that the conformation of the protein part which is corresponded to the bax appear to have not been altered by the presence of the additional domain. Unfortunately calculated TM score indicates that the global topology of the models may not correspond to the real structure of the fusion protein.

Models score quality (MODfold)  

0.2470 – model31
0.2463 – model51

p53 wih both signal domains

The best models were created by simple threading programs like Lomets but the structures found by TASSER server were almost the same. The major distinction between models created by these programs was the topology of the secretion domain. Lomets recognized correctly the structure of secretion peptide but it was unable to reconstruct the geometry of p53. The physical quality of all obtained models was evaluate by means the Modfold server. The assessment reveal that the resolution of all predicted structures is low. However one can find the structural similarity between the secretion domain from hemolysin A and obtained model of the domain. In the case of p53 core domain situation is better. RMSD between modelled protein core and experimentally resolved structures collected from PDB is surprisingly high. Unfortunatelly the validity of the other parts of the molecule is below the level of confidence and it appear to be without significant statistical meaning. As it was mentioned before in the case of models created by Lomets the resolution of secretion domain is acceptable.

Listeriolysin with secretion domain

Most of obtained models are physically incorrect and it is unlikely they represented valid proteins. Only one structured predicted by Tasser seems to be acceptable. It should be remarked that the resolution of this models is low especially for the secretion protein. Many residues has inproper values of dihedral angles. Despite of these findings it appear that the conformation of the protein part which is corresponded to the listeriolysin appear to have not been altered by the presence of the additional domain. Unfortunately calculated TM score suggests that the global topology of the models may not correspond to the real structure of the fusion protein.

Invasin

Due to lack of proper structural template the best solution was to create a model olfy for the domain responsible for invasiveness. Employing the crystal structure of related invasin from Yersinia pseudotuberculosis the full structure model of invasive domain was created. Alignment to the known structure of similar invasin indicate the global geometry of both protein is closely related. It should be underlined that the spatial organisation of essential aminoacid residues in the part of molecule which interact with the integrin receptor is almost the same.

PhoP and PhoQ

The two proteins are responsible for induction of pH-dependent promoter which is pivotal element of our system. Because the full stuctures both PhoP nad PhoQ was unknown we decide to find them by means of structural modeling. PhoP was a easy target due to presence very similar proteins in PDB database. Models generated by four programs have the same geometry and the RMSD between the structures is minimal (<0.25). All structures were validate using modFOLD server. The analysis reveal that all of them appears to be correct and the distinctions among the best structures are not significant. These findings indicate the obtained models are notable congruent to the native structure.

@@ Line 58: / Line 58: @@
 <h3>2D-predictions</h3>
-By means of some programs available on bioinformatics metaservers the secondary structures for our proteins of interest have been found. If you want to know the detailed information about predicted structures click here
+By means of some programs available on bioinformatics metaservers the secondary structures for our proteins of interest have been found. All structures (except one which was found by i-TASSER) were predicted using the meta.bioinfo.pl server. If you want to know the detailed information about secondary structures click [https://2009.igem.org/wiki/index.php?title=Team:Warsaw/Modelling/Structural/secondary_structures here]
+<h3>Full models</h3>
+<h4>Secretion peptid</h4>
+It was recommemded to elucidate the three dimensional structure of the hemolysin A domain responsible for its secretion. In the case of commonly used large proteins tag such as GST it is knowjn that the added domain usually folded autonomically and do not interrupt the native structure of the rest of molecule. Although there is no available data concerning the influence of aforementioned secretion domain on the correct folding.
+<pre>Models score quality (TM  score calculated by the MODfold)  0.3015 – 0.1873
+.3015 – model1 (Muster)
+.2813 – model2 (Modeller)
+.2754 – model3 (Modeller)
+.2585 – model4 (Tasser)</pre>
+The accuracy of predicted structures is moderate. All generated models resemble each other however the RMSD values among them show that the similarity of these structures is not very significant. The global geometry of the modelled domains is not altered however in each case the spatial localisation of amino acid residues is different. Mediocre TM score indicates that the global topology of the models may not correspond to the real structure of the secretion peptid.
+<h4>Bax with secretion domain secreted</h4>
+Most of obtained models are at first sight incorrect and they do not form valid proteins. Only structured predicted by Tasser seems to be physically acceptable. It should be remarked that the resolution of these models is low especially for the secretion protein. Many residues has inproper values of dihedral angles. In spite of these results it appear that the conformation of the protein part which is corresponded to the bax appear to have not been altered by the presence of the additional domain.
+Unfortunately calculated TM score indicates that the global topology of the models may not correspond to the real structure of the fusion protein.
+<pre>Models score quality (MODfold)
+.2470 – model31
+.2463 – model51</pre>
+<h4>p53 wih both signal domains</h4>
+The best models were created by simple threading programs like Lomets but the structures found by TASSER server were almost the same. The major distinction between models created by these programs was the topology of the secretion domain. Lomets recognized correctly the structure of secretion peptide but it was unable to reconstruct the geometry of p53. The physical quality of all obtained models was evaluate by means the Modfold server. The assessment reveal that the resolution of all predicted structures is low. However one can find the structural similarity between the secretion domain from hemolysin A and obtained model of the domain. In the case of p53 core domain situation is better. RMSD between modelled protein core and experimentally resolved structures collected from PDB is surprisingly high. Unfortunatelly the validity of the other parts of the molecule is below the level of confidence and it appear to be without significant statistical meaning. As it was mentioned before in the case of models created by Lomets the resolution of secretion domain is acceptable.
+<h4>Listeriolysin with secretion domain</h4>
+Most of obtained models are physically incorrect and it is unlikely they represented valid proteins. Only one structured predicted by Tasser seems to be acceptable. It should be remarked that the resolution of this models is low especially for the secretion protein. Many residues has inproper values of dihedral angles. Despite of these findings it appear that the conformation of the protein part which is corresponded to the listeriolysin appear to have not been altered by the presence of the additional domain. Unfortunately calculated TM score suggests that the global topology of the models may not correspond to the real structure of the fusion protein.
+<h4>Invasin</h4>
+Due to lack of proper structural template the best solution was to create a model olfy for the domain responsible for invasiveness. Employing the crystal structure of related invasin from Yersinia pseudotuberculosis the full structure model of invasive domain was created. Alignment to the known structure of similar invasin indicate the global geometry of both protein is closely related. It should be underlined that the spatial organisation of essential aminoacid residues in the part of molecule which interact with the integrin receptor is almost the same.
+<h4>PhoP and PhoQ</h4>
+The two proteins are responsible for induction of pH-dependent promoter which is pivotal element of our system. Because the full stuctures both PhoP nad PhoQ was unknown we decide to find them by means of structural modeling. PhoP was a easy target due to presence very similar proteins in PDB database. Models generated by four programs have the same geometry and the RMSD between the structures is minimal (<0.25). All structures were validate using modFOLD server. The analysis reveal that all of them appears to be correct and the distinctions among the best structures are not significant. These findings indicate the obtained models are notable congruent to the native structure.
 {{WarFoot1}}