| Literature DB >> 20660476 |
Xiaobai Zhang1, Juan Cui, Daniel Nilsson, Kapila Gunasekera, Astrid Chanfon, Xiaofeng Song, Huinan Wang, Ying Xu, Torsten Ochsenreiter.
Abstract
It has long been known that trypanosomes regulate mitochondrial biogenesis during the life cycle of the parasite; however, the mitochondrial protein inventory (MitoCarta) and its regulation remain unknown. We present a novel computational method for genome-wide prediction of mitochondrial proteins using a support vector machine-based classifier with ∼90% prediction accuracy. Using this method, we predicted the mitochondrial localization of 468 proteins with high confidence and have experimentally verified the localization of a subset of these proteins. We then applied a recently developed parallel sequencing technology to determine the expression profiles and the splicing patterns of a total of 1065 predicted MitoCarta transcripts during the development of the parasite, and showed that 435 of the transcripts significantly changed their expressions while 630 remain unchanged in any of the three life stages analyzed. Furthermore, we identified 298 alternatively splicing events, a small subset of which could lead to dual localization of the corresponding proteins.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20660476 PMCID: PMC2995047 DOI: 10.1093/nar/gkq618
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Prediction performance on 50 testing sets
| MCC | Se | Sp | Acc | |
|---|---|---|---|---|
| Best performance | 0.752 | 0.823 | 0.926 | 0.894 |
| Worst performance | 0.656 | 0.801 | 0.874 | 0.852 |
| Mean | 0.709 | 0.816 | 0.901 | 0.875 |
| Std | 0.029 | 0.014 | 0.018 | 0.014 |
Figure 1.Statistical relationship between the prediction accuracy and the SVM score. The relationship is based on the analysis of 597 (positive set) and 1290 (negative set). The x-axis is the absolute SVM score and the y-axis represents prediction precision.
Comparison of prediction performance of 14 methods applied on 147 mitochondrial and 325 non-mitochondrial proteins
| Method | MCC | Se | Sp | Acc |
|---|---|---|---|---|
| TMPP | 0.752 | 0.823 | 0.926 | 0.894 |
| WoLF PSORT | 0.577 | 0.605 | 0.926 | 0.826 |
| Predotar | 0.543 | 0.612 | 0.901 | 0.811 |
| MitoProt | 0.539 | 0.735 | 0.821 | 0.794 |
| PredSL | 0.532 | 0.558 | 0.923 | 0.809 |
| PSORT | 0.474 | 0.497 | 0.920 | 0.788 |
| TargetP | 0.459 | 0.442 | 0.938 | 0.784 |
| Mitpred | 0.439 | 0.469 | 0.914 | 0.775 |
| MITOPRED | 0.422 | 0.667 | 0.772 | 0.739 |
| iPSORT | 0.419 | 0.497 | 0.886 | 0.765 |
| Proteome Analyst | 0.383 | 0.231 | 0.990 | 0.754 |
| ELSpred | 0.356 | 0.476 | 0.855 | 0.737 |
| PLOC | 0.320 | 0.429 | 0.862 | 0.727 |
| SubLoc | 0.243 | 0.517 | 0.732 | 0.665 |
Figure 2.Localization of predicted MitoCarta proteins. The fluorescence microscopy images from nine GFP tagged proteins that were randomly picked from our prediction are shown. First column GFP, second column Mitotracker, third column, overlay including the DAPI and differential interference contrast image, fourth column western blotts of total cell protein fractionated in a mitochondrial fraction (M) and a cytosolic fraction (C) and probed with antibodies against the cytosolic HSP70, the mitochondrial iron sulfur protein and GFP, fifth column the SDS–PAGE from the fractionated protein extract.
Expression and splicing profile of the MitoCarta in long slender, short stumpy and procyclic form T. brucei
| Long slender (LS) | Short stumpy (SS) | Procyclic (PC) | Total | |
|---|---|---|---|---|
| MitoCarta genes | – | – | – | 1065 |
| Differentially regulated | 109 (LS to SS) | 357 (SS to PC) | 350 (LS to PC) | 435 |
| Unchanged expression | – | – | – | 630 |
| Alternatively spliced | 174 | 186 | 154 | 298 |
| Differentially spliced | 11 (LS to SS) | 50 (SS/PC) | 45 (LS to PC) | – |
| Major internal sites | 88 | 90 | 74 | – |
| Only internal sites | 55 | 51 | 52 | – |
| N-terminal extensions | – | – | – | 23 |
| Genes expressed | 934 | 932 | 945 | – |
| TPM (mean) | 75 | 81 | 87 | – |
| TPM (median) | 24 | 22 | 29 | – |
aTotal number of genes that significantly change expression levels.
bThe major splice site contains less than 60% of the tags.
cSplice site would allow for N-terminal extension of the reading frame.
Figure. 3.Two examples of alternative splice variants potentially leading to changes in localization. Screenshot of a region from chromosome of 2.5 kbp of two genes with SLT data in TPM from procyclic (green), bloodstream long slender (red), short stumpy forms (orange) and potential mitochondrial targeting signals (MTS, white box) and signal peptide sequence (SP, grey box) on the corresponding protein sequences (black line). (A) Tb927.4.4910, the upstream splice site potentially lead to a N-terminal extension of the open reading frame now containing a MTS of 58 amino acids as predicted by MITOPROT (confidence 0.75). The downstream splice site excludes the MTS, instead leading to a SP of 34 amino acids (confidence 0.98, SingnalP). (B) Tb11.01.1040, the uptstream splice site would allow translation from the currently annotated AUG leading to a protein containing a 30 amino acid MTS (confidence 0.99, MITOPROT), while the downstream, internal splice site would exclude the MTS leading to a 345 amino acid protein.