| Literature DB >> 32938368 |
Castrense Savojardo1, Pier Luigi Martelli2, Giacomo Tartari1,3, Rita Casadio1,3.
Abstract
BACKGROUND: The prediction of protein subcellular localization is a key step of the big effort towards protein functional annotation. Many computational methods exist to identify high-level protein subcellular compartments such as nucleus, cytoplasm or organelles. However, many organelles, like mitochondria, have their own internal compartmentalization. Knowing the precise location of a protein inside mitochondria is crucial for its accurate functional characterization. We recently developed DeepMito, a new method based on a 1-Dimensional Convolutional Neural Network (1D-CNN) architecture outperforming other similar approaches available in literature.Entities:
Keywords: Convolutional neural network; Deep learning; Functional annotation; Mitochondrial protein; Subcellular localization; Submitochondrial localization
Mesh:
Substances:
Year: 2020 PMID: 32938368 PMCID: PMC7493403 DOI: 10.1186/s12859-020-03617-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The present status of Subcellular Location (SL) annotations of five reference proteomes
| Human | Mouse | Yeast | Fly | Arabidopsis | |
|---|---|---|---|---|---|
| 20,667 | 22,259 | 6049 | 13,796 | 27,466 | |
| 350 | 120 | 278 | 32 | 244 | |
| 263 | 97 | 251 | 12 | 130 | |
| 6178 | 3561 | 2208 | 895 | 3642 | |
| 9772 | 12,244 | 2022 | 2373 | 8309 | |
| 4104 | 6237 | 1290 | 10,484 | 15,141 |
Compartment-level statistics of experimental (ECO:000269) sub-mitochondrial localizations
| Localization | Human | Mouse | Yeast | Fly | Arabidopsis |
|---|---|---|---|---|---|
| Outer membrane | 61 | 26 | 35 | 3 | 21 |
| Inner membrane | 92 | 41 | 106 | 4 | 57 |
| Intermembrane space | 11 | 4 | 37 | 3 | 6 |
| Matrix | 77 | 21 | 54 | 2 | 41 |
Prediction performance (MCC) of DeepMito on the sets of proteins endowed with experimental (ECO:000269) sub-mitochondrial localization
| Class/Species | Human | Mouse | Yeast | Fly | Arabidopsis | Overall |
|---|---|---|---|---|---|---|
| 0.87 | 0.67 | 0.75 | 1.0 | 0.85 | 0.81 | |
| 0.78 | 0.72 | 0.78 | 0.71 | 0.79 | 0.77 | |
| 0.72 | 0.56 | 0.74 | 0.77 | 0.86 | 0.73 | |
| 0.72 | 0.57 | 0.80 | 0.67 | 0.76 | 0.73 |
(a)Only 12 proteins are included in this dataset
Fig. 1Distributions of combined experimental and predicted sub-mitochondrial compartments for the five species considered in this study
Summary of sub-mitochondrial localizations (annotated and predicted) of the 4307 mitochondrial proteins
| Class/Species | Human | Mouse | Yeast | Fly | Arabidopsis | Overall |
|---|---|---|---|---|---|---|
| 151 | 93 | 82 | 34 | 85 | 445 | |
| 441 | 354 | 321 | 288 | 386 | 1790 | |
| 62 | 31 | 77 | 28 | 31 | 229 | |
| 412 | 293 | 231 | 323 | 638 | 1897 | |
| 1066 | 771 | 711 | 673 | 1140 | 4361 (a) |
(a)The total count is greater than 4307 because of the presence of multi-localizing proteins
Number of proteins annotated with GO-BP and GO-MF in the five proteomes
| GO term | ||
|---|---|---|
| BP | MF | |
| 3668 | 3492 | |
| 15,984 | 10,346 | |
| 3439 | 4527 | |
| 11,053 | 5433 | |
| 30,476 (5381) | 20,306 (2133) | |
(a)Evidence codes: all except IEA
(b)IEA evidence code
(c)Terms uniquely annotated by BAR3.0
Fig. 2Comparison of data contained in different databases of human (a) and mouse (b) mitochondrial genes/proteins. In parentheses the number of unique Ensembl gene ID contained in each database
Fig. 3The DeepMito convolutional network architecture