| Literature DB >> 28640805 |
Stefano Castellana1, Caterina Fusilli1, Gianluigi Mazzoccoli2, Tommaso Biagini1, Daniele Capocefalo1, Massimo Carella3, Angelo Luigi Vescovi4,5, Tommaso Mazza1.
Abstract
24,189 are all the possible non-synonymous amino acid changes potentially affecting the human mitochondrial DNA. Only a tiny subset was functionally evaluated with certainty so far, while the pathogenicity of the vast majority was only assessed in-silico by software predictors. Since these tools proved to be rather incongruent, we have designed and implemented APOGEE, a machine-learning algorithm that outperforms all existing prediction methods in estimating the harmfulness of mitochondrial non-synonymous genome variations. We provide a detailed description of the underlying algorithm, of the selected and manually curated training and test sets of variants, as well as of its classification ability.Entities:
Mesh:
Year: 2017 PMID: 28640805 PMCID: PMC5501658 DOI: 10.1371/journal.pcbi.1005628
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Performance evaluation calculated on 864 known mitochondrial non-synonymous variants.
Number of available predictions in last column.
| 120 | 369 | 263 | 100 | 0,58 | 0,54 | 0,57 | 0,31 | 0,68 | 0,11 | 42,61 | 852 | |
| 139 | 288 | 344 | 81 | 0,46 | 0,63 | 0,51 | 0,29 | 0,71 | 0,08 | 49,88 | 852 | |
| 31 | 560 | 81 | 191 | 0,87 | 0,14 | 0,68 | 0,28 | 0,72 | 0,02 | 31,52 | 863 | |
| 0 | 638 | 3 | 222 | 0,99 | 0,00 | 0,74 | 0,00 | 1,00 | -0,03 | 26,07 | 863 | |
| 82 | 473 | 168 | 141 | 0,74 | 0,39 | 0,64 | 0,33 | 0,67 | 0,11 | 35,76 | 863 | |
| 128 | 329 | 312 | 94 | 0,51 | 0,58 | 0,53 | 0,29 | 0,71 | 0,08 | 47,05 | 863 | |
| 130 | 331 | 307 | 87 | 0,52 | 0,59 | 0,54 | 0,29 | 0,70 | 0,11 | 46,08 | 854 | |
| 69 | 432 | 79 | 154 | 0,84 | 0,31 | 0,68 | 0,47 | 0,53 | 0,18 | 31,74 | 734 | |
| 83 | 511 | 130 | 140 | 0,79 | 0,37 | 0,69 | 0,39 | 0,61 | 0,17 | 31,25 | 864 | |
| 69 | 495 | 146 | 154 | 0,77 | 0,31 | 0,65 | 0,32 | 0,68 | 0,08 | 34,72 | 864 | |
| 89 | 353 | 213 | 102 | 0,62 | 0,47 | 0,58 | 0,29 | 0,71 | 0,08 | 41,61 | 757 | |
| 141 | 291 | 350 | 82 | 0,45 | 0,63 | 0,51 | 0,28 | 0,71 | 0,07 | 50,00 | 864 | |
| 128 | 345 | 296 | 95 | 0,54 | 0,57 | 0,55 | 0,30 | 0,69 | 0,09 | 45,25 | 864 | |
| 128 | 340 | 301 | 95 | 0,53 | 0,57 | 0,54 | 0,29 | 0,71 | 0,09 | 45,83 | 864 | |
| 117 | 399 | 242 | 106 | 0,62 | 0,52 | 0,59 | 0,33 | 0,67 | 0,13 | 40,28 | 864 | |
| 85 | 337 | 304 | 138 | 0,53 | 0,38 | 0,49 | 0,23 | 0,78 | -0,08 | 51,16 | 864 | |
| 117 | 374 | 242 | 103 | 0,61 | 0,53 | 0,59 | 0,33 | 0,67 | 0,12 | 41,27 | 836 | |
| 142 | 276 | 365 | 81 | 0,43 | 0,64 | 0,48 | 0,28 | 0,72 | 0,06 | 51,62 | 864 | |
*possibly damaging variants considered as benign
#possibly damaging variants considered as harmful
§low and neutral predictions considered as harmless, while medium and high impact predictions are considered pathogenic.
Fig 1Meta-predictors performance comparisons by receiver operating characteristic curves.
Performance evaluation calculated on 153 known and unbiased mitochondrial non-synonymous variants.
| 115 | 23 | ||
| 5 | 10 | ||
Performance evaluation calculated on 48 known and unbiased mitochondrial non-synonymous variants.
| 30 | 9 | ||
| 2 | 7 | ||
Known variants grouped by mitochondrial gene symbol and OXPHOS complex.
| ATP6 | 78 | 19 (24.4%) | 22 (28.2%) | 62 (79.5%) | |
| ATP8 | 18 | 6 (33.3%) | 7 (38.9%) | 15 (83.3%) | |
| COX1 | 81 | 25 (30.9%) | 25 (30.9%) | 62 (76.5%) | |
| COX2 | 50 | 14 (28%) | 14 (28%) | 37 (74%) | |
| COX3 | 59 | 10 (16.9%) | 10 (16.9%) | 49 (83.1%) | |
| III | CYB | 98 | 33 (33.7%) | 33 (33.7%) | 79 (80.6%) |
| ND1 | 103 | 37 (35.9%) | 38 (36.9%) | 75 (72.8%) | |
| ND2 | 60 | 12 (20%) | 12 (20%) | 49 (81.7%) | |
| ND3 | 26 | 6 (23.1%) | 6 (23.1%) | 22 (84.6%) | |
| ND4 | 74 | 12 (16.2%) | 12 (12%) | 68 (91.9%) | |
| ND4L | 25 | 3 (12%) | 3 (12%) | 23 (92%) | |
| ND5 | 140 | 28 (20%) | 28 (20%) | 117 (83.6%) | |
| ND6 | 52 | 18 (34.6%) | 18 (34.6%) | 41 (78.8%) |
List of assembled predictors and annotations in MitImpact.
| PolyPhen2, SIFT, FatHmm, PROVEAN, MutationAssessor, EFIN, | |
| CAROL, Condel, | |
| dbSNP 144, COSMIC 68, MITOMAP July 2015 | |
| PhyloP100V, PhastCons100V, SiteVar, |