| Literature DB >> 21251333 |
Guido Leoni1, Loredana Le Pera, Fabrizio Ferrè, Domenico Raimondo, Anna Tramontano.
Abstract
BACKGROUND: Analysis of the human genome has revealed that as much as an order of magnitude more of the genomic sequence is transcribed than accounted for by the predicted and characterized genes. A number of these transcripts are alternatively spliced forms of known protein coding genes; however, it is becoming clear that many of them do not necessarily correspond to a functional protein.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21251333 PMCID: PMC3091307 DOI: 10.1186/gb-2011-12-1-r9
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Scheme of possible scenarios for comparing different isoforms. Only peptides mapping in the products of shaded regions are considered specific.
Figure 2Distribution of expression level values for specific exons of transcripts included in the noASPos, ASPos and unknown datasets.
Figure 3Venn diagrams showing the number of isoforms predicted to be functional and unlikely to be functional according to each method. (a-h) The number of isoforms predicted to be functional according to each method in the ASPos dataset (a), the noASPos dataset (c), the unknown dataset (e) and the negative dataset (g) and the number of isoforms unlikely to be functional according to each method in the ASPos dataset (b), the noASPos dataset (d), the unknown dataset (f) and the negative dataset (h). AS, preservation of active sites; Pfam, completeness of Pfam domains; St, structural plausibility.
Results of the statistical analysis with respect to the unknown dataset
| Coverage | TP | TN | FP | FN | Accuracy | Sensitivity | Specificity | |
|---|---|---|---|---|---|---|---|---|
| AS | 0.14 | 79 | 31 | 48 | 1 | 0.69 | 0.99 | 0.39 |
| St | 0.26 | 134 | 76 | 69 | 13 | 0.72 | 0.91 | 0.52 |
| Pfam | 0.81 | 480 | 165 | 227 | 23 | 0.72 | 0.95 | 0.41 |
| AS U St | 0.35 | 175 | 101 | 99 | 14 | 0.71 | 0.93 | 0.50 |
| St U Pfam | 0.89 | 490 | 203 | 257 | 35 | 0.71 | 0.93 | 0.44 |
| AS U Pfam | 0.85 | 485 | 186 | 250 | 24 | 0.71 | 0.95 | 0.43 |
| AS U St U Pfam | 0.92 | 494 | 221 | 272 | 36 | 0.70 | 0.93 | 0.45 |
| AS ∩ St | 0.06 | 38 | 6 | 18 | 0 | 0.71 | 1.00 | 0.25 |
| St ∩ Pfam | 0.18 | 124 | 38 | 39 | 1 | 0.80 | 0.99 | 0.49 |
| AS ∩ Pfam | 0.10 | 74 | 10 | 25 | 0 | 0.77 | 1.00 | 0.28 |
| AS ∩ St ∩ Pfam | 0.05 | 37 | 3 | 10 | 0 | 0.80 | 1.00 | 0.23 |
Coverage, accuracy, sensitivity and specificity of the different strategies and their combinations (U = union and ∩ = intersection) with respect to the unknown dataset. AS, preservation of active sites; Pfam, completeness of Pfam domains; St, structural plausibility. The definition of the other parameters is reported in Materials and methods. FN, false negative; FP, false positive; TN, true negative; TP, true positive.
Results of the statistical analysis with respect to the negative dataset
| Coverage | TP | TN | FP | FN | Accuracy | Sensitivity | Specificity | |
|---|---|---|---|---|---|---|---|---|
| St | 0.21 | 134 | 44 | 40 | 13 | 0.77 | 0.91 | 0.52 |
| Pfam | 0.81 | 480 | 117 | 280 | 23 | 0.67 | 0.95 | 0.29 |
| St U Pfam | 0.87 | 490 | 145 | 291 | 35 | 0.66 | 0.93 | 0.33 |
| St ∩ Pfam | 0.15 | 124 | 16 | 29 | 1 | 0.82 | 0.99 | 0.36 |
Coverage, accuracy, sensitivity and specificity of the different strategies and their combinations (U = union and ∩ = intersection) with respect to the negative dataset. Pfam, completeness of Pfam domains; St, structural plausibility. The definition of the other parameters is reported in Materials and methods. FN, false negative; FP, false positive; TN, true negative; TP, true positive.