| Literature DB >> 19811632 |
Petri Törönen1, Petri Pehkonen, Liisa Holm.
Abstract
BACKGROUND: The analysis of over-represented functional classes in a list of genes is one of the most essential bioinformatics research topics. Typical examples of such lists are the differentially expressed genes from transcriptional analysis which need to be linked to functional information represented in the Gene Ontology (GO). Despite the importance of this procedure, there is a little work on consistent evaluation of various GO analysis methods. Especially, there is no literature on creating benchmark datasets for GO analysis tools.Entities:
Mesh:
Year: 2009 PMID: 19811632 PMCID: PMC2762998 DOI: 10.1186/1471-2105-10-319
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flow chart representation of the first steps of POSGODA. A simplified representation of the flow chart of the function used to select independent classes, to define the multiple testing corrected p-value signal levels, and to define the class outcome closest to the signal level. Input parameters and data are represented with light blue, intermediate steps are represented with green and end states are represented with gray. Steps associated with numbers are explained in the text. For more detailed un-simplified representation see the supplementary text.
Figure 2Flow chart representation of the last steps of POSGODA. A simplified representation of the flow chart of the function used to map genes to the generated gene list. Colouring is similar to fig. 1. The detailed representation is in the supplementary text.
Evaluation of embedded signal levels with two signal normalizations
| Holm | 5 | uncorr. | 2.54E-05 | 2.01E-05 | 1.59E-05 | 9.99E-06 | 5.8E-06 | 4.73E-06 | 4.26E-06 |
| Holm | 10 | corrected | 0.0556 | 0.044 | 0.0348 | 0.0219 | 0.0127 | 0.0104 | 0.0093 |
| Holm | 5 | uncorr. | 2.54E-05 | 2E-05 | 1.58E-05 | 9.99E-06 | 5.58E-06 | 4.73E-06 | 4.18E-06 |
| Holm | 10 | corrected | 0.0555 | 0.0437 | 0.0346 | 0.0218 | 0.0122 | 0.0103 | 0.0091 |
| FDR | 5 | uncorr. | 0.000124 | 0.000107 | 8.69E-05 | 5.38E-05 | 4.04E-05 | 2.75E-05 | 0.000021 |
| FDR | 10 | corrected | 0.054 | 0.0481 | 0.044 | 0.0281 | 0.0191 | 0.0114 | 0.0093 |
| FDR | 5 | uncorr. | 0.000246 | 0.000219 | 0.0002 | 0.000128 | 8.69E-05 | 0.000052 | 4.23E-05 |
| FDR | 10 | corrected | 0.0547 | 0.0468 | 0.0382 | 0.0236 | 0.0177 | 0.0121 | 0.0092 |
Table shows the summary of results with reported percentiles. Signal generation is tested with 5 and 10 positive classes and with signal normalizations that use Holm's method and FDR. Note that 5 percentile limit and 95 percentile limit stay within the required 0.01-0.05 range. Minimum and maximum show deviation from the signal range but the deviation is always only appr. 10% of the allowed signal.
Top scoring GO classes from one of the positive datasets
| 1 | 5.152 | metal ion transporter activity | -0.005 | -0.0078 | -0.0068 | -0.0067 | |
| 2 | 4.8916 | carboxylic acid transport | 0.0266 | -0.0088 | -0.0076 | 0.0165 | |
| 3 | 4.7933 | organic acid transport | 0.0262 | -0.0089 | -0.0077 | 0.0162 | |
| 4 | 4.6381 | tRNA ligase activity | -0.0048 | -0.0075 | -0.0064 | -0.0064 | |
| 5 | 4.6381 | -0.0048 | -0.0075 | -0.0065 | -0.0064 | ||
| 6 | 4.6381 | -0.0048 | -0.0075 | -0.0065 | -0.0064 | ||
| 7 | 4.6381 | ligase activity, forming carbon-oxygen bonds | -0.0048 | -0.0075 | -0.0064 | -0.0064 | |
| 8 | 4.6381 | ligase activity, formingaminoacyl-tRNA and relatedcompounds | -0.0048 | -0.0075 | -0.0064 | -0.0064 | |
| 9 | 4.5219 | -0.0048 | -0.0076 | -0.0065 | -0.0065 | ||
| 10 | 4.2093 | carboxylic acid transporter activity | 0.0269 | -0.0087 | -0.0075 | 0.0168 | |
| 11 | 4.1976 | ligase activity, forming phosphoric ester bonds | -0.005 | -0.0079 | -0.0068 | -0.0068 | |
| 12 | 4.1196 | -0.0056 | -0.0076 | -0.0075 | -0.0075 | ||
| 13 | 4.0201 | -0.0056 | -0.0048 | -0.0048 | -0.0048 | ||
| 14 | 3.9477 | organic acid transporter activity | 0.0258 | -0.0089 | -0.0078 | 0.0159 | |
| 15 | 3.8469 | transition metal ion transporter activity | -0.0042 | -0.0066 | -0.0057 | -0.0057 | |
| 16 | 3.7791 | iron ion transporter activity | -0.0025 | -0.0039 | -0.0034 | -0.0034 | |
| 17 | 3.6126 | tRNA aminoacylation for protein translation | -0.0044 | -0.0068 | -0.0059 | -0.0059 | |
| 18 | 3.6126 | amino acid activation | -0.0044 | -0.0068 | -0.0059 | -0.0059 | |
| 19 | 3.6126 | tRNA aminoacylation | -0.0044 | -0.0068 | -0.0059 | -0.0059 | |
| 20 | 3.5079 | ion transporter activity | -0.0092 | -0.0144 | -0.0125 | -0.0123 | |
| 21 | 3.4751 | amine transport | -0.0055 | 0.0121 | -0.0075 | -0.0074 | |
| 22 | 3.3985 | amino acid transporter activity | -0.0045 | -0.0071 | -0.0061 | -0.0061 | |
| 23 | 3.2561 | cation transporter activity | -0.0085 | -0.0134 | -0.0116 | -0.0115 | |
| 24 | 3.246 | di-, tri-valent inorganic cation transport | 0.0258 | -0.0089 | -0.0078 | 0.0159 | |
| 25 | 3.1866 | transporter activity | -0.0055 | -0.0192 | -0.023 | 0.2493 | 0.2833 |
| 26 | 3.1644 | transition metal ion transport | 0.0291 | -0.0082 | -0.0071 | 0.0186 | |
| 27 | 3.0334 | basic amino acid transporter activity | -0.0022 | -0.0035 | -0.0031 | -0.003 | |
| 28 | 2.9412 | siderophore-iron transporter activity | -0.0016 | -0.0025 | -0.0022 | -0.0021 | |
| 29 | 2.9412 | siderophore transporter activity | -0.0016 | -0.0025 | -0.0022 | -0.0021 | |
| 30 | 2.9035 | metal ion transport | 0.0241 | -0.0094 | -0.0081 | 0.0144 | |
Table shows 30 most over-represented GO classes from the dataset, ordered with the log10(p-values). Dataset was generated with artificial signal embedded to five classes, shown with bold font in the class list. Table shows also correlation of each reported class with these signal classes. The clear strongest correlation is shown here with bold font. Signal classes are marked in columns using their ranking in the result list. Notice the varying number of strong correlations that different signal classes have.
Figure 3Comparison of topology-elimination algorithm and standard ranked list. A performance comparison between standard GO list from DAVID and Ontologizer output with the topology-elimination algorithm. We show the difference in the cumulative sum of positive GO classes across the top ranks. X-axis shows the rank, whereas the Y-axis shows the difference. Positive value indicate better performance by the Ontologizer algorithm and negative value better performance by the DAVID GO list. We summarize the comparison with 20 datasets showing five percentiles for each rank position: median (black line), 25 and 75 percentile (blue line) and minimum and maximum (cyan line). Notice that topology-elimination shows better performance across the top-15 ranks.
Figure 4Comparison of parent-child algorithm and standard ranked list. A performance comparison between standard GO list from DAVID and Ontologizer output with parent-child algorithm. X and Y axis are similar to figure 3. Positive value refers to better performance by Ontologizer algorithm and negative value better performance by DAVID GO list. Percentiles and their colouring is also identical to earlier fig. 4. Here the difference, in favor of the parent-child algorithm seems small.
Evaluation of DAVID and GENERATOR clustering using the generated test datasets
| 1 | 0.3 | 0.09 | 0.6 | 1 | 0.4 | 0.17 | 0.23 |
| 2 | 0.38 | 0.06 | 0.75 | 1 | 0.5 | 0.12 | 0.38 |
| 3 | 0.38 | 0.07 | 0.6 | 0.8 | 0.46 | 0.13 | 0.33 |
| 4 | 0.36 | 0.04 | 0.8 | 0.4 | 0.5 | 0.07 | 0.43 |
| 5 | 0.33 | 0.07 | 0.6 | 1 | 0.43 | 0.14 | 0.29 |
| 6 | 0.23 | 0.05 | 1 | 1 | 0.38 | 0.1 | 0.28 |
| 7 | 0.14 | 0.04 | 0.33 | 0.67 | 0.2 | 0.07 | 0.13 |
| 8 | 0.15 | 0.02 | 0.67 | 0.33 | 0.25 | 0.04 | 0.21 |
| 9 | 0.36 | 0.03 | 0.8 | 0.4 | 0.5 | 0.05 | 0.45 |
| 10 | 0.38 | 0.09 | 0.6 | 1 | 0.46 | 0.16 | 0.3 |
| 11 | 0.67 | 0.07 | 0.8 | 0.8 | 0.73 | 0.13 | 0.59 |
| 12 | 0.42 | 0.05 | 1 | 0.6 | 0.59 | 0.09 | 0.5 |
| 13 | 0.43 | 0.07 | 0.6 | 0.8 | 0.5 | 0.13 | 0.38 |
| 14 | 0.5 | 0.04 | 1 | 0.5 | 0.67 | 0.07 | 0.6 |
| 15 | 0.5 | 0.09 | 1 | 1 | 0.67 | 0.16 | 0.51 |
| 16 | 0.33 | 0.03 | 0.67 | 0.67 | 0.44 | 0.06 | 0.38 |
| 17 | 0.09 | 0.03 | 0.33 | 0.67 | 0.14 | 0.07 | 0.08 |
| 18 | 0.42 | 0.07 | 1 | 1 | 0.59 | 0.13 | 0.46 |
| 19 | 0.38 | 0.05 | 0.75 | 0.75 | 0.5 | 0.09 | 0.41 |
| 20 | 0.13 | 0.03 | 0.33 | 0.67 | 0.18 | 0.06 | 0.12 |
Table presents the Positive Predictive Value (PPV), sensitivity (sens.) and F1 score for GENERATOR(GEN.) and DAVID clustering results. Furthermore, we show the difference of F1 scores between two methods. Notice that the F1 score for GENERATOR is consistently better, as is shown by the difference in the last column. DAVID outperforms GENERATOR only in sensitivity, but this is natural as DAVID created 3-10 times more clusters.
Figure 5A toy example of the filtering using correlation. Circles represent various classes and thick line shows correlation between classes and dotted line an weak correlation. Classes have no correlation, when they lack a connecting line. Figure shows that if we have already class A among selected signal classes the first filtering discards classes B and E. Note that second filtering discards class C as we have a path of strong correlations through class B to it. Class D is left unfiltered. Positioning of circles in the figure is unrelated to the GO structure.