| Literature DB >> 16549014 |
Enrique M Muro1, Carolina Perez-Iratxeta, Miguel A Andrade-Navarro.
Abstract
BACKGROUND: The annotations of Affymetrix DNA microarray probe sets with Gene Ontology terms are carefully selected for correctness. This results in very accurate but incomplete annotations which is not always desirable for microarray experiment evaluation.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16549014 PMCID: PMC1435773 DOI: 10.1186/1471-2105-7-159
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Scheme of the process followed to retrieve GO terms associated to Affymetrix probe sets. Cylinders are databases, boxes are entries, diamonds and circles are attached properties. Plain lines are direct links, whereas arrows indicate a fuzzy relation of inclusion in the direction of the arrow. See text for details of those databases. (a) Sources of GO terms. Source 0: NetAffx. Source 1: links from NetAffx to other databases annotated with GO terms. Sources 2 and 3 use inference from properties associated to linked entries. Source 2 consists of GO terms derived from SwissProt keywords (KW2GO mapping) and MeSH terms (categories A, C, D, and G) from MEDLINE (MeSH2GO mapping). Source 3 consists of GO terms derived from sources 0, 1, and 2, by a mapping between GO terms (GO2GO mapping). (b) General schema for the definition of a mapping. A fuzzy mapping is computed by analysis of co-occurrences of the values of a property "p" attached to a database d1 entry (left) and another property "q" attached to database dentry (right) via any number of intermediate databases. (c) Mappings used in this work: KW2GO, MeSH2GO, and GO2GO. See Methods for details.
Coverage of Affymetrix probe sets with GO terms
| 0 | 22051 | - | 117761 | - | 23206 | - | 121031 | - |
| 1 | 24735 | 2756 | 144952 | 29095 | 26115 | 2978 | 147036 | 28298 |
| 2 | 9468 | 3 | 32332 | 1105 | 13917 | 9 | 41949 | 3705 |
| 3 | 12395 | 0 | 21792 | 3621 | 12657 | 0 | 20191 | 3605 |
| 1,2,3 | 24738 | 2759 | 149749 | 33821 | 26124 | 2987 | 154425 | 35608 |
The total of probe sets in MOE430 and HG-U133 chips are 45,101 and 44,760, respectively. New: number of probe sets or annotations not covered by any source with a lower number. Source 0 are the annotations given by NetAffx; 1 are annotations taken from linked databases; 2 are annotations implied by association to terms in linked databases; 3 are annotations implied by GO terms in 0, 1, or 2. 1,2,3 are the union of the annotations of sources 1, 2, and 3.
Recall and precision of amplified GO annotations respect to NetAffx given GO annotations.
| 1 | 98.3% | 79.9% | 98.1% | 80.7% |
| 2 | 25.4% | 92.7% | 30.0% | 86.6% |
| 3 | 13.3% | 72.0% | 12.2% | 73.4% |
| 1,2,3 | 98.4% | 77.4% | 98.1% | 76.9% |
Recall is defined as the percentage of NetAffx annotations (source 0) that are found in the Probe2GO annotations. Precision is the percentage of Probe2GO annotations that are provided by NetAffx (source 0). 1,2,3: union of the annotations of sources 1, 2, and 3.