| Literature DB >> 16372901 |
Sebastian Noth1, Arndt Benecke.
Abstract
BACKGROUND: Significant inconsistencies between probe-to-gene annotations between different releases of probe set identifiers by commercial microarray platform solutions have been reported. Such inconsistencies lead to misleading or ambiguous interpretation of published gene expression results.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16372901 PMCID: PMC1361791 DOI: 10.1186/1471-2105-6-307
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Venn diagrams comparing the different unique probe and gene sets according to probe-to-gene annotation releases and array versions. For explanation on gene ID suffixes, please refer to paragraph 5. Obsolete gene IDs and nomenclature suffixes. A1.0 = Human Genome Survey Array 1.0; A2.0 = Human Genome Survey Array 2.0; V1 = probe-to-gene annotation release 1; V2 = probe-to-gene annotation release 2.
PANTHER annotation versus disseminated annotation releases 1 & 2. The data from the Supplementary Files 1 & 2 were submitted to the PANTHER website either using the probe IDs or the gene IDs. The differences in identification, display, and absentee calls, between the datasets and the annotation releases are due to the fact that PANTHER is continuously updated, only considers protein coding genes, excludes GenBank-only annotated mRNAs, and retains only the gene ID with the highest suffix.
| V1 | Probe ID | 51 | 30 | 29 | 21 | 0 |
| Gene ID | 51 | 44 | 22 | 2 | 5 | |
| V2 | Probe ID | 110 | 65 | 59 | 45 | 0 |
| Gene ID | 110 | 98 | 44 | 0 | 12 | |
Probe/Gene IDs for MAPKK3. The different probe and gene IDs corresponding to the two annotation releases (V& and V2) where submitted to PANTHER with the listed results. This table summarizes for a single gene the differences and potential ambiguities when submitting gene IDs rather than probe IDs, and illustrates the effect of curation status (obsolete vs. valid) or annotation release. Note that >70% of all gene IDs in the current HGS V2 PGA table carry suffixes, that ~13% of all represented genes have more than one probe ID associated, and that probes often have secondary hits. All of these require different look-up tables to be generated by the user in order to achieve coherency and transparency in the data analysis process.
| 235514 | hCG1993739 | 2 × 10E-23 | hCG1993739 | 2 × 10–23 | hCG1993739 |
| 127877 | hCG1993739.1 | 6 × 10E-80 | hCG28371 | ||
| 182262 | hCG1993739.1 | 3 × 10E-70 | hCG1980405 | ||
| 234450 | hCG1993739.1 | 6 × 10E-58 | hCG1997534 | ||
| 235514 | hCG1993739 | 2 × 10E-23 | hCG1993739 | 2 × 10–23 | hCG1993739 |
| 127877 | hCG1993739.2 | 6 × 10E-80 | hCG28371 | ||
| 182262 | hCG1993739.2 | 3 × 10E-70 | hCG1980405 | ||
| 234450 | hCG1993739.2 | 6 × 10E-58 | hCG1997534 | ||
| 171301 | hCG1993739.2 | - | - | ||