| Literature DB >> 19166590 |
Bart Hj van den Berg1, Jay H Konieczka, Fiona M McCarthy, Shane C Burgess.
Abstract
BACKGROUND: Systems biology modeling from microarray data requires the most contemporary structural and functional array annotation. However, microarray annotations, especially for non-commercial, non-traditional biomedical model organisms, are often dated. In addition, most microarray analysis tools do not readily accept EST clone names, which are abundantly represented on arrays. Manual re-annotation of microarrays is impracticable and so we developed a computational re-annotation tool (ArrayIDer) to retrieve the most recent accession mapping files from public databases based on EST clone names or accessions and rapidly generate database accessions for entire microarrays.Entities:
Mesh:
Year: 2009 PMID: 19166590 PMCID: PMC2636773 DOI: 10.1186/1471-2105-10-30
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1. EST clone names or EST accession numbers from input are searched against the UniGene database (DBS). Accessions without match (N/M = no match) are written to UniGene_NO_Match.xls. Matching accessions (M = Match) are searched against the IPI database. Accessions with no IPI matches are written to IPI_NO_Match.xls. Matching accessions are written to ArrayIDer_FINAL.xls. All data from IPI_NO_Match.xls is also written to ArrayIDer_FINAL.xls, since it contains identifier information retrieved from the UniGene database.
ArrayIDer output description.
| Column Name | Type of information |
| SEQ_TYPE | mRNA or EST |
| UNIGENE_ID | Corresponding NCBI UniGene database identifier |
| GENE_SYMBOL | Gene symbol provided by NCBI |
| GENE_ID | NCBI Entrez gene identifier |
| PROT_GB_ACC | Corresponding NCBI Protein accession number(s) |
| PROT_GI_NO | Corresponding NCBI Protein GI number(s) |
| PEPT_ACC | Corresponding Peptide accession(s) |
| Retrieval DB | Database of additional retrieved protein accession (Swiss-Prot/TrEMBL; RefSEQ, ENSMBLE) |
| DB_ACC | Accession number(s) corresponding to Retrieval DB |
| IPI_ID | Corresponding IPI identifier(s) |
| UNIPROT_ACC | Corresponding UniProtKB accession(s) |
| ENSEMBL_ID | Corresponding ENSEMBL identifier(s) |
| UNIPARC_ID | Corresponding UniParc identifier(s) |
The input information is indicated in bold text and the different types of information retrieved by ArrayIDer are shown. Identifiers can be used to cross-reference several other publicly available databases to retrieve additional information for genes of interest.
Figure 2Result format for ArrayIDer online. (A) The table contains a list with all input matches to the NCBI UniGene and IPI database. The results can be exported in Excel format using the link (B).