| Literature DB >> 15117422 |
Guohui Zhou1, Xinyu Wen, Hang Liu, Michael J Schlicht, Martin J Hessner, Peter J Tonellato, Milton W Datta.
Abstract
BACKGROUND: Once specific genes are identified through high throughput genomics technologies there is a need to sort the final gene list to a manageable size for validation studies. The triaging and sorting of genes often relies on the use of supplemental information related to gene structure, metabolic pathways, and chromosomal location. Yet in disease states where the genes may not have identifiable structural elements, poorly defined metabolic pathways, or limited chromosomal data, flexible systems for obtaining additional data are necessary. In these situations having a tool for searching the biomedical literature using the list of identified genes while simultaneously defining additional search terms would be useful.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15117422 PMCID: PMC419696 DOI: 10.1186/1471-2105-5-46
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1GeneInfo user interface
Figure 2GeneInfo database in Rational Rose
Figure 3Example output based on the two different word filtering options. The default filter searches the gene names with the users search terms. The gene search term splitter takes each gene name and separates the terms, combining each with the user search terms, and then combining the results. The default filter results in a higher number of references than the gene search term filter.
Figure 4Example output evaluated by individual search queries. In the example the gene ID generated six gene name queries linked to the user generated search term "antioxidant". The individual queries generated different numbers of references, the results of which were combined in the final result. Hyperlinks to each query link to PubMed, allowing the user to determine which queries contribute specific references to the final result.
The Selenium Prostate Cancer Gene Table. Each Unigene ID was searched against PubMed using GeneInfo. Additional search terms were used and the number of references recorded. The default filter was used in all searches.
| Hs.12646 | hypothetical protein FLJ22693 | 1 | 0 | 0 | 0 | 0 | 0 |
| Hs.12705 | hypothetical 43.1 kd protein | ||||||
| Hs.153636 | far upstream element (FUSE) binding protein 3 | ||||||
| Hs.167013 | dynamin 2 | ||||||
| Hs.180909 | peroxiredoxin 1 | 399 | 10 | 2 | 2 | 2 | 357 |
| Hs.19122 | eukaryotic translation initiation factor 4E-like 3 | ||||||
| Hs.19699 | Conserved gene telomeric to alpha globin cluster | 1005 | 10322 | 3434 | 9843 | 1000 | 1000 |
| Hs.21263 | suppressor of potassium transport defect 3 | ||||||
| Hs.25732 | eukaryotic translation initiation factor 4 gamma, 3 | ||||||
| Hs.26395 | erythrocyte membrane protein band 4.1-like 1 | ||||||
| Hs.2799 | cartilage linking protein 1 | 391 | 0 | 0 | 0 | 4 | 4 |
| Hs.3991 | CDC26 subunit of anaphase promoting complex | ||||||
| Hs.42586 | KIAA1560 protein | 1088 | 10322 | 3434 | 9843 | 1000 | 1003 |
| Hs.42959 | KIAA1012 protein | 1000 | 10322 | 3434 | 9843 | 1000 | 1000 |
| Hs.55608 | hypothetical protein MGC955 | ||||||
| Hs.75835 | phosphomannomutase 1 | 9 | 0 | 0 | 0 | 0 | 0 |
| Hs.76917 | F-box only protein 8 | 342 | 4 | 0 | 3 | 5 | 1 |
| Hs.78354 | surfeit 5 | 64 | 0 | 1 | 0 | 0 | 3 |
| Hs.808 | heterogeneous nuclear ribonucleoprotein F | 59 | 0 | 2 | 0 | 0 | 0 |
| Hs.8117 | erbb2 interacting protein | 36 | 0 | 2 | 1 | 1 | 0 |
| Hs.83070 | growth factor receptor-bound protein 14 | ||||||
| Hs.83954 | protein associated with PRK1 | 6 | 0 | 0 | 0 | 0 | 0 |
| Hs.87327 | EST | 51 | 0 | 0 | 0 | 0 | 0 |
| Hs.97477 | lysozyme homolog | 1000 | 1 | 16 | 1 | 73 | 136 |
BEAR GeneInfo search results with different filters. Results are shown for the selenium prostate cancer gene list searched with the additional terms "prostate cancer" and "selenium". Results are presented using the default or the Gene search term splitter filters, and compared to hand searched results from EndNote. All numbers are presented as (number of references returned/number of relevant references).
| Hs.12646 | hypothetical protein FLJ22693 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.12705 | hypothetical 43.1 kd protein | 0/0 | 0/0 | ||||||
| Hs.153636 | far upstream element (FUSE) binding protein 3 | 0/0 | 0/0 | ||||||
| Hs.167013 | dynamin 2 | 2/2 | 0/0 | ||||||
| Hs.180909 | peroxiredoxin 1 | 0/0 | 0/0 | 0/0 | 2/2 | 2/2 | 2/2 | 0/0 | 12/12 |
| Hs.19122 | eukaryotic translation initiation factor 4E-like 3 | 0/0 | 0/0 | ||||||
| Hs.19699 | Conserved gene telomeric to alpha globin cluster | 10322/0 | 10322/0 | 0/0 | 0/0 | 3434/0 | 3434/0 | 0/0 | 0/0 |
| Hs.21263 | suppressor of potassium transport defect 3 | 0/0 | 0/0 | ||||||
| Hs.25732 | eukaryotic translation initiation factor 4 gamma, 3 | 0/0 | 0/0 | ||||||
| Hs.26395 | erythrocyte membrane protein band 4.1-like 1 | 0/0 | 0/0 | ||||||
| Hs.2799 | cartilage linking protein 1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.3991 | CDC26 subunit of anaphase promoting complex | 0/0 | 0/0 | ||||||
| Hs.42586 | KIAA1560 protein | 10322/0 | 10322/0 | 76706/0 | 0/0 | 3434/0 | 3434/0 | 76706/0 | 0/0 |
| Hs.42959 | KIAA1012 protein | 10322/0 | 10322/0 | 0/0 | 0/0 | 3434/0 | 3434/0 | 0/0 | 0/0 |
| Hs.55608 | hypothetical protein MGC955 | 0/0 | 0/0 | ||||||
| Hs.75835 | phosphomannomutase 1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.76917 | F-box only protein 8 | 4/4 | 4/4 | 0/0 | 1/1 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.78354 | surfeit 5 | 0/0 | 0/0 | 7/7 | 1/1 | 1/1 | 1/1 | 8/8 | 1/1 |
| Hs.808 | heterogeneous nuclear ribonucleoprotein F | 0/0 | 0/0 | 0/0 | 3/3 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.8117 | erbb2 interacting protein | 0/0 | 0/0 | 0/0 | 1/1 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.83070 | growth factor receptor-bound protein 14 | 0/0 | 0/0 | ||||||
| Hs.83954 | protein associated with PRK1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
| Hs.87327 | EST | 37/0 | 2/0 | ||||||
| Hs.97477 | lysozyme homolog | 1/1 | 1/1 | 1/1 | 0/0 | 16/16 | 16/16 | 20/20 | 0/0 |