| Literature DB >> 20699007 |
Kshitish K Acharya1, Darshan S Chandrashekar, Neelima Chitturi, Hardik Shah, Varun Malhotra, K S Sreelakshmi, H Deepti, Akhilesh Bajpai, Sravanthi Davuluri, Pranami Bora, Leena Rao.
Abstract
BACKGROUND: In the recent years, there has been a rise in gene expression profiling reports. Unfortunately, it has not been possible to make maximum use of available gene expression data. Many databases and programs can be used to derive the possible expression patterns of mammalian genes, based on existing data. However, these available resources have limitations. For example, it is not possible to obtain a list of genes that are expressed in certain conditions. To overcome such limitations, we have taken up a new strategy to predict gene expression patterns using available information, for one tissue at a time.Entities:
Mesh:
Year: 2010 PMID: 20699007 PMCID: PMC3091663 DOI: 10.1186/1471-2164-11-467
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The contribution of the number of gene-sets from each resource across species.
| No. of gene-sets* | |||
|---|---|---|---|
| Resources | Human | Mouse | Rat |
| ArrayExpress | 36 (15537) | 9 (11438) | 45 (10452) |
| GEO | 131 (6337) | 302 (6359) | 24 (8594) |
| PubMed | 43 (225) | 138 (3791) | 41 (1116) |
| Total | 210 (7366) | 449 (7196) | 110 (6721) |
*The value in parenthesis represents average gene count across the gene-sets.
Figure 1Categorization of gene-sets in MGEx-Tdb based on the 'number of genes per gene-set'. For example, there are 52 small gene-sets (each with less than 20 genes) and of these, 51 sets have been retrieved from literature.
Extent of agreement for gene expression pattern of 13 genes, between MCD and the databases.
| Database | Score (maximum = 26) | % agreement with reports on individual gene studies |
|---|---|---|
| HPRD | 20 | 77 |
| MGEx-Tdb | 14 | 54 |
| UniGene | 12 | 46 |
| BioGPS | 11 | 42 |
Relative assessment of amount and details of information, and agreement with MCD.
| Database | Information availability and volume of supporting data | Agreement with reports on individual gene studies | ||
|---|---|---|---|---|
| Score (maximum = 33) | % score | Score (maximum = 11) | % agreement | |
| HPRD* | 13 | 39 | 8 | 73 |
| MGEX-Tdb | 27 | 82 | 5 | 45 |
| UniGene | 14 | 42 | 3 | 27 |
| BioGPS | 11 | 33 | 5 | 45 |
Notes:
See additional file 8 for detailed methods;
*HPRD is specific for humans. Hence the score depicted for this database is not an average across the species.
Results in response to queries for expression status of genes under different testicular conditions.
| No. of genes retrieved in response to different queries | ||||||
|---|---|---|---|---|---|---|
| Database1 | Normal testis (human) | Azoospermia (human) | Asthenozoospermia (human) | Testicular cancer2 (human) | Adjudin treatment (rat) | Developmental stage-postnatal (mouse) |
| MGEx-Tdb | 12753 | 5215 | 10 | 16617 | 10982 | 15209 |
| BioGPS3 | 403 | 14 | 0 | 3 | 0 | 2 |
| RefExA4 | 92 | 3 | 0 | 3 | NA | NA |
| TissueDistributionDBs4 | 16124 | 4 | 0 | 2 | 0 | 0 |
| UniGene4 | 18421 | 4 | 0 | 194 | 0 | 15 |
| HPRD | 4249 | 2 | 0 | 0 | NA | NA |
Notes:
See additional file 9 for detailed methods;
NA: Not Applicable, i.e., data restricted to humans only in these databases;
1The URLs are listed in the additional file 4.
2Querying with testis AND cancer retrieved 116 genes in BioGPS; "Germ cell cancer" in testis retrieved 9655 genes (10042 hits) in TissueDistributionDBs and "Germ cell tumor" in testis retrieved 14874 genes (18130 hits) in UniGene and 2 genes in HPRD.
3Includes results from human, mouse & rat species (search can not be restricted to specific species).
4Number of genes in the results corresponding to different queries: probe names in case of RefExA & cluster IDs in case of UniGene & TissueDistributionDBs.
5Querying with alternative equivalent terms of postnatal (neonate) retrieved 10467 genes (11519 hits) in UniGene.
Results for expression status of genes at developmental stages in mouse testis from different databases.
| Databases1 | Postnatal period | Specific postnatal Stage (0-6 day/TS27) |
|---|---|---|
| MGEx-Tdb | 210162 | 21195 |
| Bgee | 15899 | 15501 |
| MGI (GXD) | 59543 | NA |
| MRG | ~8500 | ~8500 |
| 4DXpress | ND | ND |
Notes:
See additional file 9 for detailed methods;
NA: Not Applicable, i.e., no query feature available for early post-natal stages of development; ND: No Data in postnatal development stage [available till TS 26 only]; TS: Theiler stage (A term used to denote the stage of development of a mouse; Theiler, 1989); Bgee: a dataBase for Gene Expression Evolution; MGI (GXD): Mouse Genome Informatics (Gene Expression Database); MRG: Mammalian Reproductive Genetics; 4DXpress: EXpression database in 4D (four dimensions).
1The references and URLs are listed in the additional file 4.
2MGEx-Tdb results correspond to genes transcribed/dormant on specific days between day 0-20 and many genes cancel due to contradictory results. However, one can obtain better results for specific stage using this database, as indicated in the third column.
3The results include some repeats.