| Literature DB >> 21034472 |
Dennis P Wall1, Rimma Pivovarov, Mark Tong, Jae-Yoon Jung, Vincent A Fusaro, Todd F DeLuca, Peter J Tonellato.
Abstract
BACKGROUND: Disease-specific genetic information has been increasing at rapid rates as a consequence of recent improvements and massive cost reductions in sequencing technologies. Numerous systems designed to capture and organize this mounting sea of genetic data have emerged, but these resources differ dramatically in their disease coverage and genetic depth. With few exceptions, researchers must manually search a variety of sites to assemble a complete set of genetic evidence for a particular disease of interest, a process that is both time-consuming and error-prone.Entities:
Mesh:
Year: 2010 PMID: 21034472 PMCID: PMC2990725 DOI: 10.1186/1755-8794-3-50
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Databases integrated within Genotator (Statistics Gathered: June 19, 2010)
| Database | Website | Description | Statistics |
|---|---|---|---|
| GeneCards [ | [ | Searchable database of human genes that provides concise genomic, genetic and functional information | 73,017 Entries, 28,656 of them with symbols approved by the HUGO gene nomenclature committee |
| Genetic Association Database [ | [ | Archive of human genetic association studies of complex diseases and disorders | 2673 genes for 5636 diseases/phenotypes |
| HuGE Navigator [ | [ | Knowledge base including information on gene-disease and gene-gene associations | 9429 genes for 2215 diseases |
| Human Gene Mutation Database [ | [ | Database established for the study of mutational mechanisms in human genes | 72414 mutations in public release |
| Online Mendelian Inheritance in Man [ | [ | NCBI's compendium of human genes and genetic phenotypes | 13, 158 genes for 2799 phenotypes/diseases |
| Your Favorite Gene | [ | Database containing scientific descriptions and overviews of genes and their corresponding proteins with links to the most used gene data-banks | ~ 8595 genes |
| UniProt [ | [ | Central database of protein sequences with accurate, consistent, rich sequence and functional annotation | UniProtKB/Swiss-Prot Release 2010_06 of 18-May-10 of contains 517100 sequence entries |
| UniProtKB/TrEMBL Release 2010_07 of 15-Jun-2010 of contains 11109684 sequence entries | |||
| PharmGKB [ | [ | Database of primary genotype and phenotype data, annotate gene variants and gene-drug-disease relationships | 3,197 diseases, 26,216 genes |
| Entrez Gene | [ | Searchable database of genes, from RefSeq genomes, and defined by sequence and/or located in the NCBI Map Viewer | 42644 genes |
| WikiGenes [ | [ | Non-profit, open access database of articles on genes, proteins and chemical compounds | Global community |
| GenAtlas [ | [ | Database containing relevant information with respect to gene mapping and genetic diseases | 22466 genes/4434 phenotypes |
The characteristics included in Genotator from select resources.
| Database | Characteristics Incorporated |
|---|---|
| Genetic Association Database | List of Genes |
| "Yes" Association (published statement of link between gene and disease phenotype) | |
| "No" Association (published statement of no link between gene and disease) | |
| p-value (from genome wide association studies) | |
| HuGE Navigator | List of Genes (official gene symbols) |
| Gene Prospector Score [ | |
| List of PubMed References | |
| Entrez Gene | List of Genes (official gene symbols) |
| Official Full Name | |
| Symbol Synonyms | |
| Chromosome Number | |
| Location on Chromosome | |
| Gene ID | |
| Ensembl ID |
All other databases listed in Table 1 provided a list of genes only.
Figure 1Genotator algorithm. A user inputs a disease term into the search field of the Genotator web resource http://genotator.hms.harvard.edu to initiate the pipeline. Genotator then simultaneously queries the databases listed in Table 1 to generate an unordered set of genes while resolving synonyms and removing duplicate entries. The algorithm then gathers characteristics from specific databases, principally including GAD, Gene Prospector (from HuGE Navigator), and NCBI, in order to build an attribute vector and to calculate a relevance score (scoring algorithm detailed in the Implementation section, dark red fields are used in calculating the final Genotator score). The complete set of results, including the Genotator score, associated gene attributes and supporting publications, are displayed directly in the website and made available for download.
Figure 2Screen shot of the main components of Genotator. Genotator is freely accessible at http://genotator.hms.harvard.edu. (A) The site consists of a splash page, (B) a disorders/diseases page that stores the complete set of results for all queries in alphabetical order, (C) and a query page that accepts a single disease or comma separated disease list and email address for alerting users when the query completes.
The percent contribution of disease candidate genes from each of the 11 databases totaled over autism, Parkinson's Disease, and Alzheimer Disease.
| Database | % Contribution |
|---|---|
| GeneCards | 76.85 |
| HuGE Navigator | 43.09 |
| WikiGenes | 30.51 |
| Genetic Association Database | 11.02 |
| UniProt | 4.87 |
| Your Favorite Gene | 3.23 |
| Human Gene Mutation Database | 3.20 |
| PharmGKB | 2.88 |
| Online Mendelian Inheritance in Man | 2.27 |
| Entrez Gene | 1.52 |
| GenAtlas | 1.23 |
Figure 3Percentage of unique contribution made by each of the 11 Genotator resources. Although a sizeable percentage of results came from GeneCards, WikiGenes, and HugeNavigator (with each contributing well over 200 unique genes totaled across autism, PD, and AD), all but GeneAtlas and OMIM provided at least 9 unique genes. The total number of genes found in each database for the three diseases are listed above the graph. The percent of unique genes was normalized for each disease.
Top scoring autism genes ranked on Genotator score.
| Gene | Score | Literature Support | Reference | AG Classification |
|---|---|---|---|---|
| SLC6A4 | 136.7 | Linkage and association analysis at the serotonin transporter (SLC6A4) locus in a rigid-compulsive subset of autism. | [ | Probable |
| NRXN1 | 27.2 | Intragenic rearrangements in NRXN1 in three families with autism spectrum disorder, developmental delay, and speech delay. | [ | Promising |
| FMR1 | 22.6 | Association and transmission analysis of the FMR1 IVS10 + 14C-T variant in autism | [ | Probable |
| PTEN | 19.6 | Subset of individuals with autism spectrum disorders and extreme macrocephaly associated with germline PTEN tumour suppressor gene mutations. | [ | Probable |
| FRAXA* | 17.9 | Point mutation analysis of the FMR-1 gene in autism. | [ | |
| FRAXE* | 17.9 | Cognitive, behavioral, and neuroanatomical assessment of two unrelated male children expressing FRAXE. | [ | |
| FRAXF* | 17.3 | Mental impairment and attention deficit hyperactivity disorder in a family with FRAXF. | [ | |
| CNTNAP2 | 15.9 | Linkage, association, and gene-expression analyses identify CNTNAP2 as an autism-susceptibility gene. | [ | Probable |
| UBE3A | 14.5 | Linkage disequilibrium at the Angelman syndrome gene UBE3A in autism families. | [ | Probable |
| CDH10 | 12 | Common genetic variants on 5p14.1 associate with autism spectrum disorders. | [ |
The table lists the title and reference of the research study supporting the gene-disease association together with the official gene symbol. We used the review article by Abrahams and Geschwind (2008) as a source for the most promising autism gene candidates (AG classification). Starred genes are those that did not appear in SFARI Gene. The complete results are available online as Additional file 1.
Top 10 Parkinson's Disease Genes ranked on Genotator score.
| Gene | Score | Literature Support | Reference | WRW Mention |
|---|---|---|---|---|
| LRRK2 | 105.1 | Frequency of LRRK2 mutations in early-and late-onset Parkinson disease. | [ | Y |
| MAPT | 60.8 | Different MAPT haplotypes are associated with Parkinson's disease and progressive supranuclear palsy. | [ | Y |
| SNCA | 59.8 | Genome-wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease. | [ | Y |
| PARK2 | 59.6 | Case-control study of the parkin gene in early-onset Parkinson disease. | [ | Y |
| APOE | 34.7 | Phenotypic associations of tau and ApoE in Parkinson's disease. | [ | |
| GBA | 21.4 | Genotype-phenotype correlations between GBA mutations and Parkinson disease risk and onset. | [ | Y |
| BDNF | 19.5 | BDNF Val66Met polymorphism is associated with cognitive impairment in Italian patients with Parkinson's disease. | [ | |
| DRD2 | 18.7 | Association of DRD3 and GRIN2B with impulse control and related behaviors in Parkinson's disease. | [ | |
| MAOB | 17.1 | Association of variations in monoamine oxidases A and B with Parkinson's disease subgroups. | [ | |
| PINK1 | 16.8 | Parkin and PINK1 mutations in early-onset Parkinson's disease: comprehensive screening in publicly available cases and control. | [ | Y |
The table lists the title and reference of the research study supporting the gene-disease association together with the official gene symbol. We used the recent review article by Wider, Ross, and Wszolek (2010) as a source for the most promising Parkinson's disease gene candidates (WRW Mention). A complete set of results is available online as Additional file 2.
Top 10 Alzheimer Disease Genes ranked on Genotator score.
| Gene | Score | Literature Support | Reference |
|---|---|---|---|
| APOE | 513 | Effect of APOE genotype on amyloid plaque load and gray matter volume in Alzheimer disease. | [ |
| PSEN1 | 45.4 | Early Onset Alzheimer's Disease with Spastic Paraparesis, Dysarthria and Seizures and N135 S Mutation in PSEN. | [ |
| ACE | 28.4 | Amyloid-beta-Related Genes SORL1 and ACE are Genetically Associated With Risk for Late-onset Alzheimer Disease in the Chinese Population. | [ |
| PRNP | 23.1 | Earlier onset of Alzheimer's disease: risk polymorphisms within PRNP, PRND, CYP46, and APOE genes. | [ |
| BCHE | 22.7 | Dipeptidyl carboxypeptidase 1 (DCP1) and butyrylcholinesterase (BCHE) gene interactions with the apolipoprotein E epsilon4 allele as risk factors in Alzheimer's disease and in Parkinson's disease with coexisting Alzheimer pathology. | [ |
| APOC1 | 20.8 | APOE and APOC1 promoter polymorphisms and the risk of Alzheimer disease in African American and Caribbean Hispanic individuals | [ |
| IL1B | 19.9 | Assessment of Alzheimer's disease case-control associations using family-based methods. | [ |
| IL1A | 19.5 | Pharmacogenomics in Alzheimer's disease. | [ |
| MTHFR | 19.3 | Association of MTHFR gene polymorphism C677T with susceptibility to late-onset Alzheimer's disease. | [ |
| BDNF | 17.5 | Association between BDNF Val66Met polymorphism and Alzheimer disease, dementia with Lewy bodies, and Pick disease. | [ |
The table lists the title and reference of the research study supporting the gene-disease association together with the official gene symbol. A complete set of results is available online as Additional file 3.