| Literature DB >> 23519237 |
Laura K Vaughan1, Vinodh Srinivasasainagendra.
Abstract
With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the importance of bioinformatics and computational biology in genomics research and has led to the development of thousands of tools to integrate and utilize these resources. When utilizing such resources, the principles of reproducible research are often overlooked. In this manuscript we provide selected case studies illustrating issues that may arise while working with genes and genetic polymorphisms. These case studies illustrate potential sources of error which can be introduced if the practices of reproducible research are not employed and non-concurrent databases are used. We also show examples of a lack of transparency when these databases are concerned when using popular bioinformatics tools. These examples highlight that resources are constantly evolving, and in order to provide reproducible results, research should be aware of and connected to the correct release of the data, particularly when implementing computational tools.Entities:
Keywords: bioinformatics; database; gene mapping; reproducible research
Year: 2013 PMID: 23519237 PMCID: PMC3604632 DOI: 10.3389/fgene.2013.00038
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Inconsistency in gene “names” and locations.
| 9214 | 1 | Fas apoptotic inhibitory molecule 3 | 203466126-203483738 | 205144354-205161966 | 207076633-207095378 | ||
| 5284 | FLJ22667|MGC125361|MGC125362 | 1 | Polymeric immunoglobulin receptor | 203490267-203508202 | 205168495-205186430 | 207101869-207119811 | |
| 83953 | FCA/MR|FKSG87 | 1 | Fc receptor, IgA, IgM, high affinity | Information Not available | 205198027-205210593 | 207131404-207143970 | |
| 148 | ADRA1C|ADRA1L1|ALPHA1AAR | 8 | Adrenergic, alpha-1A-, receptor | 26683139-26778839 | 26683139-26778839 | 26627222-26722922 | |
| 147 | ADRA1|ALPHA1BAR | 5 | Adrenergic, alpha-1B-, receptor | 159276318-159332129 | 159276318-159332595 | 159343740-159400017 | |
| 146 | ADRA1| | 20 | Adrenergic, alpha-1D-, receptor | 4149816-4177659 | 4149278-4177659 | 4201278-4229659 | |
Bold genes indicate terms from original list from collaborator. Annotation information retrieved for the candidate gene list through Entrez gene's GeneInfo. GeneInfo data can be downloaded from Entrez gene's FTP location on September 10th 2010 (.
Discrepancies in the location of SNP rs2844871.
| Affymetrix 6.0 | NetAffx 30 | Hg18 | NCBI36 | Not specified | chr22:14459243 |
| NetAffx 31 | Hg19 | GRCH 37 | 131 | SNP is listed, but no position information given | |
| NetAffx 32 | Hg19 | GRCH 37 | 132 | SNP is listed, but no position information given | |
| UCSC genome browser | July 2011 | Hg18 | NCBI36 | 128 | chr22: 14459242 |
| Hg18 | NCBI36 | 130 | chr14: 19763716 | ||
| chr22: 16079242 | |||||
| Hg 19 | GRCH 37 | 132 | chr14:19763467 | ||
| chr22:16078993 | |||||
| chrUn_gl000244:34403 | |||||
| Hg 19 | GRCH 37 | 135 | chr14:19763467 | ||
| chr22:16078993 | |||||
| chrUn_gl000244:34403 | |||||
| dbSNP | July 2011 | Hg 19 | NCBI 37.1 | 132 | chr2: 125655701 |
| October 2012 | Hg19 | NCBI 37.3 | 137 | chr14: 19763717 | |
| chr22: 16079243 | |||||
| NA | |||||
| chr2:124523528 | |||||
| HapMap | Release 27 | Hg18 | Not specified | Not specified | chr22: 14459243 |
| HapMap | Release 28 | Hg18 | NCBI36 | 126 | chr22: 14459243 |
Genotyped on Affy 6.0 for Phase II samples, no dates or other information was given in HapMart.
Search results for SNP rs284487 from various sources. Search conducted on Jul 11th, 2011 and updated October 29th, 2012.
| Entrez Gene | |
| UCSC Genome Browser | |
| Affymetrix NetAffx Analysis Center | |
| dbSNP | |
| UCSC LiftOver | |
| My Experiment | |
| GenePattern GRRD | |
| Sweave | |
| HUGO | |
| DAVID | |
| GeneCruiser |