Literature DB >> 19877174

Single nucleotide differences (SNDs) in the dbSNP database may lead to errors in genotyping and haplotyping studies.

Lucia Musumeci1, Jonathan W Arthur, Florence S G Cheung, Ashraful Hoque, Scott Lippman, Juergen K V Reichardt.   

Abstract

The creation of single nucleotide polymorphism (SNP) databases (such as NCBI dbSNP) has facilitated scientific research in many fields. SNP discovery and detection has improved to the extent that there are over 17 million human reference (rs) SNPs reported to date (Build 129 of dbSNP). SNP databases are unfortunately not always complete and/or accurate. In fact, half of the reported SNPs are still only candidate SNPs and are not validated in a population. We describe the identification of SNDs (single nucleotide differences) in humans, that may contaminate the dbSNP database. These SNDs, reported as real SNPs in the database, do not exist as such, but are merely artifacts due to the presence of a paralogue (highly similar duplicated) sequence in the genome. Using sequencing we showed how SNDs could originate in two paralogous genes and evaluated samples from a population of 100 individuals for the presence/absence of SNPs. Moreover, using bioinformatics, we predicted as many as 8.32% of the biallelic, coding SNPs in the dbSNP database to be SNDs. Our identification of SNDs in the database will allow researchers to not only select truly informative SNPs for association studies, but also aid in determining accurate SNP genotypes and haplotypes.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 19877174      PMCID: PMC2797835          DOI: 10.1002/humu.21137

Source DB:  PubMed          Journal:  Hum Mutat        ISSN: 1059-7794            Impact factor:   4.878


  17 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Quality and completeness of SNP databases.

Authors:  David E Reich; Stacey B Gabriel; David Altshuler
Journal:  Nat Genet       Date:  2003-03-24       Impact factor: 38.330

3.  Discrepancies in dbSNP confirmation rates and allele frequency distributions from varying genotyping error rates and patterns.

Authors:  Adele A Mitchell; Michael E Zwick; Aravinda Chakravarti; David J Cutler
Journal:  Bioinformatics       Date:  2004-02-05       Impact factor: 6.937

4.  Complex SNP-related sequence variation in segmental genome duplications.

Authors:  David Fredman; Stefan J White; Susanna Potter; Evan E Eichler; Johan T Den Dunnen; Anthony J Brookes
Journal:  Nat Genet       Date:  2004-07-11       Impact factor: 38.330

Review 5.  A renaissance of "biochemical genetics"? SNPs, haplotypes, function, and complex diseases.

Authors:  Ruty Mehrian-Shai; Juergen K V Reichardt
Journal:  Mol Genet Metab       Date:  2004 Sep-Oct       Impact factor: 4.797

Review 6.  SNP discovery in associating genetic variation with human disease phenotypes.

Authors:  Yousin Suh; Jan Vijg
Journal:  Mutat Res       Date:  2005-06-03       Impact factor: 2.433

Review 7.  Repbase Update, a database of eukaryotic repetitive elements.

Authors:  J Jurka; V V Kapitonov; A Pavlicek; P Klonowski; O Kohany; J Walichiewicz
Journal:  Cytogenet Genome Res       Date:  2005       Impact factor: 1.636

8.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

Review 9.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

10.  Current limitations of SNP data from the public domain for studies of complex disorders: a test for ten candidate genes for obesity and osteoporosis.

Authors:  Volodymyr Dvornyk; Ji-Rong Long; Dong-Hai Xiong; Peng-Yuan Liu; Lan-Juan Zhao; Hui Shen; Yuan-Yuan Zhang; Yong-Jun Liu; Sonia Rocha-Sanchez; Peng Xiao; Robert R Recker; Hong-Wen Deng
Journal:  BMC Genet       Date:  2004-02-25       Impact factor: 2.797

View more
  24 in total

1.  FOXO3 gene variants and human aging: coding variants may not be key players.

Authors:  Timothy A Donlon; J David Curb; Qimei He; John S Grove; Kamal H Masaki; Beatriz Rodriguez; Ayako Elliott; D Craig Willcox; Bradley J Willcox
Journal:  J Gerontol A Biol Sci Med Sci       Date:  2012-03-28       Impact factor: 6.053

Review 2.  Variation in the mutation rate across mammalian genomes.

Authors:  Alan Hodgkinson; Adam Eyre-Walker
Journal:  Nat Rev Genet       Date:  2011-10-04       Impact factor: 53.242

Review 3.  Impact of host genetic polymorphisms on vaccine induced antibody response.

Authors:  Janina E Linnik; Adrian Egli
Journal:  Hum Vaccin Immunother       Date:  2016-01-25       Impact factor: 3.452

Review 4.  Computational approaches to study the effects of small genomic variations.

Authors:  Kamil Khafizov; Maxim V Ivanov; Olga V Glazova; Sergei P Kovalenko
Journal:  J Mol Model       Date:  2015-09-08       Impact factor: 1.810

Review 5.  Online tools for bioinformatics analyses in nutrition sciences.

Authors:  Sridhar A Malkaram; Yousef I Hassan; Janos Zempleni
Journal:  Adv Nutr       Date:  2012-09-01       Impact factor: 8.701

6.  High-quality DNA sequence capture of 524 disease candidate genes.

Authors:  Peidong Shen; Wenyi Wang; Sujatha Krishnakumar; Curtis Palm; Aung-Kyaw Chi; Gregory M Enns; Ronald W Davis; Terence P Speed; Michael N Mindrinos; Curt Scharfe
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-05       Impact factor: 11.205

7.  Exome sequencing for gene discovery: time does not stand still.

Authors:  Mark S LeDoux
Journal:  Ann Neurol       Date:  2012-10       Impact factor: 10.422

Review 8.  Role of aldo-keto reductase family 1 (AKR1) enzymes in human steroid metabolism.

Authors:  Tea Lanišnik Rižner; Trevor M Penning
Journal:  Steroids       Date:  2013-11-01       Impact factor: 2.668

9.  Identification and functional characterization of new missense SNPs in the coding region of the TP53 gene.

Authors:  Flora Doffe; Vincent Carbonnier; Manon Tissier; Bernard Leroy; Isabelle Martins; Johanna S M Mattsson; Patrick Micke; Sarka Pavlova; Sarka Pospisilova; Jana Smardova; Andreas C Joerger; Klas G Wiman; Guido Kroemer; Thierry Soussi
Journal:  Cell Death Differ       Date:  2020-11-30       Impact factor: 15.828

10.  Using false discovery rates to benchmark SNP-callers in next-generation sequencing projects.

Authors:  Rhys A Farrer; Daniel A Henk; Dan MacLean; David J Studholme; Matthew C Fisher
Journal:  Sci Rep       Date:  2013       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.