Literature DB >> 15621661

DNA sequence error rates in Genbank records estimated using the mouse genome as a reference.

Philipp L Wesche1, Daniel J Gaffney, Peter D Keightley.   

Abstract

We estimate DNA sequence error rates in Genbank records containing protein-coding and non-coding DNA sequences by comparing sequences of the inbred mouse strain C57BL/6J, sequenced as part of the mouse genome project and independently by other laboratories. C57BL/6J was produced by more than 100 generations of brother-sister mating, and can be assumed to be virtually free of residual polymorphism and mutational variation, so differences between independent sequences can be attributed to error. The estimated single nucleotide error rate for coding DNA is 0.10% (SE 0.012%), which is substantially lower than previous estimates for error rates in Genbank accessions. The estimated single nucleotide error rate for intronic DNA sequences (0.22%; SE 0.051%) is significantly higher than the rate for coding DNA. Since error rates for the mouse genome sequence are very low, the vast majority of the errors we detected are likely to be in individual Genbank accessions. The frequency of insertion-deletion (indel) errors in non-coding DNA approaches that of single nucleotide errors in non-coding DNA, whereas indel errors are uncommon in coding sequences.

Entities:  

Mesh:

Year:  2004        PMID: 15621661     DOI: 10.1080/10425170400008972

Source DB:  PubMed          Journal:  DNA Seq        ISSN: 1026-7913


  10 in total

1.  A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment.

Authors:  Virag Sharma; Andrew E Firth; Ivan Antonov; Olivier Fayet; John F Atkins; Mark Borodovsky; Pavel V Baranov
Journal:  Mol Biol Evol       Date:  2011-06-14       Impact factor: 16.240

2.  A modular assembly cloning technique (aided by the BIOF software tool) for seamless and error-free assembly of long DNA fragments.

Authors:  Nadezhda A Orlova; Alexandre V Orlov; Ivan I Vorobiev
Journal:  BMC Res Notes       Date:  2012-06-18

3.  Djinn Lite: a tool for customised gene transcript modelling, annotation-data enrichment and exploration.

Authors:  Erdahl T Teber; Edward Crawford; Kent B Bolton; Derek Van Dyk; Peter R Schofield; Vimal Kapoor; W Bret Church
Journal:  BMC Bioinformatics       Date:  2006-01-23       Impact factor: 3.169

4.  A novel mini-DNA barcoding assay to identify processed fins from internationally protected shark species.

Authors:  Andrew T Fields; Debra L Abercrombie; Rowena Eng; Kevin Feldheim; Demian D Chapman
Journal:  PLoS One       Date:  2015-02-03       Impact factor: 3.240

5.  Comparative genomic analysis of vertebrate mitochondrial reveals a differential of rearrangements rate between taxonomic class.

Authors:  Paula Montaña-Lozano; Manuela Moreno-Carmona; Mauricio Ochoa-Capera; Natalia S Medina; Jeffrey L Boore; Carlos F Prada
Journal:  Sci Rep       Date:  2022-03-31       Impact factor: 4.379

6.  Accurate and fast methods to estimate the population mutation rate from error prone sequences.

Authors:  Bjarne Knudsen; Michael M Miyamoto
Journal:  BMC Bioinformatics       Date:  2009-08-11       Impact factor: 3.169

7.  Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes.

Authors:  Rolf S Kaas; Carsten Friis; David W Ussery; Frank M Aarestrup
Journal:  BMC Genomics       Date:  2012-10-31       Impact factor: 3.969

8.  Control control control: a reassessment and comparison of GenBank and chromatogram mtDNA sequence variation in Baltic grey seals (Halichoerus grypus).

Authors:  Katharina Fietz; Jeff A Graves; Morten Tange Olsen
Journal:  PLoS One       Date:  2013-08-16       Impact factor: 3.240

9.  Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico Typing Resource (SISTR).

Authors:  James Robertson; Catherine Yoshida; Peter Kruczkiewicz; Celine Nadon; Anil Nichani; Eduardo N Taboada; John Howard Eagles Nash
Journal:  Microb Genom       Date:  2018-01-17

10.  Illumina Next Generation Sequencing for the Analysis of Eimeria Populations in Commercial Broilers and Indigenous Chickens.

Authors:  Ankit T Hinsu; Jalpa R Thakkar; Prakash G Koringa; Vladimir Vrba; Subhash J Jakhesara; Androniki Psifidi; Javier Guitian; Fiona M Tomley; Dharamsibhai N Rank; Muthusamy Raman; Chaitanya G Joshi; Damer P Blake
Journal:  Front Vet Sci       Date:  2018-07-30
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.