Literature DB >> 19768587

Mining SNPs from DNA sequence data; computational approaches to SNP discovery and analysis.

Jan van Oeveren1, Antoine Janssen.   

Abstract

Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation and are the basis for most molecular markers. Before these SNPs can be used for direct sequence-based SNP detection or in a derived SNP assay, they need to be identified. For those regions or species where no validated SNPs are available in the public databases, a good alternative is to mine them from DNA sequences. The alignment of multiple sequence fragments originating from different genotypes representing the same region on the genome will allow for the discovery of sequence variants. The corresponding nucleotide mismatches are likely to be SNPs or insertions/deletions. A large amount of sequence data to be mined is present in the public databases (both expressed sequence tags and genomic sequences) and is free to use without having to do large-scale sequencing oneself. However, with the appearance of the next-generation sequencing machines (Roche GS/454, Illumina GA/Solexa, SOLiD), high-throughput sequencing is becoming widely available. This will allow for the sequencing of polymorphic genotypes on specific target areas and consequent SNP identification. In this paper we discuss the bioinformatics tools required to analyze DNA sequence data for SNP mining. A general approach for the consecutive steps in the mining process is described and commonly used SNP discovery pipelines are presented.

Entities:  

Mesh:

Year:  2009        PMID: 19768587     DOI: 10.1007/978-1-60327-411-1_4

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  7 in total

Review 1.  The impact of next-generation sequencing on genomics.

Authors:  Jun Zhang; Rod Chiodini; Ahmed Badr; Genfa Zhang
Journal:  J Genet Genomics       Date:  2011-03-15       Impact factor: 4.275

2.  ReCombine: a suite of programs for detection and analysis of meiotic recombination in whole-genome datasets.

Authors:  Carol M Anderson; Stacy Y Chen; Michelle T Dimon; Ashwini Oke; Joseph L DeRisi; Jennifer C Fung
Journal:  PLoS One       Date:  2011-10-25       Impact factor: 3.240

3.  Transcriptome analysis of the oil-rich tea plant, Camellia oleifera, reveals candidate genes related to lipid metabolism.

Authors:  En-Hua Xia; Jian-Jun Jiang; Hui Huang; Li-Ping Zhang; Hai-Bin Zhang; Li-Zhi Gao
Journal:  PLoS One       Date:  2014-08-19       Impact factor: 3.240

4.  Transcriptome sequencing and SNP detection in Phoebe chekiangensis.

Authors:  Bing He; Yingang Li; Zhouxian Ni; Li-An Xu
Journal:  PeerJ       Date:  2017-05-10       Impact factor: 2.984

5.  EVA: Exome Variation Analyzer, an efficient and versatile tool for filtering strategies in medical genomics.

Authors:  Sophie Coutant; Chloé Cabot; Arnaud Lefebvre; Martine Léonard; Elise Prieur-Gaston; Dominique Campion; Thierry Lecroq; Hélène Dauchel
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

6.  Transcriptome characterisation of Pinus tabuliformis and evolution of genes in the Pinus phylogeny.

Authors:  Shi-Hui Niu; Zhe-Xin Li; Hu-Wei Yuan; Xiao-Yang Chen; Yue Li; Wei Li
Journal:  BMC Genomics       Date:  2013-04-18       Impact factor: 3.969

7.  Transcriptomic Analysis of Eucryptorrhynchus chinensis (Coleoptera: Curculionidae) Using 454 Pyrosequencing Technology.

Authors:  Zhen-Kai Liu; Jun-Bao Wen
Journal:  J Insect Sci       Date:  2016-09-12       Impact factor: 1.857

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.