Literature DB >> 15284499

Automated SNP detection in expressed sequence tags: statistical considerations and application to maritime pine sequences.

Loïck Le Dantec1, David Chagné, David Pot, Olivier Cantin, Pauline Garnier-Géré, Frank Bedon, Jean-Marc Frigerio, Philippe Chaumeil, Patrick Léger, Virginie Garcia, Frédéric Laigret, Antoine De Daruvar, Christophe Plomion.   

Abstract

We developed an automated pipeline for the detection of single nucleotide polymorphisms (SNPs) in expressed sequence tag (EST) data sets, by combining three DNA sequence analysis programs: Phred, Phrap and PolyBayes. This application requires access to the individual electrophoregram traces. First, a reference set of 65 SNPs was obtained from the sequencing of 30 gametes in 13 maritime pine (Pinus pinaster Ait.) gene fragments (6671 bp), resulting in a frequency of 1 SNP every 102.6 bp. Second, parameters of the three programs were optimized in order to retrieve as many true SNPs, while keeping the rate of false positive as low as possible. Overall, the efficiency of detection of true SNPs was 83.1%. However, this rate varied largely as a function of the rare SNP allele frequency: down to 41% for rare SNP alleles (frequency < 10%), up to 98% for allele frequencies above 10%. Third, the detection method was applied to the 18498 assembled maritime pine (Pinus pinaster Ait.) ESTs, allowing to identify a total of 1400 candidate SNPs, in contigs containing between 4 and 20 sequence reads. These genetic resources, described for the first time in a forest tree species, were made available at http://www.pierroton.inra/genetics/Pinesnps. We also derived an analytical expression for the SNP detection probability as a function of the SNP allele frequency, the number of haploid genomes used to generate the EST sequence database, and the sample size of the contigs considered for SNP detection. The frequency of the SNP allele was shown to be the main factor influencing the probability of SNP detection.

Entities:  

Mesh:

Year:  2004        PMID: 15284499     DOI: 10.1023/B:PLAN.0000036376.11710.6f

Source DB:  PubMed          Journal:  Plant Mol Biol        ISSN: 0167-4412            Impact factor:   4.076


  29 in total

Review 1.  The use of single nucleotide polymorphisms in the isolation of common disease genes.

Authors:  J H Riley; C J Allan; E Lai; A Roses
Journal:  Pharmacogenomics       Date:  2000-02       Impact factor: 2.533

2.  Genetic epidemiology of single-nucleotide polymorphisms.

Authors:  A Collins; C Lonjou; N E Morton
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-21       Impact factor: 11.205

3.  High-throughput identification, database storage and analysis of SNPs in EST sequences.

Authors:  F J Useche; G Gao; M Harafey; A Rafalski
Journal:  Genome Inform       Date:  2001

Review 4.  SNP association studies in Alzheimer's disease highlight problems for complex disease analysis.

Authors:  T Emahazion; L Feuk; M Jobs; S L Sawyer; D Fredman; D St Clair; J A Prince; A J Brookes
Journal:  Trends Genet       Date:  2001-07       Impact factor: 11.639

Review 5.  Single nucleotide polymorphisms as tools in human genetics.

Authors:  I C Gray; D A Campbell; N K Spurr
Journal:  Hum Mol Genet       Date:  2000-10       Impact factor: 6.150

6.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

7.  Consed: a graphical tool for sequence finishing.

Authors:  D Gordon; C Abajian; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

8.  Variations on a theme: cataloging human DNA sequence variation.

Authors:  F S Collins; M S Guyer; A Charkravarti
Journal:  Science       Date:  1997-11-28       Impact factor: 47.728

9.  The extent of linkage disequilibrium in Arabidopsis thaliana.

Authors:  Magnus Nordborg; Justin O Borevitz; Joy Bergelson; Charles C Berry; Joanne Chory; Jenny Hagenblad; Martin Kreitman; Julin N Maloof; Tina Noyes; Peter J Oefner; Eli A Stahl; Detlef Weigel
Journal:  Nat Genet       Date:  2002-01-07       Impact factor: 38.330

10.  Snipping polymorphisms from large EST collections in barley (Hordeum vulgare L.).

Authors:  R Kota; S Rudd; A Facius; G Kolesov; T Thiel; H Zhang; N Stein; K Mayer; A Graner
Journal:  Mol Genet Genomics       Date:  2003-08-23       Impact factor: 3.291

View more
  23 in total

Review 1.  Towards decoding the conifer giga-genome.

Authors:  John Mackay; Jeffrey F D Dean; Christophe Plomion; Daniel G Peterson; Francisco M Cánovas; Nathalie Pavy; Pär K Ingvarsson; Outi Savolainen; M Ángeles Guevara; Silvia Fluch; Barbara Vinceti; Dolores Abarca; Carmen Díaz-Sala; María-Teresa Cervera
Journal:  Plant Mol Biol       Date:  2012-09-09       Impact factor: 4.076

2.  Generation, functional analysis and utility of Citrus grandis EST from a flower-derived cDNA library.

Authors:  Manosh Kumar Biswas; Lijun Chai; Xu Qiang; Xiuxin Deng
Journal:  Mol Biol Rep       Date:  2012-04-05       Impact factor: 2.316

3.  Genome-wide discovery of DNA polymorphism in Brassica rapa.

Authors:  Soomin Park; Hee-Ju Yu; Jeong-Hwan Mun; Seung-Chan Lee
Journal:  Mol Genet Genomics       Date:  2009-12-19       Impact factor: 3.291

4.  Expressed sequence tags from loblolly pine embryos reveal similarities with angiosperm embryogenesis.

Authors:  John Cairney; Li Zheng; Allison Cowels; Joseph Hsiao; Victoria Zismann; Jia Liu; Shu Ouyang; Francoise Thibaud-Nissen; John Hamilton; Kevin Childs; Gerald S Pullman; Yiting Zhang; Thomas Oh; C Robin Buell
Journal:  Plant Mol Biol       Date:  2006-09-26       Impact factor: 4.076

5.  An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora.

Authors:  Jorge Mc Mondego; Ramon O Vidal; Marcelo F Carazzolle; Eric K Tokuda; Lucas P Parizzi; Gustavo Gl Costa; Luiz Fp Pereira; Alan C Andrade; Carlos A Colombo; Luiz Ge Vieira; Gonçalo Ag Pereira
Journal:  BMC Plant Biol       Date:  2011-02-08       Impact factor: 4.215

6.  An efficient method for developing SNP markers based on EST data combined with high resolution melting (HRM) analysis.

Authors:  Tokuko Ujino-Ihara; Yuriko Taguchi; Yoshinari Moriguchi; Yoshihiko Tsumura
Journal:  BMC Res Notes       Date:  2010-03-02

7.  In vitro vs in silico detected SNPs for the development of a genotyping array: what can we learn from a non-model species?

Authors:  Camille Lepoittevin; Jean-Marc Frigerio; Pauline Garnier-Géré; Franck Salin; María-Teresa Cervera; Barbara Vornam; Luc Harvengt; Christophe Plomion
Journal:  PLoS One       Date:  2010-06-09       Impact factor: 3.240

8.  Allele discovery of ten candidate drought-response genes in Austrian oak using a systematically informatics approach based on 454 amplicon sequencing.

Authors:  Andreas Homolka; Thomas Eder; Dieter Kopecky; Maria Berenyi; Kornel Burg; Silvia Fluch
Journal:  BMC Res Notes       Date:  2012-04-03

9.  Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions.

Authors:  Xavier Argout; Olivier Fouet; Patrick Wincker; Karina Gramacho; Thierry Legavre; Xavier Sabau; Ange Marie Risterucci; Corinne Da Silva; Julio Cascardo; Mathilde Allegre; David Kuhn; Joseph Verica; Brigitte Courtois; Gaston Loor; Regis Babin; Olivier Sounigo; Michel Ducamp; Mark J Guiltinan; Manuel Ruiz; Laurence Alemanno; Regina Machado; Wilberth Phillips; Ray Schnell; Martin Gilmour; Eric Rosenquist; David Butler; Siela Maximova; Claire Lanaud
Journal:  BMC Genomics       Date:  2008-10-30       Impact factor: 3.969

10.  A candidate gene based approach validates Md-PG1 as the main responsible for a QTL impacting fruit texture in apple (Malus x domestica Borkh).

Authors:  Sara Longhi; Martha T Hamblin; Livio Trainotti; Cameron P Peace; Riccardo Velasco; Fabrizio Costa
Journal:  BMC Plant Biol       Date:  2013-03-04       Impact factor: 4.215

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.