Literature DB >> 14534179

Modeling sequencing errors by combining Hidden Markov models.

C Lottaz1, C Iseli, C V Jongeneel, P Bucher.   

Abstract

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.

Mesh:

Year:  2003        PMID: 14534179     DOI: 10.1093/bioinformatics/btg1067

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  69 in total

1.  An interactive bovine in silico SNP database (IBISS).

Authors:  Rachel J Hawken; Wesley C Barris; Sean M McWilliam; Brian P Dalrymple
Journal:  Mamm Genome       Date:  2004-10       Impact factor: 2.957

2.  Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance.

Authors:  Javier Terol; Ana Conesa; Jose M Colmenero; Manuel Cercos; Francisco Tadeo; Javier Agustí; Enriqueta Alós; Fernando Andres; Guillermo Soler; Javier Brumos; Domingo J Iglesias; Stefan Götz; Francisco Legaz; Xavier Argout; Brigitte Courtois; Patrick Ollitrault; Carole Dossat; Patrick Wincker; Raphael Morillon; Manuel Talon
Journal:  BMC Genomics       Date:  2007-01-25       Impact factor: 3.969

3.  Phylogenomics reveals deep molluscan relationships.

Authors:  Kevin M Kocot; Johanna T Cannon; Christiane Todt; Mathew R Citarella; Andrea B Kohn; Achim Meyer; Scott R Santos; Christoffer Schander; Leonid L Moroz; Bernhard Lieb; Kenneth M Halanych
Journal:  Nature       Date:  2011-09-04       Impact factor: 49.962

4.  Hidden Markov Models and their Applications in Biological Sequence Analysis.

Authors:  Byung-Jun Yoon
Journal:  Curr Genomics       Date:  2009-09       Impact factor: 2.236

5.  Population-level transcriptome sequencing of nonmodel organisms Erynnis propertius and Papilio zelicaon.

Authors:  Shawn T O'Neil; Jason D K Dzurisin; Rory D Carmichael; Neil F Lobo; Scott J Emrich; Jessica J Hellmann
Journal:  BMC Genomics       Date:  2010-05-17       Impact factor: 3.969

6.  OrthoSelect: a web server for selecting orthologous gene alignments from EST sequences.

Authors:  Fabian Schreiber; Gert Wörheide; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2009-06-02       Impact factor: 16.971

7.  OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.

Authors:  Fabian Schreiber; Kerstin Pick; Dirk Erpenbeck; Gert Wörheide; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2009-07-16       Impact factor: 3.169

8.  The effect of sequencing errors on metagenomic gene prediction.

Authors:  Katharina J Hoff
Journal:  BMC Genomics       Date:  2009-11-12       Impact factor: 3.969

9.  Accurate and fast methods to estimate the population mutation rate from error prone sequences.

Authors:  Bjarne Knudsen; Michael M Miyamoto
Journal:  BMC Bioinformatics       Date:  2009-08-11       Impact factor: 3.169

10.  Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies.

Authors:  Shaolin Wang; Eric Peatman; Jason Abernathy; Geoff Waldbieser; Erika Lindquist; Paul Richardson; Susan Lucas; Mei Wang; Ping Li; Jyothi Thimmapuram; Lei Liu; Deepika Vullaganti; Huseyin Kucuktas; Christopher Murdock; Brian C Small; Melanie Wilson; Hong Liu; Yanliang Jiang; Yoona Lee; Fei Chen; Jianguo Lu; Wenqi Wang; Peng Xu; Benjaporn Somridhivej; Puttharat Baoprasertkul; Jonas Quilang; Zhenxia Sha; Baolong Bao; Yaping Wang; Qun Wang; Tomokazu Takano; Samiran Nandi; Shikai Liu; Lilian Wong; Ludmilla Kaltenboeck; Sylvie Quiniou; Eva Bengten; Norman Miller; John Trant; Daniel Rokhsar; Zhanjiang Liu
Journal:  Genome Biol       Date:  2010-01-22       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.