Literature DB >> 12618381

SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model.

Marina Alexandersson1, Simon Cawley, Lior Pachter.   

Abstract

Comparative-based gene recognition is driven by the principle that conserved regions between related organisms are more likely than divergent regions to be coding. We describe a probabilistic framework for gene structure and alignment that can be used to simultaneously find both the gene structure and alignment of two syntenic genomic regions. A key feature of the method is the ability to enhance gene predictions by finding the best alignment between two syntenic sequences, while at the same time finding biologically meaningful alignments that preserve the correspondence between coding exons. Our probabilistic framework is the generalized pair hidden Markov model, a hybrid of (1). generalized hidden Markov models, which have been used previously for gene finding, and (2). pair hidden Markov models, which have applications to sequence alignment. We have built a gene finding and alignment program called SLAM, which aligns and identifies complete exon/intron structures of genes in two related but unannotated sequences of DNA. SLAM is able to reliably predict gene structures for any suitably related pair of organisms, most notably with fewer false-positive predictions compared to previous methods (examples are provided for Homo sapiens/Mus musculus and Plasmodium falciparum/Plasmodium vivax comparisons). Accuracy is obtained by distinguishing conserved noncoding sequence (CNS) from conserved coding sequence. CNS annotation is a novel feature of SLAM and may be useful for the annotation of UTRs, regulatory elements, and other noncoding features.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12618381      PMCID: PMC430255          DOI: 10.1101/gr.424203

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  20 in total

1.  An assessment of gene prediction accuracy in large DNA sequences.

Authors:  R Guigó; P Agarwal; J F Abril; M Burset; J W Fickett
Journal:  Genome Res       Date:  2000-10       Impact factor: 9.043

2.  Integrating genomic homology into gene structure prediction.

Authors:  I Korf; P Flicek; D Duan; M R Brent
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

Authors:  C M Bergman; M Kreitman
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

4.  Computational inference of homologous gene structures in the human genome.

Authors:  R F Yeh; L P Lim; C B Burge
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

5.  The conserved exon method for gene finding.

Authors:  V Bafna; D H Huson
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

6.  Genie--gene finding in Drosophila melanogaster.

Authors:  M G Reese; D Kulp; H Tammana; D Haussler
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

7.  Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment.

Authors:  W J Kent; A M Zahler
Journal:  Genome Res       Date:  2000-08       Impact factor: 9.043

8.  Human and mouse gene structure: comparative analysis and application to exon prediction.

Authors:  S Batzoglou; L Pachter; J P Mesirov; B Berger; E S Lander
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

9.  Alignment of whole genomes.

Authors:  A L Delcher; S Kasif; R D Fleischmann; J Peterson; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-06-01       Impact factor: 16.971

10.  An apolipoprotein influencing triglycerides in humans and mice revealed by comparative sequencing.

Authors:  L A Pennacchio; M Olivier; J A Hubacek; J C Cohen; D R Cox; J C Fruchart; R M Krauss; E M Rubin
Journal:  Science       Date:  2001-10-05       Impact factor: 47.728

View more
  52 in total

1.  Identification and characterization of multi-species conserved sequences.

Authors:  Elliott H Margulies; Mathieu Blanchette; David Haussler; Eric D Green
Journal:  Genome Res       Date:  2003-12       Impact factor: 9.043

2.  Gene structure conservation aids similarity based gene prediction.

Authors:  Irmtraud M Meyer; Richard Durbin
Journal:  Nucleic Acids Res       Date:  2004-02-04       Impact factor: 16.971

3.  Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes.

Authors:  W James Kent; Robert Baertsch; Angie Hinrichs; Webb Miller; David Haussler
Journal:  Proc Natl Acad Sci U S A       Date:  2003-09-19       Impact factor: 11.205

4.  GeneWise and Genomewise.

Authors:  Ewan Birney; Michele Clamp; Richard Durbin
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

5.  Visualization of multiple genome annotations and alignments with the K-BROWSER.

Authors:  Kushal Chakrabarti; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

6.  Accurate identification of novel human genes through simultaneous gene prediction in human, mouse, and rat.

Authors:  Colin Dewey; Jia Qian Wu; Simon Cawley; Marina Alexandersson; Richard Gibbs; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

7.  DNA-energetics-based analyses suggest additional genes in prokaryotes.

Authors:  Garima Khandelwal; Jalaj Gupta; B Jayaram
Journal:  J Biosci       Date:  2012-07       Impact factor: 1.826

8.  Parametric inference for biological sequence analysis.

Authors:  Lior Pachter; Bernd Sturmfels
Journal:  Proc Natl Acad Sci U S A       Date:  2004-11-08       Impact factor: 11.205

9.  Iterative gene prediction and pseudogene removal improves genome annotation.

Authors:  Marijke J van Baren; Michael R Brent
Journal:  Genome Res       Date:  2006-05       Impact factor: 9.043

10.  Graemlin: general and robust alignment of multiple large interaction networks.

Authors:  Jason Flannick; Antal Novak; Balaji S Srinivasan; Harley H McAdams; Serafim Batzoglou
Journal:  Genome Res       Date:  2006-08-09       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.