Literature DB >> 12888525

Computational identification of protein coding potential of conserved sequence tags through cross-species evolutionary analysis.

Flavio Mignone1, Giorgio Grillo, Sabino Liuni, Graziano Pesole.   

Abstract

The identification of conserved sequence tags (CSTs) through comparative genome analysis may reveal important regulatory elements involved in shaping the spatio-temporal expression of genetic information. It is well known that the most significant fraction of CSTs observed in human-mouse comparisons correspond to protein coding exons, due to their strong evolutionary constraints. As we still do not know the complete gene inventory of the human and mouse genomes it is of the utmost importance to establish if detected conserved sequences are genes or not. We propose here a simple algorithm that, based on the observation of the specific evolutionary dynamics of coding sequences, efficiently discriminates between coding and non-coding CSTs. The application of this method may help the validation of predicted genes, the prediction of alternative splicing patterns in known and unknown genes and the definition of a dictionary of non-coding regulatory elements.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12888525      PMCID: PMC169873          DOI: 10.1093/nar/gkg483

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  20 in total

1.  Integrating genomic homology into gene structure prediction.

Authors:  I Korf; P Flicek; D Duan; M R Brent
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

3.  SGP-1: prediction and validation of homologous genes based on sequence alignments.

Authors:  T Wiehe; S Gebauer-Jung; T Mitchell-Olds; R Guigó
Journal:  Genome Res       Date:  2001-09       Impact factor: 9.043

4.  Evaluation of gene-finding programs on mammalian sequences.

Authors:  S Rogic; A K Mackworth; F B Ouellette
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

5.  Applications of generalized pair hidden Markov models to alignment and gene finding problems.

Authors:  Lior Pachter; Marina Alexandersson; Simon Cawley
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

6.  Numerous potentially functional but non-genic conserved sequences on human chromosome 21.

Authors:  Emmanouil T Dermitzakis; Alexandre Reymond; Robert Lyle; Nathalie Scamuffa; Catherine Ucla; Samuel Deutsch; Brian J Stevenson; Volker Flegel; Philipp Bucher; C Victor Jongeneel; Stylianos E Antonarakis
Journal:  Nature       Date:  2002-12-05       Impact factor: 49.962

7.  Improved microbial gene identification with GLIMMER.

Authors:  A L Delcher; D Harmon; S Kasif; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-12-01       Impact factor: 16.971

Review 8.  Genomic strategies to identify mammalian regulatory sequences.

Authors:  L A Pennacchio; E M Rubin
Journal:  Nat Rev Genet       Date:  2001-02       Impact factor: 53.242

9.  Active conservation of noncoding sequences revealed by three-way species comparisons.

Authors:  I Dubchak; M Brudno; G G Loots; L Pachter; C Mayor; E M Rubin; K A Frazer
Journal:  Genome Res       Date:  2000-09       Impact factor: 9.043

10.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences.

Authors:  W Makalowski; M S Boguski
Journal:  Proc Natl Acad Sci U S A       Date:  1998-08-04       Impact factor: 11.205

View more
  15 in total

1.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Authors:  Alexander Stark; Michael F Lin; Pouya Kheradpour; Jakob S Pedersen; Leopold Parts; Joseph W Carlson; Madeline A Crosby; Matthew D Rasmussen; Sushmita Roy; Ameya N Deoras; J Graham Ruby; Julius Brennecke; Emily Hodges; Angie S Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V Han; Morgan L Maeder; Benjamin J Polansky; Bryanne E Robson; Stein Aerts; Jacques van Helden; Bassem Hassan; Donald G Gilbert; Deborah A Eastman; Michael Rice; Michael Weir; Matthew W Hahn; Yongkyu Park; Colin N Dewey; Lior Pachter; W James Kent; David Haussler; Eric C Lai; David P Bartel; Gregory J Hannon; Thomas C Kaufman; Michael B Eisen; Andrew G Clark; Douglas Smith; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

2.  RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data.

Authors:  Stefan Washietl; Sven Findeiss; Stephan A Müller; Stefan Kalkhof; Martin von Bergen; Ivo L Hofacker; Peter F Stadler; Nick Goldman
Journal:  RNA       Date:  2011-02-28       Impact factor: 4.942

3.  Conservation anchors in the vertebrate genome.

Authors:  Ronny Aloni; Doron Lancet
Journal:  Genome Biol       Date:  2005-06-29       Impact factor: 13.583

4.  Identifying alternative hyper-splicing signatures in MG-thymoma by exon arrays.

Authors:  Lilach Soreq; Adi Gilboa-Geffen; Sonia Berrih-Aknin; Paul Lacoste; Ariel Darvasi; Eyal Soreq; Hagai Bergman; Hermona Soreq
Journal:  PLoS One       Date:  2008-06-11       Impact factor: 3.240

5.  Physico-chemical fingerprinting of RNA genes.

Authors:  Ankita Singh; Akhilesh Mishra; Ali Khosravi; Garima Khandelwal; B Jayaram
Journal:  Nucleic Acids Res       Date:  2017-04-20       Impact factor: 16.971

6.  CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison.

Authors:  Tiziana Castrignanò; Alessandro Canali; Giorgio Grillo; Sabino Liuni; Flavio Mignone; Graziano Pesole
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

7.  Distinguishing protein-coding from non-coding RNAs through support vector machines.

Authors:  Jinfeng Liu; Julian Gough; Burkhard Rost
Journal:  PLoS Genet       Date:  2006-04-28       Impact factor: 5.917

8.  Statistical assessment of discriminative features for protein-coding and non coding cross-species conserved sequence elements.

Authors:  Teresa M Creanza; David S Horner; Annarita D'Addabbo; Rosalia Maglietta; Flavio Mignone; Nicola Ancona; Graziano Pesole
Journal:  BMC Bioinformatics       Date:  2009-06-16       Impact factor: 3.169

9.  Accurate discrimination of conserved coding and non-coding regions through multiple indicators of evolutionary dynamics.

Authors:  Matteo Rè; Graziano Pesole; David S Horner
Journal:  BMC Bioinformatics       Date:  2009-09-08       Impact factor: 3.169

Review 10.  Differentiating protein-coding and noncoding RNA: challenges and ambiguities.

Authors:  Marcel E Dinger; Ken C Pang; Tim R Mercer; John S Mattick
Journal:  PLoS Comput Biol       Date:  2008-11-28       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.