Literature DB >> 11779845

The K(A)/K(S) ratio test for assessing the protein-coding potential of genomic regions: an empirical and simulation study.

Anton Nekrutenko1, Kateryna D Makova, Wen-Hsiung Li.   

Abstract

Comparative genomics is a simple, powerful way to increase the accuracy of gene prediction. In this study, we show the utility of a simple test for the identification of protein-coding exons using human/mouse sequence comparisons. The test takes advantage of the fact that in the vast majority of coding regions, synonymous substitutions (K(S)) occur much more frequently than nonsynonymous ones (K(A)) and uses the K(A)/K(S) ratio as the criterion. We show the following: (1) most of the human and mouse exons are sufficiently long and have a suitable degree of sequence divergence for the test to perform reliably; (2) the test is suited for the identification of long exons and single exon genes, which are difficult to predict by current methods; (3) the test has a false-negative rate, lower than most of current gene prediction methods and a false-positive rate lower than all current methods; (4) the test has been automated and can be used in combination with other existing gene-prediction methods.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11779845      PMCID: PMC155263          DOI: 10.1101/gr.200901

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  11 in total

1.  Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs.

Authors:  N Jareborg; E Birney; R Durbin
Journal:  Genome Res       Date:  1999-09       Impact factor: 9.043

2.  Evaluation of gene-finding programs on mammalian sequences.

Authors:  S Rogic; A K Mackworth; F B Ouellette
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

3.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

4.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

5.  Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment.

Authors:  W J Kent; A M Zahler
Journal:  Genome Res       Date:  2000-08       Impact factor: 9.043

6.  Active conservation of noncoding sequences revealed by three-way species comparisons.

Authors:  I Dubchak; M Brudno; G G Loots; L Pachter; C Mayor; E M Rubin; K A Frazer
Journal:  Genome Res       Date:  2000-09       Impact factor: 9.043

7.  Human and mouse gene structure: comparative analysis and application to exon prediction.

Authors:  S Batzoglou; L Pachter; J P Mesirov; B Berger; E S Lander
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

8.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences.

Authors:  W Makalowski; M S Boguski
Journal:  Proc Natl Acad Sci U S A       Date:  1998-08-04       Impact factor: 11.205

9.  The sequence of the human genome.

Authors:  J C Venter; M D Adams; E W Myers; P W Li; R J Mural; G G Sutton; H O Smith; M Yandell; C A Evans; R A Holt; J D Gocayne; P Amanatides; R M Ballew; D H Huson; J R Wortman; Q Zhang; C D Kodira; X H Zheng; L Chen; M Skupski; G Subramanian; P D Thomas; J Zhang; G L Gabor Miklos; C Nelson; S Broder; A G Clark; J Nadeau; V A McKusick; N Zinder; A J Levine; R J Roberts; M Simon; C Slayman; M Hunkapiller; R Bolanos; A Delcher; I Dew; D Fasulo; M Flanigan; L Florea; A Halpern; S Hannenhalli; S Kravitz; S Levy; C Mobarry; K Reinert; K Remington; J Abu-Threideh; E Beasley; K Biddick; V Bonazzi; R Brandon; M Cargill; I Chandramouliswaran; R Charlab; K Chaturvedi; Z Deng; V Di Francesco; P Dunn; K Eilbeck; C Evangelista; A E Gabrielian; W Gan; W Ge; F Gong; Z Gu; P Guan; T J Heiman; M E Higgins; R R Ji; Z Ke; K A Ketchum; Z Lai; Y Lei; Z Li; J Li; Y Liang; X Lin; F Lu; G V Merkulov; N Milshina; H M Moore; A K Naik; V A Narayan; B Neelam; D Nusskern; D B Rusch; S Salzberg; W Shao; B Shue; J Sun; Z Wang; A Wang; X Wang; J Wang; M Wei; R Wides; C Xiao; C Yan; A Yao; J Ye; M Zhan; W Zhang; H Zhang; Q Zhao; L Zheng; F Zhong; W Zhong; S Zhu; S Zhao; D Gilbert; S Baumhueter; G Spier; C Carter; A Cravchik; T Woodage; F Ali; H An; A Awe; D Baldwin; H Baden; M Barnstead; I Barrow; K Beeson; D Busam; A Carver; A Center; M L Cheng; L Curry; S Danaher; L Davenport; R Desilets; S Dietz; K Dodson; L Doup; S Ferriera; N Garg; A Gluecksmann; B Hart; J Haynes; C Haynes; C Heiner; S Hladun; D Hostin; J Houck; T Howland; C Ibegwam; J Johnson; F Kalush; L Kline; S Koduru; A Love; F Mann; D May; S McCawley; T McIntosh; I McMullen; M Moy; L Moy; B Murphy; K Nelson; C Pfannkoch; E Pratts; V Puri; H Qureshi; M Reardon; R Rodriguez; Y H Rogers; D Romblad; B Ruhfel; R Scott; C Sitter; M Smallwood; E Stewart; R Strong; E Suh; R Thomas; N N Tint; S Tse; C Vech; G Wang; J Wetter; S Williams; M Williams; S Windsor; E Winn-Deen; K Wolfe; J Zaveri; K Zaveri; J F Abril; R Guigó; M J Campbell; K V Sjolander; B Karlak; A Kejariwal; H Mi; B Lazareva; T Hatton; A Narechania; K Diemer; A Muruganujan; N Guo; S Sato; V Bafna; S Istrail; R Lippert; R Schwartz; B Walenz; S Yooseph; D Allen; A Basu; J Baxendale; L Blick; M Caminha; J Carnes-Stine; P Caulk; Y H Chiang; M Coyne; C Dahlke; A Deslattes Mays; M Dombroski; M Donnelly; D Ely; S Esparham; C Fosler; H Gire; S Glanowski; K Glasser; A Glodek; M Gorokhov; K Graham; B Gropman; M Harris; J Heil; S Henderson; J Hoover; D Jennings; C Jordan; J Jordan; J Kasha; L Kagan; C Kraft; A Levitsky; M Lewis; X Liu; J Lopez; D Ma; W Majoros; J McDaniel; S Murphy; M Newman; T Nguyen; N Nguyen; M Nodell; S Pan; J Peck; M Peterson; W Rowe; R Sanders; J Scott; M Simpson; T Smith; A Sprague; T Stockwell; R Turner; E Venter; M Wang; M Wen; D Wu; M Wu; A Xia; A Zandieh; X Zhu
Journal:  Science       Date:  2001-02-16       Impact factor: 47.728

10.  Statistical methods for detecting molecular adaptation.

Authors: 
Journal:  Trends Ecol Evol       Date:  2000-12-01       Impact factor: 17.712

View more
  116 in total

1.  ETOPE: Evolutionary test of predicted exons.

Authors:  Anton Nekrutenko; Wen-Yu Chung; Wen-Hsiung Li
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  CHOP: visualization of 'wobbling' and isolation of highly conserved regions from aligned DNA sequences.

Authors:  Masato Ohtsuka; Shohei Horiuchi; Jerzy K Kulski; Minoru Kimura; Hidetoshi Inoko
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  Known and novel post-transcriptional regulatory sequences are conserved across plant families.

Authors:  Justin N Vaughn; Sally R Ellingson; Flavio Mignone; Albrecht von Arnim
Journal:  RNA       Date:  2012-01-11       Impact factor: 4.942

4.  Identification of genes with fast-evolving regions in microbial genomes.

Authors:  Yu Zheng; Richard J Roberts; Simon Kasif
Journal:  Nucleic Acids Res       Date:  2004-12-02       Impact factor: 16.971

5.  Identification of novel exons from rat-mouse comparisons.

Authors:  Anton Nekrutenko
Journal:  J Mol Evol       Date:  2004-11       Impact factor: 2.395

6.  Comprehensive identification of Drosophila dorsal-ventral patterning genes using a whole-genome tiling array.

Authors:  Frédéric Biemar; David A Nix; Jessica Piel; Brant Peterson; Matthew Ronshaugen; Victor Sementchenko; Ian Bell; J Robert Manak; Michael S Levine
Journal:  Proc Natl Acad Sci U S A       Date:  2006-08-14       Impact factor: 11.205

7.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Authors:  Alexander Stark; Michael F Lin; Pouya Kheradpour; Jakob S Pedersen; Leopold Parts; Joseph W Carlson; Madeline A Crosby; Matthew D Rasmussen; Sushmita Roy; Ameya N Deoras; J Graham Ruby; Julius Brennecke; Emily Hodges; Angie S Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V Han; Morgan L Maeder; Benjamin J Polansky; Bryanne E Robson; Stein Aerts; Jacques van Helden; Bassem Hassan; Donald G Gilbert; Deborah A Eastman; Michael Rice; Michael Weir; Matthew W Hahn; Yongkyu Park; Colin N Dewey; Lior Pachter; W James Kent; David Haussler; Eric C Lai; David P Bartel; Gregory J Hannon; Thomas C Kaufman; Michael B Eisen; Andrew G Clark; Douglas Smith; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

Review 8.  Models of coding sequence evolution.

Authors:  Wayne Delport; Konrad Scheffler; Cathal Seoighe
Journal:  Brief Bioinform       Date:  2008-10-29       Impact factor: 11.622

9.  Hypothetical proteins present during recovery phase of radiation resistant bacterium Deinococcus radiodurans are under purifying selection.

Authors:  Anubrata D Das; Hari S Misra
Journal:  J Mol Evol       Date:  2013-08-10       Impact factor: 2.395

10.  Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes.

Authors:  Michael F Lin; Joseph W Carlson; Madeline A Crosby; Beverley B Matthews; Charles Yu; Soo Park; Kenneth H Wan; Andrew J Schroeder; L Sian Gramates; Susan E St Pierre; Margaret Roark; Kenneth L Wiley; Rob J Kulathinal; Peili Zhang; Kyl V Myrick; Jerry V Antone; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.