Literature DB >> 15060007

Accurate identification of novel human genes through simultaneous gene prediction in human, mouse, and rat.

Colin Dewey1, Jia Qian Wu, Simon Cawley, Marina Alexandersson, Richard Gibbs, Lior Pachter.   

Abstract

We describe a new method for simultaneously identifying novel homologous genes with identical structure in the human, mouse, and rat genomes by combining pairwise predictions made with the SLAM gene-finding program. Using this method, we found 3698 gene triples in the human, mouse, and rat genomes which are predicted with exactly the same gene structure. We show, both computationally and experimentally, that the introns of these triples are predicted accurately as compared with the introns of other ab initio gene prediction sets. Computationally, we compared the introns of these gene triples, as well as those from other ab initio gene finders, with known intron annotations. We show that a unique property of SLAM, namely that it predicts gene structures simultaneously in two organisms, is key to producing sets of predictions that are highly accurate in intron structure when combined with other programs. Experimentally, we performed reverse transcription-polymerase chain reaction (RT-PCR) in both the human and rat to test the exon pairs flanking introns from a subset of the gene triples for which the human gene had not been previously identified. By performing RT-PCR on orthologous introns in both the human and rat genomes, we additionally explore the validity of using RT-PCR as a method for confirming gene predictions.

Entities:  

Mesh:

Year:  2004        PMID: 15060007      PMCID: PMC383310          DOI: 10.1101/gr.1939804

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  17 in total

1.  Applications of generalized pair hidden Markov models to alignment and gene finding problems.

Authors:  Lior Pachter; Marina Alexandersson; Simon Cawley
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

2.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

3.  SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model.

Authors:  Marina Alexandersson; Simon Cawley; Lior Pachter
Journal:  Genome Res       Date:  2003-03       Impact factor: 9.043

4.  Initial sequencing and comparative analysis of the mouse genome.

Authors:  Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander
Journal:  Nature       Date:  2002-12-05       Impact factor: 49.962

5.  Comparative gene prediction in human and mouse.

Authors:  Genís Parra; Pankaj Agarwal; Josep F Abril; Thomas Wiehe; James W Fickett; Roderic Guigó
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

Review 6.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

7.  Improving gene recognition accuracy by combining predictions from two gene-finding programs.

Authors:  Sanja Rogic; B F Francis Ouellette; Alan K Mackworth
Journal:  Bioinformatics       Date:  2002-08       Impact factor: 6.937

8.  Genie--gene finding in Drosophila melanogaster.

Authors:  M G Reese; D Kulp; H Tammana; D Haussler
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

9.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

10.  Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes.

Authors:  Roderic Guigo; Emmanouil T Dermitzakis; Pankaj Agarwal; Chris P Ponting; Genis Parra; Alexandre Reymond; Josep F Abril; Evan Keibler; Robert Lyle; Catherine Ucla; Stylianos E Antonarakis; Michael R Brent
Journal:  Proc Natl Acad Sci U S A       Date:  2003-01-27       Impact factor: 11.205

View more
  8 in total

1.  Multiple whole-genome alignments without a reference organism.

Authors:  Inna Dubchak; Alexander Poliakov; Andrey Kislyuk; Michael Brudno
Journal:  Genome Res       Date:  2009-01-28       Impact factor: 9.043

2.  Identification of key factors regulating self-renewal and differentiation in EML hematopoietic precursor cells by RNA-sequencing analysis.

Authors:  Shan Zong; Shuyun Deng; Kenian Chen; Jia Qian Wu
Journal:  J Vis Exp       Date:  2014-11-11       Impact factor: 1.355

3.  Reference based annotation with GeneMapper.

Authors:  Sourav Chatterji; Lior Pachter
Journal:  Genome Biol       Date:  2006-04-05       Impact factor: 13.583

4.  Predicting site-specific human selective pressure using evolutionary signatures.

Authors:  Javad Sadri; Abdoulaye Banire Diallo; Mathieu Blanchette
Journal:  Bioinformatics       Date:  2011-07-01       Impact factor: 6.937

5.  Recognition of unknown conserved alternatively spliced exons.

Authors:  Uwe Ohler; Noam Shomron; Christopher B Burge
Journal:  PLoS Comput Biol       Date:  2005-07-08       Impact factor: 4.475

6.  Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics.

Authors:  Damian Fermin; Baxter B Allen; Thomas W Blackwell; Rajasree Menon; Marcin Adamski; Yin Xu; Peter Ulintz; Gilbert S Omenn; David J States
Journal:  Genome Biol       Date:  2006-04-28       Impact factor: 13.583

7.  Comparative gene finding in chicken indicates that we are closing in on the set of multi-exonic widely expressed human genes.

Authors:  Robert Castelo; Alexandre Reymond; Carine Wyss; Francisco Câmara; Genís Parra; Stylianos E Antonarakis; Roderic Guigó; Eduardo Eyras
Journal:  Nucleic Acids Res       Date:  2005-04-04       Impact factor: 16.971

8.  A method for identifying alternative or cryptic donor splice sites within gene and mRNA sequences. Comparisons among sequences from vertebrates, echinoderms and other groups.

Authors:  Katherine M Buckley; Liliana D Florea; L Courtney Smith
Journal:  BMC Genomics       Date:  2009-07-16       Impact factor: 3.969

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.