Literature DB >> 15479946

Gene prediction and verification in a compact genome with numerous small introns.

Aaron E Tenney1, Randall H Brown, Charles Vaske, Jennifer K Lodge, Tamara L Doering, Michael R Brent.   

Abstract

The genomes of clusters of related eukaryotes are now being sequenced at an increasing rate, creating a need for accurate, low-cost annotation of exon-intron structures. In this paper, we demonstrate that reverse transcription-polymerase chain reaction (RT-PCR) and direct sequencing based on predicted gene structures satisfy this need, at least for single-celled eukaryotes. The TWINSCAN gene prediction algorithm was adapted for the fungal pathogen Cryptococcus neoformans by using a precise model of intron lengths in combination with ungapped alignments between the genome sequences of the two closely related Cryptococcus varieties. This approach resulted in approximately 60% of known genes being predicted exactly right at every coding base and splice site. When previously unannotated TWINSCAN predictions were tested by RT-PCR and direct sequencing, 75% of targets spanning two predicted introns were amplified and produced high-quality sequence. When targets spanning the complete predicted open reading frame were tested, 72% of them amplified and produced high-quality sequence. We conclude that sequencing a small number of expressed sequence tags (ESTs) to provide training data, running TWINSCAN on an entire genome, and then performing RT-PCR and direct sequencing on all of its predictions would be a cost-effective method for obtaining an experimentally verified genome annotation.

Entities:  

Mesh:

Year:  2004        PMID: 15479946      PMCID: PMC525692          DOI: 10.1101/gr.2816704

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  13 in total

1.  Primer3 on the WWW for general users and for biologist programmers.

Authors:  S Rozen; H Skaletsky
Journal:  Methods Mol Biol       Date:  2000

2.  Integrating genomic homology into gene structure prediction.

Authors:  I Korf; P Flicek; D Duan; M R Brent
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Computational gene prediction using multiple sources of evidence.

Authors:  Jonathan E Allen; Mihaela Pertea; Steven L Salzberg
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

4.  Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences.

Authors:  Robert L Strausberg; Elise A Feingold; Lynette H Grouse; Jeffery G Derge; Richard D Klausner; Francis S Collins; Lukas Wagner; Carolyn M Shenmen; Gregory D Schuler; Stephen F Altschul; Barry Zeeberg; Kenneth H Buetow; Carl F Schaefer; Narayan K Bhat; Ralph F Hopkins; Heather Jordan; Troy Moore; Steve I Max; Jun Wang; Florence Hsieh; Luda Diatchenko; Kate Marusina; Andrew A Farmer; Gerald M Rubin; Ling Hong; Mark Stapleton; M Bento Soares; Maria F Bonaldo; Tom L Casavant; Todd E Scheetz; Michael J Brownstein; Ted B Usdin; Shiraki Toshiyuki; Piero Carninci; Christa Prange; Sam S Raha; Naomi A Loquellano; Garrick J Peters; Rick D Abramson; Sara J Mullahy; Stephanie A Bosak; Paul J McEwan; Kevin J McKernan; Joel A Malek; Preethi H Gunaratne; Stephen Richards; Kim C Worley; Sarah Hale; Angela M Garcia; Laura J Gay; Stephen W Hulyk; Debbie K Villalon; Donna M Muzny; Erica J Sodergren; Xiuhua Lu; Richard A Gibbs; Jessica Fahey; Erin Helton; Mark Ketteman; Anuradha Madan; Stephanie Rodrigues; Amy Sanchez; Michelle Whiting; Anup Madan; Alice C Young; Yuriy Shevchenko; Gerard G Bouffard; Robert W Blakesley; Jeffrey W Touchman; Eric D Green; Mark C Dickson; Alex C Rodriguez; Jane Grimwood; Jeremy Schmutz; Richard M Myers; Yaron S N Butterfield; Martin I Krzywinski; Ursula Skalska; Duane E Smailus; Angelique Schnerch; Jacqueline E Schein; Steven J M Jones; Marco A Marra
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-11       Impact factor: 11.205

5.  C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression.

Authors:  Jérôme Reboul; Philippe Vaglio; Jean-François Rual; Philippe Lamesch; Monica Martinez; Christopher M Armstrong; Siming Li; Laurent Jacotot; Nicolas Bertin; Rekin's Janky; Troy Moore; James R Hudson; James L Hartley; Michael A Brasch; Jean Vandenhaute; Simon Boulton; Gregory A Endress; Sarah Jenna; Eric Chevet; Vasilis Papasotiropoulos; Peter P Tolias; Jason Ptacek; Mike Snyder; Raymond Huang; Mark R Chance; Hongmei Lee; Lynn Doucette-Stamm; David E Hill; Marc Vidal
Journal:  Nat Genet       Date:  2003-05       Impact factor: 38.330

6.  Interpolated Markov models for eukaryotic gene finding.

Authors:  S L Salzberg; M Pertea; A L Delcher; M J Gardner; H Tettelin
Journal:  Genomics       Date:  1999-07-01       Impact factor: 5.736

7.  Phat--a gene finding program for Plasmodium falciparum.

Authors:  S E Cawley; A I Wirth; T P Speed
Journal:  Mol Biochem Parasitol       Date:  2001-12       Impact factor: 1.759

8.  The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).

Authors:  Daniela S Gerhard; Lukas Wagner; Elise A Feingold; Carolyn M Shenmen; Lynette H Grouse; Greg Schuler; Steven L Klein; Susan Old; Rebekah Rasooly; Peter Good; Mark Guyer; Allison M Peck; Jeffery G Derge; David Lipman; Francis S Collins; Wonhee Jang; Steven Sherry; Mike Feolo; Leonie Misquitta; Eduardo Lee; Kirill Rotmistrovsky; Susan F Greenhut; Carl F Schaefer; Kenneth Buetow; Tom I Bonner; David Haussler; Jim Kent; Mark Kiekhaus; Terry Furey; Michael Brent; Christa Prange; Kirsten Schreiber; Nicole Shapiro; Narayan K Bhat; Ralph F Hopkins; Florence Hsie; Tom Driscoll; M Bento Soares; Tom L Casavant; Todd E Scheetz; Michael J Brown-stein; Ted B Usdin; Shiraki Toshiyuki; Piero Carninci; Yulan Piao; Dawood B Dudekula; Minoru S H Ko; Koichi Kawakami; Yutaka Suzuki; Sumio Sugano; C E Gruber; M R Smith; Blake Simmons; Troy Moore; Richard Waterman; Stephen L Johnson; Yijun Ruan; Chia Lin Wei; S Mathavan; Preethi H Gunaratne; Jiaqian Wu; Angela M Garcia; Stephen W Hulyk; Edwin Fuh; Ye Yuan; Anna Sneed; Carla Kowis; Anne Hodgson; Donna M Muzny; John McPherson; Richard A Gibbs; Jessica Fahey; Erin Helton; Mark Ketteman; Anuradha Madan; Stephanie Rodrigues; Amy Sanchez; Michelle Whiting; Anup Madari; Alice C Young; Keith D Wetherby; Steven J Granite; Peggy N Kwong; Charles P Brinkley; Russell L Pearson; Gerard G Bouffard; Robert W Blakesly; Eric D Green; Mark C Dickson; Alex C Rodriguez; Jane Grimwood; Jeremy Schmutz; Richard M Myers; Yaron S N Butterfield; Malachi Griffith; Obi L Griffith; Martin I Krzywinski; Nancy Liao; Ryan Morin; Ryan Morrin; Diana Palmquist; Anca S Petrescu; Ursula Skalska; Duane E Smailus; Jeff M Stott; Angelique Schnerch; Jacqueline E Schein; Steven J M Jones; Robert A Holt; Agnes Baross; Marco A Marra; Sandra Clifton; Kathryn A Makowski; Stephanie Bosak; Joel Malek
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

9.  Ferrous iron uptake in Cryptococcus neoformans.

Authors:  E S Jacobson; A P Goodner; K J Nyhus
Journal:  Infect Immun       Date:  1998-09       Impact factor: 3.441

10.  Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing.

Authors:  Jia Qian Wu; David Shteynberg; Manimozhiyan Arumugam; Richard A Gibbs; Michael R Brent
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

View more
  17 in total

1.  Loss of cell wall alpha(1-3) glucan affects Cryptococcus neoformans from ultrastructure to virulence.

Authors:  Amy J Reese; Aki Yoneda; Julia A Breger; Anne Beauvais; Hong Liu; Cara L Griffith; Indrani Bose; Myoung-Ju Kim; Colleen Skau; Sarah Yang; Julianne A Sefko; Masako Osumi; Jean-Paul Latge; Eleftherios Mylonakis; Tamara L Doering
Journal:  Mol Microbiol       Date:  2007-03       Impact factor: 3.501

2.  Conrad: gene prediction using conditional random fields.

Authors:  David DeCaprio; Jade P Vinson; Matthew D Pearson; Philip Montgomery; Matthew Doherty; James E Galagan
Journal:  Genome Res       Date:  2007-08-09       Impact factor: 9.043

3.  Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Authors:  Vardges Ter-Hovhannisyan; Alexandre Lomsadze; Yury O Chernoff; Mark Borodovsky
Journal:  Genome Res       Date:  2008-08-29       Impact factor: 9.043

4.  A fluorogenic C. neoformans reporter strain with a robust expression of m-cherry expressed from a safe haven site in the genome.

Authors:  Rajendra Upadhya; Woei C Lam; Brian T Maybruck; Maureen J Donlin; Andrew L Chang; Sarah Kayode; Kate L Ormerod; James A Fraser; Tamara L Doering; Jennifer K Lodge
Journal:  Fungal Genet Biol       Date:  2017-09-12       Impact factor: 3.495

5.  A eukaryotic capsular polysaccharide is synthesized intracellularly and secreted via exocytosis.

Authors:  Aki Yoneda; Tamara L Doering
Journal:  Mol Biol Cell       Date:  2006-10-04       Impact factor: 4.138

6.  Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions.

Authors:  Chaochun Wei; Philippe Lamesch; Manimozhiyan Arumugam; Jennifer Rosenberg; Ping Hu; Marc Vidal; Michael R Brent
Journal:  Genome Res       Date:  2005-04       Impact factor: 9.043

7.  An intergenic "safe haven" region in Cryptococcus neoformans serotype D genomes.

Authors:  Yumeng Fan; Xiaorong Lin
Journal:  Fungal Genet Biol       Date:  2020-09-15       Impact factor: 3.495

8.  A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome.

Authors:  Mark Yandell; Adina M Bailey; Sima Misra; ShengQiang Shu; Colin Wiel; Martha Evans-Holm; Susan E Celniker; Gerald M Rubin
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-24       Impact factor: 11.205

9.  Efficient implementation of RNA interference in the pathogenic yeast Cryptococcus neoformans.

Authors:  Indrani Bose; Tamara L Doering
Journal:  J Microbiol Methods       Date:  2011-04-29       Impact factor: 2.363

10.  Exploration of whole-genome responses of the human AIDS-associated yeast pathogen Cryptococcus neoformans var grubii: nitric oxide stress and body temperature.

Authors:  Eric D Chow; Oliver W Liu; Sean O'Brien; Hiten D Madhani
Journal:  Curr Genet       Date:  2007-07-28       Impact factor: 3.886

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.