Literature DB >> 14534192

Gene prediction with a hidden Markov model and a new intron submodel.

Mario Stanke1, Stephan Waack.   

Abstract

MOTIVATION: The problem of finding the genes in eukaryotic DNA sequences by computational methods is still not satisfactorily solved. Gene finding programs have achieved relatively high accuracy on short genomic sequences but do not perform well on longer sequences with an unknown number of genes in them. Here existing programs tend to predict many false exons.
RESULTS: We have developed a new program, AUGUSTUS, for the ab initio prediction of protein coding genes in eukaryotic genomes. The program is based on a Hidden Markov Model and integrates a number of known methods and submodels. It employs a new way of modeling intron lengths. We use a new donor splice site model, a new model for a short region directly upstream of the donor splice site model that takes the reading frame into account and apply a method that allows better GC-content dependent parameter estimation. AUGUSTUS predicts on longer sequences far more human and drosophila genes accurately than the ab initio gene prediction programs we compared it with, while at the same time being more specific. AVAILABILITY: A web interface for AUGUSTUS and the executable program are located at http://augustus.gobics.de.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14534192     DOI: 10.1093/bioinformatics/btg1080

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  593 in total

1.  The First Draft Genome Assembly of Snow Sheep (Ovis nivicola).

Authors:  Maulik Upadhyay; Andreas Hauser; Elisabeth Kunz; Stefan Krebs; Helmut Blum; Arsen Dotsev; Innokentiy Okhlopkov; Vugar Bagirov; Gottfried Brem; Natalia Zinovieva; Ivica Medugorac
Journal:  Genome Biol Evol       Date:  2020-08-01       Impact factor: 3.416

2.  AGenDA: gene prediction by cross-species sequence comparison.

Authors:  Leila Taher; Oliver Rinner; Saurabh Garg; Alexander Sczyrba; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  AUGUSTUS: a web server for gene finding in eukaryotes.

Authors:  Mario Stanke; Rasmus Steinkamp; Stephan Waack; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 4.  A beginner's guide to eukaryotic genome annotation.

Authors:  Mark Yandell; Daniel Ence
Journal:  Nat Rev Genet       Date:  2012-04-18       Impact factor: 53.242

5.  The Solanum commersonii Genome Sequence Provides Insights into Adaptation to Stress Conditions and Genome Evolution of Wild Potato Relatives.

Authors:  Riccardo Aversano; Felice Contaldi; Maria Raffaella Ercolano; Valentina Grosso; Massimo Iorizzo; Filippo Tatino; Luciano Xumerle; Alessandra Dal Molin; Carla Avanzato; Alberto Ferrarini; Massimo Delledonne; Walter Sanseverino; Riccardo Aiese Cigliano; Salvador Capella-Gutierrez; Toni Gabaldón; Luigi Frusciante; James M Bradeen; Domenico Carputo
Journal:  Plant Cell       Date:  2015-04-14       Impact factor: 11.277

6.  Genome of the Chinese tree shrew.

Authors:  Yu Fan; Zhi-Yong Huang; Chang-Chang Cao; Ce-Shi Chen; Yuan-Xin Chen; Ding-Ding Fan; Jing He; Hao-Long Hou; Li Hu; Xin-Tian Hu; Xuan-Ting Jiang; Ren Lai; Yong-Shan Lang; Bin Liang; Sheng-Guang Liao; Dan Mu; Yuan-Ye Ma; Yu-Yu Niu; Xiao-Qing Sun; Jin-Quan Xia; Jin Xiao; Zhi-Qiang Xiong; Lin Xu; Lan Yang; Yun Zhang; Wei Zhao; Xu-Dong Zhao; Yong-Tang Zheng; Ju-Min Zhou; Ya-Bing Zhu; Guo-Jie Zhang; Jun Wang; Yong-Gang Yao
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

7.  The Oxytricha trifallax macronuclear genome: a complex eukaryotic genome with 16,000 tiny chromosomes.

Authors:  Estienne C Swart; John R Bracht; Vincent Magrini; Patrick Minx; Xiao Chen; Yi Zhou; Jaspreet S Khurana; Aaron D Goldman; Mariusz Nowacki; Klaas Schotanus; Seolkyoung Jung; Robert S Fulton; Amy Ly; Sean McGrath; Kevin Haub; Jessica L Wiggins; Donna Storton; John C Matese; Lance Parsons; Wei-Jen Chang; Michael S Bowen; Nicholas A Stover; Thomas A Jones; Sean R Eddy; Glenn A Herrick; Thomas G Doak; Richard K Wilson; Elaine R Mardis; Laura F Landweber
Journal:  PLoS Biol       Date:  2013-01-29       Impact factor: 8.029

8.  Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes.

Authors:  Michael F Lin; Joseph W Carlson; Madeline A Crosby; Beverley B Matthews; Charles Yu; Soo Park; Kenneth H Wan; Andrew J Schroeder; L Sian Gramates; Susan E St Pierre; Margaret Roark; Kenneth L Wiley; Rob J Kulathinal; Peili Zhang; Kyl V Myrick; Jerry V Antone; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

9.  The sequence and de novo assembly of the giant panda genome.

Authors:  Ruiqiang Li; Wei Fan; Geng Tian; Hongmei Zhu; Lin He; Jing Cai; Quanfei Huang; Qingle Cai; Bo Li; Yinqi Bai; Zhihe Zhang; Yaping Zhang; Wen Wang; Jun Li; Fuwen Wei; Heng Li; Min Jian; Jianwen Li; Zhaolei Zhang; Rasmus Nielsen; Dawei Li; Wanjun Gu; Zhentao Yang; Zhaoling Xuan; Oliver A Ryder; Frederick Chi-Ching Leung; Yan Zhou; Jianjun Cao; Xiao Sun; Yonggui Fu; Xiaodong Fang; Xiaosen Guo; Bo Wang; Rong Hou; Fujun Shen; Bo Mu; Peixiang Ni; Runmao Lin; Wubin Qian; Guodong Wang; Chang Yu; Wenhui Nie; Jinhuan Wang; Zhigang Wu; Huiqing Liang; Jiumeng Min; Qi Wu; Shifeng Cheng; Jue Ruan; Mingwei Wang; Zhongbin Shi; Ming Wen; Binghang Liu; Xiaoli Ren; Huisong Zheng; Dong Dong; Kathleen Cook; Gao Shan; Hao Zhang; Carolin Kosiol; Xueying Xie; Zuhong Lu; Hancheng Zheng; Yingrui Li; Cynthia C Steiner; Tommy Tsan-Yuk Lam; Siyuan Lin; Qinghui Zhang; Guoqing Li; Jing Tian; Timing Gong; Hongde Liu; Dejin Zhang; Lin Fang; Chen Ye; Juanbin Zhang; Wenbo Hu; Anlong Xu; Yuanyuan Ren; Guojie Zhang; Michael W Bruford; Qibin Li; Lijia Ma; Yiran Guo; Na An; Yujie Hu; Yang Zheng; Yongyong Shi; Zhiqiang Li; Qing Liu; Yanling Chen; Jing Zhao; Ning Qu; Shancen Zhao; Feng Tian; Xiaoling Wang; Haiyin Wang; Lizhi Xu; Xiao Liu; Tomas Vinar; Yajun Wang; Tak-Wah Lam; Siu-Ming Yiu; Shiping Liu; Hemin Zhang; Desheng Li; Yan Huang; Xia Wang; Guohua Yang; Zhi Jiang; Junyi Wang; Nan Qin; Li Li; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Gane Ka-Shu Wong; Maynard Olson; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Nature       Date:  2009-12-13       Impact factor: 49.962

10.  Chromosome-level assembly of the mustache toad genome using third-generation DNA sequencing and Hi-C analysis.

Authors:  Yongxin Li; Yandong Ren; Dongru Zhang; Hui Jiang; Zhongkai Wang; Xueyan Li; Dingqi Rao
Journal:  Gigascience       Date:  2019-09-01       Impact factor: 6.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.