Literature DB >> 22047402

RNA-Seq improves annotation of protein-coding genes in the cucumber genome.

Zhen Li1, Zhonghua Zhang, Pengcheng Yan, Sanwen Huang, Zhangjun Fei, Kui Lin.   

Abstract

BACKGROUND: As more and more genomes are sequenced, genome annotation becomes increasingly important in bridging the gap between sequence and biology. Gene prediction, which is at the center of genome annotation, usually integrates various resources to compute consensus gene structures. However, many newly sequenced genomes have limited resources for gene predictions. In an effort to create high-quality gene models of the cucumber genome (Cucumis sativus var. sativus), based on the EVidenceModeler gene prediction pipeline, we incorporated the massively parallel complementary DNA sequencing (RNA-Seq) reads of 10 cucumber tissues into EVidenceModeler. We applied the new pipeline to the reassembled cucumber genome and included a comparison between our predicted protein-coding gene sets and a published set.
RESULTS: The reassembled cucumber genome, annotated with RNA-Seq reads from 10 tissues, has 23, 248 identified protein-coding genes. Compared with the published prediction in 2009, approximately 8, 700 genes reveal structural modifications and 5, 285 genes only appear in the reassembled cucumber genome. All the related results, including genome sequence and annotations, are available at http://cmb.bnu.edu.cn/Cucumis_sativus_v20/.
CONCLUSIONS: We conclude that RNA-Seq greatly improves the accuracy of prediction of protein-coding genes in the reassembled cucumber genome. The comparison between the two gene sets also suggests that it is feasible to use RNA-Seq reads to annotate newly sequenced or less-studied genomes.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22047402      PMCID: PMC3219749          DOI: 10.1186/1471-2164-12-540

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  50 in total

1.  The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants.

Authors:  Shu Ouyang; C Robin Buell
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Advancing RNA-Seq analysis.

Authors:  Brian J Haas; Michael C Zody
Journal:  Nat Biotechnol       Date:  2010-05       Impact factor: 54.908

4.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

5.  A computational screen for methylation guide snoRNAs in yeast.

Authors:  T M Lowe; S R Eddy
Journal:  Science       Date:  1999-02-19       Impact factor: 47.728

6.  TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders.

Authors:  W H Majoros; M Pertea; S L Salzberg
Journal:  Bioinformatics       Date:  2004-05-14       Impact factor: 6.937

7.  BLAST2GENE: a comprehensive conversion of BLAST output into independent genes and gene fragments.

Authors:  Mikita Suyama; David Torrents; Peer Bork
Journal:  Bioinformatics       Date:  2004-03-22       Impact factor: 6.937

8.  Rfam: annotating non-coding RNAs in complete genomes.

Authors:  Sam Griffiths-Jones; Simon Moxon; Mhairi Marshall; Ajay Khanna; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release.

Authors:  Brian J Haas; Jennifer R Wortman; Catherine M Ronning; Linda I Hannick; Roger K Smith; Rama Maiti; Agnes P Chan; Chunhui Yu; Maryam Farzad; Dongying Wu; Owen White; Christopher D Town
Journal:  BMC Biol       Date:  2005-03-22       Impact factor: 7.431

10.  Gene finding in novel genomes.

Authors:  Ian Korf
Journal:  BMC Bioinformatics       Date:  2004-05-14       Impact factor: 3.169

View more
  70 in total

1.  Genome-Wide Mapping of Structural Variations Reveals a Copy Number Variant That Determines Reproductive Morphology in Cucumber.

Authors:  Zhonghua Zhang; Linyong Mao; Huiming Chen; Fengjiao Bu; Guangcun Li; Jinjing Sun; Shuai Li; Honghe Sun; Chen Jiao; Rachel Blakely; Junsong Pan; Run Cai; Ruibang Luo; Yves Van de Peer; Evert Jacobsen; Zhangjun Fei; Sanwen Huang
Journal:  Plant Cell       Date:  2015-05-22       Impact factor: 11.277

2.  Improved structural annotation of protein-coding genes in the Meloidogyne hapla genome using RNA-Seq.

Authors:  Yuelong Guo; David McK Bird; Dahlia M Nielsen
Journal:  Worm       Date:  2014-05-16

3.  QTL-seq identifies an early flowering QTL located near Flowering Locus T in cucumber.

Authors:  Hongfeng Lu; Tao Lin; Joël Klein; Shenhao Wang; Jianjian Qi; Qian Zhou; Jinjing Sun; Zhonghua Zhang; Yiqun Weng; Sanwen Huang
Journal:  Theor Appl Genet       Date:  2014-05-21       Impact factor: 5.699

4.  Seqping: gene prediction pipeline for plant genomes using self-training gene models and transcriptomic data.

Authors:  Kuang-Lim Chan; Rozana Rosli; Tatiana V Tatarinova; Michael Hogan; Mohd Firdaus-Raih; Eng-Ti Leslie Low
Journal:  BMC Bioinformatics       Date:  2017-01-27       Impact factor: 3.169

5.  Fine mapping of the pleiotropic locus B for black spine and orange mature fruit color in cucumber identifies a 50 kb region containing a R2R3-MYB transcription factor.

Authors:  Yuhong Li; Changlong Wen; Yiqun Weng
Journal:  Theor Appl Genet       Date:  2013-05-21       Impact factor: 5.699

6.  QTL mapping for downy mildew resistance in cucumber inbred line WI7120 (PI 330628).

Authors:  Yuhui Wang; Kyle VandenLangenberg; Todd C Wehner; Peter A G Kraan; Jos Suelmann; Xiangyang Zheng; Ken Owens; Yiqun Weng
Journal:  Theor Appl Genet       Date:  2016-05-04       Impact factor: 5.699

7.  A genomic variation map provides insights into the genetic basis of cucumber domestication and diversity.

Authors:  Jianjian Qi; Xin Liu; Di Shen; Han Miao; Bingyan Xie; Xixiang Li; Peng Zeng; Shenhao Wang; Yi Shang; Xingfang Gu; Yongchen Du; Ying Li; Tao Lin; Jinhong Yuan; Xueyong Yang; Jinfeng Chen; Huiming Chen; Xingyao Xiong; Ke Huang; Zhangjun Fei; Linyong Mao; Li Tian; Thomas Städler; Susanne S Renner; Sophien Kamoun; William J Lucas; Zhonghua Zhang; Sanwen Huang
Journal:  Nat Genet       Date:  2013-10-20       Impact factor: 38.330

8.  The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions.

Authors:  Shaogui Guo; Jianguo Zhang; Honghe Sun; Jerome Salse; William J Lucas; Haiying Zhang; Yi Zheng; Linyong Mao; Yi Ren; Zhiwen Wang; Jiumeng Min; Xiaosen Guo; Florent Murat; Byung-Kook Ham; Zhaoliang Zhang; Shan Gao; Mingyun Huang; Yimin Xu; Silin Zhong; Aureliano Bombarely; Lukas A Mueller; Hong Zhao; Hongju He; Yan Zhang; Zhonghua Zhang; Sanwen Huang; Tao Tan; Erli Pang; Kui Lin; Qun Hu; Hanhui Kuang; Peixiang Ni; Bo Wang; Jingan Liu; Qinghe Kou; Wenju Hou; Xiaohua Zou; Jiao Jiang; Guoyi Gong; Kathrin Klee; Heiko Schoof; Ying Huang; Xuesong Hu; Shanshan Dong; Dequan Liang; Juan Wang; Kui Wu; Yang Xia; Xiang Zhao; Zequn Zheng; Miao Xing; Xinming Liang; Bangqing Huang; Tian Lv; Junyi Wang; Ye Yin; Hongping Yi; Ruiqiang Li; Mingzhu Wu; Amnon Levi; Xingping Zhang; James J Giovannoni; Jun Wang; Yunfu Li; Zhangjun Fei; Yong Xu
Journal:  Nat Genet       Date:  2012-11-25       Impact factor: 38.330

9.  Towards an improved apple reference transcriptome using RNA-seq.

Authors:  Yang Bai; Laura Dougherty; Kenong Xu
Journal:  Mol Genet Genomics       Date:  2014-02-16       Impact factor: 3.291

10.  A high-quality cucumber genome assembly enhances computational comparative genomics.

Authors:  Paweł Osipowski; Magdalena Pawełkowicz; Michał Wojcieszek; Agnieszka Skarzyńska; Zbigniew Przybecki; Wojciech Pląder
Journal:  Mol Genet Genomics       Date:  2019-10-16       Impact factor: 3.291

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.