Literature DB >> 23431329

Incorporating the human gene annotations in different databases significantly improved transcriptomic and genetic analyses.

Geng Chen1, Charles Wang, Leming Shi, Xiongfei Qu, Jiwei Chen, Jianmin Yang, Caiping Shi, Long Chen, Peiying Zhou, Baitang Ning, Weida Tong, Tieliu Shi.   

Abstract

Human gene annotation is crucial for conducting transcriptomic and genetic studies; however, the impacts of human gene annotations in diverse databases on related studies have been less evaluated. To enable full use of various human annotation resources and better understand the human transcriptome, here we systematically compare the human annotations present in RefSeq, Ensembl (GENCODE), and AceView on diverse transcriptomic and genetic analyses. We found that the human gene annotations in the three databases are far from complete. Although Ensembl and AceView annotated more genes than RefSeq, more than 15,800 genes from Ensembl (or AceView) are within the intergenic and intronic regions of AceView (or Ensembl) annotation. The human transcriptome annotations in RefSeq, Ensembl, and AceView had distinct effects on short-read mapping, gene and isoform expression profiling, and differential expression calling. Furthermore, our findings indicate that the integrated annotation of these databases can obtain a more complete gene set and significantly enhance those transcriptomic analyses. We also observed that many more known SNPs were located within genes annotated in Ensembl and AceView than in RefSeq. In particular, 1033 of 3041 trait/disease-associated SNPs involved in about 200 human traits/diseases that were previously reported to be in RefSeq intergenic regions could be relocated within Ensembl and AceView genes. Our findings illustrate that a more complete transcriptome generated by incorporating human gene annotations in diverse databases can strikingly improve the overall results of transcriptomic and genetic studies.

Entities:  

Mesh:

Year:  2013        PMID: 23431329      PMCID: PMC3677258          DOI: 10.1261/rna.037473.112

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


  45 in total

1.  Gene index analysis of the human genome estimates approximately 120,000 genes.

Authors:  F Liang; I Holt; G Pertea; S Karamycheva; S L Salzberg; J Quackenbush
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

2.  How to count ... human genes.

Authors:  S A Aparicio
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

3.  An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene.

Authors:  Shinji Hirotsune; Noriyuki Yoshida; Amy Chen; Lisa Garrett; Fumihiro Sugiyama; Satoru Takahashi; Ken-ichi Yagami; Anthony Wynshaw-Boris; Atsushi Yoshiki
Journal:  Nature       Date:  2003-05-01       Impact factor: 49.962

4.  Bioinformatics. Gene counters struggle to get the right answer.

Authors:  Elizabeth Pennisi
Journal:  Science       Date:  2003-08-22       Impact factor: 47.728

Review 5.  Oncogenomics and the development of new cancer therapies.

Authors:  Robert L Strausberg; Andrew J G Simpson; Lloyd J Old; Gregory J Riggins
Journal:  Nature       Date:  2004-05-27       Impact factor: 49.962

Review 6.  An assessment of the sequence gaps: unfinished business in a finished human genome.

Authors:  Evan E Eichler; Royden A Clark; Xinwei She
Journal:  Nat Rev Genet       Date:  2004-05       Impact factor: 53.242

7.  Human genome: end of the beginning.

Authors:  Lincoln D Stein
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

8.  Differential expression in RNA-seq: a matter of depth.

Authors:  Sonia Tarazona; Fernando García-Alcalde; Joaquín Dopazo; Alberto Ferrer; Ana Conesa
Journal:  Genome Res       Date:  2011-09-08       Impact factor: 9.043

9.  Gene prediction with a hidden Markov model and a new intron submodel.

Authors:  Mario Stanke; Stephan Waack
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

10.  AceView: a comprehensive cDNA-supported gene and transcripts annotation.

Authors:  Danielle Thierry-Mieg; Jean Thierry-Mieg
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

View more
  10 in total

1.  Incomplete annotation has a disproportionate impact on our understanding of Mendelian and complex neurogenetic disorders.

Authors:  David Zhang; Sebastian Guelfi; Sonia Garcia-Ruiz; Beatrice Costa; Regina H Reynolds; Karishma D'Sa; Wenfei Liu; Thomas Courtin; Amy Peterson; Andrew E Jaffe; John Hardy; Juan A Botía; Leonardo Collado-Torres; Mina Ryten
Journal:  Sci Adv       Date:  2020-06-10       Impact factor: 14.136

Review 2.  Sequencing XMET genes to promote genotype-guided risk assessment and precision medicine.

Authors:  Yaqiong Jin; Geng Chen; Wenming Xiao; Huixiao Hong; Joshua Xu; Yongli Guo; Wenzhong Xiao; Tieliu Shi; Leming Shi; Weida Tong; Baitang Ning
Journal:  Sci China Life Sci       Date:  2019-05-20       Impact factor: 6.038

3.  Dissecting the Characteristics and Dynamics of Human Protein Complexes at Transcriptome Cascade Using RNA-Seq Data.

Authors:  Geng Chen; Jiwei Chen; Caiping Shi; Leming Shi; Weida Tong; Tieliu Shi
Journal:  PLoS One       Date:  2013-06-18       Impact factor: 3.240

4.  Re-annotation of presumed noncoding disease/trait-associated genetic variants by integrative analyses.

Authors:  Geng Chen; Dianke Yu; Jiwei Chen; Ruifang Cao; Juan Yang; Huan Wang; Xiangjun Ji; Baitang Ning; Tieliu Shi
Journal:  Sci Rep       Date:  2015-03-30       Impact factor: 4.379

5.  RNA-seq analysis of impact of PNN on gene expression and alternative splicing in corneal epithelial cells.

Authors:  Debra Akin; Jeremy R B Newman; Lauren M McIntyre; Stephen P Sugrue
Journal:  Mol Vis       Date:  2016-01-16       Impact factor: 2.367

6.  Identification of Tissue-Specific Protein-Coding and Noncoding Transcripts across 14 Human Tissues Using RNA-seq.

Authors:  Jinhang Zhu; Geng Chen; Sibo Zhu; Suqing Li; Zhuo Wen; Yuanting Zheng; Leming Shi
Journal:  Sci Rep       Date:  2016-06-22       Impact factor: 4.379

7.  Roadblock: improved annotations do not necessarily translate into new functional insights.

Authors:  Nicola A L Hall; Becky C Carlyle; Wilfried Haerty; Elizabeth M Tunbridge
Journal:  Genome Biol       Date:  2021-11-22       Impact factor: 13.583

8.  A comprehensive evaluation of ensembl, RefSeq, and UCSC annotations in the context of RNA-seq read mapping and gene quantification.

Authors:  Shanrong Zhao; Baohong Zhang
Journal:  BMC Genomics       Date:  2015-02-18       Impact factor: 3.969

9.  Arkas: Rapid reproducible RNAseq analysis.

Authors:  Anthony R Colombo; Timothy J Triche; Giridharan Ramsingh
Journal:  F1000Res       Date:  2017-04-27

10.  Incomplete annotation has a disproportionate impact on our understanding of Mendelian and complex neurogenetic disorders.

Authors:  David Zhang; Sebastian Guelfi; Sonia Garcia-Ruiz; Beatrice Costa; Regina H Reynolds; Karishma D'Sa; Wenfei Liu; Thomas Courtin; Amy Peterson; Andrew E Jaffe; John Hardy; Juan A Botía; Leonardo Collado-Torres; Mina Ryten
Journal:  Sci Adv       Date:  2020-06-10       Impact factor: 14.136

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.