Literature DB >> 31930398

AnnoGen: annotating genome-wide pragmatic features.

Quanhu Sheng1, Hui Yu2, Olufunmilola Oyebamiji2, Jiandong Wang3, Danqian Chen4, Scott Ness2, Ying-Yong Zhao4, Yan Guo2.   

Abstract

MOTIVATION: Genome annotation is an important step for all in-depth bioinformatics analysis. It is imperative to augment quantity and diversity of genome-wide annotation data for the latest reference genome to promote its adoption by ongoing and future impactful studies.
RESULTS: We developed a python toolkit AnnoGen, which at the first time, allows the annotation of three pragmatic genomic features for the GRCh38 genome in enormous base-wise quantities. The three features are chemical binding Energy, sequence information Entropy and Homology Score. The Homology Score is an exceptional feature that captures the genome-wide homology through single-base-offset tiling windows of 100 continual nucleotide bases. AnnoGen is capable of annotating the proprietary pragmatic features for variable user-interested genomic regions and optionally comparing two parallel sets of genomic regions. AnnoGen is characterized with simple utility modes and succinct HTML report of informative statistical tables and plots.
AVAILABILITY AND IMPLEMENTATION: https://github.com/shengqh/annogen.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2020        PMID: 31930398      PMCID: PMC7203733          DOI: 10.1093/bioinformatics/btaa027

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

Review 1.  The thermodynamics of DNA structural motifs.

Authors:  John SantaLucia; Donald Hicks
Journal:  Annu Rev Biophys Biomol Struct       Date:  2004

2.  Estimating the entropy of DNA sequences.

Authors:  A O Schmitt; H Herzel
Journal:  J Theor Biol       Date:  1997-10-07       Impact factor: 2.691

3.  Improvements and impacts of GRCh38 human reference on high throughput sequencing data analysis.

Authors:  Yan Guo; Yulin Dai; Hui Yu; Shilin Zhao; David C Samuels; Yu Shyr
Journal:  Genomics       Date:  2017-01-26       Impact factor: 5.736

4.  GREAT improves functional interpretation of cis-regulatory regions.

Authors:  Cory Y McLean; Dave Bristor; Michael Hiller; Shoa L Clarke; Bruce T Schaar; Craig B Lowe; Aaron M Wenger; Gill Bejerano
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

5.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.

Authors:  Kai Wang; Mingyao Li; Hakon Hakonarson
Journal:  Nucleic Acids Res       Date:  2010-07-03       Impact factor: 16.971

6.  beRBP: binding estimation for human RNA-binding proteins.

Authors:  Hui Yu; Jing Wang; Quanhu Sheng; Qi Liu; Yu Shyr
Journal:  Nucleic Acids Res       Date:  2019-03-18       Impact factor: 16.971

7.  GLANET: genomic loci annotation and enrichment tool.

Authors:  Burçak Otlu; Can Firtina; Sündüz Keles; Oznur Tastan
Journal:  Bioinformatics       Date:  2017-09-15       Impact factor: 6.937

8.  Fast computation and applications of genome mappability.

Authors:  Thomas Derrien; Jordi Estellé; Santiago Marco Sola; David G Knowles; Emanuele Raineri; Roderic Guigó; Paolo Ribeca
Journal:  PLoS One       Date:  2012-01-19       Impact factor: 3.240

9.  Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

Authors:  Valerie A Schneider; Tina Graves-Lindsay; Kerstin Howe; Nathan Bouk; Hsiu-Chuan Chen; Paul A Kitts; Terence D Murphy; Kim D Pruitt; Françoise Thibaud-Nissen; Derek Albracht; Robert S Fulton; Milinn Kremitzki; Vincent Magrini; Chris Markovic; Sean McGrath; Karyn Meltz Steinberg; Kate Auger; William Chow; Joanna Collins; Glenn Harden; Timothy Hubbard; Sarah Pelan; Jared T Simpson; Glen Threadgold; James Torrance; Jonathan M Wood; Laura Clarke; Sergey Koren; Matthew Boitano; Paul Peluso; Heng Li; Chen-Shan Chin; Adam M Phillippy; Richard Durbin; Richard K Wilson; Paul Flicek; Evan E Eichler; Deanna M Church
Journal:  Genome Res       Date:  2017-04-10       Impact factor: 9.043

10.  On the length, weight and GC content of the human genome.

Authors:  Allison Piovesan; Maria Chiara Pelleri; Francesca Antonaros; Pierluigi Strippoli; Maria Caracausi; Lorenza Vitale
Journal:  BMC Res Notes       Date:  2019-02-27
View more
  1 in total

1.  Non-canonical RNA-DNA differences and other human genomic features are enriched within very short tandem repeats.

Authors:  Hui Yu; Shilin Zhao; Scott Ness; Huining Kang; Quanhu Sheng; David C Samuels; Olufunmilola Oyebamiji; Ying-Yong Zhao; Yan Guo
Journal:  PLoS Comput Biol       Date:  2020-06-08       Impact factor: 4.475

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.