Literature DB >> 26007227

Epigenomic k-mer dictionaries: shedding light on how sequence composition influences in vivo nucleosome positioning.

Raffaele Giancarlo1, Simona E Rombo1, Filippo Utro1.   

Abstract

MOTIVATION: Information-theoretic and compositional analysis of biological sequences, in terms of k-mer dictionaries, has a well established role in genomic and proteomic studies. Much less so in epigenomics, although the role of k-mers in chromatin organization and nucleosome positioning is particularly relevant. Fundamental questions concerning the informational content and compositional structure of nucleosome favouring and disfavoring sequences with respect to their basic building blocks still remain open.
RESULTS: We present the first analysis on the role of k-mers in the composition of nucleosome enriched and depleted genomic regions (NER and NDR for short) that is: (i) exhaustive and within the bounds dictated by the information-theoretic content of the sample sets we use and (ii) informative for comparative epigenomics. We analize four different organisms and we propose a paradigmatic formalization of k-mer dictionaries, providing two different and complementary views of the k-mers involved in NER and NDR. The first extends well known studies in this area, its comparative nature being its major merit. The second, very novel, brings to light the rich variety of k-mers involved in influencing nucleosome positioning, for which an initial classification in terms of clusters is also provided. Although such a classification offers many insights, the following deserves to be singled-out: short poly(dA:dT) tracts are reported in the literature as fundamental for nucleosome depletion, however a global quantitative look reveals that their role is much less prominent than one would expect based on previous studies.
AVAILABILITY AND IMPLEMENTATION: Dictionaries, clusters and Supplementary Material are available online at http://math.unipa.it/rombo/epigenomics/. CONTACT: simona.rombo@unipa.it SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26007227     DOI: 10.1093/bioinformatics/btv295

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Evolutionary mechanism and biological functions of 8-mers containing CG dinucleotide in yeast.

Authors:  Yan Zheng; Hong Li; Yue Wang; Hu Meng; Qiang Zhang; Xiaoqing Zhao
Journal:  Chromosome Res       Date:  2017-02-09       Impact factor: 5.239

2.  Probabilistic variable-length segmentation of protein sequences for discriminative motif discovery (DiMotif) and sequence embedding (ProtVecX).

Authors:  Ehsaneddin Asgari; Alice C McHardy; Mohammad R K Mofrad
Journal:  Sci Rep       Date:  2019-03-05       Impact factor: 4.379

3.  FEDRO: a software tool for the automatic discovery of candidate ORFs in plants with c →u RNA editing.

Authors:  Fabio Fassetti; Claudia Giallombardo; Ofelia Leone; Luigi Palopoli; Simona E Rombo; Adolfo Saiardi
Journal:  BMC Bioinformatics       Date:  2019-04-18       Impact factor: 3.169

4.  Intrinsic laws of k-mer spectra of genome sequences and evolution mechanism of genomes.

Authors:  Zhenhua Yang; Hong Li; Yun Jia; Yan Zheng; Hu Meng; Tonglaga Bao; Xiaolong Li; Liaofu Luo
Journal:  BMC Evol Biol       Date:  2020-11-23       Impact factor: 3.260

5.  Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition.

Authors:  Akinori Awazu
Journal:  Bioinformatics       Date:  2016-08-25       Impact factor: 6.937

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.