Literature DB >> 25058025

Classification of selectively constrained DNA elements using feature vectors and rule-based classifiers.

Dimitris Polychronopoulos1, Emanuel Weitschek2, Slavica Dimitrieva3, Philipp Bucher4, Giovanni Felici5, Yannis Almirantis6.   

Abstract

Scarce work has been done in the analysis of the composition of conserved non-coding elements (CNEs) that are identified by comparisons of two or more genomes and are found to exist in all metazoan genomes. Here we present the analysis of CNEs with a methodology that takes into account word occurrence at various lengths scales in the form of feature vector representation and rule based classifiers. We implement our approach on both protein-coding exons and CNEs, originating from human, insect (Drosophila melanogaster) and worm (Caenorhabditis elegans) genomes, that are either identified in the present study or obtained from the literature. Alignment free feature vector representation of sequences combined with rule-based classification methods leads to successful classification of the different CNEs classes. Biologically meaningful results are derived by comparison with the genomic signatures approach, and classification rates for a variety of functional elements of the genomes along with surrogates are presented.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Alignment-free classification; Classification; Conserved noncoding elements (CNEs); Feature vector representation; Rule-based classifiers; Ultraconserved elements (UCEs)

Mesh:

Substances:

Year:  2014        PMID: 25058025     DOI: 10.1016/j.ygeno.2014.07.004

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  9 in total

1.  On avoided words, absent words, and their application to biological sequence analysis.

Authors:  Yannis Almirantis; Panagiotis Charalampopoulos; Jia Gao; Costas S Iliopoulos; Manal Mohamed; Solon P Pissis; Dimitris Polychronopoulos
Journal:  Algorithms Mol Biol       Date:  2017-03-14       Impact factor: 1.405

2.  Next generation sequencing reads comparison with an alignment-free distance.

Authors:  Emanuel Weitschek; Daniele Santoni; Giulia Fiscon; Maria Cristina De Cola; Paola Bertolazzi; Giovanni Felici
Journal:  BMC Res Notes       Date:  2014-12-03

3.  matK-QR classifier: a patterns based approach for plant species identification.

Authors:  Ravi Prabhakar More; Rupali Chandrashekhar Mane; Hemant J Purohit
Journal:  BioData Min       Date:  2016-12-09       Impact factor: 2.522

4.  Comparisons of forecasting for hepatitis in Guangxi Province, China by using three neural networks models.

Authors:  Ruijing Gan; Ni Chen; Daizheng Huang
Journal:  PeerJ       Date:  2016-11-08       Impact factor: 2.984

Review 5.  Conserved non-coding elements: developmental gene regulation meets genome organization.

Authors:  Dimitris Polychronopoulos; James W D King; Alexander J Nash; Ge Tan; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2017-12-15       Impact factor: 16.971

6.  Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

Authors:  Cheng-Hong Yang; Kuo-Chuan Wu; Li-Yeh Chuang; Hsueh-Wei Chang
Journal:  Evol Bioinform Online       Date:  2018-03-05       Impact factor: 1.625

7.  CNEFinder: finding conserved non-coding elements in genomes.

Authors:  Lorraine A K Ayad; Solon P Pissis; Dimitris Polychronopoulos
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

8.  Introduction of 'Generalized Genomic Signatures' for the quantification of neighbour preferences leads to taxonomy- and functionality-based distinction among sequences.

Authors:  Konstantinos Apostolou-Karampelis; Dimitris Polychronopoulos; Yannis Almirantis
Journal:  Sci Rep       Date:  2019-02-08       Impact factor: 4.379

9.  LAF: Logic Alignment Free and its application to bacterial genomes classification.

Authors:  Emanuel Weitschek; Fabio Cunial; Giovanni Felici
Journal:  BioData Min       Date:  2015-12-08       Impact factor: 2.522

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.