Literature DB >> 34432282

Ensemble Classifiers for Multiclass MicroRNA Classification.

Luise Odenthal1, Jens Allmer2, Malik Yousef3.   

Abstract

Gene regulation is of utmost importance to cell homeostasis; thus, any dysregulation in it often leads to disease. MicroRNAs (miRNAs) are involved in posttranscriptional gene regulation and consequently, their dysregulation has been associated with many diseases.MiRBase version 21 contains microRNAs from about 200 species organized into about 70 clades. It has been shown that not all miRNAs collected in the database are likely to be real and, therefore, novel routes to delineate between correct and false miRNAs should be explored. We introduce a novel approach based on k-mer frequencies and machine learning that assigns an unknown/unlabeled miRNA to its most likely clade/species of origin. A simple way to filter new data would be to ensure that the novel miRNA categorizes closely to the species it is said to originate from. For that, an ensemble classifier of multiple two-class random forest classifiers was designed, where each random forest was trained on one species-clade pair. The approach was tested with different sampling methods on a dataset that was taken from miRBase version 21 and it was evaluated using a hierarchical F-measure. The approach predicted 81% to 94% of the test data correctly, depending on the sampling method. This is the first classifier that can classify miRNAs to their species of origin. This method will aid in the evaluation of miRNA database integrity and analysis of noisy miRNA samples.
© 2022. Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Categorization; Machine learning; miRNA

Mesh:

Substances:

Year:  2022        PMID: 34432282     DOI: 10.1007/978-1-0716-1170-8_12

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  2 in total

1.  maTE: discovering expressed interactions between microRNAs and their targets.

Authors:  Malik Yousef; Loai Abdallah; Jens Allmer
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

2.  Phylo.io: Interactive Viewing and Comparison of Large Phylogenetic Trees on the Web.

Authors:  Oscar Robinson; David Dylus; Christophe Dessimoz
Journal:  Mol Biol Evol       Date:  2016-04-19       Impact factor: 16.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.