Literature DB >> 30439480

Genome classification improvements based on k-mer intervals in sequences.

Gyu-Bum Han1, Dong-Ho Cho2.   

Abstract

Given the vast amount of genomic data, alignment-free sequence comparison methods are required due to their low computational complexity. k-mer based methods can improve comparison accuracy by extracting an effective feature of the genome sequences. The aim of this paper is to extract k-mer intervals of a sequence as a feature of a genome for high comparison accuracy. In the proposed method, we calculated the distance between genome sequences by comparing the distribution of k-mer intervals. Then, we identified the classification results using phylogenetic trees. We used viral, mitochondrial (MT), microbial and mammalian genome sequences to perform classification for various genome sets. We confirmed that the proposed method provides a better classification result than other k-mer based methods. Furthermore, the proposed method could efficiently be applied to long sequences such as human and mouse genomes.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2018        PMID: 30439480     DOI: 10.1016/j.ygeno.2018.11.001

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  4 in total

1.  Specificity Analysis of Genome Based on Statistically Identical K-Words With Same Base Combination.

Authors:  Hyein Seo; Yong-Joon Song; Kiho Cho; Dong-Ho Cho
Journal:  IEEE Open J Eng Med Biol       Date:  2020-07-14

2.  A systematic evaluation of bioinformatics tools for identification of long noncoding RNAs.

Authors:  You Duan; Wanting Zhang; Yingyin Cheng; Mijuan Shi; Xiao-Qin Xia
Journal:  RNA       Date:  2020-10-14       Impact factor: 4.942

3.  Phylogenetic Analysis of HIV-1 Genomes Based on the Position-Weighted K-mers Method.

Authors:  Yuanlin Ma; Zuguo Yu; Runbin Tang; Xianhua Xie; Guosheng Han; Vo V Anh
Journal:  Entropy (Basel)       Date:  2020-02-23       Impact factor: 2.524

4.  mLoc-mRNA: predicting multiple sub-cellular localization of mRNAs using random forest algorithm coupled with feature selection via elastic net.

Authors:  Prabina Kumar Meher; Anil Rai; Atmakuri Ramakrishna Rao
Journal:  BMC Bioinformatics       Date:  2021-06-24       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.