Literature DB >> 27455882

PR2S2Clust: Patched RNA-seq read segments' structure-oriented clustering.

Ashis Kumer Biswas1, Jean X Gao1.   

Abstract

RNA-seq, the next generation sequencing platform, enables researchers to explore deep into the transcriptome of organisms, such as identifying functional non-coding RNAs (ncRNAs), and quantify their expressions on tissues. The functions of ncRNAs are mostly related to their secondary structures. Thus by exploring the clustering in terms of structural profiles of the corresponding read-segments would be essential and this fuels in our motivation behind this research. In this manuscript we proposed PR2S2Clust, Patched RNA-seq Read Segments' Structure-oriented Clustering, which is an analysis platform to extract features to prepare the secondary structure profiles of the RNA-seq read segments. It provides a strategy to employ the profiles to annotate the segments into ncRNA classes using several clustering strategies. The system considers seven pairwise structural distance metrics by considering short-read mappings onto each structure, which we term as the "patched structure" while clustering the segments. In this regard, we show applications of both classical and ensemble clusterings of the partitional and hierarchical variations. Extensive real-world experiments over three publicly available RNA-seq datasets and a comparative analysis over four competitive systems confirm the effectiveness and superiority of the proposed system. The source codes and dataset of PR2S2Clust are available at the http://biomecis.uta.edu/~ashis/res/PR2S2Clust-suppl/ .

Keywords:  Patched RNA-seq read segments; ensemble clustering; hierarchical clustering; partitional clustering

Mesh:

Substances:

Year:  2016        PMID: 27455882     DOI: 10.1142/S021972001650027X

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  2 in total

1.  Characterization of the Genomic Diversity of Norovirus in Linked Patients Using a Metagenomic Deep Sequencing Approach.

Authors:  Neda Nasheri; Nicholas Petronella; Jennifer Ronholm; Sabah Bidawid; Nathalie Corneau
Journal:  Front Microbiol       Date:  2017-01-31       Impact factor: 5.640

2.  Linear space string correction algorithm using the Damerau-Levenshtein distance.

Authors:  Chunchun Zhao; Sartaj Sahni
Journal:  BMC Bioinformatics       Date:  2020-12-09       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.