Literature DB >> 24191069

Model-based clustering for RNA-seq data.

Yaqing Si1, Peng Liu, Pinghua Li, Thomas P Brutnell.   

Abstract

MOTIVATION: RNA-seq technology has been widely adopted as an attractive alternative to microarray-based methods to study global gene expression. However, robust statistical tools to analyze these complex datasets are still lacking. By grouping genes with similar expression profiles across treatments, cluster analysis provides insight into gene functions and networks, and hence is an important technique for RNA-seq data analysis.
RESULTS: In this manuscript, we derive clustering algorithms based on appropriate probability models for RNA-seq data. An expectation-maximization algorithm and another two stochastic versions of expectation-maximization algorithms are described. In addition, a strategy for initialization based on likelihood is proposed to improve the clustering algorithms. Moreover, we present a model-based hybrid-hierarchical clustering method to generate a tree structure that allows visualization of relationships among clusters as well as flexibility of choosing the number of clusters. Results from both simulation studies and analysis of a maize RNA-seq dataset show that our proposed methods provide better clustering results than alternative methods such as the K-means algorithm and hierarchical clustering methods that are not based on probability models.
AVAILABILITY AND IMPLEMENTATION: An R package, MBCluster.Seq, has been developed to implement our proposed algorithms. This R package provides fast computation and is publicly available at http://www.r-project.org

Entities:  

Mesh:

Year:  2013        PMID: 24191069     DOI: 10.1093/bioinformatics/btt632

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  50 in total

Review 1.  A Survey of Data Mining and Deep Learning in Bioinformatics.

Authors:  Kun Lan; Dan-Tong Wang; Simon Fong; Lian-Sheng Liu; Kelvin K L Wong; Nilanjan Dey
Journal:  J Med Syst       Date:  2018-06-28       Impact factor: 4.460

2.  Discovery and Characterization of the 3-Hydroxyacyl-ACP Dehydratase Component of the Plant Mitochondrial Fatty Acid Synthase System.

Authors:  Xin Guan; Yozo Okazaki; Andrew Lithio; Ling Li; Xuefeng Zhao; Huanan Jin; Dan Nettleton; Kazuki Saito; Basil J Nikolau
Journal:  Plant Physiol       Date:  2017-02-15       Impact factor: 8.340

3.  MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY.

Authors:  David K Lim; Naim U Rashid; Joseph G Ibrahim
Journal:  Ann Appl Stat       Date:  2021-03-18       Impact factor: 2.083

4.  Nitrogen-Sparing Mechanisms in Chlamydomonas Affect the Transcriptome, the Proteome, and Photosynthetic Metabolism.

Authors:  Stefan Schmollinger; Timo Mühlhaus; Nanette R Boyle; Ian K Blaby; David Casero; Tabea Mettler; Jeffrey L Moseley; Janette Kropat; Frederik Sommer; Daniela Strenkert; Dorothea Hemme; Matteo Pellegrini; Arthur R Grossman; Mark Stitt; Michael Schroda; Sabeeha S Merchant
Journal:  Plant Cell       Date:  2014-04-18       Impact factor: 11.277

5.  Global Transcriptome Profiling of Developing Leaf and Shoot Apices Reveals Distinct Genetic and Environmental Control of Floral Transition and Inflorescence Development in Barley.

Authors:  Benedikt Digel; Artem Pankin; Maria von Korff
Journal:  Plant Cell       Date:  2015-08-25       Impact factor: 11.277

6.  Conditional Depletion of the Chlamydomonas Chloroplast ClpP Protease Activates Nuclear Genes Involved in Autophagy and Plastid Protein Quality Control.

Authors:  Silvia Ramundo; David Casero; Timo Mühlhaus; Dorothea Hemme; Frederik Sommer; Michèle Crèvecoeur; Michèle Rahire; Michael Schroda; Jannette Rusch; Ursula Goodenough; Matteo Pellegrini; Maria Esther Perez-Perez; José Luis Crespo; Olivier Schaad; Natacha Civic; Jean David Rochaix
Journal:  Plant Cell       Date:  2014-05-30       Impact factor: 11.277

7.  The mitochondrial protein PGAM5 suppresses energy consumption in brown adipocytes by repressing expression of uncoupling protein 1.

Authors:  Sho Sugawara; Yusuke Kanamaru; Shiori Sekine; Lila Maekawa; Akinori Takahashi; Tadashi Yamamoto; Kengo Watanabe; Takao Fujisawa; Kazuki Hattori; Hidenori Ichijo
Journal:  J Biol Chem       Date:  2020-03-06       Impact factor: 5.157

8.  Time-Course Transcriptomics Analysis Reveals Key Responses of Submerged Deepwater Rice to Flooding.

Authors:  Anzu Minami; Kenji Yano; Rico Gamuyao; Keisuke Nagai; Takeshi Kuroha; Madoka Ayano; Masanari Nakamori; Masaya Koike; Yuma Kondo; Yoko Niimi; Keiko Kuwata; Takamasa Suzuki; Tetsuya Higashiyama; Yumiko Takebayashi; Mikiko Kojima; Hitoshi Sakakibara; Atsushi Toyoda; Asao Fujiyama; Nori Kurata; Motoyuki Ashikari; Stefan Reuscher
Journal:  Plant Physiol       Date:  2018-02-23       Impact factor: 8.340

9.  Flexible model-based clustering of mixed binary and continuous data: application to genetic regulation and cancer.

Authors:  Fatin N Zainul Abidin; David R Westhead
Journal:  Nucleic Acids Res       Date:  2017-04-20       Impact factor: 16.971

10.  Reproductive developmental transcriptome analysis of Tripidium ravennae (Poaceae).

Authors:  Nathan Maren; Fangzhou Zhao; Rishi Aryal; Darren Touchell; Wusheng Liu; Thomas Ranney; Hamid Ashrafi
Journal:  BMC Genomics       Date:  2021-06-28       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.