Literature DB >> 27153643

Sparse group factor analysis for biclustering of multiple data sources.

Kerstin Bunte1, Eemeli Leppäaho1, Inka Saarinen1, Samuel Kaski1.   

Abstract

MOTIVATION: Modelling methods that find structure in data are necessary with the current large volumes of genomic data, and there have been various efforts to find subsets of genes exhibiting consistent patterns over subsets of treatments. These biclustering techniques have focused on one data source, often gene expression data. We present a Bayesian approach for joint biclustering of multiple data sources, extending a recent method Group Factor Analysis to have a biclustering interpretation with additional sparsity assumptions. The resulting method enables data-driven detection of linear structure present in parts of the data sources.
RESULTS: Our simulation studies show that the proposed method reliably infers biclusters from heterogeneous data sources. We tested the method on data from the NCI-DREAM drug sensitivity prediction challenge, resulting in an excellent prediction accuracy. Moreover, the predictions are based on several biclusters which provide insight into the data sources, in this case on gene expression, DNA methylation, protein abundance, exome sequence, functional connectivity fingerprints and drug sensitivity.
AVAILABILITY AND IMPLEMENTATION: http://research.cs.aalto.fi/pml/software/GFAsparse/ CONTACTS: : kerstin.bunte@googlemail.com or samuel.kaski@aalto.fi.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2016        PMID: 27153643     DOI: 10.1093/bioinformatics/btw207

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

Review 1.  Applications of machine learning in drug discovery and development.

Authors:  Jessica Vamathevan; Dominic Clark; Paul Czodrowski; Ian Dunham; Edgardo Ferran; George Lee; Bin Li; Anant Madabhushi; Parantu Shah; Michaela Spitzer; Shanrong Zhao
Journal:  Nat Rev Drug Discov       Date:  2019-06       Impact factor: 84.694

Review 2.  It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data.

Authors:  Juan Xie; Anjun Ma; Anne Fennell; Qin Ma; Jing Zhao
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

Review 3.  Artificial intelligence and machine learning in precision and genomic medicine.

Authors:  Sameer Quazi
Journal:  Med Oncol       Date:  2022-06-15       Impact factor: 3.738

4.  Machine Learning in Drug Discovery: A Review.

Authors:  Suresh Dara; Swetha Dhamercherla; Surender Singh Jadav; Ch Madhu Babu; Mohamed Jawed Ahsan
Journal:  Artif Intell Rev       Date:  2021-08-11       Impact factor: 9.588

5.  miRSM: an R package to infer and analyse miRNA sponge modules in heterogeneous data.

Authors:  Junpeng Zhang; Lin Liu; Taosheng Xu; Wu Zhang; Chunwen Zhao; Sijing Li; Jiuyong Li; Nini Rao; Thuc Duy Le
Journal:  RNA Biol       Date:  2021-04-06       Impact factor: 4.652

6.  Understanding the Molecular Drivers of Disease Heterogeneity in Crohn's Disease Using Multi-omic Data Integration and Network Analysis.

Authors:  Padhmanand Sudhakar; Bram Verstockt; Jonathan Cremer; Sare Verstockt; João Sabino; Marc Ferrante; Séverine Vermeire
Journal:  Inflamm Bowel Dis       Date:  2021-05-17       Impact factor: 5.325

7.  Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions.

Authors:  Tomoki Tokuda; Junichiro Yoshimoto; Yu Shimizu; Go Okada; Masahiro Takamura; Yasumasa Okamoto; Shigeto Yamawaki; Kenji Doya
Journal:  PLoS One       Date:  2017-10-19       Impact factor: 3.240

8.  Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.

Authors:  Ricard Argelaguet; Britta Velten; Damien Arnol; Sascha Dietrich; Thorsten Zenz; John C Marioni; Florian Buettner; Wolfgang Huber; Oliver Stegle
Journal:  Mol Syst Biol       Date:  2018-06-20       Impact factor: 11.429

9.  LMSM: A modular approach for identifying lncRNA related miRNA sponge modules in breast cancer.

Authors:  Junpeng Zhang; Taosheng Xu; Lin Liu; Wu Zhang; Chunwen Zhao; Sijing Li; Jiuyong Li; Nini Rao; Thuc Duy Le
Journal:  PLoS Comput Biol       Date:  2020-04-23       Impact factor: 4.475

10.  Identification of associations between genotypes and longitudinal phenotypes via temporally-constrained group sparse canonical correlation analysis.

Authors:  Xiaoke Hao; Chanxiu Li; Jingwen Yan; Xiaohui Yao; Shannon L Risacher; Andrew J Saykin; Li Shen; Daoqiang Zhang
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.