Literature DB >> 35358296

Two-stage linked component analysis for joint decomposition of multiple biologically related data sets.

Huan Chen1, Brian Caffo1, Genevieve Stein-O'Brien2, Jinrui Liu3, Ben Langmead4, Carlo Colantuoni5, Luo Xiao6.   

Abstract

Integrative analysis of multiple data sets has the potential of fully leveraging the vast amount of high throughput biological data being generated. In particular such analysis will be powerful in making inference from publicly available collections of genetic, transcriptomic and epigenetic data sets which are designed to study shared biological processes, but which vary in their target measurements, biological variation, unwanted noise, and batch variation. Thus, methods that enable the joint analysis of multiple data sets are needed to gain insights into shared biological processes that would otherwise be hidden by unwanted intra-data set variation. Here, we propose a method called two-stage linked component analysis (2s-LCA) to jointly decompose multiple biologically related experimental data sets with biological and technological relationships that can be structured into the decomposition. The consistency of the proposed method is established and its empirical performance is evaluated via simulation studies. We apply 2s-LCA to jointly analyze four data sets focused on human brain development and identify meaningful patterns of gene expression in human neurogenesis that have shared structure across these data sets.
© The Author 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Integrative methods; Joint decomposition; Low rank models; Multiview data; Principal component analysis

Mesh:

Year:  2022        PMID: 35358296      PMCID: PMC9566367          DOI: 10.1093/biostatistics/kxac005

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.279


  24 in total

1.  Convex Banding of the Covariance Matrix.

Authors:  Jacob Bien; Florentina Bunea; Luo Xiao
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

2.  Are clusterings of multiple data views independent?

Authors:  Lucy L Gao; Jacob Bien; Daniela Witten
Journal:  Biostatistics       Date:  2020-10-01       Impact factor: 5.899

3.  CORTECON: a temporal transcriptome analysis of in vitro human cerebral cortex development from human embryonic stem cells.

Authors:  Joyce van de Leemput; Nathan C Boles; Thomas R Kiehl; Barbara Corneo; Patty Lederman; Vilas Menon; Changkyu Lee; Refugio A Martinez; Boaz P Levi; Carol L Thompson; Shuyuan Yao; Ajamete Kaykas; Sally Temple; Christopher A Fasano
Journal:  Neuron       Date:  2014-07-02       Impact factor: 17.173

4.  projectR: an R/Bioconductor package for transfer learning via PCA, NMF, correlation and clustering.

Authors:  Gaurav Sharma; Carlo Colantuoni; Loyal A Goff; Elana J Fertig; Genevieve Stein-O'Brien
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

5.  BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Authors:  Eric F Lock; Jun Young Park; Katherine A Hoadley
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

6.  Spatiotemporal gene expression trajectories reveal developmental hierarchies of the human cortex.

Authors:  Tomasz J Nowakowski; Aparna Bhaduri; Alex A Pollen; Beatriz Alvarado; Mohammed A Mostajo-Radji; Elizabeth Di Lullo; Maximilian Haeussler; Carmen Sandoval-Espinosa; Siyuan John Liu; Dmitry Velmeshev; Johain Ryad Ounadjela; Joe Shuga; Xiaohui Wang; Daniel A Lim; Jay A West; Anne A Leyrat; W James Kent; Arnold R Kriegstein
Journal:  Science       Date:  2017-12-08       Impact factor: 47.728

7.  Integrative multi-view regression: Bridging group-sparse and low-rank models.

Authors:  Gen Li; Xiaokang Liu; Kun Chen
Journal:  Biometrics       Date:  2019-03-29       Impact factor: 2.571

8.  Semiparametric partial common principal component analysis for covariance matrices.

Authors:  Bingkai Wang; Xi Luo; Yi Zhao; Brian Caffo
Journal:  Biometrics       Date:  2020-10-10       Impact factor: 2.571

9.  gEAR: Gene Expression Analysis Resource portal for community-driven, multi-omic data exploration.

Authors:  Joshua Orvis; Brian Gottfried; Jayaram Kancherla; Ricky S Adkins; Yang Song; Amiel A Dror; Dustin Olley; Kevin Rose; Elena Chrysostomou; Michael C Kelly; Beatrice Milon; Maggie S Matern; Hela Azaiez; Brian Herb; Carlo Colantuoni; Robert L Carter; Seth A Ament; Matthew W Kelley; Owen White; Hector Corrada Bravo; Anup Mahurkar; Ronna Hertzano
Journal:  Nat Methods       Date:  2021-08       Impact factor: 47.990

10.  Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data.

Authors:  Eun Jeong Min; Qi Long
Journal:  BMC Bioinformatics       Date:  2020-04-15       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.