Literature DB >> 33381253

THREE-WAY CLUSTERING OF MULTI-TISSUE MULTI-INDIVIDUAL GENE EXPRESSION DATA USING SEMI-NONNEGATIVE TENSOR DECOMPOSITION.

Miaoyan Wang1, Jonathan Fischer1, Yun S Song1.   

Abstract

The advent of high-throughput sequencing technologies has led to an increasing availability of large multi-tissue data sets which contain gene expression measurements across different tissues and individuals. In this setting, variation in expression levels arises due to contributions specific to genes, tissues, individuals, and interactions thereof. Classical clustering methods are ill-suited to explore these three-way interactions and struggle to fully extract the insights into transcriptome complexity contained in the data. We propose a new statistical method, called MultiCluster, based on semi-nonnegative tensor decomposition which permits the investigation of transcriptome variation across individuals and tissues simultaneously. We further develop a tensor projection procedure which detects covariate-related genes with high power, demonstrating the advantage of tensor-based methods in incorporating information across similar tissues. Through simulation and application to the GTEx RNA-seq data from 53 human tissues, we show that MultiCluster identifies three-way interactions with high accuracy and robustness.

Entities:  

Keywords:  15A69; 62H30; Primary 62P10; clustering; gene expression; secondary 62H25; tensor decomposition; tensor projection

Year:  2019        PMID: 33381253      PMCID: PMC7771883          DOI: 10.1214/18-aoas1228

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  22 in total

1.  Human genetics: GTEx pilot quantifies eQTL variation across tissues and individuals.

Authors:  Orli G Bahcall
Journal:  Nat Rev Genet       Date:  2015-06-16       Impact factor: 53.242

2.  Statistical models for meta-analysis: A brief tutorial.

Authors:  George A Kelley; Kristi S Kelley
Journal:  World J Methodol       Date:  2012-08-26

3.  OPERATOR NORM INEQUALITIES BETWEEN TENSOR UNFOLDINGS ON THE PARTITION LATTICE.

Authors:  Miaoyan Wang; Khanh Dao Duc; Jonathan Fischer; Yun S Song
Journal:  Linear Algebra Appl       Date:  2017-01-17       Impact factor: 1.401

4.  JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.

Authors:  Eric F Lock; Katherine A Hoadley; J S Marron; Andrew B Nobel
Journal:  Ann Appl Stat       Date:  2013-03-01       Impact factor: 2.083

5.  G protein-coupled receptor 26 immunoreactivity in intranuclear inclusions associated with polyglutamine and intranuclear inclusion body diseases.

Authors:  Fumiaki Mori; Kunikazu Tanji; Yasuo Miki; Yasuko Toyoshima; Mari Yoshida; Akiyoshi Kakita; Hitoshi Takahashi; Jun Utsumi; Hidenao Sasaki; Koichi Wakabayashi
Journal:  Neuropathology       Date:  2015-08-24       Impact factor: 1.906

6.  Tensor decomposition for multiple-tissue gene expression experiments.

Authors:  Victoria Hore; Ana Viñuela; Alfonso Buil; Julian Knight; Mark I McCarthy; Kerrin Small; Jonathan Marchini
Journal:  Nat Genet       Date:  2016-08-01       Impact factor: 38.330

7.  Visualizing the structure of RNA-seq expression data using grade of membership models.

Authors:  Kushal K Dey; Chiaowen Joyce Hsiao; Matthew Stephens
Journal:  PLoS Genet       Date:  2017-03-23       Impact factor: 5.917

8.  The protocadherin 11X/Y (PCDH11X/Y) gene pair as determinant of cerebral asymmetry in modern Homo sapiens.

Authors:  Thomas H Priddle; Timothy J Crow
Journal:  Ann N Y Acad Sci       Date:  2013-04-18       Impact factor: 5.691

9.  Context Specific and Differential Gene Co-expression Networks via Bayesian Biclustering.

Authors:  Chuan Gao; Ian C McDowell; Shiwen Zhao; Christopher D Brown; Barbara E Engelhardt
Journal:  PLoS Comput Biol       Date:  2016-07-28       Impact factor: 4.475

10.  Genic insights from integrated human proteomics in GeneCards.

Authors:  Simon Fishilevich; Shahar Zimmerman; Asher Kohn; Tsippi Iny Stein; Tsviya Olender; Eugene Kolker; Marilyn Safran; Doron Lancet
Journal:  Database (Oxford)       Date:  2016-04-05       Impact factor: 3.451

View more
  5 in total

1.  Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data.

Authors:  Anru Zhang; Rungang Han
Journal:  J Am Stat Assoc       Date:  2019-03-20       Impact factor: 5.033

2.  Comparison of sparse biclustering algorithms for gene expression datasets.

Authors:  Kath Nicholls; Chris Wallace
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

3.  Identification of genes associated with altered gene expression and m6A profiles during hypoxia using tensor decomposition based unsupervised feature extraction.

Authors:  Sanjiban Sekhar Roy; Y-H Taguchi
Journal:  Sci Rep       Date:  2021-04-26       Impact factor: 4.379

4.  Modal clustering of matrix-variate data.

Authors:  Federico Ferraccioli; Giovanna Menardi
Journal:  Adv Data Anal Classif       Date:  2022-05-05

5.  Dissect Relationships Between Gene Co-expression and Functional Connectivity in Human Brain.

Authors:  Xue Zhang; Yingying Xie; Jie Tang; Wen Qin; Feng Liu; Hao Ding; Yuan Ji; Bingbing Yang; Peng Zhang; Wei Li; Zhaoxiang Ye; Chunshui Yu
Journal:  Front Neurosci       Date:  2021-12-09       Impact factor: 4.677

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.