Literature DB >> 17146048

Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables.

Matthew A Zapala1, Nicholas J Schork.   

Abstract

A fundamental step in the analysis of gene expression and other high-dimensional genomic data is the calculation of the similarity or distance between pairs of individual samples in a study. If one has collected N total samples and assayed the expression level of G genes on those samples, then an N x N similarity matrix can be formed that reflects the correlation or similarity of the samples with respect to the expression values over the G genes. This matrix can then be examined for patterns via standard data reduction and cluster analysis techniques. We consider an alternative to conventional data reduction and cluster analyses of similarity matrices that is rooted in traditional linear models. This analysis method allows predictor variables collected on the samples to be related to variation in the pairwise similarity/distance values reflected in the matrix. The proposed multivariate method avoids the need for reducing the dimensions of a similarity matrix, can be used to assess relationships between the genes used to construct the matrix and additional information collected on the samples under study, and can be used to analyze individual genes or groups of genes identified in different ways. The technique can be used with any high-dimensional assay or data type and is ideally suited for testing subsets of genes defined by their participation in a biochemical pathway or other a priori grouping. We showcase the methodology using three published gene expression data sets.

Mesh:

Year:  2006        PMID: 17146048      PMCID: PMC1748243          DOI: 10.1073/pnas.0609333103

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  31 in total

Review 1.  From patterns to pathways: gene expression data analysis comes of age.

Authors:  Donna K Slonim
Journal:  Nat Genet       Date:  2002-12       Impact factor: 38.330

2.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

3.  Gene regulation and DNA damage in the ageing human brain.

Authors:  Tao Lu; Ying Pan; Shyan-Yuan Kao; Cheng Li; Isaac Kohane; Jennifer Chan; Bruce A Yankner
Journal:  Nature       Date:  2004-06-09       Impact factor: 49.962

4.  A concordance correlation coefficient to evaluate reproducibility.

Authors:  L I Lin
Journal:  Biometrics       Date:  1989-03       Impact factor: 2.571

5.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

6.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

7.  Expression monitoring by hybridization to high-density oligonucleotide arrays.

Authors:  D J Lockhart; H Dong; M C Byrne; M T Follettie; M V Gallo; M S Chee; M Mittmann; C Wang; M Kobayashi; H Horton; E L Brown
Journal:  Nat Biotechnol       Date:  1996-12       Impact factor: 54.908

8.  The tissue plasminogen activator-plasminogen proteolytic cascade accelerates amyloid-beta (Abeta) degradation and inhibits Abeta-induced neurodegeneration.

Authors:  Jerry P Melchor; Robert Pawlak; Sidney Strickland
Journal:  J Neurosci       Date:  2003-10-01       Impact factor: 6.167

9.  A factor analysis model for functional genomics.

Authors:  Rafal Kustra; Romy Shioda; Mu Zhu
Journal:  BMC Bioinformatics       Date:  2006-04-21       Impact factor: 3.169

10.  Visualising very large phylogenetic trees in three dimensional hyperbolic space.

Authors:  Timothy Hughes; Young Hyun; David A Liberles
Journal:  BMC Bioinformatics       Date:  2004-04-29       Impact factor: 3.169

View more
  102 in total

1.  Wavelet-based functional clustering for patterns of high-dimensional dynamic gene expression.

Authors:  Bong-Rae Kim; Timothy McMurry; Wei Zhao; Rongling Wu; Arthur Berg
Journal:  J Comput Biol       Date:  2010-08       Impact factor: 1.479

2.  Curve-based multivariate distance matrix regression analysis: application to genetic association analyses involving repeated measures.

Authors:  Rany M Salem; Daniel T O'Connor; Nicholas J Schork
Journal:  Physiol Genomics       Date:  2010-04-27       Impact factor: 3.107

3.  Human behavioral informatics in genetic studies of neuropsychiatric disease: multivariate profile-based analysis.

Authors:  Cinnamon S Bloss; Kelly M Schiabor; Nicholas J Schork
Journal:  Brain Res Bull       Date:  2010-04-28       Impact factor: 4.077

Review 4.  Genomic similarity and kernel methods I: advancements by building on mathematical and statistical foundations.

Authors:  Daniel J Schaid
Journal:  Hum Hered       Date:  2010-07-03       Impact factor: 0.444

5.  Associating Multivariate Quantitative Phenotypes with Genetic Variants in Family Samples with a Novel Kernel Machine Regression Method.

Authors:  Qi Yan; Daniel E Weeks; Juan C Celedón; Hemant K Tiwari; Bingshan Li; Xiaojing Wang; Wan-Yu Lin; Xiang-Yang Lou; Guimin Gao; Wei Chen; Nianjun Liu
Journal:  Genetics       Date:  2015-10-19       Impact factor: 4.562

6.  Structural connectivity patterns associated with the putative visual word form area and children's reading ability.

Authors:  Qiuyun Fan; Adam W Anderson; Nicole Davis; Laurie E Cutting
Journal:  Brain Res       Date:  2014-08-22       Impact factor: 3.252

7.  5-HT₁A receptor binding is increased after recovery from bulimia nervosa compared to control women and is associated with behavioral inhibition in both groups.

Authors:  Ursula F Bailer; Cinnamon S Bloss; Guido K Frank; Julie C Price; Carolyn C Meltzer; Chester A Mathis; Mark A Geyer; Angela Wagner; Carl R Becker; Nicholas J Schork; Walter H Kaye
Journal:  Int J Eat Disord       Date:  2010-09-24       Impact factor: 4.861

8.  A multivariate distance-based analytic framework for connectome-wide association studies.

Authors:  Zarrar Shehzad; Clare Kelly; Philip T Reiss; R Cameron Craddock; John W Emerson; Katie McMahon; David A Copland; F Xavier Castellanos; Michael P Milham
Journal:  Neuroimage       Date:  2014-02-28       Impact factor: 6.556

9.  Genetic background comparison using distance-based regression, with applications in population stratification evaluation and adjustment.

Authors:  Qizhai Li; Sholom Wacholder; David J Hunter; Robert N Hoover; Stephen Chanock; Gilles Thomas; Kai Yu
Journal:  Genet Epidemiol       Date:  2009-07       Impact factor: 2.135

10.  Untangling the relatedness among correlations, part I: Nonparametric approaches to inter-subject correlation analysis at the group level.

Authors:  Gang Chen; Yong-Wook Shin; Paul A Taylor; Daniel R Glen; Richard C Reynolds; Robert B Israel; Robert W Cox
Journal:  Neuroimage       Date:  2016-05-17       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.