Literature DB >> 29040382

Generalized correlation measure using count statistics for gene expression data with ordered samples.

Y X Rachel Wang1, Ke Liu2, Elizabeth Theusch3, Jerome I Rotter4, Marisa W Medina3, Michael S Waterman5, Haiyan Huang2, Oliver Stegle.   

Abstract

Motivation: Capturing association patterns in gene expression levels under different conditions or time points is important for inferring gene regulatory interactions. In practice, temporal changes in gene expression may result in complex association patterns that require more sophisticated detection methods than simple correlation measures. For instance, the effect of regulation may lead to time-lagged associations and interactions local to a subset of samples. Furthermore, expression profiles of interest may not be aligned or directly comparable (e.g. gene expression profiles from two species).
Results: We propose a count statistic for measuring association between pairs of gene expression profiles consisting of ordered samples (e.g. time-course), where correlation may only exist locally in subsequences separated by a position shift. The statistic is simple and fast to compute, and we illustrate its use in two applications. In a cross-species comparison of developmental gene expression levels, we show our method not only measures association of gene expressions between the two species, but also provides alignment between different developmental stages. In the second application, we applied our statistic to expression profiles from two distinct phenotypic conditions, where the samples in each profile are ordered by the associated phenotypic values. The detected associations can be useful in building correspondence between gene association networks under different phenotypes. On the theoretical side, we provide asymptotic distributions of the statistic for different regions of the parameter space and test its power on simulated data. Availability and implementation: The code used to perform the analysis is available as part of the Supplementary Material. Contact: msw@usc.edu or hhuang@stat.berkeley.edu. Supplementary information: Supplementary data are available at Bioinformatics online. © Crown copyright 2017.

Entities:  

Mesh:

Year:  2018        PMID: 29040382      PMCID: PMC5860612          DOI: 10.1093/bioinformatics/btx641

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Biclustering of expression data.

Authors:  Y Cheng; G M Church
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

2.  Cluster analysis of gene expression dynamics.

Authors:  Marco F Ramoni; Paola Sebastiani; Isaac S Kohane
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-24       Impact factor: 11.205

3.  Identifying time-lagged gene clusters using gene expression data.

Authors:  Liping Ji; Kian-Lee Tan
Journal:  Bioinformatics       Date:  2004-09-16       Impact factor: 6.937

4.  Biclustering of linear patterns in gene expression data.

Authors:  Qinghui Gao; Christine Ho; Yingmin Jia; Jingyi Jessica Li; Haiyan Huang
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

5.  Efficient statistical significance approximation for local similarity analysis of high-throughput time series data.

Authors:  Li C Xia; Dongmei Ai; Jacob Cram; Jed A Fuhrman; Fengzhu Sun
Journal:  Bioinformatics       Date:  2012-11-23       Impact factor: 6.937

6.  Clustering of gene expression data using a local shape-based similarity measure.

Authors:  Rajarajeswari Balasubramaniyan; Eyke Hüllermeier; Nils Weskamp; Jörg Kämper
Journal:  Bioinformatics       Date:  2004-10-28       Impact factor: 6.937

7.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

8.  Phenotypic predictors of response to simvastatin therapy among African-Americans and Caucasians: the Cholesterol and Pharmacogenetics (CAP) Study.

Authors:  Joel A Simon; Feng Lin; Stephen B Hulley; Patricia J Blanche; David Waters; Stephen Shiboski; Jerome I Rotter; Deborah A Nickerson; Huiying Yang; Mohammed Saad; Ronald M Krauss
Journal:  Am J Cardiol       Date:  2006-01-27       Impact factor: 2.778

9.  Estimating mutual information using B-spline functions--an improved similarity measure for analysing gene expression data.

Authors:  Carsten O Daub; Ralf Steuer; Joachim Selbig; Sebastian Kloska
Journal:  BMC Bioinformatics       Date:  2004-08-31       Impact factor: 3.169

10.  Similarity queries for temporal toxicogenomic expression profiles.

Authors:  Adam A Smith; Aaron Vollrath; Christopher A Bradfield; Mark Craven
Journal:  PLoS Comput Biol       Date:  2008-07-18       Impact factor: 4.475

View more
  3 in total

1.  Network Modeling in Biology: Statistical Methods for Gene and Brain Networks.

Authors:  Y X Rachel Wang; Lexin Li; Jingyi Jessica Li; Haiyan Huang
Journal:  Stat Sci       Date:  2021-02       Impact factor: 2.901

2.  GeneFishing to reconstruct context specific portraits of biological processes.

Authors:  Ke Liu; Elizabeth Theusch; Yun Zhou; Tal Ashuach; Andrea C Dose; Peter J Bickel; Marisa W Medina; Haiyan Huang
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-04       Impact factor: 11.205

Review 3.  Gene Co-Expression Network Tools and Databases for Crop Improvement.

Authors:  Rabiatul-Adawiah Zainal-Abidin; Sarahani Harun; Vinothienii Vengatharajuloo; Amin-Asyraf Tamizi; Nurul Hidayah Samsulrizal
Journal:  Plants (Basel)       Date:  2022-06-21
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.