Literature DB >> 28369202

Row versus column correlations: avoiding the ecological fallacy in RNA/protein expression studies.

Jonathon J O'Brien1, Harsha P Gunawardena2, Bahjat F Qaqish3.   

Abstract

Biomedical researchers are often interested in computing the correlation between RNA and protein abundance. However, correlations can be computed between rows of a data matrix or between columns, and the results are not the same. The belief that these two types of correlation are estimating the same phenomenon is a special case of a well-known logical error called the ecological fallacy. In this article, we review different uses of correlation found in the literature, explain the differences between row and column correlations and argue that one of them has an undesirable interpretation in most applications. Through simulation studies and theoretical derivations, we show that the commonly used Pearson's coefficient, computed from protein and transcript data from a single sample, is only loosely related to the biological correlation that most researchers will be interested in studying. Beyond our basic exploration of the ecological fallacy, we examine how correlations are affected by relative quantification proteomics data and common normalization procedures, finding that double normalization is capable of completely masking true correlative relationships. We conclude with guidelines for properly identifying and computing consistent correlation coefficients.

Mesh:

Substances:

Year:  2018        PMID: 28369202      PMCID: PMC6171494          DOI: 10.1093/bib/bbx021

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  12 in total

1.  Ecological correlations and the behavior of individuals.

Authors:  W S Robinson
Journal:  Int J Epidemiol       Date:  2009-01-28       Impact factor: 7.196

2.  Global quantification of mammalian gene expression control.

Authors:  Björn Schwanhäusser; Dorothea Busse; Na Li; Gunnar Dittmar; Johannes Schuchhardt; Jana Wolf; Wei Chen; Matthias Selbach
Journal:  Nature       Date:  2011-05-19       Impact factor: 49.962

3.  Batch effect removal methods for microarray gene expression data integration: a survey.

Authors:  Cosmin Lazar; Stijn Meganck; Jonatan Taminau; David Steenhoff; Alain Coletta; Colin Molter; David Y Weiss-Solís; Robin Duque; Hugues Bersini; Ann Nowé
Journal:  Brief Bioinform       Date:  2012-07-31       Impact factor: 11.622

4.  Proteogenomic characterization of human colon and rectal cancer.

Authors:  Bing Zhang; Jing Wang; Xiaojing Wang; Jing Zhu; Qi Liu; Zhiao Shi; Matthew C Chambers; Lisa J Zimmerman; Kent F Shaddox; Sangtae Kim; Sherri R Davies; Sean Wang; Pei Wang; Christopher R Kinsinger; Robert C Rivers; Henry Rodriguez; R Reid Townsend; Matthew J C Ellis; Steven A Carr; David L Tabb; Robert J Coffey; Robbert J C Slebos; Daniel C Liebler
Journal:  Nature       Date:  2014-07-20       Impact factor: 49.962

5.  Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation.

Authors:  Peng Lu; Christine Vogel; Rong Wang; Xin Yao; Edward M Marcotte
Journal:  Nat Biotechnol       Date:  2006-12-24       Impact factor: 54.908

6.  Protein abundances are more conserved than mRNA abundances across diverse taxa.

Authors:  Jon M Laurent; Christine Vogel; Taejoon Kwon; Stephanie A Craig; Daniel R Boutz; Holly K Huse; Kazunari Nozue; Harkamal Walia; Marvin Whiteley; Pamela C Ronald; Edward M Marcotte
Journal:  Proteomics       Date:  2010-12       Impact factor: 3.984

7.  Defining the transcriptome and proteome in three functionally different human cell lines.

Authors:  Emma Lundberg; Linn Fagerberg; Daniel Klevebring; Ivan Matic; Tamar Geiger; Juergen Cox; Cajsa Algenäs; Joakim Lundeberg; Matthias Mann; Mathias Uhlen
Journal:  Mol Syst Biol       Date:  2010-12-21       Impact factor: 11.429

8.  Sequence signatures and mRNA concentration can explain two-thirds of protein abundance variation in a human cell line.

Authors:  Christine Vogel; Raquel de Sousa Abreu; Daijin Ko; Shu-Yun Le; Bruce A Shapiro; Suzanne C Burns; Devraj Sandhu; Daniel R Boutz; Edward M Marcotte; Luiz O Penalva
Journal:  Mol Syst Biol       Date:  2010-08-24       Impact factor: 11.429

9.  Deep proteome and transcriptome mapping of a human cancer cell line.

Authors:  Nagarjuna Nagaraj; Jacek R Wisniewski; Tamar Geiger; Juergen Cox; Martin Kircher; Janet Kelso; Svante Pääbo; Matthias Mann
Journal:  Mol Syst Biol       Date:  2011-11-08       Impact factor: 11.429

10.  Protein synthesis rate is the predominant regulator of protein expression during differentiation.

Authors:  Anders R Kristensen; Joerg Gsponer; Leonard J Foster
Journal:  Mol Syst Biol       Date:  2013       Impact factor: 11.429

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.