Maria Chikina1, Elena Zaslavsky1, Stuart C Sealfon1. 1. Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15217, USA and Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
Abstract
MOTIVATION: Identifying alterations in gene expression associated with different clinical states is important for the study of human biology. However, clinical samples used in gene expression studies are often derived from heterogeneous mixtures with variable cell-type composition, complicating statistical analysis. Considerable effort has been devoted to modeling sample heterogeneity, and presently, there are many methods that can estimate cell proportions or pure cell-type expression from mixture data. However, there is no method that comprehensively addresses mixture analysis in the context of differential expression without relying on additional proportion information, which can be inaccurate and is frequently unavailable. RESULTS: In this study, we consider a clinically relevant situation where neither accurate proportion estimates nor pure cell expression is of direct interest, but where we are rather interested in detecting and interpreting relevant differential expression in mixture samples. We develop a method, Cell-type COmputational Differential Estimation (CellCODE), that addresses the specific statistical question directly, without requiring a physical model for mixture components. Our approach is based on latent variable analysis and is computationally transparent; it requires no additional experimental data, yet outperforms existing methods that use independent proportion measurements. CellCODE has few parameters that are robust and easy to interpret. The method can be used to track changes in proportion, improve power to detect differential expression and assign the differentially expressed genes to the correct cell type.
MOTIVATION: Identifying alterations in gene expression associated with different clinical states is important for the study of human biology. However, clinical samples used in gene expression studies are often derived from heterogeneous mixtures with variable cell-type composition, complicating statistical analysis. Considerable effort has been devoted to modeling sample heterogeneity, and presently, there are many methods that can estimate cell proportions or pure cell-type expression from mixture data. However, there is no method that comprehensively addresses mixture analysis in the context of differential expression without relying on additional proportion information, which can be inaccurate and is frequently unavailable. RESULTS: In this study, we consider a clinically relevant situation where neither accurate proportion estimates nor pure cell expression is of direct interest, but where we are rather interested in detecting and interpreting relevant differential expression in mixture samples. We develop a method, Cell-type COmputational Differential Estimation (CellCODE), that addresses the specific statistical question directly, without requiring a physical model for mixture components. Our approach is based on latent variable analysis and is computationally transparent; it requires no additional experimental data, yet outperforms existing methods that use independent proportion measurements. CellCODE has few parameters that are robust and easy to interpret. The method can be used to track changes in proportion, improve power to detect differential expression and assign the differentially expressed genes to the correct cell type.
Authors: Shai S Shen-Orr; Robert Tibshirani; Purvesh Khatri; Dale L Bodian; Frank Staedtler; Nicholas M Perry; Trevor Hastie; Minnie M Sarwal; Mark M Davis; Atul J Butte Journal: Nat Methods Date: 2010-03-07 Impact factor: 28.547
Authors: Noa Novershtern; Aravind Subramanian; Lee N Lawton; Raymond H Mak; W Nicholas Haining; Marie E McConkey; Naomi Habib; Nir Yosef; Cindy Y Chang; Tal Shay; Garrett M Frampton; Adam C B Drake; Ilya Leskov; Bjorn Nilsson; Fred Preffer; David Dombkowski; John W Evans; Ted Liefeld; John S Smutko; Jianzhu Chen; Nir Friedman; Richard A Young; Todd R Golub; Aviv Regev; Benjamin L Ebert Journal: Cell Date: 2011-01-21 Impact factor: 41.582
Authors: Kate L Jeffrey; Tilman Brummer; Michael S Rolph; Sue M Liu; Nuria A Callejas; Raelene J Grumont; Corine Gillieron; Fabienne Mackay; Shane Grey; Montserrat Camps; Christian Rommel; Steve D Gerondakis; Charles R Mackay Journal: Nat Immunol Date: 2006-02-12 Impact factor: 25.606
Authors: Helder I Nakaya; Jens Wrammert; Eva K Lee; Luigi Racioppi; Stephanie Marie-Kunze; W Nicholas Haining; Anthony R Means; Sudhir P Kasturi; Nooruddin Khan; Gui-Mei Li; Megan McCausland; Vibhu Kanchan; Kenneth E Kokko; Shuzhao Li; Rivka Elbein; Aneesh K Mehta; Alan Aderem; Kanta Subbarao; Rafi Ahmed; Bali Pulendran Journal: Nat Immunol Date: 2011-07-10 Impact factor: 25.606
Authors: Dirk Repsilber; Sabine Kern; Anna Telaar; Gerhard Walzl; Gillian F Black; Joachim Selbig; Shreemanta K Parida; Stefan H E Kaufmann; Marc Jacobsen Journal: BMC Bioinformatics Date: 2010-01-14 Impact factor: 3.169
Authors: Nadine Provençal; Janine Arloth; Annamaria Cattaneo; Christoph Anacker; Nadia Cattane; Tobias Wiechmann; Simone Röh; Maik Ködel; Torsten Klengel; Darina Czamara; Nikola S Müller; Jari Lahti; Katri Räikkönen; Carmine M Pariante; Elisabeth B Binder Journal: Proc Natl Acad Sci U S A Date: 2019-08-09 Impact factor: 11.205
Authors: Chang Liu; Maria Chikina; Rahul Deshpande; Ashley V Menk; Ting Wang; Tracy Tabib; Erin A Brunazzi; Kate M Vignali; Ming Sun; Donna B Stolz; Robert A Lafyatis; Wei Chen; Greg M Delgoffe; Creg J Workman; Stacy G Wendell; Dario A A Vignali Journal: Immunity Date: 2019-07-23 Impact factor: 31.745
Authors: Jaclyn N Taroni; Peter C Grayson; Qiwen Hu; Sean Eddy; Matthias Kretzler; Peter A Merkel; Casey S Greene Journal: Cell Syst Date: 2019-05-22 Impact factor: 10.304
Authors: Matthew C Altman; Elizabeth Whalen; Alkis Togias; George T O'Connor; Leonard B Bacharier; Gordon R Bloomberg; Meyer Kattan; Robert A Wood; Scott Presnell; Petra LeBeau; Katy Jaffee; Cynthia M Visness; William W Busse; James E Gern Journal: J Allergy Clin Immunol Date: 2018-03-05 Impact factor: 10.793
Authors: Paolo Cravedi; Miguel Fribourg; Weijia Zhang; Zhengzi Yi; Elena Zaslavsky; German Nudelman; Lisa Anderson; Susan Hartzell; Sophie Brouard; Peter S Heeger Journal: Am J Transplant Date: 2020-05-27 Impact factor: 8.086
Authors: Anna B Morris; Clara R Farley; David F Pinelli; Layne E Adams; Mark S Cragg; Jeremy M Boss; Christopher D Scharer; Miguel Fribourg; Paolo Cravedi; Peter S Heeger; Mandy L Ford Journal: Immunity Date: 2020-01-14 Impact factor: 31.745
Authors: Hyun Jae Lee; Athina Georgiadou; Michael Walther; Davis Nwakanma; Lindsay B Stewart; Michael Levin; Thomas D Otto; David J Conway; Lachlan J Coin; Aubrey J Cunnington Journal: Sci Transl Med Date: 2018-06-27 Impact factor: 17.956