Fumihiko Takeuchi1, Norihiro Kato2. 1. Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine (NCGM), 1-21-1 Toyama, Shinjuku-ku, Tokyo, 162-8655, Japan. fumihiko@takeuchi.name. 2. Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine (NCGM), 1-21-1 Toyama, Shinjuku-ku, Tokyo, 162-8655, Japan.
Abstract
BACKGROUND: Epigenome-wide association studies (EWAS) and differential gene expression analyses are generally performed on tissue samples, which consist of multiple cell types. Cell-type-specific effects of a trait, such as disease, on the omics expression are of interest but difficult or costly to measure experimentally. By measuring omics data for the bulk tissue, cell type composition of a sample can be inferred statistically. Subsequently, cell-type-specific effects are estimated by linear regression that includes terms representing the interaction between the cell type proportions and the trait. This approach involves two issues, scaling and multicollinearity. RESULTS: First, although cell composition is analyzed in linear scale, differential methylation/expression is analyzed suitably in the logit/log scale. To simultaneously analyze two scales, we applied nonlinear regression. Second, we show that the interaction terms are highly collinear, which is obstructive to ordinary regression. To cope with the multicollinearity, we applied ridge regularization. In simulated data, nonlinear ridge regression attained well-balanced sensitivity, specificity and precision. Marginal model attained the lowest precision and highest sensitivity and was the only algorithm to detect weak signal in real data. CONCLUSION: Nonlinear ridge regression performed cell-type-specific association test on bulk omics data with well-balanced performance. The omicwas package for R implements nonlinear ridge regression for cell-type-specific EWAS, differential gene expression and QTL analyses. The software is freely available from https://github.com/fumi-github/omicwas.
BACKGROUND: Epigenome-wide association studies (EWAS) and differential gene expression analyses are generally performed on tissue samples, which consist of multiple cell types. Cell-type-specific effects of a trait, such as disease, on the omics expression are of interest but difficult or costly to measure experimentally. By measuring omics data for the bulk tissue, cell type composition of a sample can be inferred statistically. Subsequently, cell-type-specific effects are estimated by linear regression that includes terms representing the interaction between the cell type proportions and the trait. This approach involves two issues, scaling and multicollinearity. RESULTS: First, although cell composition is analyzed in linear scale, differential methylation/expression is analyzed suitably in the logit/log scale. To simultaneously analyze two scales, we applied nonlinear regression. Second, we show that the interaction terms are highly collinear, which is obstructive to ordinary regression. To cope with the multicollinearity, we applied ridge regularization. In simulated data, nonlinear ridge regression attained well-balanced sensitivity, specificity and precision. Marginal model attained the lowest precision and highest sensitivity and was the only algorithm to detect weak signal in real data. CONCLUSION: Nonlinear ridge regression performed cell-type-specific association test on bulk omics data with well-balanced performance. The omicwas package for R implements nonlinear ridge regression for cell-type-specific EWAS, differential gene expression and QTL analyses. The software is freely available from https://github.com/fumi-github/omicwas.
Authors: Simon Anders; Davis J McCarthy; Yunshun Chen; Michal Okoniewski; Gordon K Smyth; Wolfgang Huber; Mark D Robinson Journal: Nat Protoc Date: 2013-08-22 Impact factor: 13.491
Authors: Lindsay M Reynolds; Jackson R Taylor; Jingzhong Ding; Kurt Lohman; Craig Johnson; David Siscovick; Gregory Burke; Wendy Post; Steven Shea; David R Jacobs; Hendrik Stunnenberg; Stephen B Kritchevsky; Ina Hoeschele; Charles E McCall; David Herrington; Russell P Tracy; Yongmei Liu Journal: Nat Commun Date: 2014-11-18 Impact factor: 14.919
Authors: Brooke Rhead; Calliope Holingue; Michael Cole; Xiaorong Shao; Hong L Quach; Diana Quach; Khooshbu Shah; Elizabeth Sinclair; John Graf; Thomas Link; Ruby Harrison; Elior Rahmani; Eran Halperin; Wei Wang; Gary S Firestein; Lisa F Barcellos; Lindsey A Criswell Journal: Arthritis Rheumatol Date: 2017-03 Impact factor: 10.995
Authors: Gregor Sturm; Francesca Finotello; Florent Petitprez; Jitao David Zhang; Jan Baumbach; Wolf H Fridman; Markus List; Tatsiana Aneichyk Journal: Bioinformatics Date: 2019-07-15 Impact factor: 6.931
Authors: Eugene Andres Houseman; William P Accomando; Devin C Koestler; Brock C Christensen; Carmen J Marsit; Heather H Nelson; John K Wiencke; Karl T Kelsey Journal: BMC Bioinformatics Date: 2012-05-08 Impact factor: 3.169
Authors: Devin C Koestler; Meaghan J Jones; Joseph Usset; Brock C Christensen; Rondi A Butler; Michael S Kobor; John K Wiencke; Karl T Kelsey Journal: BMC Bioinformatics Date: 2016-03-08 Impact factor: 3.169