Literature DB >> 33752591

Nonlinear ridge regression improves cell-type-specific differential expression analysis.

Fumihiko Takeuchi1, Norihiro Kato2.   

Abstract

BACKGROUND: Epigenome-wide association studies (EWAS) and differential gene expression analyses are generally performed on tissue samples, which consist of multiple cell types. Cell-type-specific effects of a trait, such as disease, on the omics expression are of interest but difficult or costly to measure experimentally. By measuring omics data for the bulk tissue, cell type composition of a sample can be inferred statistically. Subsequently, cell-type-specific effects are estimated by linear regression that includes terms representing the interaction between the cell type proportions and the trait. This approach involves two issues, scaling and multicollinearity.
RESULTS: First, although cell composition is analyzed in linear scale, differential methylation/expression is analyzed suitably in the logit/log scale. To simultaneously analyze two scales, we applied nonlinear regression. Second, we show that the interaction terms are highly collinear, which is obstructive to ordinary regression. To cope with the multicollinearity, we applied ridge regularization. In simulated data, nonlinear ridge regression attained well-balanced sensitivity, specificity and precision. Marginal model attained the lowest precision and highest sensitivity and was the only algorithm to detect weak signal in real data.
CONCLUSION: Nonlinear ridge regression performed cell-type-specific association test on bulk omics data with well-balanced performance. The omicwas package for R implements nonlinear ridge regression for cell-type-specific EWAS, differential gene expression and QTL analyses. The software is freely available from https://github.com/fumi-github/omicwas.

Entities:  

Keywords:  Cell type; Differential gene expression analysis; Epigenome-wide association study; Nonlinear regression; Ridge regression; eQTL; mQTL

Mesh:

Year:  2021        PMID: 33752591      PMCID: PMC7986289          DOI: 10.1186/s12859-021-03982-3

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  32 in total

1.  Count-based differential expression analysis of RNA sequencing data using R and Bioconductor.

Authors:  Simon Anders; Davis J McCarthy; Yunshun Chen; Michal Okoniewski; Gordon K Smyth; Wolfgang Huber; Mark D Robinson
Journal:  Nat Protoc       Date:  2013-08-22       Impact factor: 13.491

2.  Robust ridge regression estimators for nonlinear models with applications to high throughput screening assay data.

Authors:  Changwon Lim
Journal:  Stat Med       Date:  2014-12-10       Impact factor: 2.373

Review 3.  Epigenome-wide association studies for common human diseases.

Authors:  Vardhman K Rakyan; Thomas A Down; David J Balding; Stephan Beck
Journal:  Nat Rev Genet       Date:  2011-07-12       Impact factor: 53.242

4.  GLINT: a user-friendly toolset for the analysis of high-throughput DNA-methylation array data.

Authors:  Elior Rahmani; Reut Yedidim; Liat Shenhav; Regev Schweiger; Omer Weissbrod; Noah Zaitlen; Eran Halperin
Journal:  Bioinformatics       Date:  2017-06-15       Impact factor: 6.937

5.  Age-related variations in the methylome associated with gene expression in human monocytes and T cells.

Authors:  Lindsay M Reynolds; Jackson R Taylor; Jingzhong Ding; Kurt Lohman; Craig Johnson; David Siscovick; Gregory Burke; Wendy Post; Steven Shea; David R Jacobs; Hendrik Stunnenberg; Stephen B Kritchevsky; Ina Hoeschele; Charles E McCall; David Herrington; Russell P Tracy; Yongmei Liu
Journal:  Nat Commun       Date:  2014-11-18       Impact factor: 14.919

6.  Rheumatoid Arthritis Naive T Cells Share Hypermethylation Sites With Synoviocytes.

Authors:  Brooke Rhead; Calliope Holingue; Michael Cole; Xiaorong Shao; Hong L Quach; Diana Quach; Khooshbu Shah; Elizabeth Sinclair; John Graf; Thomas Link; Ruby Harrison; Elior Rahmani; Eran Halperin; Wei Wang; Gary S Firestein; Lisa F Barcellos; Lindsey A Criswell
Journal:  Arthritis Rheumatol       Date:  2017-03       Impact factor: 10.995

7.  Comprehensive evaluation of transcriptome-based cell-type quantification methods for immuno-oncology.

Authors:  Gregor Sturm; Francesca Finotello; Florent Petitprez; Jitao David Zhang; Jan Baumbach; Wolf H Fridman; Markus List; Tatsiana Aneichyk
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.931

8.  DNA methylation arrays as surrogate measures of cell mixture distribution.

Authors:  Eugene Andres Houseman; William P Accomando; Devin C Koestler; Brock C Christensen; Carmen J Marsit; Heather H Nelson; John K Wiencke; Karl T Kelsey
Journal:  BMC Bioinformatics       Date:  2012-05-08       Impact factor: 3.169

9.  Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL).

Authors:  Devin C Koestler; Meaghan J Jones; Joseph Usset; Brock C Christensen; Rondi A Butler; Michael S Kobor; John K Wiencke; Karl T Kelsey
Journal:  BMC Bioinformatics       Date:  2016-03-08       Impact factor: 3.169

10.  Genetic effects on gene expression across human tissues.

Authors:  Alexis Battle; Christopher D Brown; Barbara E Engelhardt; Stephen B Montgomery
Journal:  Nature       Date:  2017-10-11       Impact factor: 49.962

View more
  1 in total

1.  Characterizing the properties of bisulfite sequencing data: maximizing power and sensitivity to identify between-group differences in DNA methylation.

Authors:  Jonathan Mill; Eilis Hannon; Dorothea Seiler Vellame; Isabel Castanho; Aisha Dahir
Journal:  BMC Genomics       Date:  2021-06-15       Impact factor: 3.969

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.