Weiwei Zhang1, Hao Wu2, Ziyi Li3. 1. School of Science, East China University of Technology, Nanchang, Jiangxi 330013, China. 2. Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA. 3. Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.
Abstract
MOTIVATION: It is a common practice in epigenetics research to profile DNA methylation on tissue samples, which is usually a mixture of different cell types. To properly account for the mixture, estimating cell compositions has been recognized as an important first step. Many methods were developed for quantifying cell compositions from DNA methylation data, but they mostly have limited applications due to lack of reference or prior information. RESULTS: We develop Tsisal, a novel complete deconvolution method which accurately estimate cell compositions from DNA methylation data without any prior knowledge of cell types or their proportions. Tsisal is a full pipeline to estimate number of cell types, cell compositions and identify cell-type-specific CpG sites. It can also assign cell type labels when (full or part of) reference panel is available. Extensive simulation studies and analyses of seven real datasets demonstrate the favorable performance of our proposed method compared with existing deconvolution methods serving similar purpose. AVAILABILITY AND IMPLEMENTATION: The proposed method Tsisal is implemented as part of the R/Bioconductor package TOAST at https://bioconductor.org/packages/TOAST. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: It is a common practice in epigenetics research to profile DNA methylation on tissue samples, which is usually a mixture of different cell types. To properly account for the mixture, estimating cell compositions has been recognized as an important first step. Many methods were developed for quantifying cell compositions from DNA methylation data, but they mostly have limited applications due to lack of reference or prior information. RESULTS: We develop Tsisal, a novel complete deconvolution method which accurately estimate cell compositions from DNA methylation data without any prior knowledge of cell types or their proportions. Tsisal is a full pipeline to estimate number of cell types, cell compositions and identify cell-type-specific CpG sites. It can also assign cell type labels when (full or part of) reference panel is available. Extensive simulation studies and analyses of seven real datasets demonstrate the favorable performance of our proposed method compared with existing deconvolution methods serving similar purpose. AVAILABILITY AND IMPLEMENTATION: The proposed method Tsisal is implemented as part of the R/Bioconductor package TOAST at https://bioconductor.org/packages/TOAST. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Vitor Onuchic; Ryan J Hartmaier; David N Boone; Michael L Samuels; Ronak Y Patel; Wendy M White; Vesna D Garovic; Steffi Oesterreich; Matt E Roth; Adrian V Lee; Aleksandar Milosavljevic Journal: Cell Rep Date: 2016-11-15 Impact factor: 9.423
Authors: E Riboli; K J Hunt; N Slimani; P Ferrari; T Norat; M Fahey; U R Charrondière; B Hémon; C Casagrande; J Vignat; K Overvad; A Tjønneland; F Clavel-Chapelon; A Thiébaut; J Wahrendorf; H Boeing; D Trichopoulos; A Trichopoulou; P Vineis; D Palli; H B Bueno-De-Mesquita; P H M Peeters; E Lund; D Engeset; C A González; A Barricarte; G Berglund; G Hallmans; N E Day; T J Key; R Kaaks; R Saracci Journal: Public Health Nutr Date: 2002-12 Impact factor: 4.022
Authors: Scott L Carter; Kristian Cibulskis; Elena Helman; Aaron McKenna; Hui Shen; Travis Zack; Peter W Laird; Robert C Onofrio; Wendy Winckler; Barbara A Weir; Rameen Beroukhim; David Pellman; Douglas A Levine; Eric S Lander; Matthew Meyerson; Gad Getz Journal: Nat Biotechnol Date: 2012-05 Impact factor: 54.908
Authors: Sean P Pitroda; Bassam T Wakim; Ravi F Sood; Mara G Beveridge; Michael A Beckett; Dhara M MacDermed; Ralph R Weichselbaum; Nikolai N Khodarev Journal: BMC Med Date: 2009-11-05 Impact factor: 8.775
Authors: Chen Lyu; Manyan Huang; Nianjun Liu; Zhongxue Chen; Philip J Lupo; Benjamin Tycko; John S Witte; Charlotte A Hobbs; Ming Li Journal: Bioinformatics Date: 2022-07-04 Impact factor: 6.931