Weiwei Zhang1,2, Hao Feng3, Hao Wu3, Xiaoqi Zheng1. 1. Department of Mathematics, Shanghai Normal University, Shanghai 200234, China. 2. School of Science, East China University of Technology, Nanchang, Jiangxi 330013, China. 3. Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA.
Abstract
MOTIVATION: Tumor sample classification has long been an important task in cancer research. Classifying tumors into different subtypes greatly benefits therapeutic development and facilitates application of precision medicine on patients. In practice, solid tumor tissue samples obtained from clinical settings are always mixtures of cancer and normal cells. Thus, the data obtained from these samples are mixed signals. The 'tumor purity', or the percentage of cancer cells in cancer tissue sample, will bias the clustering results if not properly accounted for. RESULTS: In this article, we developed a model-based clustering method and an R function which uses DNA methylation microarray data to infer tumor subtypes with the consideration of tumor purity. Simulation studies and the analyses of The Cancer Genome Atlas data demonstrate improved results compared with existing methods. AVAILABILITY AND IMPLEMENTATION: InfiniumClust is part of R package InfiniumPurify , which is freely available from CRAN ( https://cran.r-project.org/web/packages/InfiniumPurify/index.html ). CONTACT: hao.wu@emory.edu or xqzheng@shnu.edu.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Tumor sample classification has long been an important task in cancer research. Classifying tumors into different subtypes greatly benefits therapeutic development and facilitates application of precision medicine on patients. In practice, solid tumor tissue samples obtained from clinical settings are always mixtures of cancer and normal cells. Thus, the data obtained from these samples are mixed signals. The 'tumor purity', or the percentage of cancer cells in cancer tissue sample, will bias the clustering results if not properly accounted for. RESULTS: In this article, we developed a model-based clustering method and an R function which uses DNA methylation microarray data to infer tumor subtypes with the consideration of tumor purity. Simulation studies and the analyses of The Cancer Genome Atlas data demonstrate improved results compared with existing methods. AVAILABILITY AND IMPLEMENTATION: InfiniumClust is part of R package InfiniumPurify , which is freely available from CRAN ( https://cran.r-project.org/web/packages/InfiniumPurify/index.html ). CONTACT: hao.wu@emory.edu or xqzheng@shnu.edu.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Richard M Neve; Koei Chin; Jane Fridlyand; Jennifer Yeh; Frederick L Baehner; Tea Fevr; Laura Clark; Nora Bayani; Jean-Philippe Coppe; Frances Tong; Terry Speed; Paul T Spellman; Sandy DeVries; Anna Lapuk; Nick J Wang; Wen-Lin Kuo; Jackie L Stilwell; Daniel Pinkel; Donna G Albertson; Frederic M Waldman; Frank McCormick; Robert B Dickson; Michael D Johnson; Marc Lippman; Stephen Ethier; Adi Gazdar; Joe W Gray Journal: Cancer Cell Date: 2006-12 Impact factor: 31.743
Authors: Arvind K Virmani; Jeffrey A Tsou; Kimberly D Siegmund; Linda Y C Shen; Tiffany I Long; Peter W Laird; Adi F Gazdar; Ite A Laird-Offringa Journal: Cancer Epidemiol Biomarkers Prev Date: 2002-03 Impact factor: 4.254
Authors: Roel G W Verhaak; Katherine A Hoadley; Elizabeth Purdom; Victoria Wang; Yuan Qi; Matthew D Wilkerson; C Ryan Miller; Li Ding; Todd Golub; Jill P Mesirov; Gabriele Alexe; Michael Lawrence; Michael O'Kelly; Pablo Tamayo; Barbara A Weir; Stacey Gabriel; Wendy Winckler; Supriya Gupta; Lakshmi Jakkula; Heidi S Feiler; J Graeme Hodgson; C David James; Jann N Sarkaria; Cameron Brennan; Ari Kahn; Paul T Spellman; Richard K Wilson; Terence P Speed; Joe W Gray; Matthew Meyerson; Gad Getz; Charles M Perou; D Neil Hayes Journal: Cancer Cell Date: 2010-01-19 Impact factor: 31.743
Authors: Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard Journal: J Clin Oncol Date: 2009-02-09 Impact factor: 44.544
Authors: E Andres Houseman; Brock C Christensen; Ru-Fang Yeh; Carmen J Marsit; Margaret R Karagas; Margaret Wrensch; Heather H Nelson; Joseph Wiemels; Shichun Zheng; John K Wiencke; Karl T Kelsey Journal: BMC Bioinformatics Date: 2008-09-09 Impact factor: 3.169
Authors: Mustafa Umit Oner; Jianbin Chen; Egor Revkov; Anne James; Seow Ye Heng; Arife Neslihan Kaya; Jacob Josiah Santiago Alvarez; Angela Takano; Xin Min Cheng; Tony Kiat Hon Lim; Daniel Shao Weng Tan; Weiwei Zhai; Anders Jacobsen Skanderup; Wing-Kin Sung; Hwee Kuan Lee Journal: Patterns (N Y) Date: 2021-12-09