| Literature DB >> 33230549 |
Xiaoyu Lu1, Szu-Wei Tu1, Wennan Chang2, Changlin Wan2, Jiashi Wang3, Yong Zang4, Baskar Ramdas5, Reuben Kapur5, Xiongbin Lu6, Sha Cao7, Chi Zhang8.
Abstract
Deconvolution of mouse transcriptomic data is challenged by the fact that mouse models carry various genetic and physiological perturbations, making it questionable to assume fixed cell types and cell type marker genes for different data set scenarios. We developed a Semi-Supervised Mouse data Deconvolution (SSMD) method to study the mouse tissue microenvironment. SSMD is featured by (i) a novel nonparametric method to discover data set-specific cell type signature genes; (ii) a community detection approach for fixing cell types and their marker genes; (iii) a constrained matrix decomposition method to solve cell type relative proportions that is robust to diverse experimental platforms. In summary, SSMD addressed several key challenges in the deconvolution of mouse tissue data, including: (i) varied cell types and marker genes caused by highly divergent genotypic and phenotypic conditions of mouse experiment; (ii) diverse experimental platforms of mouse transcriptomics data; (iii) small sample size and limited training data source and (iv) capable to estimate the proportion of 35 cell types in blood, inflammatory, central nervous or hematopoietic systems. In silico and experimental validation of SSMD demonstrated its high sensitivity and accuracy in identifying (sub) cell types and predicting cell proportions comparing with state-of-the-arts methods. A user-friendly R package and a web server of SSMD are released via https://github.com/xiaoyulu95/SSMD.Entities:
Keywords: cancer microenvironment; mouse omics data; semi-supervised learning; tissue data deconvolution
Mesh:
Substances:
Year: 2021 PMID: 33230549 PMCID: PMC8294548 DOI: 10.1093/bib/bbaa307
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 13.994