Literature DB >> 33432005

Data-driven detection of subtype-specific differentially expressed genes.

Lulu Chen1, Yingzhou Lu1, Chiung-Ting Wu1, Robert Clarke2, Guoqiang Yu1, Jennifer E Van Eyk3, David M Herrington4, Yue Wang5.   

Abstract

Among multiple subtypes of tissue or cell, subtype-specific differentially-expressed genes (SDEGs) are defined as being most-upregulated in only one subtype but not in any other. Detecting SDEGs plays a critical role in the molecular characterization and deconvolution of multicellular complex tissues. Classic differential analysis assumes a null hypothesis whose test statistic is not subtype-specific, thus can produce a high false positive rate and/or lower detection power. Here we first introduce a One-Versus-Everyone Fold Change (OVE-FC) test for detecting SDEGs. We then propose a scaled test statistic (OVE-sFC) for assessing the statistical significance of SDEGs that applies a mixture null distribution model and a tailored permutation test. The OVE-FC/sFC test was validated on both type 1 error rate and detection power using extensive simulation data sets generated from real gene expression profiles of purified subtype samples. The OVE-FC/sFC test was then applied to two benchmark gene expression data sets of purified subtype samples and detected many known or previously unknown SDEGs. Subsequent supervised deconvolution results on synthesized bulk expression data, obtained using the SDEGs detected from the independent purified expression data by the OVE-FC/sFC test, showed superior performance in deconvolution accuracy when compared with popular peer methods.

Entities:  

Year:  2021        PMID: 33432005      PMCID: PMC7801594          DOI: 10.1038/s41598-020-79704-1

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  30 in total

1.  Iterative normalization of cDNA microarray data.

Authors:  Yue Wang; Jianping Lu; Richard Lee; Zhiping Gu; Robert Clarke
Journal:  IEEE Trans Inf Technol Biomed       Date:  2002-03

2.  PUGSVM: a caBIG™ analytical tool for multiclass gene selection and predictive classification.

Authors:  Guoqiang Yu; Huai Li; Sook Ha; Ie-Ming Shih; Robert Clarke; Eric P Hoffman; Subha Madhavan; Jianhua Xuan; Yue Wang
Journal:  Bioinformatics       Date:  2010-12-24       Impact factor: 6.937

3.  Analysis of variance: is there a difference in means and what does it mean?

Authors:  Lillian S Kao; Charles E Green
Journal:  J Surg Res       Date:  2007-10-22       Impact factor: 2.192

4.  fdrtool: a versatile R package for estimating local and tail area-based false discovery rates.

Authors:  Korbinian Strimmer
Journal:  Bioinformatics       Date:  2008-04-25       Impact factor: 6.937

5.  Proteomic Architecture of Human Coronary and Aortic Atherosclerosis.

Authors:  David M Herrington; Chunhong Mao; Sarah J Parker; Zongming Fu; Guoqiang Yu; Lulu Chen; Vidya Venkatraman; Yi Fu; Yizhi Wang; Timothy D Howard; Goo Jun; Caroline F Zhao; Yongmei Liu; Georgia Saylor; Weston R Spivia; Grace B Athas; Dana Troxclair; James E Hixson; Richard S Vander Heide; Yue Wang; Jennifer E Van Eyk
Journal:  Circulation       Date:  2018-06-19       Impact factor: 29.690

6.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

7.  Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation.

Authors:  Davis J McCarthy; Yunshun Chen; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2012-01-28       Impact factor: 16.971

8.  A self-directed method for cell-type identification and separation of gene expression microarrays.

Authors:  Neta S Zuckerman; Yair Noam; Andrea J Goldsmith; Peter P Lee
Journal:  PLoS Comput Biol       Date:  2013-08-22       Impact factor: 4.475

9.  xCell: digitally portraying the tissue cellular heterogeneity landscape.

Authors:  Dvir Aran; Zicheng Hu; Atul J Butte
Journal:  Genome Biol       Date:  2017-11-15       Impact factor: 13.583

10.  Mathematical modelling of transcriptional heterogeneity identifies novel markers and subpopulations in complex tissues.

Authors:  Niya Wang; Eric P Hoffman; Lulu Chen; Li Chen; Zhen Zhang; Chunyu Liu; Guoqiang Yu; David M Herrington; Robert Clarke; Yue Wang
Journal:  Sci Rep       Date:  2016-01-07       Impact factor: 4.379

View more
  2 in total

1.  swCAM: estimation of subtype-specific expressions in individual samples with unsupervised sample-wise deconvolution.

Authors:  Lulu Chen; Chiung-Ting Wu; Chia-Hsiang Lin; Rujia Dai; Chunyu Liu; Robert Clarke; Guoqiang Yu; Jennifer E Van Eyk; David M Herrington; Yue Wang
Journal:  Bioinformatics       Date:  2021-12-14       Impact factor: 6.937

2.  PASSer2.0: Accurate Prediction of Protein Allosteric Sites Through Automated Machine Learning.

Authors:  Sian Xiao; Hao Tian; Peng Tao
Journal:  Front Mol Biosci       Date:  2022-07-11
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.