| Literature DB >> 29949994 |
Francesca Petralia1,2, Li Wang1,2,3, Jie Peng4, Arthur Yan1,2, Jun Zhu1,2,3, Pei Wang1,2.
Abstract
Motivation: Tumor tissue samples often contain an unknown fraction of stromal cells. This problem is widely known as tumor purity heterogeneity (TPH) was recently recognized as a severe issue in omics studies. Specifically, if TPH is ignored when inferring co-expression networks, edges are likely to be estimated among genes with mean shift between non-tumor- and tumor cells rather than among gene pairs interacting with each other in tumor cells. To address this issue, we propose Tumor Specific Net (TSNet), a new method which constructs tumor-cell specific gene/protein co-expression networks based on gene/protein expression profiles of tumor tissues. TSNet treats the observed expression profile as a mixture of expressions from different cell types and explicitly models tumor purity percentage in each tumor sample.Entities:
Mesh:
Year: 2018 PMID: 29949994 PMCID: PMC6022554 DOI: 10.1093/bioinformatics/bty280
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 2.Correlation between prior estimate and true purity (left plot) and correlation between estimated and true purity (right plot) for each value of δ
Fig. 3.Performance of mixNet and TSNet evaluated on synthetic data for partially overlapping tumor and normal networks and independently generated tumor and normal networks. (a, b) Average of ROC curves over 10 different replicates involving 1000 genes for different sample size, i.e. (N = 200, N = 400) resulting from mixNet and TSNet. ROC curves were obtained by varying penalty parameters which control the dimension of the estimated co-expression networks. (c) Boxplot of true-positive rate and false positive rate over 10 replicates for the model minimizing the Bayesian information criteria. Performance of both mixNet and TSNet are evaluated for simulation scenarios involving different number of genes (p = 500, p = 1000) and different number of samples (N = 200, N = 400)
Fig. 4.Pearson’s correlation of tumor purity from TSNet, ABSOLUTE (Carter ) and ESTIMATE (Yoshihara ) with methylation-based estimates of the fraction of leukocytes in tumor tissue
Fig. 5.(a) Biggest independent component in TSNet-tumor network. The number of edges specific to TSNet-tumor network is 366 (red); while the number of edges shared with mixNet network is 341 (blue). (b) Hub genes (i.e. highly connected node) in TSNet-networks (TSNet-tumor and TSNet-normal) which are poorly connected in mixNet network. For each gene, the red bar shows the number of connecting edges in TSNet-tumor plus the number of connecting edges in TSNet-normal divided by the sum of total number of edges of the two networks; while the blue bar shows the number of connecting edges in mixNet network divided by the total number of edges in the network. (c) Enriched pathways in the second-order neighborhood of tumor repressor HIC1 for TSNet-tumor and TSNet-normal. Enrichment analysis was carried out using the software Enrichr (Chen )
Fig. 6.Pathway enrichment of hub-structure. (a) Degree plot for two enriched pathways in mixNet and TSNet-Normal, i.e. ‘Kegg ECM Receptor Interaction’ and stromal genes. The degree of gene g in a given network is defined as the number of connecting edges of gene g. To allow comparison across different networks, degrees have been normalized by dividing them for the total number of edges in the network. (b) Number of enriched categories for TSNet-Tumor, TSNet-Normal and mixNet networks. List of pathways enriched in TSNet-tumor but not in mixNet and vice versa
Fig. 7.Pathway enrichment of topological structure. (a) Number of enriched pathways at 1% false discovery rate cut-off for TSNet-tumor, TSNet-normal and mixNet networks. (b) Top enriched pathways in TSNet-tumor which are not enriched in mixNet. (c) Top enriched pathways in mixNet which are not enriched in TSNet-tumor