| Literature DB >> 34663427 |
Christopher Yogodzinski1,2,3, Abolfazl Arab4,5,6, Justin R Pritchard7, Hani Goodarzi4,5,6, Luke A Gilbert8,9,10.
Abstract
BACKGROUND: Advances in cancer biology are increasingly dependent on integration of heterogeneous datasets. Large-scale efforts have systematically mapped many aspects of cancer cell biology; however, it remains challenging for individual scientists to effectively integrate and understand this data.Entities:
Keywords: Data integration; Functional genomics; Multiomics; Synthetic lethality
Mesh:
Year: 2021 PMID: 34663427 PMCID: PMC8524992 DOI: 10.1186/s13073-021-00987-8
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 15.266
Fig. 4CanDI enables rapid integration of external datasets to reveal immunotherapy targets. A A graph showing genes that are upregulated in KRAS-mutant NSCLC cell lines relative to primary human bronchial epithelial cells. A cell membrane protein localization score is shown for each gene. Higher protein localization scores indicate higher confidence annotations. B A scatter plot showing gene expression for genes that encode cell surface proteins in KRAS-mutant NSCLC cell lines and primary human bronchial epithelial cells. N = 46 for KRAS-mutant NSCLC cell lines and N = 4 for primary human bronchial epithelial cells. C A graph showing genes that are upregulated in EGFR-mutant NSCLC cell lines relative to primary human bronchial epithelial cells. A cell membrane protein localization score is shown for each gene. Higher protein localization scores indicate higher confidence annotations. D A scatter plot showing gene expression for genes that encode cell surface proteins in EGFR-mutant NSCLC cell lines and primary human bronchial epithelial cells. N = 21 for EGFR-mutant NSCLC cell lines and N = 4 for primary human bronchial epithelial cells. E A graph showing genes that are upregulated in NSCLC cell lines relative to primary human bronchial epithelial cells. A cell membrane protein localization score is shown for each gene. Higher protein localization scores indicate higher confidence annotations. F A scatter plot showing gene expression for genes that encode cell surface proteins in NSCLC cell lines and primary human bronchial epithelial cells. N = 141 for NSCLC cell lines and N = 4 for primary human bronchial epithelial cells
Fig. 1CanDI integrates multiomics data enabling discovery. A A schematic showing human cell models integrated by CanDI. B A schematic illustrating types of data integrated by CanDI. C A cartoon of a genome-scale CRISPR screen to identify genes that modulate response to PARP inhibition by Olaparib. D A schematic depicting data feature inputs parsed by CanDI. E Essentiality of Fanconi anemia genes in ovarian and breast cancer cell lines separated by BRCA mutation status. Gene essentiality (CERES gene effect score) is displayed by a heat map. N = 6 BRCA1-mutant ovarian cancer, N = 51 BRCA1-wildtype ovarian cancer, N = 4 BRCA1-mutant breast cancer, N = 39 BRCA1-wildtype breast cancer
Fig. 2CanDI enables a global analysis of conditional essentiality in cancer. A Average gene essentiality for KRAS and EGFR in groups of NSCLC cell lines stratified by KRAS mutation status. N = 61 for KRAS-wildtype shown in blue. N = 33 for KRAS-mutant shown in blue. Gene essentiality is the average gene effect (CERES score) across all cell lines for the given group. B Average gene essentiality for KRAS and EGFR in groups of NSCLC cell lines stratified by EGFR mutation status. N = 77 for EGFR-wildtype shown in blue, N = 17 for EGFR-mutant shown in blue. Gene essentiality is the average gene effect (CERES score) of all cell lines within a given group. C P values from chi-square tests of gene essentiality and nonsense mutations. D P values from chi-square tests of gene essentiality and missense mutations. E A scatter plot showing effect size of the change in gene essentiality with select missense mutations and the − Log10(P value) of each essentiality/mutation pair. F A scatter plot showing effect size of the change in gene essentiality with select nonsense mutations and the − Log10(P value) of each essentiality/mutation pair. G A scatter plot showing effect size of the change in gene essentiality with all mutations and the − Log10(P value) of each essentiality/mutation pair
Fig. 3CanDI reveals female and male context-specific essential genes. A Differential gene expression and differential gene essentiality in male and female CRC cell lines. For differential expression analysis: N = 24 male cell lines and N = 14 female cell lines. For differential essentiality analysis: and N = 18 male cell lines and N = 9 female cell lines. B The distribution of gene effect CERES scores in male and female CRC cell lines. The top seven and bottom three differentially essential genes are shown in violin plots split by the sex of the cell lines. C Differential gene expression and differential gene essentiality in male and female NSCLC cell lines. For differential expression analysis: N = 30 male cell lines and N = 13 female cell lines. For differential essentiality analysis and N = 23 male cell lines and N = 9 female cell lines. D The distribution of gene effect CERES scores in male and female NSCLC cell lines. The top seven and bottom three differentially essential genes are shown in violin plots split by the sex of the cell lines. E Differential gene expression and differential gene essentiality in male and female PDAC cancer cell lines. For differential expression analysis: N = 26 male cell lines and N = 19 female cell lines. For differential essentiality analysis: N = 24 male cell lines and N = 12 female cell lines. F The distribution of gene essentiality CERES scores in male and female PDAC cell lines. The top seven and bottom three differentially essential genes are shown in violin plots split by the sex of the cell lines