| Literature DB >> 33073517 |
Sumana Sharma1,2, Cansu Dincer1, Paula Weidemüller1, Gavin J Wright2, Evangelia Petsalaki1.
Abstract
An emerging theme from large-scale genetic screens that identify genes essential for cell fitness is that essentiality of a given gene is highly context-specific. Identification of such contexts could be the key to defining gene function and also to develop novel therapeutic interventions. Here, we present Context-specific Essentiality Network-tools (CEN-tools), a website and python package, in which users can interrogate the essentiality of a gene from large-scale genome-scale CRISPR screens in a number of biological contexts including tissue of origin, mutation profiles, expression levels and drug responses. We show that CEN-tools is suitable for the systematic identification of genetic dependencies and for more targeted queries. The associations between genes and a given context are represented as dependency networks (CENs), and we demonstrate the utility of these networks in elucidating novel gene functions. In addition, we integrate the dependency networks with existing protein-protein interaction networks to reveal context-dependent essential cellular pathways in cancer cells. Together, we demonstrate the applicability of CEN-tools in aiding the current efforts to define the human cellular dependency map.Entities:
Keywords: zzm321990CRISPRzzm321990; NRAS-mutant melanoma; context-specific essentiality; networks; omics integration
Mesh:
Substances:
Year: 2020 PMID: 33073517 PMCID: PMC7569414 DOI: 10.15252/msb.20209698
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 13.068
Figure 1CEN‐tools identifies core essential genes involved in regular housekeeping functions of a cell
Venn diagram for prediction of core essential genes by CEN‐tools using both the BROAD and SANGER projects, and ADaM novel core fitness genes (Behan et al, 2019).
Box plot for Silhouette Scores (si‐scores) of core essential genes predicted by CEN‐tools and ADaM for the two projects BROAD and SANGER. For each project, the core essential genes were predicted separately. The centre of each box plot represents the sample median; and the ends of the box are the upper and lower quartiles; the whiskers extend to the smallest and largest observations within 1.5 times the interquartile range of the quartiles. Observations lying outside the whiskers are shown as individual data points. Box plots were drawn based on si‐scores from the following numbers of genes for each project: BROAD (ADaM, SANGER, BROAD: 146, SANGER, BROAD: 373, only BROAD: 423); SANGER (ADaM, SANGER, BROAD: 146, SANGER, BROAD: 373, only SANGER: 111).
Pie chart for percentages of core essential genes predicted by both ADaM and CEN‐tools and only CEN‐tools by using both projects. The pie chart on the right panel represents the percentages of novel core genes from CEN‐Tools, known core genes in pluripotent stem cells and core biological processes, and also therapeutically tractable genes annotated by the Project Score. (Behan et al, 2019).
Bar Plot of protein‐complex enrichment of core genes. Y axis represents the significance of the enrichment; colours of the bars represent the percentages of the novel core genes in the complexes. The red line represents the adjusted P‐value of 0.01.
Box plot for the log value of the basal expression levels of core genes in BAGEL (Hart & Moffat, 2016), ADaM, CEN‐tools predictions, and non‐essential BAGEL and “Not core” CEN‐tools genes. The centre of each box plot represents the sample median; the ends of the box are the upper and lower quartiles; the whiskers extend to the smallest and largest observations within 1.5 times the interquartile range of the quartiles. Observations lying outside the whiskers are shown as individual data points. Box plots were drawn based on expression from the following numbers of genes for each group: Not‐core : 15,330, BAGEL essential : 606, ADaM, SANGER & BROAD : 145, ADaM : 346, SANGER & BROAD : 367.
Figure 2Interrogation of contexts from CEN‐tools identifies novel gene–gene relationships
An example of a CEN from CEN‐tools. In this example, components of the BRAF co‐essential genes from PICKLES were extracted from the CEN‐tools BRAF‐centric CEN network in Skin. Edges with confidence level of 2 (P‐value < 0.01, correlation score 0.6 are depicted).
CENs for genes that have restricted expression and essentiality in different tissue types. The majority of the genes in this network are TFs important for lineage specification of a cell line to a particular tissue type. The width of the line in (A) and (B) denotes the confidence of association.
The CEN of transcription factors in skin tissue. This CEN was generated from the BROAD data set.
Schematic of the Cignal® lentiviral reporter construct for assessing the activity of the serum response factor (SRF) transcription factor. The construct expresses GFP under a control of a basal promoter element (TATA box) together with multiple tandem repeats of serum response element (SRE). This construct was used to generate a reporter Cas9 expressing A375 cell line for SRF activity.
Representative histograms depicting the GFP expression from the parental and the reporter line under different perturbations. For gRNA transductions, polyclonal lines were used to assess the GFP expression 6 days post‐transduction. GFP expressions of trametinib‐treated cells were measured 2 days post‐treatment.
Bar graph depicting the GFP fold change distribution compared with that from parental line GFP distribution from three independent gRNA transductions targeting the indicated genes. Polyclonal knockout lines were used for quantification. The height of the bar graph represents the mean of fold change obtained from three replicates and the error bars depict the standard deviation. P‐values were obtained from unpaired t‐test; **P < 0.01; ns: not significant. Representative raw‐FACS plots from one of the three replicates are also shown in Appendix Fig S5B.
Figure 3CENs can identify cell essential processes for ‐mutant melanoma cell lines
The essentialities of RAF1 (upper panel) and SHOC2 (lower panel) are higher in NRAS‐mutant melanoma cell lines (six samples) compared with NRAS‐WT melanoma cell lines (27 samples). The centre of each box plot represents the sample median; ends of the box are the upper and lower quartiles; the whiskers extend to the smallest and largest observations within 1.5 times the interquartile range of the quartiles. Individual data points are overlayed onto the box plots. The dotted line shows the median essentiality across all cell lines displayed (regardless of group). P‐value is obtained from a Wilcoxon test; ***P ≤ 0.001, ****P ≤ 0.0001.
Protein‐protein interaction network of essential components of NRAS‐mutant melanoma cell lines. All nodes in the network are genes whose essentiality is significantly higher in NRAS‐mutant melanoma compared with NRAS‐WT melanoma cell lines. The colour intensity represents the median‐change in essential scores of NRAS in melanoma cell lines compared with that of NRAS‐WT cell lines.
Enriched pathways in the network in (B).
Box plots depicting the higher essentiality of IGF1R in NRAS‐mutant melanoma cell lines (6 samples) compared with non‐NRAS-mutant melanoma (27 samples, upper panel) and lower essentiality of IGF1R in BRAF mutant melanoma cell lines (20 samples) compared with BRAF WT melanoma lines (13 samples, lower panel). The centre of each box plot represents the sample median; ends of the box are the upper and lower quartiles; the whiskers extend to the smallest and largest observations within 1.5 times the interquartile range of the quartiles. Individual data points are overlayed onto the box plots. The dotted line shows the median essentiality across all cell lines displayed (regardless of group). P‐value is obtained from a Wilcoxon test; **P < 0.01, ****P ≤ 0.0001.
| Resource | Resource Information | Used for | Reference |
|---|---|---|---|
|
| |||
| Project ScoreDependency Map (DepMap) | Genome‐wide CRISPR screens (Essentiality scores‐ Gene depletion fold change (logFC) matrix)For BROAD, DepMap Achilles 19Q2 was used*For INTEGRATED, DepMap Achilles 19Q3 was used | Identification of Core essential geneEssentiality scores for statistical tests | Meyers |
| Cancer Cell Line Encyclopedia (CCLE) | Annotations of cell lines and Mutation, CNV, expression information for BROAD and INTEGRATED projectsFor BROAD, DepMap Achilles 19Q2 was used*For INTEGRATED, DepMap Achilles 19Q3 was used | Identification of contexts | Meyers |
| The Genotype‐Tissue Expression (GTEx) | Median gene‐level TPM by tissue (Analysis v8—RNASeq v.1.1.9, 2017‐06‐05) | Basal Level Expressions of the genes in cell lines | Aguet |
| Genomics of Drug Sensitivity in Cancer (CancerRx) | Database for drug response and therapeutic biomarkers of cancer cell lines:GDSC1: 15Oct19 version and GDSC2: 15Oct19 version | Identification of contexts | Yang |
| Cell line passports | Annotations of cell lines from SANGER project (v.20020610) Mutation information of cell lines, MSS/MSI status annotation, gene expression and CNV data sets (v.20191101) | Identification of contextsCell line ID mapping | van der Meer |
| BAGEL (Bayesian Analysis of Gene Essentiality) | Gold‐standard essential and non‐essential gene sets from the BAGELR implementation | TrainingBenchmarking | Hart and Moffat ( |
| ADaM (Adaptive daisy model) | An algorithm for identification of core fitness and context‐specific essential genes in large‐scale CRISPR‐Cas9 screens | Comparison | Behan |
| HPSCs (Human Pluripotent Stem Cells) | Stem cell core gene set | ComparisonCore Annotation | Hart and Moffat ( |
| TRRUST | Human TF database | Network Annotation | Han |
| Surfaceome | Human Surface protein database | Network Annotation | Bausch‐Fluck |
| SLCs (solute carrier proteins) | A group of membrane transport proteins | Network Annotation | César‐Razquin |
| Kinases | Enzymes responsible for phosphorylation (important for signalling) | Network Annotation | Invergo |
| BioMart | A data mining tool for Ensembl genes, transcripts, proteins and also external information | Conversion of Ensembl IDs to official gene names | Kinsella |
| Cancer Genome Interpreter | Annotation of the genes having validated oncogenic mutations | Gene Annotation | Tamborero |
|
| |||
| dplyr (version 0.8.5) | R package for data processing | CEN‐tools web server implementation | Wickham |
| ggplot2 (version 3.3.0) | R package for plotting | CEN‐tools web server implementation | Wickham ( |
| ggpubr (version 0.2.5) | R package for plotting | CEN‐tools web server implementation | Kassambara ( |
| gridExtra (version 2.3) | R package for plotting | CEN‐tools web server implementation | Auguie ( |
| httr (version 1.4.1) | R package for API access | CEN‐tools web server implementation | Wickham ( |
| jsonlite (version 1.6.1) | R package for handling JSON data | CEN‐tools web server implementation | preprint: Ooms ( |
| magrittr (version 1.5) | R package for data processing | CEN‐tools web server implementation | Bache and Wickham ( |
| plotly (version 4.9.2.1) | R package for interactive plotting | CEN‐tools web server implementation | Sievert ( |
| plyr (version 1.8.6) | R package for data processing | CEN‐tools web server implementation | Wickham ( |
| R (version 3.6.2) | R language | CEN‐tools web server implementation | R Core Team ( |
| rprojroot (version 1.3‐2) | R package for data loading | CEN‐tools web server implementation | Müller ( |
| shiny (version 1.4.0.2) | R package to develop shiny apps | CEN‐tools web server implementation | Chang |
| shinyalert (version 1.0) | R package to develop shiny apps | CEN‐tools web server implementation | Attali and Edwards ( |
| shinycssloaders (version 0.3) | R package to develop shiny apps | CEN‐tools web server implementation | Sali and Attali ( |
| shinydashboard (version 0.7.1) | R package to develop shiny apps | CEN‐tools web server implementation | Chang and Ribeiro ( |
| shinyhelper (version 0.3.2) | R package to develop shiny apps | CEN‐tools web server implementation | Mason‐Thom ( |
| shinythemes (version 1.1.2) | R package to develop shiny apps | CEN‐tools web server implementation | Chang ( |
| shinyWidgets (version 0.5.1) | R package to develop shiny apps | CEN‐tools web server implementation | Perrier |
| stringr (version 1.4.0) | R package for text operations | CEN‐tools web server implementation | Wickham ( |
| V8 (version 3.2.1) | R package for handling javascript | CEN‐tools web server implementation | Ooms ( |
| visNetwork (version 2.0.9) | R package for visualising networks | CEN‐tools web server implementation | Almende |
| tidyr (version 0.8.3) | R package for data processing | Construction of t‐sne plot for interactive cell line selector | Wickham and Henry ( |
| Rtsne (version 0.15) | R package for t‐sne calculation | Construction of t‐sne plot for interactive cell line selector | Krijthe ( |
| Python (version 3.6.9) | Python language | CEN‐tools data curation and analysis | van Rossum and de Boer ( |
| NumPy | Python package for scientific computing | CEN‐tools data curation and analysis | Oliphant ( |
| pandas | Python package for data analysis and manipulation | CEN‐tools data curation and analysis | The Pandas Development Team ( |
| SciPy | Python package for mathematics, science, and engineering | CEN‐tools data curation and analysis | Virtanen |
| scikit‐learn | Python package for predictive data analysis | CEN‐tools data curation and analysis | Pedregosa |
| argparse | Python package for writing command‐line interfaces | Proving user input for pyCEN | Pedregosa |
| pickle | Python package for serialising and de‐serialising of python objects | CEN‐tools data curation and analysis | van Rossum and Team ( |
| matplotlib | Python package for visualisations | Data visualisation | Hunter ( |
| seaborn | Python package for visualisations | Data visualisation | Hunter ( |
| os | Python package for enabling operating system dependent functionality | CEN‐tools data curation and analysis | van Rossum and Team ( |