| Literature DB >> 30053271 |
Petr Smirnov1,2, Victor Kofia1, Alexander Maru1, Mark Freeman1, Chantal Ho1, Nehme El-Hachem1, George-Alexandru Adam1,3, Wail Ba-Alawi1,2, Zhaleh Safikhani1,2, Benjamin Haibe-Kains1,2,3,4.
Abstract
Recent cancer pharmacogenomic studies profiled large panels of cell lines against hundreds of approved drugs and experimental chemical compounds. The overarching goal of these screens is to measure sensitivity of cell lines to chemical perturbations, correlate these measures to genomic features, and thereby develop novel predictors of drug response. However, leveraging these valuable data is challenging due to the lack of standards for annotating cell lines and chemical compounds, and quantifying drug response. Moreover, it has been recently shown that the complexity and complementarity of the experimental protocols used in the field result in high levels of technical and biological variation in the in vitro pharmacological profiles. There is therefore a need for new tools to facilitate rigorous comparison and integrative analysis of large-scale drug screening datasets. To address this issue, we have developed PharmacoDB (pharmacodb.pmgenomics.ca), a database integrating the largest cancer pharmacogenomic studies published to date. Here, we describe how the curation of cell line and chemical compound identifiers maximizes the overlap between datasets and how users can leverage such data to compare and extract robust drug phenotypes. PharmacoDB provides a unique resource to mine a compendium of curated cancer pharmacogenomic datasets that are otherwise disparate and difficult to integrate.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30053271 PMCID: PMC5753377 DOI: 10.1093/nar/gkx911
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Specifications of all the datasets included in PharmacoDB
| Dataset | # Drugs | # Cells | # Experiments | Viability Assay | Available Molecular Data | Citations |
|---|---|---|---|---|---|---|
| CCLE | 22 | 1061 | 11 670 | CellTiter Glo | mRNA expression, CNV, Mutation | ( |
| GDSC1000 | 250 | 1109 | 225 480 | Syto60 | mRNA expression, CNV, Mutation, Methylation | ( |
| gCSI | 16 | 754 | 6455 | CellTiter Glo | mRNA expression, CNV, Mutation | ( |
| GRAY | 90 | 84 | 9413 | CellTiter Glo | mRNA expression, CNV, RPPA, Methylation, ExomeSeq | ( |
| FIMM | 52 | 50 | 2561 | CellTiter Glo | None | ( |
| CTRPv2 | 544 | 888 | 395 263 | CellTiter Glo | None | ( |
| UHNBreast | 4 | 84 | 52 | SRB | RNAseq, RPPA, Mut | (Safikhani |
|
|
|
|
|
Viability assay: Syto60: Proliferation; fluorescent DNA stain (Invitrogen); CellTiter Glo: Viability, membrane integrity, ATP (Promega); SRB: Sulforhodamine B colorimetric. Available molecular data: Mut: targeted mutation data; ExomeSeq: Whole exome-sequencing data; mRNA: gene expression data; methylation: methylation microarray data; CNV: copy number variation data; RPPA: protein expression data using reverse phase protein lysate microarray.
Figure 1.Main functionalities of PharmacoDB, displaying (A) the interfaces to query the database through searching or exploring available entities, (B) the five primary data types with respective profile pages, (C) the main visualizations of the aggregated data in PharmacoDB and (D) the link to PharmacoGx for extensive computational analysis of pharmacogenomic data.
Figure 2.The total number of identifier matches between pairs of datasets for (A) cell lines and (B) compounds before curation, after automatically removing differences in capitalization and white space and manually reviewing matches within edit distance of 2, and after undergoing subsequent manual curation.
Figure 3.An example profile page from PharmacoDB for a cell line. The page is organized so that the left column contains textual information, and the right column contains plots. Panel (A) is the information card for the cell line (MCF7). Panel (B) contains tables listing the available data profiles for this cell. Panel (C) contains summary plots about the drug screening performed in each dataset and the waterfall plots of the cell response to treatment with compounds.
Figure 4.An example of drug dose response curve plot with (A) the A549 lung cancer cell line treated with the MEK inhibitor AZD6244, and (B) the corresponding table of summary statistics. IC50: dose of 50% inhibition of cell viability; EC50: dose at which 50% of the maximum response is observed; AAC: area above curve; Einf: maximum theoretical inhibition; DSS: drug sensitivity score.