| Literature DB >> 34472587 |
Ziaurrehman Tanoli1, Jehad Aldahdooh1, Farhan Alam1, Yinyin Wang1, Umair Seemab1, Maddalena Fratelli2, Petr Pavlis3, Marian Hajduch3, Florence Bietrix4, Philip Gribbon5, Andrea Zaliani5, Matthew D Hall6, Min Shen6, Kyle Brimacombe6, Evgeny Kulesskiy7, Jani Saarela7, Krister Wennerberg8, Markus Vähä-Koskela7, Jing Tang1.
Abstract
Chemosensitivity assays are commonly used for preclinical drug discovery and clinical trial optimization. However, data from independent assays are often discordant, largely attributed to uncharacterized variation in the experimental materials and protocols. We report here the launching of Minimal Information for Chemosensitivity Assays (MICHA), accessed via https://micha-protocol.org. Distinguished from existing efforts that are often lacking support from data integration tools, MICHA can automatically extract publicly available information to facilitate the assay annotation including: 1) compounds, 2) samples, 3) reagents and 4) data processing methods. For example, MICHA provides an integrative web server and database to obtain compound annotation including chemical structures, targets and disease indications. In addition, the annotation of cell line samples, assay protocols and literature references can be greatly eased by retrieving manually curated catalogues. Once the annotation is complete, MICHA can export a report that conforms to the FAIR principle (Findable, Accessible, Interoperable and Reusable) of drug screening studies. To consolidate the utility of MICHA, we provide FAIRified protocols from five major cancer drug screening studies as well as six recently conducted COVID-19 studies. With the MICHA web server and database, we envisage a wider adoption of a community-driven effort to improve the open access of drug sensitivity assays.Entities:
Keywords: FAIR research data; data integration tools; drug discovery; drug sensitivity assays
Year: 2022 PMID: 34472587 PMCID: PMC8769689 DOI: 10.1093/bib/bbab350
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Figure 1User interface and workflow of MICHA. Users start with compound annotation by uploading a list of compounds with names and standard InChiKeys. MICHA will return the pharmacological (drug targets, disease indications) and physiochemical properties of the compounds via an integrative web server and database, available under the ‘Compounds’ tab. Users then may click on Samples, Reagents or Data processing tabs to annotate their drug screening protocols. Auto suggestions are provided to avoid spelling mistakes or terminology conflicts. Finally, users can download the summary reports containing input data as well as integrated information provided by MICHA.
FAIRified protocols by MICHA
| Protocol name | Type | Comp-ounds | Cell lines | Detection technology | Dilution fold | Plate type | Min concentration (nM) | Max concentration (nM) | Metric | References |
|---|---|---|---|---|---|---|---|---|---|---|
| GDSC | Cancer | 345 | 987 | Fluorescence | 2 | 384 and 96 | 0.03 | 4000 000 | IC50 | [ |
| CCLE | Cancer | 24 | 504 | Luminescence | 3.16 | 1536 | 2.5 | 8000 | IC50, EC50 | [ |
| CTRPv2 | Cancer | 203 | 242 | Luminescence | 2 | 384 | 0.56 | 592,000 | IC50 | [ |
| FIMM | Cancer | 528 | 4 | Fluorescence | 10 | 384 | 0.0 | 1000 000 | DSS | [ |
| Mario Negri | Cancer | 1 | 16 | Spectrophotometry, luminescence | 10 | 96 | 0.0 | 10 000 | AUC | [ |
| NCATS | COVID-19 | 5430 | 1 | Luminescence | 2–5 | 384 | 0.0 | 120 000 | AC50 | [ |
| Ellinger | COVID-19 | 103 | 1 | Label free | 3.33 | 384 | 20 | 20 000 | IC50, CC50 | [ |
| Gordon | COVID-19 | 73 | 1 | Spectrophotometry | 96 | 1 | 10 000 | IC50 | [ | |
| Jeon | COVID-19 | 43 | 1 | Microscopy | 2 | 384 | 50 | 50 000 | IC50 | [ |
| Touret | COVID-19 | 83 | 1 | qPCR | 2 | 96 | 600 | 40 000 | EC50 | [ |
| Weston | COVID-19 | 43 | 1 | Luminescence | 2 | 96 | 50 | 5000 | IC50 | [ |
Figure 2Average number of targets for compounds in multiple databases. (A) Approved drugs. (B) Investigational compounds.
Comparison between protocols of CCLE, GDSC and CTRPv2
| CCLE | GDSC | CTRPv2 | |
|---|---|---|---|
| Plate types | 1536 | 384, 96 | 384-Opaque white |
| Experimental mediums | NA | DMEM, RPMI | ALPHAMEM, DMEM, DMEMF, EMEM, HAMSF, IMDM, L15, McCoys5A, MCDB, MEM, RPMI, Waymouth, WilliamsE |
| Detection technology | Luminescence | Fluorescence | Luminescence |
| Cell density | 250 | NA | 500, 1000, 2000, 3000, 5000, 10 000 |
| Treatment time (h) | 72–84 | 72 | 72 |
| Analysis metrics | IC50, EC50 | IC50 | IC50 |
| Compounds | 24 | 345 | 203 |
| Number of cell lines | 504 | 987 | 242 |
NA indicates that the information is unavailable.
Figure 3Overlapping data between CCLE, GDSC and CTRPv2 database. (A) Overlapping compounds. (B) Overlapping cell lines.
Figure 4(A) Clinical phase of FAIRified compounds. (B) Tissue types for FAIRified cell lines.