| Literature DB >> 35640971 |
Blas Chaves-Urbano1, Barbara Hernando1, Maria J Garcia1, Geoff Macintyre1.
Abstract
SUMMARY: Selecting the optimal cancer cell line for an experiment can be challenging given the diversity of lines available. Here, we present CNpare, which identifies similar cell line models based on genome-wide DNA copy number. AVAILABILITY: CNpare is available as an R package at https://github.com/macintyrelab/CNpare. All analysis performed in the manuscript can be reproduced via the code found at https://github.com/macintyrelab/CNpare_analyses. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.Entities:
Year: 2022 PMID: 35640971 PMCID: PMC9272807 DOI: 10.1093/bioinformatics/btac371
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Fig. 1.Overview of CNpare and an example of a cell line match. (A) Schematic providing a high-level overview of CNpare’s workflow and computation. The user inputs an absolute copy number profile (left) and CNpare compares this to a precomputed database of cell line copy number from the CCLE and GDSC projects, using a series of different comparison metrics (center). Output is in the form of a list of cell lines ranked based on the strength of match to the input profile. Included in the output is a graphical representation of differences between the genomes and an estimate on the percentage genome difference. (B) Example of a CNpare workflow using OVKATE cell line as the input cell line. The top plot shows the copy number profile of the OVKATE ovarian cancer cell line used as input to CNpare. On the left side, the copy number profile of the top hit found using Pearson’s r and Manhattan distance is displayed in blue, along with the OVKATE line in red. Underneath the percentage genome difference between the profiles is listed. On the right-hand side, the results of matching the OVKATE cell line based on copy number signatures are displayed, showing the results of clustering all ovarian cancer cell lines based on seven copy number signatures (Macintyre ). For visualization purposes, the two signatures with the highest variation across cluster means are shown. Each large dot represents the cluster centroid and each small dot represents a cell line. The cluster containing the OVKATE cell line is indicated (A color version of this figure appears in the online version of this article.)