| Literature DB >> 31208319 |
Daniel G Piqué1, Grasiella A Andriani2, Elaine Maggi2, Samuel E Zimmerman1, John M Greally2, Cristina Montagna3,4, Jessica C Mar5,6,7.
Abstract
BACKGROUND: Numerical chromosomal variation is a hallmark of populations of malignant cells. Identifying the factors that promote numerical chromosomal variation is important for understanding mechanisms of carcinogenesis. However, the ability to quantify and visualize differences in chromosome number between experimentally-defined groups (e.g. control vs treated) obtained from single-cell experiments is currently limited by the lack of user-friendly software.Entities:
Keywords: Aneuploidy; Automated hypothesis testing; Fluorescence in situ hybridization; Multi-experiment comparison; Numerical chromosomal variation; Single-cell genomics; Visualization
Mesh:
Year: 2019 PMID: 31208319 PMCID: PMC6580570 DOI: 10.1186/s12859-019-2842-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Overview of the aneuvis workflow for analyzing numerical chromosomal variation. a An experiment begins with the quantification of number of chromosomes per cell using either FISH, SKY, or sc-WGS. b Next, the number of chromosomes per cell within each treatment group is stored as a cell x chromosome matrix, where the entries indicate the number of inferred copies of a chromosome in a cell. c Aneuvis incorporates information from the experimental design as well as from chromosomal copy number matrices to determine whether differences exist between treatment groups. A table of descriptive statistics summarized by group and by chromosome is automatically generated and available for download. Visual representations of the relationship in aneuploidy between different groups are also automatically generated. A permutation-based approach allows the user to conclude whether there is a statistically significant difference in the ploidy characteristics between treatment groups
Fig. 2Screenshot of the aneuvis data upload interface. The top half of the screen is where the user uploads the data into aneuvis. The bottom half of the interface contains a guide that describes how each file should be formatted for input into aneuvis (here, a guide for formatting sc-WGS data processed by Ginkgo)
Scores and indices used to quantify aneuploidy
| Method | Equation | Support | %-based? | Reflects magnitude of copy number changes? | Reflects variability in chromosomal states? | Meaning and Considerations |
|---|---|---|---|---|---|---|
| Instability index ( |
| No | No | Yes | Calculates the sum of the percentage of abnormal chromosome counts across each chromosome. | |
| Consider: | ||||||
| ANCA ( |
| No | No | No | ANCA = Average Number of Copy Number Alterations per cell. | |
| Consider: | ||||||
| Normalized ANCA ( |
| No | No | No | ||
| Aneuploidy score ( |
| No | Yes | No | ||
| Consider: Even a few cells with very large numbers of chromosomes will cause | ||||||
| Heterogeneity score ( |
| No | No | Yes | Increases with greater ‘spread’ in the number of observed chromosomal states | |
| Consider: | ||||||
| Ploidy proportion | Yes | No | Yes |
For the instability index [7], = the number of unique chromosomes examined. , where is the number of cells containing the modal number of the chromosome. is the number of cells examined. For the ANCA score [8, 9], is the number of non-diploid chromosomes observed in the cell. For the aneuploidy score [5], is the copy number of the cell at the chromosome. is the euploid copy number at the chromosome. For the heterogeneity score [5], is the total number of copy number states. is the number of cells with copy number state at bin . is ordered such that … . For the ploidy proportion, is the proportion of cells that are aneuploid, is the proportion of cells that are diploid, and is the proportion of cells that are polyploid
Fig. 3Screenshot of table summary available in aneuvis. Each row displays summary statistics for quantifying numerical chromosome variation per chromosome, treatment, and experimental type. Tables are interactive and can be filtered, sorted, or downloaded
Fig. 4Screenshot of scatterplot and ternary plot using data generated from the table summary. a The relationship between the aneuploidy score (x-axis) and the heterogeneity score (y-axis) is shown for data derived from FISH (circles) and sc-WGS (triangles). Senescent fibroblasts are colored red and young cells are colored turquoise. b A ternary plot showing the percentage of cells that are diploid, polyploid, and aneuploid per experimental approach and per treatment category
Fig. 5Screenshot of the aneuvis bivariate percentage heatmaps from 2-chromosome FISH data. The percentage of (a) young (N = 406) or (b) senescent (N = 396) fibroblasts within each chromosomal state is shown. The depth of the red color indicates that more cells exist within the indicated state. The axes for the diploid state (2 copies of each chromosome) are highlighted in bold
Active applications for analyzing and visualizing copy number variation data with graphical user interfaces (GUI)
| Application | Reference &/or URL | Application Type | Data type | QDV |
|---|---|---|---|---|
| CNVinspector | [ | Web application | Array CGH | No |
| Ginkgo | [ | Web application | sc- WGS | No |
| GenomeCAT | [ | Java application | Array CGH, WGS | No |
| SNPitty | [ | Docker container | WGS | No |
| Aneuvis | [ | Web application | sc-WGSa, SKY, FISH | Yes |
Each row contains data for a separate application, and columns specify features of each application. The URL for the application or source code, application type, and data type used are listed for each application. The capacity of the application to quantify the degree of numerical chromosomal variation (QDV) within and between treatment groups is also listed
Requires preprocessing from aligned .bam or .bed files into a copy number state matrix by an application such as Ginkgo
Fig. 6Screenshots of Ginkgo visualization and output from sc-WGS input from 83 single cells. a Dendrogram representation based on unsupervised hierarchical clustering using Ward linkage and the Euclidian distance of copy number states between individual cells. The lower right side of panel A contains links to download the summarized copy number data. b An example of a copy number heatmap constructed by aneuvis. Each row corresponds to an individual cell, and each column corresponds to a genomic bin. c Summary section showing various quality control summaries alongside graphical outputs of the copy number state across the genome of each cell