| Literature DB >> 28915574 |
Anna Babayan1, Malik Alawi2,3, Michael Gormley4, Volkmar Müller5, Harriet Wikman1, Ryan P McMullin6, Denis A Smirnov4, Weimin Li4, Maria Geffken7, Klaus Pantel1, Simon A Joosse1.
Abstract
BACKGROUND: Whole genome amplification (WGA) is required for single cell genotyping. Effectiveness of currently available WGA technologies in combination with next generation sequencing (NGS) and material preservation is still elusive.Entities:
Keywords: CellSave; NGS; SNP; WGA; allelic dropout
Year: 2016 PMID: 28915574 PMCID: PMC5593545 DOI: 10.18632/oncotarget.10701
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Mean DNA yield (Table 1A) and PCR quality control success rate (Table 1B) for single SK-BR-3 cells and CTCs extracted from EDTA and CellSave preservation tubes, and FFPE material, after amplification with Ampli1, PicoPlex, and REPLI-g WGA kits
| Table 1A: DNA output, μg | |||||
|---|---|---|---|---|---|
| WGA kit | WGA output, mean ± st. dev., μg | ||||
| SK-BR-3 EDTA | SK-BR-3 CellSave | SK-BR-3 FFPE | CTC EDTA | Average | |
| 7.067 ± 1.082 | 5.857 ± 2.226 | 6.738 ± 1.608 | 4.688 ± 3.187 | 6.088 ± 2.285 | |
| 2.864 ± 1.137 | 3.392 ± 2.320 | 4.710 ± 0.406 | 4.013 ± 1.236 | 3.745 ± 1.555 | |
| 15.394 ± 1.353 | 11.374 ± 1.252 | 77.966 ± 30.820 | 31.410 ± 12.841 | 34.036 ± 31.232 | |
CTC – circulating tumor cell; st.dev – standard deviation.
Figure 1Distribution of identified known SNPs between datasets
(A) Known SNPs identified in single cells, amplified with Ampli1, PicoPlex, and REPLI-g WGA kits and obtained from EDTA-preserved blood in comparison to unamplified DNA. (B) Known SNPs identified in single cells, amplified with Ampli1 or PicoPlex and obtained from EDTA- and CellSave-preserved blood in comparison to unamplified DNA from unfixed cells. (C) Known SNPs identified in single CTCs, amplified with PicoPlex in comparison to each other.
The counts and statistics of SNP and indel calls in SK-BR-3 individual cells, obtained from EDTA-collected blood, amplified with Ampli1, PicoPlex, and REPLI-g WGA kits and sequenced with Illumina's HiSeq2000 and ThermoFisher's IonProton NGS platforms
| Groups | WGA kit | Ampli1 | PicoPlex | REPLI-g | Reference exome | |||
|---|---|---|---|---|---|---|---|---|
| NGS platform | HiSeq2000 | IonProton | HiSeq2000 | IonProton | HiSeq2000 | IonProton | HiSeq2000 | |
| Total SNPs | 9944 | 1986 | 9948 | 1695 | 403 | 64 | 17659 | |
| Known SNP | 9679 | 1936 | 5237 | 1157 | 200 | 31 | 17251 | |
| Known SNP, % | 97.3 | 97.5 | 52.6 | 68.3 | 49.6 | 48.4 | 97.7 | |
| SNP novel | 265 | 50 | 4711 | 538 | 203 | 33 | 408 | |
| ADO, % | 9.0 | 19.8 | 24.0 | 42.4 | 100.0 | 100.0 | na | |
| Common SNPs with known SNPs in reference | 7125 | 1525 | 4680 | 1073 | 173 | 30 | 17251 | |
| Sensitivity, % | 41.3 | 8.8 | 27.1 | 6.2 | 1.0 | 0.2 | 100.0 | |
| PPV, % | 73.6 | 78.8 | 89.4 | 92.7 | 86.5 | 96.8 | na | |
| Total indels | 1148 | 2688 | 2469 | 1688 | 140 | 52 | 502 | |
| Known indels | 176 | 23 | 82 | 14 | 3 | 1 | 310 | |
| Known indels, % | 15.3 | 0.9 | 3.3 | 0.8 | 2.1 | 1.9 | 61.8 | |
| Common indels with known indels in reference | 116 | 16 | 71 | 11 | 2 | 1 | 310 | |
| Sensitivity, % | 37.4 | 5.2 | 22.9 | 3.6 | 0.7 | 0.3 | 100.0 | |
| PPV, % | 65.9 | 69.6 | 86.6 | 78.6 | 66.7 | 100.0 | na | |
| Spearman correlation coefficient (r) | 0.66 | 0.63 | 0.81 | 0.80 | 0.25 | 0.25 | na | |
| <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | na | ||
Total SNPs/indels – number of all identified SNPs/indels. Known SNP – fraction of SNPs/indels, present in SNP database. Novel SNPs/indels – number of SNPs/indels, not present in SNP database. ADO – allelic dropout. PPV – positive predictive value.
Figure 2Plots of CNA profiles along the whole genome (x axis)
(A) CNA profile of unamplified DNA from unfixed cells. (B–G) plots of CNAs in single SK-BR-3 cells, obtained from EDTA-preserved blood. (H, I) CNA profiles of individual CTCs, obtained from EDTA-preserved blood of the same breast cancer patient. WGA kits: (B, E) Ampli1; (C, F, H, I) PicoPlex; (D, G) REPLI-g.
The counts and statistics of SNP and indel calls in single SK-BR-3 cells, analyzed in duplicates, obtained from CellSave-preserved blood, in comparison to single SK-BR-3 cells, obtained from EDTA-collected blood
| Groups of experiments | WGA kit | Ampli1 | PicoPlex | SK-BR-3 genomic DNA | ||||
|---|---|---|---|---|---|---|---|---|
| NGS | HiSeq2000 | HiSeq2000 | HiSeq2000 | |||||
| Material preservation | EDTA | CellSave | CellSave | EDTA | CellSave | CellSave | na | |
| Number of cells | 1 | 1 | 1 | 1 | 1 | 1 | ∼8×106 | |
| Total SNPs | 9944 | 7826 | 4088 | 9948 | 9738 | 9821 | 17659 | |
| Known SNP | 9676 | 6189 | 2857 | 5237 | 2522 | 4457 | 17251 | |
| Known SNP, % | 97.3 | 79.1 | 69.9 | 52.6 | 25.9 | 45.4 | 97.7 | |
| SNP novel | 265 | 1637 | 1231 | 4711 | 7216 | 5364 | 408 | |
| ADO rate, % | 9.0 | 36.4 | 78.5 | 54.0 | 74.3 | 66.4 | na | |
| Common SNPs with known SNPs in reference | 7125 | 5680 | 2381 | 4680 | 2088 | 2885 | 17251 | |
| Sensitivity, % | 41.3 | 32.9 | 13.8 | 27.1 | 12.1 | 16.7 | na | |
| PPV, % | 73.6 | 91.8 | 83.3 | 89.4 | 82.8 | 64.7 | na | |
| Total indels | 1148 | 723 | 165 | 2469 | 790 | 914 | 502 | |
| Known indels | 176 | 89 | 36 | 82 | 24 | 63 | 310 | |
| Known indels, % | 15.3 | 12.3 | 21.8 | 3.3 | 3.0 | 6.9 | 61.8 | |
| Common indels with known indels in reference | 116 | 76 | 32 | 71 | 19 | 42 | 310 | |
| Sensitivity, % | 37.4 | 24.5 | 10.3 | 22.9 | 6.1 | 13.6 | na | |
| PPV, % | 65.9 | 85.4 | 88.9 | 86.6 | 79.2 | 66.7 | na | |
| Spearman correlation coefficient (r) | 0.64 | 0.84 | 0.25 | 0.81 | 0.69 | 0.09 | na | |
| <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | na | ||
Total SNPs/indels – number of all identified SNPs/indels. Known SNP – fraction of SNPs/indels, present in SNP database. Novel SNPs/indels – number of SNPs/indels, not present in SNP database. ADO – allelic dropout. PPV – positive predictive value.
Figure 3Characteristics of pooled 1, 3, 5, and 10 SK-BR-3 cells, obtained from CellSave-preserved blood, amplified with Ampli1 and PicoPlex WGA kits, and sequenced with HiSeq2000 NGS platform
(A) Total identified SNPs. (B) Known identified SNPs. (C) Concordance of identified SNPs with reference dataset. (D) Sensitivity of the SNP calling analysis. (E) Allelic dropout. (F) Correlation of CNA profiles with CNA profile of unamplified DNA.
The counts and statistics of SNP and indel calls in CTCs
| Groups | Cell | CTC-1 | CTC-2 |
|---|---|---|---|
| WGA kit | PicoPlex | ||
| NGS | HiSeq2000 | ||
| Blood preservative | EDTA | ||
| Total SNPs | 34994 | 14658 | |
| Known SNP | 4304 | 6030 | |
| Known SNP, % | 12.3 | 41.1 | |
| SNP novel | 30690 | 8628 | |
| Common in both datasets known SNPs | 1135 | ||
| Fraction of common known from known identified in dataset, % | 26.4 | 18.8 | |
| Total indels | 4383 | 4103 | |
| Known indels | 42 | 81 | |
| Known indels, % | 1.0 | 2.0 | |
| Common in both datasets known indels | 15 | ||
| Fraction of common known from known identified in dataset, % | 37.7 | 18.5 | |
| Correlation between CTCs, r | 0.10 | ||
| Present in both CTCs | |||
| Present in only one CTC | wt | ||
| wt | |||
Total SNPs/indels – number of all identified SNPs/indels. Known SNP – fraction of SNPs/indels, present in SNP database. Novel SNPs – number of SNPs, not present in SNP database. ADO – allelic dropout. r – Spearman correlation coefficient. wt – wild type.