| Literature DB >> 28438122 |
Rajini R Haraksingh1,2, Alexej Abyzov3, Alexander Eckehart Urban4,5,6.
Abstract
BACKGROUND: High-resolution microarray technology is routinely used in basic research and clinical practice to efficiently detect copy number variants (CNVs) across the entire human genome. A new generation of arrays combining high probe densities with optimized designs will comprise essential tools for genome analysis in the coming years. We systematically compared the genome-wide CNV detection power of all 17 available array designs from the Affymetrix, Agilent, and Illumina platforms by hybridizing the well-characterized genome of 1000 Genomes Project subject NA12878 to all arrays, and performing data analysis using both manufacturer-recommended and platform-independent software. We benchmarked the resulting CNV call sets from each array using a gold standard set of CNVs for this genome derived from 1000 Genomes Project whole genome sequencing data.Entities:
Keywords: Array Comparative Genome Hybridization (aCGH); Copy Number Variation (CNV); SNP array
Mesh:
Year: 2017 PMID: 28438122 PMCID: PMC5402652 DOI: 10.1186/s12864-017-3658-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Size and nature of gold standard CNVs from sample NA12878. a. Histogram showing size distribution of NA12878 gold standard CNVs. Bin sizes change by a factor of ten across panels. Peaks in the size range of Alu elements, in the 301–400 bp bin, and in the size range of LINE1 elements, in the 5001–6000 bp bin, are indicated by arrows. b. Distribution of total numbers of gold standard deletions and duplications by size. It can reasonably be expected that most CNVs smaller than 1 kb in size are not detectable by the arrays in this study
Features and experimental details of CNV detection platforms used in this study
| Array | Total Features | Probe Spacing | Data Source | Platform-Specific Analysis Software | Number of Autosomal CNVs Called (replicate 1, replicate 2) | |
|---|---|---|---|---|---|---|
| Platform Specific Analysis | Nexus Analysis | |||||
| Affymetrix Arrays | ||||||
| Affymetrix CytoScan HD | 2,600,000 Copy Number + 750,000 SNP | 0.88 kb mean probe spacing in RefSeq genes | Affymetrix | Affymetrix Chromosome Analysis Suite | 201,241 | 82,93 |
| Affymetrix Genome-Wide Human SNP Array 6.0 | 946,000 Copy Number + 906,000 SNP | 0.7 kb mean probe spacing | Affymetrix & Mc Carroll et al. | Birdsuitea | 151a | 105,112 |
| Agilent Arrays | ||||||
| Agilent SurePrint G3 Human CGH Microarray, 1×1M (Design ID 021529) | 963,029 (60mers) | 2.1 kb overall median probe spacing (1.8 kb in Refseq genes) | Service Provider | Agilent Genomics Workbench 7.0 | 124,133 | 65,53 |
| Agilent SurePrint G3 Human High Resolution Microarray, 1×1M (Design ID 023642) | 963,331 (60mers) | 2.6 kb overall median probe spacing, 3 kb average probe spacing | Service Provider | Agilent Genomics Workbench 7.0 | 155,152 | 177,171 |
| Agilent SurePrint G3 Human CGH Microarray, 2×400K (Design ID 021850) | 411,056 (60mers) | 5.3 kb overall median probe spacing (4.6 kb in Refseq genes) | Service Provider | Agilent Genomics Workbench 7.0 | 58,61 | 50,48 |
| Agilent SurePrint G3 Human CNV Microarray, 2×400K (Design ID 021365) | 442,892 (60mers) | 1 kb overall median probe spatial resolution in CNVs | Service Provider | Agilent Genomics Workbench 7.0 | 365,365 | 433,489 |
| Agilent SurePrint G3 Human CGH Microarray, 4×180K (Design ID 022060) | 170,334 (60mers) | 13 kb overall median probe spacing (11 kb in Refseq genes) | Service Provider | Agilent Genomics Workbench 7.0 | 24,25 | 30,32 |
| Illumina Arrays | ||||||
| Illumina HumanOmni5Exome v1 | 4,641,128 | 0.63 kb mean, 0.32 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 426,397 | 66,62 |
| Illumina HumanOmni5-4v1 | 4,301,331 | 0.68 kb mean, 0.36 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 63,61 | 56,61 |
| Illumina HumanOmni25Exome-8v1 | 2,583,651 | 1.13 kb mean, 0.57 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 58,57 | 33,35 |
| Illumina HumanOmni25-8v1-1 | 2,338,671 | 1.25 kb mean, 0.66 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 31,34 | 28,24 |
| Illumina HumanOmni1Quad-v1b | 1,134,514 | 1.2 kb mean probe spacing | Service Provider | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 245,263 | 139,143 |
| Illumina HumanOmniExpressExome 1.2 | 964,193 | 3.0 kb mean, 1.4 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 31,34 | 17,17 |
| Illumina HumanOmniExpress-24v1-0 | 730,525 | 4.1 kb mean, 2.2 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 15,12 | 11,10 |
| Illumina HumanCoreExome v1.1 | 547,644 | 5.3 kb mean, 1.9 kb median probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 5,4 | 7,6 |
| Illumina CytoSNP-850 K | 850,000 (50mers) | 50 kb mean probe spacing, 10–20 kb in 3,262 cytogenetically relevant genes | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 12,17 | 25,21 |
| Illumina PsychArray | 571,054 | 5.5 kb mean probe spacing | Illumina | Illumina GenomeStudio 2011.1 and cnvPartition 3.2.0 | 5,4 | 10,9 |
a The platform specific software CNV call set for the first replicate of the SNP 6.0 array was obtained from published data [19]. The second replicate of the SNP 6.0 array was only analyzed using Nexus software as per the Affymetrix recommendation for analysis of this array
bThe Illumina HumanOmni1Quad array has been recently discontinued but was included in this study as this array was widely used previously and many investigators may be reanalyzing data from this array for copy number information
Fig. 2CNV detection performance of each array using two different algorithms. a. Overlap of autosomal CNV calls from two different algorithms for each array platform with gold standard CNVs. Data is shown for each of two technical replicates per array. CNV call sets derived from the platform specific algorithm are in green, yellow, and red, and those derived from Nexus are in blue, pink, and purple. The number of array CNV calls overlapping a gold standard CNV by 50% reciprocally in size is in green and blue, by less than 50% reciprocally in size is in yellow and pink, and not overlapping a gold standard CNV is in red and purple. Array calls not overlapping a gold standard CNV at all were further analyzed for sequencing-based confirmation using CNVnator generated CNV calls based on the 1000 Genomes Project sequencing data for NA12878. The number of CNV calls not overlapping a gold standard CNV but with CNVnator support is shown as solid red or purple bars. The number of CNV calls not overlapping a gold standard CNV and with no CNVnator support is shown as hashed red or purple bars. b. Average rate of non-validated CNV calls for each array platform and for each algorithm. The rate of non-validated calls is calculated as the percentage of the total number of CNVs called from an array that do not overlap a gold standard CNV and do no have any supporting evidence from CNVnator (hashed red and purple bars in a.). Average rate of non-validated calls is based on two technical replicates
Fig. 3Detection of a 122 kb gold standard deletion on chromosome 19p by 17 arrays. Horizontal axis shows position along chromosome 19. Vertical axes show log R ratio of fluorescence of NA12878 DNA over fluorescence of reference DNA. Grey dots indicate probes that have not been called as part of a CNV. Red dots indicate probes that have been called as part of a CNV. Horizontal lines indicate Nexus cutoffs for low and high copy deletions (red) and duplications (blue). Gray dashed box indicates CNV region. Genes and segmental duplications (SegDups) are also shown. 1. Affymetrix SNP6.0, 2. Affymetrix CytoScanHD, 3. Agilent 1×1M-CGH, 4. Agilent 1×1M-HR, 5. Agilent 2×400K-CGH, 6. Agilent 2×400K-CNV, 7. Agilent 4×180K-CGH, 8. Illumina HumanOmni5Exome, 9. Illumina HumanOmni5, 10. Illumina HumanOmni2.5Exome, 11. Illumina HumanOmni2.5, 12. Illumina HumanOmni1Quad, 13. Illumina HumanOmniExpressExome, 14. Illumina HumanOmniExpress, 15. Illumina CoreExome, 16. Illumina CytoSNP-850, 17. Illumina Psych Array