| Literature DB >> 18237385 |
Mohua Podder1, Jian Ruan, Ben W Tripp, Zane E Chu, Scott J Tebbutt.
Abstract
BACKGROUND: Arrayed primer extension (APEX) is a microarray-based rapid minisequencing methodology that may have utility in 'personalized medicine' applications that involve genetic diagnostics of single nucleotide polymorphisms (SNPs). However, to date there have been few reports that objectively evaluate the assay completion rate, call rate and accuracy of APEX. We have further developed robust assay design, chemistry and analysis methodologies, and have sought to determine how effective APEX is in comparison to leading 'gold-standard' genotyping platforms. Our methods have been tested against industry-leading technologies in two blinded experiments based on Coriell DNA samples and SNP genotype data from the International HapMap Project.Entities:
Year: 2008 PMID: 18237385 PMCID: PMC2266772 DOI: 10.1186/1755-8794-1-5
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Multiplexing PCR and subsequent amplicon fragmentation results, prior to APEX reaction on HapMap Chip. (a) Standard multiplex PCR from a single Coriell DNA sample using optimally-designed primers [Additional files 1 &2] within seven unique multiplex groups (lanes 1–7; lane M shows 100 bp DNA ladder markers), showing wide range of amplicon sizes across the 50 SNP loci. (b) Purification, concentration and fragmentation of standard PCR amplicons. Lane 1 represents an aliquot of concentrated mixture of all seven multiplex products shown in Fig. 1a. Lane 2 shows the fragmentation result, generating single-stranded nucleic acid of 30–100 base length. (c) Multiplex PCR amplification of all 50 SNP loci in a single reaction tube using new PCR primer set [Additional file 6], showing 50-plex PCR products (individual SNP loci amplicons are unresolvable by agarose gel electrophoresis) from two Coriell DNA samples (lanes 1 & 2), plus a negative PCR control (lane 3). (d) Fragmentation of 50-plex PCR amplicons from aliquots of lane 1 & lane 2 samples shown in Fig. 1c.
Results summary for 287 HapMap samples and 41 SNPs
| MACGT (0.001 cut-off) + manual calls | 98.90% (9% manual calls) | 99.94% |
| LDA (0.75 threshold) Total cases | 94.91% (10,523 cases vs. 11,087) | 99.91% (10,514 vs. 10,523) |
| LDA (0.75 threshold) Homozygous cases | 95.41% (6,883 vs. 7,214) | 99.96% (6,880 vs. 6,883) |
| LDA (0.75 threshold) Heterozygous cases | 93.98% (3,640 vs. 3,873) | 99.84% (3,634 vs. 3,640) |
Figure 2HapMap Chip four colour microarray images showing successful de-multiplexing of 50-plex PCR from two Coriell DNA samples (a, b), plus a negative control sample (c), prior to image analysis and automated genotyping. The spots on the negative control image represent positive control probes [8, 14].
Results summary for 49 HapMap samples and 50 SNPs
| Manual calling only | 100% | 99.92%1 |
| MACGT (no cut-off) | 100% | 99.84%2 |
| LDA (0 threshold) Total cases | 100% (1,941 cases vs. 1,941) | 99.89%3 (1,939 vs. 1,941) |
| LDA (0 threshold) Homozygous cases | 100% (1,289 vs. 1,289) | 100%4 (1,289 vs. 1,289) |
| LDA (0 threshold) Heterozygous cases | 100% (652 vs. 652) | 99.7%5 (650 vs. 652) |
| MACGT (0.001 cut-off) | 94.04% | 99.94%6 |
| LDA (0.65 threshold) Total cases | 99.18% (1,925 vs. 1,941) | 99.90%3 (1,923 vs. 1,925) |
| LDA (0.65 threshold) Homozygous cases | 98.91%7 (1,275 vs. 1,289) | 100% (1,275 vs. 1,275) |
| LDA (0.65 threshold) Heterozygous cases | 99.7% (650 vs. 652) | 99.7% (648 vs. 650) |
1 Two discrepancies amongst 2,450 genotype cases.
2 Three discrepancies amongst 1,926 genotype cases (524 cases used in training set).
3 Two discrepancies amongst 1,941 genotype cases (509 cases used in training set).
4 No discrepancy amongst 1289 cases (327 cases used in training set)
5 Two discrepancies amongst 652 cases (182 cases used in training set)
6 One discrepancy amongst 1,926 genotype cases (524 cases used in training set).
7 Eleven predictions (all TT and correct) with confidence score less than 0.65 for a single SNP (rs1891403).
Figure 3Simple scatter plots for SNP rs12466929 (A/G) from 50-plex data set (this SNP is representative of the entire set of 50 HapMap SNPs). For each plot the x-axis represents signal values for X allele (A for this SNP) and the y-axis represents signal values for Y allele (G for this SNP). All values are in log scale. Magenta, green, blue and black coloured symbols denote the classes YY (GG), YX (AG), XX (AA) and NN (negative control samples), respectively. Plot (1) combines the two ASO-APEX Left probes (one for each allele); plot (2) combines the two ASO-APEX Right probes (one for each allele); plot (3) is for the APEX Left probe; plot (4) is for the APEX Right probe. All the classifiers except APEX Left (plot 3) give well separated genotype clusters for this SNP. Dynamic variable selection is able to automatically weight these LDA classifiers in such a way that the homozygous AA cluster in plot (3) (blue) is able to contribute to the final call for such genotypes, even though AG (green) and GG (magenta) genotype clusters overlap somewhat for this Left APEX probe. Additional file 11 shows four-panel scatter plots for all 50 SNPs from the 50-plex data set.
Bayesian posterior probabilities for the possible classes from each of the four possible classifiers
| XX | XY | YY | NN | |
| ASO.L | Pxx ( | Pxy ( | Pyy ( | PNN ( |
| ASO.R | Pxx ( | Pxy ( | Pyy ( | PNN ( |
| APEX.L | Pxx ( | Pxy ( | Pyy ( | PNN ( |
| APEX.R | Pxx ( | Pxy ( | Pyy ( | PNN ( |