| Literature DB >> 15061867 |
Ching Yu Austin Huang1, Joel Studebaker, Anton Yuryev, Jianping Huang, Kathryn E Scott, Jennifer Kuebler, Shobha Varde, Steven Alfisi, Craig A Gelfand, Mark Pohl, Michael T Boyce-Jacino.
Abstract
BACKGROUND: SNP genotyping typically incorporates a review step to ensure that the genotype calls for a particular SNP are correct. For high-throughput genotyping, such as that provided by the GenomeLab SNPstream instrument from Beckman Coulter, Inc., the manual review used for low-volume genotyping becomes a major bottleneck. The work reported here describes the application of a neural network to automate the review of results.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15061867 PMCID: PMC406493 DOI: 10.1186/1471-2105-5-36
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The overview of software flow for image analysis on the UHT instrument. Top – Depiction of 2-color fluorescent readouts analyzed by the UHT Image™ software. Intensities from the two fluorescent channels presented in pseudo-colors are compared to determine genotypes. Three hundred eighty-four replicates of 4 × 4 tag arrays are produced on a single glass plate. Each 4 × 4 tag array has 4 control locations and 12 probe locations for 12 SNPs. The top left location is a positive control for both colors. The top right and bottom left locations are positive controls for the two different alleles, and the bottom right location is a negative control and has a probe that lacks a complementary tag sequence in the reaction. The controls are also used to mark the array boundaries for the image analysis software. Center – The UHT® GetGenos software assigns genotype calls to individual SNP signal from every DNA sample. The results can be displayed as a P-plot (Figure 1) by QCreview™ software for manual review (arrow to the right) or used to measure clustering parameters for auto-validation by the neural network (arrow down). Bottom – Schematic representation of SNP signal call clusters measured on the P-plot. The neural network uses 64 parameters described in Additional file: 1 to auto-classify P-plot as "Pass" or "Fail".
Figure 2P-plot example displayed by the QCreview interface with a grade "pass" suggested by GetGenos. The QCreview display includes: a) the genotype call values made by UHT® GetGenos from single SNP but multiple samples; b) the signal values of positive and negative controls; c) basic statistical information about genotype clusters, such as cluster size, d) the chi-square of the Hardy-Weinberg disequilibrium test [7]; e) the plot review status and the suggested GetGenos grade for entire plot. With the QCreview interface, an authorized user can pass or fail individual points and the plot as a whole and record it into an Oracle database.
Figure 3P-plot example displayed by the QCreview interface with a grade "look" suggested by GetGenos. This grade indicates that GetGenos was uncertain about the quality of the plot. The interface components are described in the legend for Figure 2.
Figure 4P-plot example displayed by the QCreview interface with a grade "fail" suggested by GetGenos. The interface components are described in the legend for Figure 2.
Distributions of the suggested grades of "pass," "look," and "fail" assigned by GetGenos, compared to the P-plot validation made by the trained reviewers.
| GetGenos™ suggested grade | Reviewer validation | Number | % from total with this GetGenos grade |
| Pass | Pass | 14924 | 99.4 |
| Pass | Fail | 88 | 0.6 |
| Look | Pass | 2557 | 99.3 |
| Look | Fail | 18 | 0.7 |
| fail | Pass | 1335 | 14.4 |
| fail | Fail | 7932 | 85.6 |
Distributions of Pass/Fail grades assigned by the neural net and by reviewers
| Net grade | Reviewer grade | Number | % of total for this neural net grade |
| Pass | Pass | 16789 | 99.98 |
| Pass | Fail | 3 | .02 |
| Fail | Pass | 2027 | 20.2 |
| Fail | Fail | 8035 | 79.8 |