| Literature DB >> 22500001 |
T S Shah1, J Z Liu, J A B Floyd, J A Morris, N Wirth, J C Barrett, C A Anderson.
Abstract
MOTIVATION: Existing microarray genotype-calling algorithms adopt either SNP-by-SNP (SNP-wise) or sample-by-sample (sample-wise) approaches to calling. We have developed a novel genotype-calling algorithm for the Illumina platform, optiCall, that uses both SNP-wise and sample-wise calling to more accurately ascertain genotypes at rare, low-frequency and common variants.Entities:
Mesh:
Year: 2012 PMID: 22500001 PMCID: PMC3371828 DOI: 10.1093/bioinformatics/bts180
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Calling a SNP with optiCall. In (a) intensity data is taken from all samples at the SNP. Then, using a data-derived (within and across sample) prior, and adjusting class membership probabilities based on the prior in an EM procedure (b and c), a mixture model of Student's t-distributions is fitted to the data (d)
Summary statistics of calling and QC results on 192 402 Immunochip autosomal SNPs
| Caller | Mean call rate (%) | Number with call rate <98% | Number with HWE | Number of QC fails | Number of unique QC passes | Number of unique QC fails |
|---|---|---|---|---|---|---|
| Illuminus | 99.44 | 6311 | 8096 | 10 263 | 2305 | 2852 |
| GenoSNP | 97.63 | 19 432 | 15 239 | 22 572 | 310 | 8505 |
| GenCall | 96.01 | 13 861 | 9413 | 15 665 | 156 | 1454 |
| optiCall | 97.06 | 7440 | 7210 | 10 006 | 796 | 168 |
Call rate is defined as the proportion of genotype calls for a SNP assigned a genotype other than unknown. The QC threshold is set at a call rate of <98% or <10−5 HWE P-value. A unique QC pass/fail is a SNP that passed/failed QC uniquely to the given caller.
Comparison of QC passes and failures across 1200 manually called SNPs
| Caller | TP | FP | TF | False-fail | Sensitivity/specificity |
|---|---|---|---|---|---|
| Illuminus | 574 | 260 | 134 | 232 | 0.71/0.34 |
| GenoSNP | 196 | 13 | 381 | 610 | 0.24/0.97 |
| GenCall | 519 | 33 | 361 | 287 | 0.64/0.92 |
| optiCall | 650 | 92 | 302 | 156 | 0.81/0.77 |
| Manual | 806 | 0 | 394 | 0 | 1.00/1.00 |
TP, manual pass and algorithm pass. FP, manual fail and algorithm pass. TF, manual fail and algorithm fail. False-fail = manual pass and algorithm fail.
Chromosome 21, comparison to manual calls
| Caller | QC | Monomorphic SNPs (of which rare misses) | Mean | ||||
|---|---|---|---|---|---|---|---|
| TP | FP | TF | FF | Sensitivity/specificity | |||
| Illuminus | 1761 | 25 | 33 | 49 | 0.97/0.57 | 164 (21) | 0.993 |
| GenoSNP | 1668 | 2 | 56 | 142 | 0.92/0.97 | 85 (0) | 0.996 |
| GenCall | 1737 | 7 | 51 | 73 | 0.96/0.88 | 173 (1) | 0.997 |
| Optical | 1785 | 14 | 44 | 25 | 0.99/0.76 | 172 (0) | 0.997 |
| Manual | 1810 | 0 | 58 | 0 | 1.00/1.00 | 188 (0) | 1.000 |
Monomorphic SNPs = the number of SNPs a genotype-calling algorithm calls monomorphic from its TPs, with the subset of missed rare variants (when compared to manual calls) shown in brackets. r2 is as in Section 3.1 and is calculated over the true QC pass SNPs which were polymorphic according to both the caller and the manual calls.