| Literature DB >> 20175893 |
Elizabeth A Tindall1, Desiree C Petersen, Stina Nikolaysen, Webb Miller, Stephan C Schuster, Vanessa M Hayes.
Abstract
BACKGROUND: High-throughput custom designed genotyping arrays are a valuable resource for biologically focused research studies and increasingly for validation of variation predicted by next-generation sequencing (NGS) technologies. We investigate the Illumina GoldenGate chemistry using custom designed VeraCode and sentrix array matrix (SAM) assays for each of these applications, respectively. We highlight applications for interpretation of Illumina generated genotype cluster plots to maximise data inclusion and reduce genotyping errors.Entities:
Year: 2010 PMID: 20175893 PMCID: PMC2848685 DOI: 10.1186/1756-0500-3-39
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1Effect of outliers on cluster formation. Exclusion of three outlier samples represented as heterozygous for rs8081356 (A) and rs10096900 (C), corrected genotype clusters for both SNPs (B and D respectively). Correcting the genotype calls reduced the normalised θ (distance between heterozygous and homozygous clusters).
Figure 2Two cluster calling. Rare variant and two-cluster genotyping was successful for the majority of samples assessed on the VeraCode platform as demonstrated in rs17011642 (A). Incorrect rare variant calling was observed for SNP rs12988908, with the heterozygous samples being called homozygous for the mutant allele (B). Non-autosomal variants were correctly genotyped in most cases as demonstrated in SNP rs5919392 (C). Only one X-linked SNP, rs17217069 was incorrectly genotyped with the mutant allele being called heterozygous.
Figure 3Undefined cluster distribution for verified human SNPs (VeraCode). The GenCall software failed to define genotypes for two verified human SNPs, rs10786712 (A) and rs1632947 (B) genotyped on the VeraCode platform. Samples were poorly separated into three groups along the Y- and X-axis of the cluster plot. These three groups represented the three different genotypes as indicated.
Figure 4Undefined cluster distribution in a . Failure to define separate genotype clusters for newly identified variants TD102 (A) and TD108 (B) was due to incorrect sequence input into the GoldeGate assay design software. The failure of one ASO to bind resulted in the different genotype groups being separated by a difference in fluorescence intensity of one allele along the Y-axis of the cluster plot.