| Literature DB >> 22920755 |
Jessica G Woo1, Lisa J Martin, Lili Ding, W Mark Brown, Timothy D Howard, Carl D Langefeld, Charles J Moomaw, Mary Haverbusch, Guangyun Sun, Subba R Indugula, Hong Cheng, Ranjan Deka, Daniel Woo.
Abstract
BACKGROUND: DNA from buccal brush samples is being used for high-throughput analyses in a variety of applications, but the impact of sample type on genotyping success and downstream statistical analysis remains unclear. The objective of the current study was to determine laboratory predictors of genotyping failure among buccal DNA samples, and to evaluate the successfully genotyped results with respect to analytic quality control metrics. Sample and genotyping characteristics were compared between buccal and blood samples collected in the population-based Genetic and Environmental Risk Factors for Hemorrhagic Stroke (GERFHS) study (https://gerfhs.phs.wfubmc.edu/public/index.cfm).Entities:
Mesh:
Substances:
Year: 2012 PMID: 22920755 PMCID: PMC3447646 DOI: 10.1186/1471-2156-13-75
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Patient and quality control characteristics by sample type
| | | | |
| Age (years ± SD) | 66.7 ± 14.8 | 67.2 ± 14.8 | 64.6 ± 14.9 |
| Sex (% male) | 417 (49.1%) | 348 (49.2%) | 69 (48.6%) |
| Stroke status (% cases) | 433 (50.9%) | 353 (49.9%) | 80 (56.3%) |
| Cigarette smoker (% current) | 163 (19.2%) | 136 (19.2%) | 27 (19.0%) |
| | | | |
| QC Failure (%)a | 90 (10.6%) | 80 (11.3%) | 10 (7.0%) |
| cQC <0.4 | 75 (8.8%) | 69 (9.7%) | 6 (4.2%) |
| DM call rate <83% | 65 (7.6%) | 60 (8.5%) | 5 (3.5%) |
| Gender mismatch | 24 (2.8%) | 19 (2.7%) | 5 (3.5%) |
| cQC rate | 2.08 ± 0.94 | 2.00 ± 0.94 | 2.53 ± 0.82*** |
| Among passed samples | 2.33 ± 0.65 | 2.24 ± 0.65 | 2.69 ± 0.55*** |
| Among failed samples | 0.05 ± 0.43 | 0.01 ± 0.37 | 0.36 ± 0.71 |
| DM call rate | 93.1 ± 7.8% | 92.7 ± 8.3% | 95.1 ± 4.0% *** |
| Among passed samples | 95.3 ± 2.8% | 95.1 ± 2.9% | 96.0 ± 1.9% *** |
| Among failed samples | 74.7 ± 11.7% | 73.5 ± 11.7% | 83.9 ± 6.9% ** |
N (%) or mean ± SD presented.
a Failure subsets do not add to total failed samples, as samples frequently fail for more than one reason.
** p < 0.01 blood vs. buccal samples; ***p < 0.0001 blood vs. buccal samples.
Predictors of quality control failure among buccal samples
| Years since sample collection | 7.9 [5.3, 10.0] | 8.0 [5.6, 10.0] | 5.3 [4.2, 9.0] | <0.0001 | <0.0001 |
| Current cigarette smoker (%) | 136 (19.2%) | 116 (18.5%) | 20 (25%) | 0.16 | 0.18 |
| Current cigar smoker (%) | 21 (3%) | 17 (2.7%) | 4 (5%) | 0.26 | 0.28 |
| Average daily coffee intake (cups) | 1.43 (1.28, 1.59) | 1.39 (1.23, 1.54) | 1.79 (1.16, 2.42) | 0.23 | 0.25 |
| Average daily tea intake (cups) | 0.42 (0.33, 0.51) | 0.42 (0.32, 0.51) | 0.42 (0.19, 0.66) | 0.96 | 0.77 |
| Average daily sodas | 0.75 (0.62, 0.87) | 0.74 (0.60, 0.87) | 0.85 (0.54, 1.16) | 0.50 | 0.06 |
| 260/280 Ratio | 1.81 (1.80-1.81) | 1.81 (1.80-1.81) | 1.82(1.80-1.84) | 0.42 | 0.32 |
| 260/230 Ratio | 1.17 (1.15- 1.20) | 1.18 (1.15-1.21) | 1.14 (1.06-1.21) | 0.38 | 0.39 |
| Total DNA (ng/μl)a | 87.4 (82.3-92.8) | 85.6 (80.6-91.8) | 102.5 (87.4-121.5) | 0.06 | 0.05 |
| ds DNA (ng/μl)a | 55.7 (52.5-59.1) | 56.8 (53.5-60.3) | 49.4 (41.3-59.1) | 0.12 | 0.09 |
| ds/total DNA ratio | 0.66 (0.65-0.68) | 0.68 (0.67-0.69) | 0.52 (0.46-0.57) | <0.0001 | <0.0001 |
Mean (95% confidence limits of mean) or n (%) presented; median [interquartile range] presented for years since sample collection.
a Back-transformed from log values (geometric mean).
Figure 1 Probability of sample failure by ds/total DNA ratio. Predicted probability of sample pre-genotyping failure from logistic regression analysis. Line represents predicted probability and shaded area is 95% confidence limits.
Sensitivity and specificity of ds/total DNA ratios to detect buccal sample failure
| <0.34 | 23 (3.3%) | 20.5% | 98.7% | 65.2% | 90.5% |
| <0.58 | 223 (34.0%) | 70.5% | 69.7% | 22.75% | 94.5% |
Figure 2 Call rates by sample type. Histogram and smoothed spline of sample call rates by sample type.
Figure 3 Heterozygosity rates by sample type and sex. Smoothed spline of heterozygosity rates calculated across all chromosomes, by sample type and sex for A) males and B) females. Heterozygosity rates for females are higher on average than males due to inclusion of X chromosome loci
Figure 4 MAF by sample type compared with CEU HapMap. Panel A: Distributions and scatter plot of minor allele frequency (MAF) deviations from CEU HapMap reference by locus and sample type. Box plot represents overall deviation from the HapMap by sample type. Horizontal line represents median deviation, top and bottom box boundaries delineate the 25th and 75th percentiles (IQR) and whiskers extend to 1.5 times the IQR above and below the box. Panel B: MAF deviations by sample type, clustered by HapMap MAF categories. Box plots are constructed as in Panel A.
Figure 5 Cluster plots by MAF and sample type. Three SNPs meeting Hardy-Weinberg expectations (p > 0.24) were randomly selected for visual presentation, representing differing MAF classes. A) rs16933208 (MAF = 0.98%); B) rs12598047 (MAF = 2.88%); C) rs11803463 (MAF = 21.6%). Buccal-1 and Buccal-2 columns present clusters from two different plates of buccal samples, while the Blood column presents clusters from a plate of blood samples.