| Literature DB >> 21663644 |
Josyf C Mychaleckyj1, Emily A Farber, Jessica Chmielewski, Jamie Artale, Laney S Light, Donald W Bowden, Xuanlin Hou, Santica M Marcovina.
Abstract
BACKGROUND: Blood specimen collection at an early study visit is often included in observational studies or clinical trials for analysis of secondary outcome biomarkers. A common protocol is to store buffy coat specimens for future DNA isolation and these may remain in frozen storage for many years. It is uncertain if the DNA remains suitable for modern genome wide association (GWA) genotyping.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21663644 PMCID: PMC3128059 DOI: 10.1186/1479-5876-9-91
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Clinical characteristics of the stored buffy coat samples selected for DNA isolation, N = 120 total samples
| Buffy Coat Sample Characteristic | N (% or Std Err) |
|---|---|
| Male | 78 (65%) |
| Female | 42 (35%) |
| White | 77 (64.2%) |
| African-American | 23 (19.2%) |
| Hispanic | 7 (5.8%) |
| Asian | 5 (4.2%) |
| Other | 8 (6.7%) |
| Mean years (Std Err) | 63.5 (0.66) |
| Range years | 43-80 |
| 2001 | 24 (20%) |
| 2003 | 24 (20%) |
| 2004 | 24 (20%) |
| 2005 | 24 (20%) |
| 2006 | 11 (9.2%) |
| 2007 | 10 (8.3%) |
| 2008 | 3 (2.5%) |
| Total CCNs Sampled | 7 |
| Mean Samples per CCN (Range) | 17.1 (9-20) |
| Total Clinics Sampled | 74 |
| Mean Samples per Clinic (Range) | 1.62 (1-3) |
| Mean hours (Std Err) | 29.3 (1.52) |
| Range hours | 12-144 |
Percentage sums may differ from 100% due to rounding. The Laboratory Receipt Time is the time in hours from blood collection to receipt in the Central Laboratory. The value in the parentheses after the count value in the column labeled N is the percentage or standard error except where noted as a range.
SNP panel composition and genotyping results
| dbSNP rs number) | ABI TaqMan Assay Identifier | Genome Location | Samples Genotyped | Missing Genotypes (%) |
|---|---|---|---|---|
| rs7792547 | c_29193799_10 | Chr7: 150754462 | 117 | 4 (3.4%) |
| rs7326634 | c_306605_10 | Chr13: 85638600 | 117 | 1 (0.85%) |
| rs1293288 | C_8339785_10 | Chr8: 11755937 | 117 | 0 (0%) |
| rs6935566 | c_29104855_10 | Chr6: 149608518 | 117 | 1 (0.85%) |
| rs2251110 | c_8793799_20 | Chr15: 31385829 | 117 | 0 (0%) |
| rs2118922 | c_15975275_10 | Chr4: 178404068 | 117 | 2 (1.7%) |
The table lists the dbSNP rs number, ABI identifier, genome location, number of samples assayed, and the missing genotype count and proportion for the 6 SNPs assayed by TaqMan genotyping. (ABI or Applied Biosystems Inc, is now part of Life Technologies).
Figure 1Distribution of total DNA yield from 120 ACCORD buffy coat specimens. The histogram shows the number of samples within each interval of total DNA yield. The interval size is 25 ug, and the maximum and minimum yields were 1.1 ug and 312.2 ug.
Figure 2Total DNA yield as a function of collection year from 120 ACCORD buffy coat specimens. The figure shows a Tukey boxplot of variation in DNA yield for each year of blood collection in the 120 ACCORD samples. The upper and lower edges (hinges) of the boxes are the third and first quartiles and the central line shows the median value of yield for a year. The lines radiating above and below the boxes visually show the range to the maximum and minimum yields. There are no outlier samples in any of the years with unusually large or small yields as defined by the usual robust test of more than 1.5 × interquartile range.
Combined linear regression and analysis of variance results for predictor variables of the total DNA yield (ug)
| Predictor | Number of Observations | Effect Size (Std Err) | t-test p-value | anova df | anova p-value |
|---|---|---|---|---|---|
| 22 | 2 | 0.16 | |||
| | 75 | -7.4 ug (15.8) | 0.21 | ||
| | 20 | 25.7 ug (20.6) | 0.64 | ||
| 117 | -11.6 ug (4.2) | 0.0061 | 1 | 0.0061 | |
| 117 | 6 | 0.28 | |||
| 42 | 0.30 | ||||
| | 75 | 13.8 ug (13.1) | 0.30 | 1 | |
| 117 | -2.1 ug/year (0.84) | 0.015 | 1 | 0.015 | |
| 117 | -0.7 ug/hour (0.5) | 0.16 | 1 | 0.16 | |
The table contains the marginal univariate t-test results are shown for all predictor variables with 1 or 2 degrees of freedom (df). The marginal tests describe the effect of the predictor variable after adjusting for all other variables in the model. The individual t-test results for each CCN are not shown. The anova and t-test p-values are necessarily identical for 1 degree of freedom predictors. The reference category for the categorical predictor variables (Race and Gender) are shown in parentheses beside the variable name.
ACCORD Illumina Omni1-Quad Statistical Genotype QC Analysis Results
| Step | Samples Remaining | SNPs Remaining | Dropped (% of Remaining) |
|---|---|---|---|
| Post-Laboratory QC | 32 | 1,140,419 | |
| Remove CNV probes | 1,048,713 | 91,706 | |
| Drop SNPs with 100% missing data | 32 | 1,015,235 | 33,478 |
| Drop samples with > 1% missing data | 32 | 1,015,235 | 0 |
| Mean per sample Genotype Missing Rate (range; sem) | 32 | 0.20% | For all SNPs |
| 0.21% | SNPs with < 100% missing genotypes | ||
| Median per sample Genotype Missing Rate | 32 | 0.16% | |
| Mean per SNP Genotype Missing Rate | 1,015,235 | 0.21% | 32 Samples |
| Median per SNP Genotype Missing Rate | 1,015,235 | 0% | 32 Samples |
The upper part of the table shows the samples and SNPs that were dropped at each step of the Statistical Genotype QC analysis and the total number remaining. The lower part of the table summarizes values of the key QC statistics in the samples and SNPs remaining after the QC steps.