| Literature DB >> 19087310 |
Bin-Cheng Yin1, Honghua Li, Bang-Ce Ye.
Abstract
BACKGROUND: High throughput genotyping of single nucleotide polymorphisms (SNPs) for genome-wide association requires technologies for generating millions of genotypes with relative ease but also at a reasonable cost and with high accuracy. In this work, we have developed a theoretical approach to estimate allele frequency in pooled DNA samples, based on the physical principles of DNA immobilization and hybridization on solid surface using the Langmuir kinetic model and quantitative analysis of the allelic signals.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19087310 PMCID: PMC2640397 DOI: 10.1186/1471-2164-9-605
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic illustration of the competitive immobilization and hybridization processes on glass surface using the strategy of tag-probe-allele sandwich structure. The single-base variation is indicted by letters. Fluorescent dyes are represented by small red or green spheres.
Figure 2Average background-corrected signal intensity as a function of allele spotting concentration for the six SNPs. All curves depict the non-linear relationship modeled as Langmuir kinetic function for signal intensity from microarray. The spotting concentration of the allelic sequences ranged from 0 to 50 μM for each SNP. The signal intensity is calculated by integrating line profiles for spots containing the perfect match and normalizing the signals. The solid and dashed lines in each plot show best fit to the Langmuir kinetic model for the two alleles.
The constants of Ka and IR for tag-probe-allele hybridization derived from spike-in experiments
| SNP | Allele | ||
| 6.5 | 2.230 | ||
| 6.6 | 2.268 | ||
| 9.9 | 2.092 | ||
| 9.9 | 2.170 | ||
| 7.9 | 2.082 | ||
| 8.0 | 2.083 | ||
| 5.1 | 2.082 | ||
| 5.2 | 2.099 | ||
| 7.7 | 1.979 | ||
| 7.9 | 2.082 | ||
| 7.9 | 1.983 | ||
| 8.0 | 1.999 | ||
Allele frequency estimates for the six SNPs from pooled DNA samples*
| Allele Frequency | ||||||||||||||
| SNP | Allele | Measure | Poo1 1 | Poo1 2 | Poo1 3 | Poo1 4 | Poo1 5 | Poo1 6 | Poo1 7 | Poo1 8 | Poo1 9 | Poo1 10 | Poo1 11 | Pool 12 |
| Actual | 0.05 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 0.95 | 1.00 | ||
| Observed | 0.053 | 0.108 | 0.19 | 0.308 | 0.396 | 0.514 | 0.604 | 0.704 | 0.804 | 0.894 | 0.949 | 0.988 | ||
| SD | 0.014 | 0.017 | 0.039 | 0.035 | 0.025 | 0.029 | 0.023 | 0.031 | 0.03 | 0.025 | 0.016 | 0.014 | ||
| Actual | 0.05 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 0.95 | 1.00 | ||
| Observed | 0.051 | 0.113 | 0.195 | 0.307 | 0.395 | 0.509 | 0.599 | 0.708 | 0.799 | 0.906 | 0.943 | 0.992 | ||
| SD | 0.017 | 0.03 | 0.036 | 0.031 | 0.026 | 0.023 | 0.022 | 0.023 | 0.041 | 0.02 | 0.021 | 0.011 | ||
| Actual | 0.05 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 0.95 | 1.00 | ||
| Observed | 0.057 | 0.107 | 0.2 | 0.307 | 0.405 | 0.503 | 0.603 | 0.706 | 0.804 | 0.893 | 0.951 | 0.99 | ||
| SD | 0.011 | 0.02 | 0.033 | 0.034 | 0.03 | 0.028 | 0.025 | 0.027 | 0.03 | 0.026 | 0.014 | 0.014 | ||
| Actual | 0.05 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 0.95 | 1.00 | ||
| Observed | 0.057 | 0.107 | 0.208 | 0.319 | 0.392 | 0.499 | 0.616 | 0.706 | 0.815 | 0.884 | 0.95 | 0.988 | ||
| SD | 0.011 | 0.021 | 0.031 | 0.034 | 0.031 | 0.027 | 0.031 | 0.027 | 0.032 | 0.031 | 0.017 | 0.013 | ||
| Actual | 0.05 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 0.95 | 1.00 | ||
| Observed | 0.056 | 0.108 | 0.208 | 0.322 | 0.407 | 0.504 | 0.612 | 0.71 | 0.815 | 0.908 | 0.951 | 0.99 | ||
| SD | 0.015 | 0.022 | 0.031 | 0.032 | 0.035 | 0.018 | 0.025 | 0.028 | 0.032 | 0.025 | 0.02 | 0.01 | ||
| Actual | 0.05 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 0.95 | 1.00 | ||
| Observed | 0.06 | 0.103 | 0.211 | 0.309 | 0.401 | 0.508 | 0.609 | 0.71 | 0.814 | 0.908 | 0.95 | 0.987 | ||
| SD | 0.012 | 0.02 | 0.023 | 0.026 | 0.029 | 0.015 | 0.021 | 0.028 | 0.031 | 0.025 | 0.023 | 0.017 | ||
*Each pooled DNA sample was measured five times.
Allele frequency estimates from pooled DNA samples with the minor allele frequency ranging from 2% to 10% for the "C" allele of SNP HBB28
| Measurement No. | Pool 1 | Pool 2 | Pool 3 | Pool 4 | Pool 5 | Pool 6 | Pool 7 | Pool 8 | Pool 9 |
| 1 | 0.014 | 0.035 | 0.047 | 0.048 | 0.058 | 0.078 | 0.091 | 0.079 | 0.110 |
| 2 | 0.015 | 0.028 | 0.038 | 0.049 | 0.059 | 0.051 | 0.085 | 0.091 | 0.095 |
| 3 | 0.030 | 0.033 | 0.043 | 0.054 | 0.059 | 0.072 | 0.088 | 0.087 | 0.099 |
| 4 | 0.018 | 0.040 | 0.042 | 0.049 | 0.067 | 0.070 | 0.077 | 0.092 | 0.118 |
| 5 | 0.031 | 0.032 | 0.043 | 0.048 | 0.063 | 0.065 | 0.083 | 0.082 | 0.095 |
| 6 | 0.022 | 0.035 | 0.037 | 0.052 | 0.054 | 0.053 | 0.077 | 0.096 | 0.097 |
| 7 | 0.024 | 0.034 | 0.039 | 0.052 | 0.072 | 0.073 | 0.087 | 0.085 | 0.096 |
| 8 | 0.032 | 0.037 | 0.041 | 0.059 | 0.067 | 0.067 | 0.089 | 0.091 | 0.125 |
| 9 | 0.032 | 0.029 | 0.046 | 0.048 | 0.070 | 0.078 | 0.078 | 0.079 | 0.119 |
| 10 | 0.019 | 0.028 | 0.036 | 0.057 | 0.059 | 0.081 | 0.061 | 0.087 | 0.120 |
| 11 | 0.018 | 0.038 | 0.044 | 0.034 | 0.056 | 0.074 | 0.084 | 0.087 | 0.114 |
| 12 | 0.031 | 0.030 | 0.041 | 0.059 | 0.059 | 0.069 | 0.084 | 0.084 | 0.098 |
| 13 | 0.022 | 0.029 | 0.048 | 0.067 | 0.054 | 0.067 | 0.079 | 0.088 | 0.097 |
| 14 | 0.024 | 0.031 | 0.032 | 0.063 | 0.065 | 0.067 | 0.087 | 0.087 | 0.095 |
| 15 | 0.032 | 0.022 | 0.042 | 0.054 | 0.067 | 0.065 | 0.080 | 0.089 | 0.114 |
| 16 | 0.023 | 0.024 | 0.039 | 0.054 | 0.062 | 0.057 | 0.077 | 0.093 | 0.111 |
| 17 | 0.024 | 0.032 | 0.048 | 0.067 | 0.062 | 0.069 | 0.080 | 0.094 | 0.094 |
| 18 | 0.030 | 0.032 | 0.040 | 0.052 | 0.058 | 0.067 | 0.082 | 0.093 | 0.097 |
| 19 | 0.024 | 0.037 | 0.032 | 0.054 | 0.058 | 0.067 | 0.077 | 0.091 | 0.096 |
| 20 | 0.029 | 0.039 | 0.035 | 0.049 | 0.072 | 0.074 | 0.082 | 0.091 | 0.100 |
| Actual | 0.020 | 0.030 | 0.040 | 0.050 | 0.060 | 0.070 | 0.080 | 0.090 | 0.100 |
| Observed | 0.025 | 0.032 | 0.041 | 0.053 | 0.062 | 0.068 | 0.081 | 0.088 | 0.105 |
| SD | 0.006 | 0.005 | 0.005 | 0.007 | 0.006 | 0.008 | 0.006 | 0.005 | 0.010 |
| SEM | 0.0060 | 0.0050 | 0.0049 | 0.0077 | 0.0059 | 0.0083 | 0.0064 | 0.0044 | 0.0103 |
P values calculated by the Kolmogorov-Smirnov test to assess the significance levels of the differences in allele frequencies between pooled samples with closest allelic frequencies
| Pools | ||
| Pool 1 | Pool 2 | 0.0078 |
| Pool 2 | Pool 3 | 0.0006 |
| Pool 3 | Pool 4 | < 0.0001 |
| Pool 4 | Pool 5 | 0.0006 |
| Pool 5 | Pool 6 | 0.0078 |
| Pool 6 | Pool 7 | < 0.0001 |
| Pool 7 | Pool 8 | 0.0078 |
| Pool 8 | Pool 9 | < 0.0001 |
Figure 3Scatter plot demonstrating the accuracy of allele frequency estimation using pooled DNA samples. The chart is drawn based on the results for allele frequency listed in Table 2 and Table 3 versus the known frequencies prior to PCR amplification for the six SNPs. Short horizontal bars are error bars. The diagonal line shows complete concordance between known and observed allelic fractions.
Figure 4Comparison of expected sampling errors and experimental errors of the frequencies for the two alleles of SNP . The chart was drawn based on the results in Table 3. (a) The solid line is the expected sampling error for SNP site (HBB28) for the allele frequency of 0.05 for sample size up to 1000. The upper broken line is the estimated combine sampling and measurement error for this method based on Table 3 (see text). The lower broken line is measurement error. (b) The same as (a) for the allele frequency of 0.10 described in Table 3.