| Literature DB >> 22252406 |
Daniel W H Ho1, Maurice K H Yap, Shea Ping Yip.
Abstract
BACKGROUND: Despite being a well-established strategy for cost reduction in disease gene mapping, pooled DNA association study is much less popular than the individual DNA approach. This situation is especially true for pooled DNA genomewide association study (GWAS), for which very few computer resources have been developed for its data analysis. This motivates the development of UPDG (Utilities package for data analysis of Pooled DNA GWAS).Entities:
Mesh:
Year: 2012 PMID: 22252406 PMCID: PMC3293712 DOI: 10.1186/1471-2156-13-1
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Summary of various components of UPDG.
| UPDG component | Function |
|---|---|
| merge.pl | Combines genotype and intensity data of individual DNA GWAS from separate files |
| adjustment.pl | Estimates allele frequencies of markers from DNA pools and performs adjustment correcting for allelic preferential amplification with the methods based on Hoogendoorn |
| QC.pl | Removes SNPs with minor allele frequencies and call rates below a user-specified threshold and generates filtered input data file for nested ANOVA |
| nested_ANOVA_[H/M/N/U].r | Carries out nested ANOVA in R environment |
| output_format.pl | Organizes results from nested ANOVA and generates a summary of markers with p values below a user-specified threshold |
| mean_Rx_statistics.pl | Generates summary information on estimated allele frequencies for markers |
Figure 1Workflow of UPDG.
Accuracy of allele frequency estimation for DNA pools.
| Correction methoda | Allele frequency difference | Absolute valuesb | % of SNPs with over-estimated allele frequencyc | |||
|---|---|---|---|---|---|---|
| Sample group | Mean difference | SD | Mean difference | SD | ||
| 1 | Case | 0.0726 | 0.0414 | 0.0728 | 0.0411 | 99.1 |
| 2 | Case | 0.0725 | 0.0419 | 0.0725 | 0.0418 | 99.1 |
| 3 | Case | 0.0477 | 0.0435 | 0.0487 | 0.0423 | 94.3 |
| 4 | Case | 0.0690 | 0.0511 | 0.0740 | 0.0435 | 92.6 |
| 1 | Control | 0.0627 | 0.0467 | 0.0630 | 0.0463 | 99.1 |
| 2 | Control | 0.0625 | 0.0465 | 0.0626 | 0.0463 | 99.1 |
| 3 | Control | 0.0352 | 0.0512 | 0.0389 | 0.0484 | 85.7 |
| 4 | Control | 0.0591 | 0.0500 | 0.0637 | 0.0438 | 93.5 |
| 1 | Case - Control | 0.0099 | 0.0276 | 0.0246 | 0.0158 | 70.4 |
| 2 | Case - Control | 0.0100 | 0.0275 | 0.0247 | 0.0157 | 69.4 |
| 3 | Case - Control | 0.0125 | 0.0328 | 0.0298 | 0.0184 | 68.6 |
| 4 | Case - Control | 0.0099 | 0.0293 | 0.0253 | 0.0176 | 69.4 |
a Correction methods used include those based on (1) Hoogendoorn et al [6], (2) Meaburn et al [7], (3) Craig et al [8] plus Meaburn et al [7], and (4) no adjustment.
b Mean of the absolute values for the differences in allele frequencies between DNA pools and individual samples.
c In total, 108 SNPs were compared for allele frequency differences as estimated for DNA pools and individual DNA samples. However, only 105 SNPs were compared for the correction method (3) (Craig et al [8] plus Meaburn et al [7]) because 3 SNPs were filtered out.