| Literature DB >> 20018000 |
Corina Shtir1, Roger Pique-Regi, Kim Siegmund, John Morrison, Fredrick Schumacher, Paul Marjoram.
Abstract
In this paper we test for association between copy number variation and diabetes in a subset of individuals from the Framingham Heart Study. We used the 500 k SNP data and called copy number variation using two algorithms: the genome alteration detection algorithm of Pique-Regi et al. and the software Golden Helix. We then tested for association between copy number and diabetes using a gene-based analysis. Our results show little evidence of association between copy number and diabetes status. Furthermore, our results indicate a relatively poor level of agreement between copy number calls resulting from the two programs. We then examined potential causes for this difference in results and the implications for future studies.Entities:
Year: 2009 PMID: 20018000 PMCID: PMC2795907 DOI: 10.1186/1753-6561-3-s7-s133
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Summary of gene-based CNV associations for GADA based segmentation (Wilcoxon rank-sum test)
| % Significant genesc | ||||
|---|---|---|---|---|
| Chr | Affymetrix genesa | Genotyped genesb | GADA | CNAM |
| 1 | 1650 | 1500 | 5.12 | 12.60 |
| 2 | 1046 | 974 | 0.72 | 16.32 |
| 3 | 893 | 829 | 0.36 | 13.51 |
| 4 | 661 | 634 | 1.31 | 1.57 |
| 5 | 722 | 680 | 7.45 | 22.35 |
| 6 | 883 | 822 | 1.84 | 18.86 |
| 7 | 709 | 655 | 3.09 | 6.87 |
| 8 | 539 | 505 | 10.18 | 1.78 |
| 9 | 632 | 582 | 0.87 | 4.81 |
| 10 | 636 | 603 | 1.84 | 8.96 |
| 11 | 1028 | 907 | 1.68 | 4.85 |
| 12 | 871 | 799 | 4.80 | 15.14 |
| 13 | 295 | 287 | 8.81 | 10.45 |
| 14 | 527 | 474 | 4.25 | 12.66 |
| 15 | 507 | 474 | 1.92 | 9.92 |
| 16 | 580 | 483 | 5.64 | 3.52 |
| 17 | 851 | 712 | 8.09 | 0.14 |
| 18 | 241 | 241 | 4.17 | 1.66 |
| 19 | 912 | 713 | 1.51 | 0.14 |
| 20 | 470 | 439 | 2.75 | 5.47 |
| 21 | 201 | 187 | 0 | 9.09 |
| 22 | 365 | 326 | 3.12 | 0 |
| Total/Average | 15219 | 13826 | 3.61 | 8.62 |
aOverall number of genes as defined by Affymetrix annotation files
bNumber of those genes for which we had SNP intensity data
cPercentage of those genes in which associated CNV was detected at the p = 0.05 value (uncorrected for multiple comparisons) using GADA and CNAM
Figure 1Q-Q Plot of . The plot shows observed (x-axis) and expected (y-axis) values of -log10 p-values resulting from the gene-based test for association between CNV and diabetes using the GADA and CNAM analyses.
Figure 2Top 20 genes with most significant CNV associations. We show the distribution of potentially significant genes varies along the genome. The upper plot shows results from the GADA gene-based analysis; the lower shows those from the CNAM analysis. In each case, red bars show the 20 genes that have the smallest p-value when testing association with diabetes.
Figure 3Scatter plots. The left plot shows a heat-map of normalized SNP intensities resulting from the CNAM normalization (x-axis) and APT normalization used for the GADA analysis (y-axis) for a randomly chosen individual. There is a striking lack of correlation between the results of the two normalization routines. The right plot is a scatter plot of resulting copy number calls for randomly chosen (but representative) regions along the genome. Again, we note a striking lack of agreement between calls resulting from CNAM and GADA.