| Literature DB >> 25519383 |
Sean Lacey1, Jae Yoon Chung2, Honghuang Lin3.
Abstract
As the cost of DNA sequencing decreases, association studies based on whole genome sequencing are now becoming feasible. It is still unclear, however, how much more we could gain from whole genome sequencing compared to exome sequencing, which has been widely used to study a variety of diseases. In this project, we performed a comparison between whole genome sequencing and exome sequencing for family-based association analysis using data from Genetic Analysis Workshop 18. Whole genome sequencing was able to identify several significant hits within intergenic regions. However, the increased cost of multiple testing counteracted the benefits and resulted in a higher false discovery rate. Our results suggest that exome sequencing is a cost-effective way to identify disease-related variants. With the decreasing sequencing cost and accumulating knowledge of the human genome, whole genome sequencing has the potential to identify important variants in regulatory regions typically inaccessible for exome sequencing.Entities:
Year: 2014 PMID: 25519383 PMCID: PMC4143706 DOI: 10.1186/1753-6561-8-S1-S38
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Number of variants captured by each platform
| Platform | Number of variants | Number of common variants |
|---|---|---|
| Whole genome sequencing | 8,348,674 | 4,152,114 |
| Exome sequencing (Agilent) | 129,204 | 58,091 |
| Exome sequencing (Illumina) | 156,910 | 70,347 |
| Exome sequencing (NimbleGen) | 113,150 | 50,000 |
| GWAS SNPs | 453,285 | 425,734 |
Figure 1Manhattan plot of common variants associated with SBP. Red line is the Bonferroni p value cutoff.
Most significant variants in whole genome sequencing
| Rank | SNP | Function | Rank | SNP | Function | Rank | SNP | Function | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3:106206487 | 1 | Intergenic | 11 | 5:144654771 | 5.3E-07 | Intergenic | 21 | 7:11022564 | 8.8E-07 | Exonic |
| 2 | 9:15503905 | 1.3E-07 | Intronic | 12 | 17:63733954 | 5.3E-07 | Intronic | 22 | 7:11025635 | 8.8E-07 | Intronic |
| 3 | 9:15500315 | 1.9E-07 | Intronic | 13 | 9:15396745 | 5.5E-07 | Intergenic | 23 | 7:11027754 | 8.8E-07 | Intronic |
| 4 | 9:15501713 | 1.9E-07 | Intronic | 14 | 9:15527348 | 6.7E-07 | Intergenic | 24 | 7:11016614 | 9.4E-07 | Intronic |
| 5 | 9:15501753 | 1.9E-07 | Intronic | 15 | 9:15501422 | 7.1E-07 | Intronic | 25 | 7:11008221 | 1.0E-06 | Intergenic |
| 6 | 9:15500150 | 2.8E-07 | Intronic | 16 | 9:15472139 | 7.5E-07 | Intronic | 26 | 9:15459251 | 1.0E-06 | Intronic |
| 7 | 9:15528010 | 3.1E-07 | Intergenic | 17 | 15:92024157 | 7.5E-07 | Intergenic | 27 | 7:11022230 | 1.0E-06 | Exonic |
| 8 | 9:15482976 | 3.3E-07 | Intronic | 18 | 9:15396695 | 8.4E-07 | Intergenic | 28 | 7:11025638 | 1.1E-06 | Intronic |
| 9 | 9:15498495 | 3.7E-07 | Intronic | 19 | 7:11016690 | 8.8E-07 | Intronic | 29 | 7:11015444 | 1.1E-06 | Intronic |
| 10 | 9:15466033 | 4.0E-07 | Intronic | 20 | 7:11018224 | 8.8E-07 | Intronic | 30 | 9:15397233 | 1.1E-06 | Intergenic |
Most significant variants in exome sequencing and GWAS
| Rank | GWAS SNPs | Agilent | Illumina | NimbleGen | ||||||||
| SNP | Function | SNP | Function | SNP | Function | SNP | Function | |||||
| 1 | 9:15472139 | 7.5E-07 | Intronic | 7:11022564 | 8.8E-07 | Exonic | 7:11022564 | 8.8E-07 | Exonic | 7:11022564 | 8.8E-07 | Exonic |
| 2 | 7:11022564 | 8.8E-07 | Exonic | 7:11022230 | 1.0E-06 | Exonic | 7:11022230 | 1.0E-06 | Exonic | 7:11022230 | 1.0E-06 | Exonic |
| 3 | 7:11008221 | 1.0E-06 | Intergenic | 9:15468480 | 4.4E-06 | Intronic | 7:94927677 | 2.9E-06 | UTR3 | 9:15571630 | 3.2E-05 | Exonic |
| 4 | 7:11022230 | 1.0E-06 | Exonic | 7:41661724 | 2.9E-05 | Intergenic | 7:94921491 | 8.3E-06 | UTR3 | 9:14863863 | 7.2E-05 | Exonic |
| 5 | 7:11015444 | 1.1E-06 | Intronic | 9:15571630 | 3.2E-05 | Exonic | 9:15571630 | 3.2E-05 | Exonic | 3:125859012 | 7.4E-05 | Intronic |
| 6 | 9:15528290 | 1.1E-06 | Intergenic | 3:184766392 | 3.9E-05 | Intronic | 3:184770380 | 3.9E-05 | UTR3 | 17:64210580 | 7.5E-05 | Exonic |
| 7 | 9:15443430 | 1.4E-06 | Intronic | 3:184769911 | 4.5E-05 | UTR3 | 3:184769911 | 4.5E-05 | UTR3 | 7:44608718 | 7.9E-05 | Intronic |
| 8 | 9:15469733 | 1.7E-06 | Intronic | 3:184769941 | 4.5E-05 | UTR3 | 3:184769941 | 4.5E-05 | UTR3 | 1:212798260 | 1.0E-04 | Exonic |
| 9 | 3:106199956 | 2.2E-06 | Intergenic | 9:14863863 | 7.2E-05 | Exonic | 7:94921543 | 6.8E-05 | UTR3 | 1:210761365 | 1.2E-04 | Exonic |
| 10 | 9:15446868 | 2.3E-06 | Intronic | 17:64210580 | 7.5E-05 | Exonic | 9:14863863 | 7.2E-05 | Exonic | 7:2414142 | 1.2E-04 | Intronic |
Number of SNPs passing FDR threshold in each platform
| 5% | 2 | 2 | 2 | 0 | 0 |
| 10% | 3 | 3 | 2 | 10 | 0 |
| 15% | 3 | 4 | 2 | 18 | 40 |
Figure 2Q-Q plots of common variants associated with SBP for whole genome sequencing and exome sequencing