| Literature DB >> 22718036 |
Alexander A Morgan1, Rong Chen, Atul Janardhan Butte.
Abstract
OBJECTIVE: We investigated the common-disease relevant information obtained from sequencing compared with that reported from genotyping arrays.Entities:
Mesh:
Year: 2012 PMID: 22718036 PMCID: PMC3392859 DOI: 10.1136/amiajnl-2011-000737
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1Difference in likelihood ratios (LRs) derived from the genome sequence and genotyping array reportable variants from the Affymetrix 500k platform. Differences in natural logarithm of likelihood ratios derived for 55 diseases for the single nucleotide polymorphisms (SNPs) measured on the Affymetrix 500k platform, compared with the likelihood ratios derived from the SNPs in the genome sequence for the 187 genomes examined. For each disease, the log likelihood ratio reported by the array based SNPs is subtracted from the sequence derived value for each individual. The boxplot shows the median as a dark bar, with the box width showing the center quartiles, and the whiskers showing the outer quartiles; extreme outliers are excluded. The results using the product of likelihood ratios for all associated variants is depicted in the plot on the left, while the results from the maximum likelihood ratio are shown in the plot on the right; note that the horizontal scales are different. To the left of each disease bar are two numbers. The lighter colored, left number indicates the number of SNPs used in the likelihood ratio derived from the full sequence, while the darker number to the right indicates the number of SNPs used in the likelihood ratios derived from the SNPs directly measured by the Affymetrix 500k platform.
Figure 2Difference in likelihood ratios (LRs) derived from the genome sequence and genotyping array reportable variants from the Illumina Omni platform. Differences in log likelihood ratios derived for 55 diseases from the SNPs measured on the Illumina Omni platform, compared with the likelihood ratios derived from the SNPs in the full genome sequence for the 187 genomes examined. The results are presented as in figure 1.
Root mean squared (RMS) difference in log likelihood ratios (LRs) between the those derived from the full sequences and those derivable from genotyping arrays, averaged across diseases
| Simulated platform | SNP calling | Risk model | Averaged RMS log likelihood difference |
| Affymetrix 500k | Measured | Product of LRs | 0.895 |
| Maximum of LRs | 0.384 | ||
| Measured and imputable | Product of LRs | 0.514 | |
| Maximum of LRs | 0.204 | ||
| Illumina Omni | Measured | Product of LRs | 0.519 |
| Maximum of LRs | 0.203 | ||
| Measured and imputable | Product of LRs | 0.292 | |
| Maximum of LRs | 0.102 |
SNP, single nucleotide polymorphism.
Relative difference between clinical profiles from genotyping array single nucleotide polymorphisms (SNPs) and sequencing by disease, using the product of likelihood ratio model
The ratio of the root mean squared (RMS) difference between the log of the product of likelihood ratios derived from genotyping array SNPs and that derived from sequence data, over the RMS likelihood ratios derived from sequence data alone for each disease.
Dark gray corresponds to a greater relative difference; lighter gray corresponds to a lower relative difference; and white (0.00) indicates no difference between the results of sequencing and genotyping.
Relative difference between clinical profiles from genotyping array single nucleotide polymorphisms (SNPs) and sequencing by disease, using the maximum likelihood ratio model
The ratio of the root mean squared (RMS) difference between the log of the maximum likelihood ratios derived from genotyping array SNPs and that derived from sequence data, over the RMS of the likelihood ratios derived from sequence data alone for each disease.
Dark gray corresponds to a greater relative difference; lighter gray corresponds to a lower relative difference; and white (0.00) indicates no difference between sequencing and genotyping.