| Literature DB >> 21660233 |
Nathan E Wineinger1, Nicholas M Pajewski, Hemant K Tiwari.
Abstract
Since the discovery of the ubiquitous contribution of copy number variation to genetic variability, researchers have commonly used metrics such as r (2) to quantify linkage disequilibrium (LD) between copy number variants (CNVs) and single nucleotide polymorphisms (SNPs). However, these reports have been restricted to SNPs outside copy number variable regions (CNVR) as current methods have not been adapted to account for SNPs displaying variable copy number. We show that traditional LD metrics inappropriately quantify SNP/CNV covariance when SNPs lie within CNVR. We derive a new method for measuring LD that solves this issue, and defaults to traditional metrics otherwise. Finally, we present a procedure to estimate CNV-SNP allele frequencies from unphased CNV-SNP genotypes. Our method allows researchers to include all SNPs in SNP/CNV LD measurements, regardless of copy number.Entities:
Keywords: CNV–SNP haplotype; copy number variation; linkage disequilibrium
Year: 2011 PMID: 21660233 PMCID: PMC3109359 DOI: 10.3389/fgene.2011.00017
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Copy number variant–single nucleotide polymorphism (CNV–SNP) alleles based upon a haploid three copy number state model (zero to two copies per haploid).
| CNV–SNP allele | Copy number | Number of A alleles | Frequency |
|---|---|---|---|
| 0 | 0 | 0 | |
| A | 1 | 1 | |
| B | 1 | 0 | |
| AA | 2 | 2 | |
| AB | 2 | 1 | |
| BB | 2 | 0 |
Figure 1Linkage disequilibrium (.
Copy number variant–single nucleotide polymorphism (CNV–SNP) genotypes based upon a haploid three copy number state model, CNV–SNP haploid configurations, and respective frequencies.
| CNV–SNP genotype | Haploid configuration | Frequency |
|---|---|---|
| 0 | 0/0 | |
| A | A/0 | 2 |
| B | B/0 | 2 |
| AA | A/A | |
| AA/0 | 2 | |
| AB | A/B | 2 |
| AB/0 | 2 | |
| BB | B/B | |
| BB/0 | 2 | |
| AAA | AA/A | 2 |
| AAB | AA/B | 2 |
| AB/A | 2 | |
| ABB | BB/A | 2 |
| AB/B | 2 | |
| BBB | BB/B | 2 |
| AAAA | AA/AA | |
| AAAB | AA/AB | 2 |
| AABB | AA/BB | 2 |
| AB/AB | ||
| ABBB | AB/BB | 2 |
| BBBB | BB/BB |
Frequency estimates are based upon haploid configurations falling into their appropriate Hardy–Weinberg equilibrium proportions.
Measurements of linkage disequilibrium (LD) between copy number variants and single nucleotide polymorphisms (SNPs) within the copy number variable region.
| Type | Frequency | |||
|---|---|---|---|---|
| Deletion only (1) | 0.111 | 0.074 | 0* | |
| Deletion only (2) | 0.429 | 0.250 | 0* | |
| Duplication only (1) | 0.667 | 0.910 | 0.818 | |
| Duplication only (2) | 0.146 | 0.146 | 0 | |
| Duplication only (3) | 0.098 | 0.098 | 0 | |
| Multiallelic (1) | 0.014 | 0.656 | 0.758 | |
| Multiallelic (2) | 0.222 | 0.222 | 0 | |
0*: .
Copy number variant–single nucleotide polymorphism (CNV–SNP) allele frequency estimation procedure diagnostics based upon 1,000 simulations of a sample size of 1,000 (2,000 haploids) and various CNV–SNP allele frequencies.
| Type | Simulated frequency | Mean difference |
|---|---|---|
| No CNVs | 0* | |
| Deletion only (1) | 0* | |
| Deletion only (2) | 0* | |
| Duplication only (1) | 0.0019 | |
| 0.0019 | ||
| 0.0019 | ||
| 0.0019 | ||
| Duplication only (2) | 0.0050 | |
| 0.0050 | ||
| 0.0048 | ||
| 0.0082 | ||
| 0.0048 | ||
| Duplication only (3) | 0* | |
| Multiallelic (1) | 0.0040 | |
| 0.0069 | ||
| 0.0108 | ||
| 0.0047 | ||
| 0.0076 | ||
| 0.0100 | ||
| Multiallelic (2) | 0.0028 | |
| 0.0038 | ||
| 0.0102 | ||
| 0.0020 | ||
| 0.0097 |
Mean difference represents the mean difference between the true and estimated CNV–SNP allele frequencies.
0*: Less than 1 × 10.