| Literature DB >> 31001313 |
Sanjeev Sariya1,2, Joseph H Lee1,2,3, Richard Mayeux1,2,3, Badri N Vardarajan1,2, Dolly Reyes-Dumeyer1,2, Jennifer J Manly1,2,3, Adam M Brickman1,2,3, Rafael Lantigua4, Martin Medrano5, Ivonne Z Jimenez-Velazquez6, Giuseppe Tosto1,2,3.
Abstract
BACKGROUND: Imputation has become a standard approach in genome-wide association studies (GWAS) to infer in silico untyped markers. Although feasibility for common variants imputation is well established, we aimed to assess rare and ultra-rare variants' imputation in an admixed Caribbean Hispanic population (CH).Entities:
Keywords: 1000G; GWAS; admixed population; imputation; rare variants
Year: 2019 PMID: 31001313 PMCID: PMC6456789 DOI: 10.3389/fgene.2019.00239
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
SNP counts in HRC and 1000G reference panel.
| Reference Panel | Individuals | Autosomal variants | Bi-allelic SNPs | Multi-allelic SNPs |
|---|---|---|---|---|
| 1000G Phase 3 | 2,504 | 81,706,022 | 77,818,332 | 3,887,690 |
| HRC | 27,165∗ | 39,131,600 | 39,131,600 | NA |
Type of imputed SNPs across reference panels.
| Reference Panel | Multi-allelic SNPs | Bi-allelic SNPs | Total SNPs | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Total SNPs | Info ≥ 0.40 (%) | Info ≥ 0.80 (%) | Total SNPs | Info ≥ 0.40 (%) | Info ≥ 0.80 (%) | Total SNPs | Info ≥ 0.40 (%) | Info ≥ 0.80 (%) | |
| All SNPs | |||||||||
| 1000G | 3,319,815 | 2,586,342 (77.90) | 2,061,295 (62.09) | 77,920,577 | 31,423,926 (40.32) | 23,468,086 (30.11) | 81,240,392 | 31,423,926 (41.86) | 25,529,381 (31.42) |
| HRC | NA | NA | NA | 38,532,090 | 23,436,980 (60.82) | 18,833,790 (48.87) | 38,532,090 | 23,436,980 (60.82) | 18,833,790 (48.79) |
| 1000G | NA | NA | NA | 30,090,251 | 22,631,112 (75.21) | 18,408,585 (61.17) | 30,090,251 | 22,631,112 (75.21) | 18,408,585 (61.17) |
| HRC | NA | NA | NA | 30,090,251 | 22,438,268 (74.56) | 18,395,036 (61.13) | 30,090,251 | 22,438,268 (74.56) | 18,395,036 (61.13) |
SNP Counts for all Bi-allelic uncommon, rare and ultra-rare SNPs.
| MAF | 1000G | HRC | ||||
|---|---|---|---|---|---|---|
| Info ≥ 0 | Info ≥ 0.40 (%) | Info ≥ 0.80 (%) | Info ≥ 0 | Info ≥ 0.40 (%) | Info ≥ 0.80 (%) | |
| (1–5%) | 6,025,281 | 5,989,223 (98.90) | 5,441,982 (90.31) | 5,434,996 | 5,421,257 (99.84) | 5,061,904 (93.13) |
| (0.1–1%) | 20,249,058 | 16,881,286 (83.36) | 10,901,789 (53.83) | 11,780,671 | 10,931,924 (92.79) | 7,404,808 (62.85) |
| (0–0.1%) | 44,562,205 | 1,490,434 (3.34) | 242,717 (0.544) | 15,055,433 | 828,256 (5.50) | 174,673 (1.16) |
| (1–5%) | 5,624,956 | 5,604,308 (99.63) | 5,148,285 (91.52) | 5,396,207 | 5,385,364 (99.79) | 5,037,187 (93.34) |
| (0.1–1%) | 11,875,603 | 10,442,603 (87.93) | 7,027,312 (59.17) | 10,945,899 | 10,268,136 (93.80) | 7,060,908 (64.50) |
| (0–0.1%) | 6,314,479 | 312,967 (4.95) | 47,614 (0.75) | 7,519,807 | 560,043 (7.44) | 127,423 (1.69) |
Figure 1Comparison of average Info quality between HRC and 1000G reference panel for all autosomal chromosomes.
Figure 2Comparison of average Info on CHR14: 70–75 MB (5 MB) vs. 73–74 MB (1 MB) region.
Comparison for mismatch counts and Kappa (K) for HRC and 1000G using WES data on Chromosome 14.
| MAF | 1000G | HRC | ||||||
|---|---|---|---|---|---|---|---|---|
| Info ≥ 0.80 | Info ≥ 0.80 | |||||||
| SNP | Total SNPs in all persons∗ | Mismatch | Kappa | SNP | Total SNPs in all persons∗ | Mismatch | Kappa | |
| (1–5%) | 2,354 | 610,550 | 7,397 (1.22%) | 0.99 | 2,264 | 587,961 | 8,963 (1.52%) | 0.99 |
| (0.1–1%) | 3,542 | 926,109 | 1,245 (0.13%) | 0.99 | 3,759 | 982,734 | 2,439 (0.24%) | 0.99 |
| (0–0.1%) | 35 | 9,163 | 10 | 0.99 | 93 | 24,348 | 32 | 0.99 |
| (0.10%) | (0.13%) | |||||||