| Literature DB >> 35046417 |
Juan L Rodriguez-Flores1,2, Radja Messai-Badji3, Amal Robay4, Asmaa Althani3,5, Ronald G Crystal6, Ramzi Temanni7, Najeeb Syed7, Monika Markovic8, Eiman Al-Khayat8, Fatima Qafoud8, Zafar Nawaz9, Ramin Badii4,9, Yasser Al-Sarraj3, Hamdi Mbarek3, Wadha Al-Muftah3, Muhammad Alvi3, Mahboubeh R Rostami1, Juan Carlos Martinez Cruzado10, Jason G Mezey1,11, Alya Al Shakaki4, Joel A Malek4, Matthew B Greenblatt12, Khalid A Fakhro4,7, Khaled Machaca4, Ajayeb Al-Nabet9, Nahla Afifi8, Andrew Brooks13,14, Said I Ismail3.
Abstract
Risk genes for Mendelian (single-gene) disorders (SGDs) are consistent across populations, but pathogenic risk variants that cause SGDs are typically population-private. The goal was to develop "QChip1," an inexpensive genotyping microarray to comprehensively screen newborns, couples, and patients for SGD risk variants in Qatar, a small nation on the Arabian Peninsula with a high degree of consanguinity. Over 108 variants in 8445 Qatari were identified for inclusion in a genotyping array containing 165,695 probes for 83,542 known and potentially pathogenic variants in 3438 SGDs. QChip1 had a concordance with whole-genome sequencing of 99.1%. Testing of QChip1 with 2707 Qatari genomes identified 32,674 risk variants, an average of 134 pathogenic alleles per Qatari genome. The most common pathogenic variants were those causing homocystinuria (1.12% risk allele frequency), and Stargardt disease (2.07%). The majority (85%) of Qatari SGD pathogenic variants were not present in Western populations such as European American, South Asian American, and African American in New York City and European and Afro-Caribbean in Puerto Rico; and only 50% were observed in a broad collection of data across the Greater Middle East including Kuwait, Iran, and United Arab Emirates. This study demonstrates the feasibility of developing accurate screening tools to identify SGD risk variants in understudied populations, and the need for ancestry-specific SGD screening tools.Entities:
Year: 2022 PMID: 35046417 PMCID: PMC8770564 DOI: 10.1038/s41525-021-00270-0
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 6.083
Step 1: Identification of pathogenic variants and genes in the Qatari Genome.
| Category | Identification of variants/genes | Variants (n) | Genes (n) |
|---|---|---|---|
| All variants/genesa | Qatari Genome Program | 94,852,664 | 19,965 |
| Weill Cornell Medicine exomes | 767,957 | 19,385 | |
| Weill Cornell Medicine genomes | 28,331,826 | 18,499 | |
| Hamad Medical Corporation | 727 | 513 | |
| Comprehensive list | All Qatari variants/genes | 104,473,390 | 20,069 |
| Single nucleotide variants | 87,813,560 | 20,042 | |
| Indels | 16,659,829 | 19,898 | |
| Variants of interest for SGD research and screening | ClinVar variants/genes, including pathogenic and non-pathogenic | 10,490,820 | 3770 |
| SnpEff computationally predicted pathogenic variants/genes for research | 805,649 | 19,770 | |
| Comprehensive listb | Qatari variants/genes of interest for SDG research and screening | 207,370 | 3770 |
| Single nucleotide variants | 196,855 | 3769 | |
| Indels | 10,515 | 1897 |
aA list of all Qatari variants and genes was compiled from all Qatari variants and the genes responsible for these variants identified in datasets described in Supplementary Table 1.
bThe comprehensive list of all Qatari variants of interest for research and screening in single gene (Mendelian) disorder (SGD) was compiled from the subset of the list of all Qatari variants/genes identified in ClinVar and predicted to be of high or moderate impact by SnpEff.
Step 2: Design of QChip1 based on the predicted pathogenic variants in the Qatari Genome.
| Microarraya | Probes ( | Variant sites ( | Genes ( |
|---|---|---|---|
| QChip0 | 184,713 | 91,942 | 3540 |
| SNV | 179,257 | 89,696 | 3529 |
| Indels | 5456 | 2246 | 10,665 |
| QChip1 | 70,715 | 61,592 | 3438 |
| SNV | 69,745 | 60,858 | 3472 |
| Indels | 970 | 734 | 491 |
aBased on the comprehensive list of Qatari variants and genes of interest for SGD research and screening (Table 1), QChip0, the precursor of QChip1, was designed on the Axion platform with 184,713 probes representing 91,942 variants and 3540 genes (see Methods for a description of prioritization of variants/genes and choice of probes). QChip0 was tested with n = 26 Qatari DNA samples for which whole-genome sequencing was available. The poorly performing probes with low-quality genotype sites were eliminated, resulting in the final design of QChip1 with 70,715 probes representing 61,592 variants and 3438 genes.
Step 3: Concordance of QChip1 compared to whole-genome sequencinga.
| Populationb | Indel | SNV | Indel + SNV | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Concordant | Discordant | Total | Concordance rate | Concordant | Discordant | Total | Concordance rate | Concordant | Discordant | Total | Concordance rate | ||
| Qatari | Average | 1.7 × 101 | 1.4 × 100 | 1.8 × 101 | 9.2 × 10−1 | 2.8 × 104 | 2.2 × 102 | 2.8 × 104 | 9.9 × 10−1 | 2.8 × 104 | 2.3 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 2.5 × 100 | 1.2 × 100 | 2.7 × 100 | 6.4 × 10−2 | 1.1 × 102 | 1.1 × 102 | 8.9 × 100 | 3.8 × 10−3 | 1.1 × 102 | 1.1 × 102 | 8.3 × 100 | 3.8 × 10−3 | |
| Sample size | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | 4.7 × 102 | |
| Confidence interval | 2.4 × 10−1 | 1.1 × 10−1 | 2.5 × 10−1 | 6.0 × 10−3 | 1.0 × 101 | 9.9 × 100 | 8.4 × 10−1 | 3.6 × 10−4 | 1.0 × 101 | 1.0 × 101 | 7.8 × 10−1 | 3.6 × 10−4 | |
| QGP_PAR | Average | 1.6 × 101 | 1.5 × 100 | 1.9 × 101 | 9.2 × 10−1 | 2.8 × 104 | 2.2 × 102 | 2.8 × 104 | 9.9 × 10−1 | 2.8 × 104 | 2.2 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 2.4 × 100 | 1.1 × 100 | 2.3 × 100 | 5.6 × 10−2 | 8.6 × 101 | 8.5 × 101 | 2.3 × 100 | 3.1 × 10−3 | 8.6 × 101 | 8.6 × 101 | 0.0 × 100 | 3.1 × 10−3 | |
| Sample size | 1.4 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | |
| Confidence interval | 4.3 × 10−1 | 1.9 × 10−1 | 3.9 × 10−1 | 9.4 × 10−3 | 1.4 × 101 | 1.4 × 101 | 3.9 × 10−1 | 5.1 × 10−4 | 1.4 × 101 | 1.4 × 101 | Undefined[ | 5.2 × 10−4 | |
| QGP_GAR | Average | 1.7 × 101 | 1.5 × 100 | 1.9 × 101 | 9.2 × 10−1 | 2.8 × 104 | 2.2 × 102 | 2.8 × 104 | 9.9 × 10−1 | 2.8 × 104 | 2.2 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 2.1 × 100 | 1.1 × 100 | 2.3 × 100 | 5.6 × 10−2 | 8.6 × 101 | 8.5 × 101 | 2.3 × 100 | 3.1 × 10−3 | 8.6 × 101 | 8.6 × 101 | 0.0 × 100 | 3.1 × 10−3 | |
| Sample size | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | 1.5 × 102 | |
| Confidence interval | 3.5 × 10−1 | 1.9 × 10−1 | 3.9 × 10−1 | 9.4 × 10−3 | 1.4 × 101 | 1.4 × 101 | 3.9 × 10−1 | 5.1 × 10−4 | 1.4 × 101 | 1.4 × 101 | Undefined[ | 5.2 × 10−4 | |
| QGP_ADM | Average | 1.7 × 101 | 1.4 × 100 | 1.9 × 101 | 9.3 × 10−1 | 2.8 × 104 | 2.4 × 102 | 2.8 × 104 | 9.9 × 10−1 | 2.8 × 104 | 2.4 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 2.6 × 100 | 1.1 × 100 | 2.8 × 100 | 5.8 × 10−2 | 7.5 × 101 | 7.4 × 101 | 2.2 × 101 | 2.7 × 10−3 | 7.5 × 101 | 7.5 × 101 | 2.1 × 101 | 2.7 × 10−3 | |
| Sample size | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | 7.1 × 101 | |
| Confidence interval | 6.3 × 10−1 | 2.0 × 10−1 | 6.9 × 10−1 | 1.4 × 10−2 | 1.8 × 101 | 1.8 × 101 | 5.4 × 100 | 6.5 × 10−4 | 1.8 × 101 | 1.8 × 101 | 5.2 × 100 | 6.6 × 10−4 | |
| QGP_WEP | Average | 1.7 × 101 | 1.3 × 100 | 1.8 × 101 | 9.3 × 10−1 | 2.8 × 104 | 2.1 × 102 | 2.0 × 104 | 9.9 × 10−1 | 2.8 × 104 | 2.1 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 2.6 × 100 | 1.2 × 100 | 2.7 × 100 | 6.5 × 10−2 | 6.2 × 101 | 6.1 × 101 | 2.7 × 100 | 2.2 × 10−3 | 6.2 × 101 | 6.2 × 101 | 0.0 × 100 | 2.2 × 10−3 | |
| Sample size | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | 8.8 × 101 | |
| Confidence interval | 5.7 × 10−1 | 2.7 × 10−1 | 6.0 × 10−1 | 1.4 × 10−2 | 1.4 × 101 | 1.3 × 101 | 6.0 × 10−1 | 4.8 × 10−4 | 1.4 × 101 | 1.4 × 101 | Undefined[ | 4.9 × 10−4 | |
| QGP_SAS | Average | 1.8 × 101 | 0.0 × 100 | 1.8 × 101 | 1.0 × 100 | 2.8 × 104 | 1.4 × 102 | 2.8 × 104 | 9.9 × 10−1 | 2.8 × 104 | 1.4 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 1.5 × 100 | 0.0 × 100 | 1.5 × 100 | 0.0 × 100 | 7.0 × 100 | 5.5 × 100 | 1.5 × 100 | 2.0 × 10−4 | 5.5 × 100 | 5.5 × 100 | 0.0 × 100 | 2.0 × 10−4 | |
| Sample size | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | 2.0 × 100 | |
| Confidence interval | 2.2 × 100 | Undefinedc | 2.2 × 100 | #NUM! | 1.0 × 101 | 8.0 × 100 | 2.2 × 100 | 2.9 × 10−4 | 8.0 × 100 | 8.0 × 100 | Undefined[ | 2.9 × 10−4 | |
| QGP_AFR | Average | 1.9 × 101 | 1.4 × 100 | 2.1 × 101 | 9.4 × 10−1 | 2.8 × 104 | 3.0 × 102 | 2.8 × 104 | 9.9 × 10−1 | 2.8 × 104 | 3.0 × 102 | 2.8 × 104 | 9.9 × 10−1 |
| Standard deviation | 2.8 × 100 | 1.3 × 100 | 2.9 × 100 | 5.7 × 10−2 | 7.1 × 101 | 7.0 × 101 | 2.9 × 100 | 2.5 × 10−3 | 7.1 × 101 | 7.1 × 101 | 0.0 × 100 | 2.5 × 10−3 | |
| Sample size | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | 2.7 × 101 | |
| Confidence interval | 1.0 × 100 | 4.9 × 10−1 | 1.2 × 100 | 2.2 × 10−2 | 2.8 × 101 | 2.8 × 101 | 1.2 × 100 | 9.9 × 10−4 | 2.8 × 101 | 2.8 × 101 | Undefined[ | 1.0 × 10−3 | |
aIn order to assess the quality of QChip1 data, genotypes were generated for n = 473 Qataris for all QChip1 sites compared to whole-genome sequencing in data. Genotypes were generated for all sites, including both reference and variant genotypes in whole-genome sequencing. The concordance between QChip1 and whole-genome sequencing indels and single nucleotide variants (SNV) genotypes were compared. Shown for all and for each population and for each variant class (indel, SNV, both) the average, standard deviation, sample size, and 95% confidence interval for the number of concordant variants, the number of discordant variants, the total number of variants compared, and the concordance rate.
bPopulations include: Qatari (all Qatari) and subpopulations: QGP_PAR (Peninsular Arabs); QGP_GAR (General Arabs); QGP_ADM (Admixed Arabs); QGP_WEP (Arabs of Wester Eurasia and Persia); QGP_SAS (South Asian Arabs); and QGP_AFR (African Arabs).
cUnable to calculate confidence interval when the standard deviation equals zero.
Step 4: Use QChip1 to assess average number of single nucleotide variants per genome of interest for SGD research and screening in Qataris and other populationsa.
| Genomes assessed ( | SNVs of interest for SGD research and screening identified by QChip1 | Median number of identified SNVs | |||
|---|---|---|---|---|---|
| Homozygous | Heterozygous | Wild-type | Missing | ||
| Qatari ( | 2 | 130 | 32,501 | 37 | 134 |
| QGP_PAR ( | 2 | 107 | 32,530 | 33 | 109 |
| QGP_GAR ( | 2 | 203 | 32,418 | 43 | 205 |
| QGP_WEP ( | 1 | 131 | 32,502 | 38 | 132 |
| QGP_SAS ( | 1 | 136 | 32,494 | 38 | 137 |
| QGP_AFR ( | 1 | 128 | 32,504 | 35 | 129 |
aIn order to compare the precision medicine value of QChip1 for pathogenic variant screening and research across Qatari subpopulations, n = 2708 Qatari genomes were assessed by QChip1 for the number of variants of interest for SGD research and screening in the Qatari genetic subpopulations. After exclusion of common variants (minor allele frequency >0.05), variants in genes not containing ClinVar pathogenic variants, variants with a batch effect, and variants not observed in Qatar, n = 32,674variants of interest were analyzed. Population genetic analysis was conducted as described in Fig. 3. The Qatari individuals genotyped on QChip1 were stratified based on dominant ancestry cluster, without exclusion of admixed individuals. Shown is (left-to-right) each population with sample size, the median number of QChip1 variants per individual (homozygous, heterozygous, wild type, and missing) and median number of genes with one or more variants per individual.
bPopulations include: Qatari (all Qatari) and subpopulations: QGP_PAR (Peninsular Arabs); QGP_GAR (General Arabs); QGP_WEP (Arabs of Wester Eurasia and Persia); QGP_SAS (South Asian Arabs); and QGP_AFR (African Arabs).
cNot included QGP_ADM, Admixed Arabs, see Table 3.
Fig. 3Population structure and principal component analysis of ancestry assessed by QChip1.
Sites and samples that failed QC based on variant batch effects or PC outliers were excluded. After QC, ADMIXTURE analysis was conducted on the remaining n = 37,674 variants and n = 2985 samples of Qataris (n = 2708) and non-Qataris (n = 277) for a range of K from 3 to 12. The lowest cross-validation error was observed for k = 5 for the full dataset. After analysis, the Qatari and non-Qatari samples were plotted separately, the panels here show the Qatari samples from the joint analysis. A Admixture (k = 5) proportions. Shown is a plot of the admixture proportions (% k from 0 to 100%, y axis), with each column representing one genome, sorted from left-to-right by dominant (highest %) k, and decreasing % k1 to k5. Genomes are color-coded by the dominant (largest %) ancestry (QGP_PAR, Peninsular Arabs, red; QGP_GAR, General Arabs, orange; QGP_WEP, Arabs of West Eurasia and Persia, bright green; QGP_SAS, South Asian Arabs, olive green; and QGP_AFR, African Arabs, light blue). Samples from prior studies of Qatar population structure (Qatar Genome public samples from Fakhro et al.[11] and Rodriguez-Flores et al.[12] genotyped on QChip1 were included in the clustering analysis and were used to assign the clusters. B Principal components analysis of Qataris. Shown is a PC1 × PC2 plot of Qatari genomes in squares color-coded by cluster of largest proportion of inferred ancestry. Not shown, QGP_ADM, Admixed Arabs.
Step 3: Known pathogenic variants of interest for Mendelian (single gene) disorder screening in Qatar using QChip1a.
| Phenotype | DbSNP | Pathogenic DBb | Chr:Pos:Ref:Alt (GRCh38) | HGVS annotation assertion from ClinVar | Genotype Counts Hom:Het:WT | Qatar | Kuwait | GME | Iran | NYC | PR |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Amyotrophic lateral sclerosis type 10 | rs80356718 | CV | 1:11022209:A:G | NM_007375.3(TARDBP):c.800A > G (p.Asn267Ser) | 0:6:2701 | 0.0011 | 0.0011 | 0.0012 | 0.0013 | 0.0000 | 0.0000 |
| Homocystinuria due to methylene tetrahydrofolate reductase deficiency | rs776483190 | CV | 1:11802980:C:T | NM_005957.4(MTHFR):c.137G > A (p.Arg46Gln) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Ehlers-Danlos syndrome, hydroxylysine-deficient | rs121913550 | CV CAGS | 1:11958627:C:T | NM_000302.4(PLOD1):c.955C > T (p.Arg319Ter) | 0:4:2703 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Glaucoma 1, open angle, A | rs74315339 | CV | 1:171652468:C:A | NM_000261.2(MYOC):c.144G > T (p.Gln48His) | 0:6:2702 | 0.0011 | 0.0000 | 0.0005 | 0.0000 | 0.0000 | 0.0000 |
| Central centrifugal cicatricial alopecia | rs142129409 | CV | 1:17262194:T:A | NM_016233.2(PADI3):c.335T > A (p.Leu112His) | 0:2:2706 | 0.0004 | 0.0000 | 0.0045 | 0.0050 | 0.0000 | 0.0000 |
| Central centrifugal cicatricial alopecia | rs139876092 | CV | 1:17267938:C:T | NM_016233.2(PADI3):c.628C > T (p.Arg210Trp) | 0:2:2702 | 0.0004 | 0.0053 | 0.0010 | 0.0000 | 0.0000 | 0.0000 |
| Central centrifugal cicatricial alopecia | rs139426141 | CV | 1:17270903:A:G | NM_016233.2(PADI3):c.856A > G (p.Thr286Ala) | 0:18:2683 | 0.0033 | 0.0021 | 0.0025 | 0.0000 | 0.0133 | 0.0196 |
| Central centrifugal cicatricial alopecia | rs144080386 | CV | 1:17270928:C:T | NM_016233.2(PADI3):c.881C > T (p.Ala294Val) | 0:7:2697 | 0.0013 | 0.0021 | 0.0030 | 0.0050 | 0.0022 | 0.0000 |
| Central centrifugal cicatricial alopecia | rs140482516 | CV | 1:17280704:C:T | NM_016233.2(PADI3):c.1669C> T (p.Arg557Trp) | 0:3:2703 | 0.0006 | 0.0032 | 0.0015 | 0.0000 | 0.0066 | 0.0098 |
| Central centrifugal cicatricial alopecia | rs34097903 | CV | 1:17280779:G:A | NM_016233.2(PADI3):c.1744G> A (p.Ala582Thr) | 0:15:2692 | 0.0028 | 0.0011 | 0.0045 | 0.0006 | 0.0089 | 0.0000 |
| Usher syndrome, type 2 A | rs777465132 | CV | 1:215758743:G:T | NM_206933.3(USH2A):c.11241C > A (p.Tyr3747Ter) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Usher syndrome, type 2 A | rs746551311 | CV | 1:216196582:G:A | NM_206933.3(USH2A):c.4222C > T (p.Gln1408Ter) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Porphyria cutanea tarda | rs121918066 | CV | 1:45015389:G:A | NM_000374.5(UROD):c.995G > A (p.Arg332His) | 0:1:2702 | 0.0002 | 0.0000 | 0.0000 | 0.0013 | 0.0000 | 0.0000 |
| Methylmalonic acidemia with homocystinuria | rs796051995 | CV | 1:45507491:C:T | NM_015506.3(MMACHC):c.217C > T (p.Arg73Ter) | 0:1:2704 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Leber congenital amaurosis 2 | rs61752871 | CV | 1:68444858:G:A | NM_000329.3(RPE65):c.271C > T (p.Arg91Trp) | 0:2:2706 | 0.0004 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Advanced sleep phase syndrome, familial, 3 | rs139315125 | CV | 1:7809900:A:G | NM_016831.3(PER3):c.1247A > G (p.His416Arg) | 0:7:2701 | 0.0013 | 0.0021 | 0.0005 | 0.0038 | 0.0000 | 0.0000 |
| Parkinson disease 7 | rs74315352 | CV | 1:7984930:A:C | NM_007262.5(PARK7):c.446A > C (p.Asp149Ala) | 0:1:2704 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Stargardt disease* | rs1800553 | CV CAGS | 1:94008251:C:T | NM_000350.2(ABCA4):c.[5512C > G;5882 G > A] | 3:106:2594 | 0.0207 | 0.0170 | 0.0211 | 0.0256 | 0.0089 | 0.0098 |
| Stargardt disease | rs61750155 | CV CAGS | 1:94021695:G:T | NM_000350.3(ABCA4):c.4793C > A (p.Ala1598Asp) | 0:3:2698 | 0.0006 | 0.0000 | 0.0010 | 0.0006 | 0.0000 | 0.0000 |
| Leber congenital amaurosis 9 | rs150726175 | CV | 1:9982630:G:A | NM_022787.4(NMNAT1):c.769G > A (p.Glu257Lys) | 0:1:2705 | 0.0002 | 0.0000 | 0.0015 | 0.0013 | 0.0000 | 0.0000 |
| Glycogen storage disease type III | rs775685508 | CV | 1:99916603:G:T | NM_000642.3(AGL):c.4353G > T (p.Trp1451Cys) | 0:4:2703 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Nemaline myopathy 2 | rs886041851 | CV HMC | 2:151610867:C:T | NM_001271208.2(NEB):c.11806-1G > A | 0:16:2691 | 0.0030 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Hyperphosphatemic familial tumoral calcinosis 1 | rs137853086 | CV | 2:165770217:G:A | NM_004482.4(GALNT3):c.484C > T (p.Arg162Ter) | 0:1:2707 | 0.0002 | 0.0000 | 0.0005 | 0.0000 | 0.0000 | 0.0000 |
| Biotin-thiamine-responsive basal ganglia disease | rs121917884 | CV HMC | 2:227688216:T:C | NM_025243.4(SLC19A3):c.1264A > G (p.Thr422Ala) | 0:6:2698 | 0.0011 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deafness, autosomal recessive 9 | rs397515591 | CV HMC | 2:26477725:C:A | NM_194248.3(OTOF):c.2239G > T (p.Glu747Ter) | 0:5:2703 | 0.0009 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| 3-Oxo-5 alpha-steroid delta 4-dehydrogenase deficiency | rs9332967 | CV | 2:31526224:C:T | NM_000348.4(SRD5A2):c.737G > A (p.Arg246Gln) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| 3-Oxo-5 alpha-steroid delta 4-dehydrogenase deficiency | rs763296857 | CV | 2:31529427:T:C | NM_000348.4(SRD5A2):c.578A > G (p.Asn193Ser) | 0:1:2704 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Glaucoma 3, primary congenital, A | rs28936700 | CV CAGS | 2:38075207:C:T | NM_000104.3(CYP1B1):c.182G > A (p.Gly61Glu) | 0:1:2705 | 0.0002 | 0.0000 | 0.0031 | 0.0044 | 0.0000 | 0.0098 |
| Sitosterolemia | rs137852988 | CV | 2:43875377:G:A | NM_022437.3(ABCG8):c.1720G> A (p.Gly574Arg) | 0:5:2699 | 0.0009 | 0.0042 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Achromatopsia | rs141386891 | CV | 2:98396449:C:T | NM_001298.3(CNGA3):c.1279C > T (p.Arg427Cys) | 0:3:2704 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Fanconi anemia, complementation group D2 | rs112832879 | CV | 3:10043483:G:A | NM_033084.5(FANCD2):c.990-1G > A | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Macular dystrophy, vitelliform, 5 | rs199867882 | CV CAGS | 3:101231117:G:A | NM_016247.4(IMPG2):c.3262C > T (p.Arg1088Ter) | 0:2:2705 | 0.0004 | 0.0011 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Retinitis pigmentosa 61 | rs775098953 | CV | 3:150928174:A:C | NM_174878.3(CLRN1):c.461T > G (p.Leu154Trp) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Biotinidase deficiency | rs397514369 | CV | 3:15644413:G:A | NM_001370658.1(BTD):c.497G > A (p.Cys166Tyr) | 0:5:2702 | 0.0009 | 0.0011 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Biotinidase deficiency | rs13078881 | CV | 3:15645186:G:C | NM_000060.2(BTD):c.[470G > A;1330 G > C] | 4:135:2557 | 0.0265 | 0.0000 | 0.0272 | 0.0294 | 0.0360 | 0.0392 |
| Biotinidase deficiency | rs138818907 | CV CAGS | 3:15645345:C:T | NM_001370658.1(BTD):c.1429C > T (p.Pro477Ser) | 0:5:2701 | 0.0009 | 0.0000 | 0.0010 | 0.0000 | 0.0000 | 0.0000 |
| Brugada syndrome | rs199473101 | CV | 3:38606682:C:T | NM_198056.2(SCN5A):c.1127G > A (p.Arg376His) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Chanarin-Dorfman Syndrome | rs104893676 | CV | 3:43691011:G:A | NM_016006.6(ABHD5):c.19G > A (p.Glu7Lys) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Dystrophic epidermolysis bullosa | rs756217590 | CV HMC | 3:48583161:C:T | NM_000094.3(COL7A1):c.4448G > A (p.Gly1483Asp) | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Spastic paraplegia 56, autosomal recessive | rs397514513 | CV HMC | 4:107945426:A:T | NM_183075.3(CYP2U1):c.947A > T (p.Asp316Val) | 0:1:2702 | 0.0002 | 0.0032 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Hypofibrinogenemia | rs121909607 | CV | 4:154589513:C:T | NM_021871.4(FGA):c.104G > A (p.Arg35His) | 0:3:2703 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Bietti crystalline corneoretinal dystrophy | rs199476187 | CV | 4:186194568:G:A | NM_207352.4(CYP4V2):c.283G > A (p.Gly95Arg) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Hereditary factor XI deficiency disease | rs121965063 | CV CAGS | 4:186274193:G:T | NM_000128.3(F11):c.403G > T (p.Glu135Ter) | 0:7:2700 | 0.0013 | 0.0011 | 0.0010 | 0.0006 | 0.0000 | 0.0000 |
| Hereditary factor XI deficiency disease | rs542967227 | CV | 4:186285765:G:A | NM_000128.3(F11):c.1432G > A (p.Gly478Arg) | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Hypogonadotropic hypogonadism 7 with or without anosmia | rs104893836 | CV | 4:67754019:T:C | NM_000406.3(GNRHR):c.317A > G (p.Gln106Arg) | 0:4:2703 | 0.0007 | 0.0011 | 0.0055 | 0.0019 | 0.0089 | 0.0000 |
| Hyaline fibromatosis syndrome | rs886041401 | CV CAGS | 4:80072427:A:G | NM_058172.6(ANTXR2):c.134T > C (p.Leu45Pro) | 0:4:2701 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Hartnup disorder | rs121434347 | CV | 5:1213517:C:T | NM_001003841.3(SLC6A19):c.718 C > T (p.Arg240Ter) | 0:4:2703 | 0.0007 | 0.0011 | 0.0005 | 0.0000 | 0.0000 | 0.0000 |
| Renal carnitine transport defect | rs72552724 | CV HMC | 5:132370055:G:T | NM_003060.4(SLC22A5):c.83G > T (p.Ser28Ile) | 0:5:2701 | 0.0009 | 0.0032 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Primary systemic carnitine deficiency | rs886041277 | CV HMC | 5:132378379:G:A | NM_003060.4(SLC22A5):c.395G > A (p.Trp132Ter) | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Nijmegen breakage syndrome-like disorder | rs772468452 | CV | 5:132595719:C:T | NM_005732.4(RAD50):c.2116C > T (p.Arg706Ter) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Primary ciliary dyskinesia | rs761622153 | CV HMC | 5:13841112:G:A | NM_001277115.2(DNAH11):c.5924 + 1 G > C | 0:71:2634 | 0.0131 | 0.0110 | 0.0000 | 0.0019 | 0.0000 | 0.0000 |
| Seizures, cortical blindness, and microcephaly syndrome | rs863225243 | CV HMC | 5:141528456:G:A | NM_005219.5(DIAPH1):c.3145C > T (p.Arg1049Ter) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Ehlers-Danlos syndrome, spondylodysplastic type, 1 | rs28937869 | CV | 5:177608994:C:T | NM_007255.3(B4GALT7):c.808C > T (p.Arg270Cys) | 0:1:2703 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Mucopolysaccharidosis type 6 | rs771296632 | CV | 5:78839361:G:A | NM_000046.5(ARSB):c.1208C > G (p.Ser403Ter) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Hypophosphatemic rickets, autosomal recessive, 2 | rs373044722 | CV | 6:131872926:C:T | NM_006208.3(ENPP1):c.1441C > T (p.Arg481Trp) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| non-classical congenital adrenal hyperplasia | rs776989258 | CV | 6:32041093:C:T | NM_000500.9(CYP21A2):c.1447C > T (p.Pro483Ser) | 0:4:2701 | 0.0007 | 0.0000 | 0.0000 | 0.0057 | 0.0000 | 0.0000 |
| Leukodystrophy, hypomyelinating, 11 | rs141156009 | CV | 6:43520961:C:T | NM_203290.4(POLR1C):c.835C > T (p.Arg279Trp) | 0:4:2704 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Autosomal recessive polycystic kidney disease | rs794727566 | CV | 6:52024750:A:G | NM_138694.4(PKHD1):c.5060T > C (p.Ile1687Thr) | 0:2:2702 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Autosomal recessive polycystic kidney disease | rs773136605 | CV | 6:52043102:C:T | NM_138694.4(PKHD1):c.2854G > A (p.Gly952Arg) | 0:3:2704 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Autosomal recessive polycystic kidney disease | rs398124478 | CV | 6:52048558:G:A | NM_138694.4(PKHD1):c. | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Retinitis pigmentosa | rs930421180 | CV | 6:64591822:G:A | NM_001142800.2(EYS):c.4045C > T (p.Arg1349Ter) | 0:1:2703 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Pendred syndrome | rs111033348 | CV | 7:107674326:C:T | NM_000441.1(SLC26A4):c.578C > T | 0:3:2704 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Pendred syndrome | rs111033256 | CV CAGS | 7:107675060:T:A | NM_000441.2(SLC26A4):c.716T > A (p.Val239Asp) | 0:7:2699 | 0.0013 | 0.0000 | 0.0010 | 0.0000 | 0.0000 | 0.0000 |
| Congenital secretory diarrhea, chloride type | rs121913032 | CV | 7:107791059:C:A | NM_000111.2(SLC26A3):c.559G > T (p.Gly187Ter) | 0:6:2701 | 0.0011 | 0.0053 | 0.0000 | 0.0006 | 0.0022 | 0.0000 |
| Maple syrup urine disease, type 3 | rs121964990 | CV HMC CAGS | 7:107915506:G:T | NM_000108.5(DLD):c.685G > T (p.Gly229Cys) | 0:14:2689 | 0.0026 | 0.0000 | 0.0010 | 0.0000 | 0.0022 | 0.0000 |
| Cystic fibrosis | rs121909005 | CV CAGS | 7:117587801:T:G | NM_000492.4(CFTR):c.1647T> G (p.Ser549Arg) | 0:1:2706 | 0.0002 | 0.0000 | 0.0005 | 0.0006 | 0.0000 | 0.0000 |
| Cystic fibrosis | rs75096551 | CV CAGS | 7:117606754:G:A | NM_000492.3(CFTR):c.2988 + 1 G > A | 0:2:2701 | 0.0004 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Cystic fibrosis | rs121909043 | CV | 7:117667029:C:G | NM_000492.3(CFTR):c.4364C > G (p.Ser1455Ter) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Primary ciliary dyskinesia | rs886039340 | CV HMC | 7:21687528:G:C | NM_001277115.2(DNAH11):c.5924 + 1 G > C | 0:15:2693 | 0.0028 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deficiency of aromatic-L-amino-acid decarboxylase | rs201951824 | CV | 7:50476625:C:T | NM_001082971.2(DDC):c.1040G > A (p.Arg347Gln) | 0:2:2702 | 0.0004 | 0.0000 | 0.0010 | 0.0000 | 0.0000 | 0.0000 |
| Argininosuccinate lyase deficiency | rs367543005 | CV CAGS | 7:66089693:C:T | NM_000048.4(ASL):c.1060C > T (p.Gln354Ter) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Cohen syndrome | rs140353201 | CV | 8:99467566:C:T | NM_017890.4(VPS13B):c.3598C > T (p.Arg1200Ter) | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Rare genetic deafness | rs779760634 | CV | 9:114423524:C:T | NM_015404.4(WHRN):c.1417-1G > A | 0:1:2701 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Walker-Warburg congenital muscular dystrophy | rs776061161 | CV | 9:131510063:G:A | NM_001077365.2(POMT1):c.699 + 67 G > A | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deficiency of UDPglucose-hexose-1-phosphate uridylyltransferase | rs111033735 | CV | 9:34648371:G:A | NM_000155.4(GALT):c.602G > A (p.Arg201His) | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Primary hyperoxaluria, type II | rs180177314 | CV | 9:37429732:G:A | NM_012203.2(GRHPR):c.494G > A (p.Gly165Asp) | 0:3:2703 | 0.0006 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Pontocerebellar hypoplasia, type 1b | rs387907196 | CV | 9:37784953:C:G | NM_016042.4(EXOSC3):c.92G > C (p.Gly31Ala) | 0:1:2703 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Testosterone 17-beta-dehydrogenase deficiency | rs119481077 | CV CAGS | 9:96254907:G:A | NM_000197.2(HSD17B3):c.238C > T (p.Arg80Trp) | 0:2:2703 | 0.0004 | 0.0000 | 0.0015 | 0.0000 | 0.0000 | 0.0000 |
| Histiocytosis-lymphadenopathy plus syndrome | rs397515429 | CV | 10:71362337:G:A | NM_018344.6(SLC29A3):c.1157G > A (p.Arg386Gln) | 0:1:2702 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Glucose-6-phosphate transport defect | rs121908979 | CV | 11:119024957:G:A | NM_001164278.2(SLC37A4):c.1309 C > T (p.Arg437Ter) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Beta-thalassemia | rs34716011 | CV HMC | 11:5226974:C:T | NM_000518.5(HBB):c.48G > A (p.Trp16Ter) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Joubert syndrome 16 | rs387907133 | CV CAGS | 11:61368600:C:T | NM_016464.5(TMEM138):c.380C > T (p.Ala127Val) | 0:2:2699 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Niemann-Pick disease, type B | rs120074126 | CV HMC | 11:6393620:C:T | NM_000543.5(SMPD1):c.1267C > T (p.His423Tyr) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Mucolipidosis | rs34940801 | CV | 12:101757571:C:T | NM_024312.5(GNPTAB):c.3335+ 1 G > A | 0:4:2702 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0022 | 0.0000 |
| Phenylketonuria | rs5030857 | CV CAGS | 12:102840507:G:A | NM_000277.3(PAH):c.1208C > T (p.Ala403Val) | 0:2:2702 | 0.0004 | 0.0000 | 0.0010 | 0.0000 | 0.0022 | 0.0000 |
| Phenylketonuria | rs5030853 | CV | 12:102851701:C:A | NM_000277.3(PAH):c.898G > T (p.Ala300Ser) | 0:7:2698 | 0.0013 | 0.0000 | 0.0015 | 0.0025 | 0.0000 | 0.0000 |
| Vitamin B12-responsive methylmalonic acidemia type cblB | rs763935916 | CV CAGS | 12:109568864:C:A | NM_052845.4(MMAB):c.197-1G > T | 0:6:2700 | 0.0011 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Glycogen storage disease due to hepatic glycogen synthase deficiency | rs121918419 | CV | 12:21568952:G:A | NM_021957.4(GYS2):c.736C > T (p.Arg246Ter) | 0:1:2705 | 0.0002 | 0.0011 | 0.0015 | 0.0006 | 0.0000 | 0.0000 |
| Glycogen storage disease due to hepatic glycogen synthase deficiency | rs201157731 | CV | 12:21574275:G:A | NM_021957.4(GYS2):c.547C > T (p.Gln183Ter) | 0:4:2703 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Myopathy, lactic acidosis, and sideroblastic anemia 2 | rs587777214 | CV | 12:32750744:G:A | NM_001040436.3(YARS2):c.1078 C > T (p.Arg360Ter) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Parkinson disease 8, autosomal dominant | rs34637584 | CV | 12:40340400:G:A | NM_198578.4(LRRK2):c.6055G > A (p.Gly2019Ser) | 0:4:2704 | 0.0007 | 0.0021 | 0.0050 | 0.0000 | 0.0000 | 0.0000 |
| Bailey-Bloch congenital myopathy | rs140291094 | CV | 12:57244322:C:G | NM_145064.3(STAC3):c.851G > C (p.Trp284Ser) | 0:1:2703 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| von Willebrand disease, type 2a | rs41276738 | CV | 12:6034812:C:T | NM_000552.4(VWF):c.2561G > A (p.Arg854Gln) | 0:3:2704 | 0.0006 | 0.0000 | 0.0015 | 0.0006 | 0.0022 | 0.0098 |
| Temtamy syndrome | rs587776954 | CV HMC | 12:6944122:A:G | NM_138425.4(C12orf57):c.1A > G (p.Met1Val) | 0:6:2700 | 0.0011 | 0.0074 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Peroxisome biogenesis disorder 2A (Zellweger) | rs61752138 | CV | 12:7209700:T:G | NM_001131025.1(PEX5):c.1578T > G (p.Asn526Lys) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Factor VII deficiency | rs121964926 | CV | 13:113118698:G:A | NM_019616.4(F7):c.1025G > A (p.Arg342Gln) | 0:1:2707 | 0.0002 | 0.0000 | 0.0005 | 0.0000 | 0.0000 | 0.0000 |
| Deafness, autosomal recessive 1A | rs774518779 | CV HMC CAGS | 13:20189076:C:T | NM_004004.6(GJB2):c.506G > A (p.Cys169Tyr) | 0:4:2702 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deafness, autosomal recessive 1A | rs104894396 | CV | 13:20189511:C:T | NM_004004.6(GJB2):c.71G > A (p.Trp24Ter) | 0:5:2702 | 0.0009 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deafness, autosomal recessive 1A | rs80338940 | CV CAGS | 13:20192782:C:T | NM_004004.6(GJB2):c.-23 + 1 G > A | 0:2:2705 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deafness, autosomal dominant 3b | rs104894414 | CV | 13:20223467:G:A | NM_001110219.3(GJB6):c.14C > T (p.Thr5Met) | 0:6:2699 | 0.0011 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Aicardi Goutieres syndrome 2 | rs75184679 | CV HMC CAGS | 13:50945445:G:A | NM_024570.4(RNASEH2B):c.529G > A (p.Ala177Thr) | 0:3:2704 | 0.0006 | 0.0011 | 0.0015 | 0.0006 | 0.0000 | 0.0000 |
| Leber congenital amaurosis 6 | rs554396590 | CV | 14:21303542:C:T | NM_020366.3(RPGRIP1):c.799C > T (p.Arg267Ter) | 0:1:2703 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Galactosylceramide beta-galactosidase deficiency | rs199847983 | CV HMC | 14:87968386:C:T | NM_000153.4(GALC):c.857G > A (p.Gly286Asp) | 0:14:2692 | 0.0026 | 0.0011 | 0.0000 | 0.0013 | 0.0000 | 0.0000 |
| Alpha-1-antitrypsin deficiency | rs28931569 | CV | 14:94383044:A:G | NM_001127701.1(SERPINA1):c.194 T > C (p.Leu65Pro) | 0:2:2702 | 0.0004 | 0.0000 | 0.0010 | 0.0000 | 0.0000 | 0.0000 |
| Mosaic variegated aneuploidy syndrome 1 | rs28989186 | CV | 15:40176672:C:T | NM_001211.5(BUB1B):c.580C > T (p.Arg194Ter) | 0:3:2704 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Limb-girdle muscular dystrophy, type 2A | rs147764579 | CV | 15:42401752:G:A | NM_000070.3(CAPN3):c.1466G > A (p.Arg489Gln) | 0:1:2706 | 0.0002 | 0.0021 | 0.0015 | 0.0000 | 0.0044 | 0.0000 |
| Peeling skin syndrome 2 | rs112292549 | CV | 15:43260151:C:A | NM_201631.4(TGM5):c.337G > T (p.Gly113Cys) | 0:3:2704 | 0.0006 | 0.0011 | 0.0000 | 0.0000 | 0.0044 | 0.0098 |
| Tay-Sachs disease | rs786204721 | CV CAGS | 15:72375971:A:G | NM_000520.6(HEXA):c.2T > C (p.Met1Thr) | 0:2:2702 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Amyotrophic lateral sclerosis type 6 | rs387906628 | CV | 16:31185031:G:A | NM_004960.3(FUS):c.616G > A (p.Gly206Ser) | 0:2:2701 | 0.0004 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Meier-Gorlin syndrome 3 | rs146795505 | CV | 16:46689707:T:C | NM_014321.4(ORC6):c.2T > C (p.Met1Thr) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0098 |
| Polymicrogyria, bilateral frontoparietal | rs121908462 | CV | 16:57651247:C:T | NM_201525.4(ADGRG1):c.112C > T (p.Arg38Trp) | 0:2:2706 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0022 | 0.0000 |
| Polymicrogyria, bilateral frontoparietal | rs121908465 | CV | 16:57651407:G:C | NM_201525.4(ADGRG1):c.272G > C (p.Cys91Ser) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Spermatogenic failure 31 | rs140352254 | CV | 16:72122957:G:A | NM_031293.3(PMFBP1):c.2725C > T (p.Arg909Ter) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Congenital disorder of glycosylation | rs28936415 | CV CAGS | 16:8811153:G:A | NM_000303.3(PMM2):c.422G > A (p.Arg141His) | 0:8:2688 | 0.0015 | 0.0000 | 0.0046 | 0.0019 | 0.0000 | 0.0000 |
| Fanconi anemia, complementation group A | rs769479800 | CV | 16:89816614:A:G | NM_000135.4(FANCA):c.2T > C (p.Met1Thr) | 0:1:2702 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Neurofibromatosis, type 1 | rs137854562 | CV HMC | 17:31235623:C:T | NM_000267.3(NF1):c.3721C > T (p.Arg1241Ter) | 0:1:2707 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Canavan Disease, Familial Form | rs766328537 | CV | 17:3476238:G:A | NM_000049.3(ASPA):c.79G > A (p.Gly27Arg) | 0:3:2704 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Mucopolysaccharidosis, MPS-III-B | rs104894595 | CV | 17:42543568:C:T | NM_000263.4(NAGLU):c.1562C > T (p.Pro521Leu) | 0:1:2700 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Spherocytosis type 4, due to band 3, Cape Town | rs28929480 | CV | 17:44260716:C:T | NM_000342.3(SLC4A1):c.268G > A (p.Glu90Lys) | 0:1:2702 | 0.0002 | 0.0011 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Pyridoxal phosphate-responsive seizures | rs773450573 | CV HMC | 17:47946682:G:A | NM_018129.4(PNPO):c.686G > A (p.Arg229Gln) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Sarcoglycanopathy | rs371675217 | CV HMC | 17:50167431:G:A | NM_000023.4(SGCA):c.101G > A (p.Arg34His) | 0:2:2704 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Sarcoglycanopathy | rs143570936 | CV | 17:50169246:G:A | NM_000023.4(SGCA):c.739G > A (p.Val247Met) | 0:4:2704 | 0.0007 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Meckel-Gruber syndrome | rs786205508 | CV HMC CAGS | 17:58208542:G:A | NM_001165927.1(MKS1):c.1036 C > T (p.Gln346Ter) | 0:1:2704 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Glycogen storage disease, type II | rs778418246 | CV | 17:80113002:G:A | NM_000152.5(GAA):c.2015G> A (p.Arg672Gln) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Niemann-Pick disease type C1 | rs759826138 | CV | 18:23539394:G:A | NM_000271.5(NPC1):c.2872C > T (p.Arg958Ter) | 0:4:2703 | 0.0007 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Junctional epidermolysis bullosa | rs886039412 | CV HMC | 18:23928766:G:A | NM_000227.5(LAMA3):c.3609 + 1 G > A | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Amyloid Cardiomyopathy, Transthyretin-related | rs76992529 | CV | 18:31598655:G:A | NM_000371.4(TTR):c.424G > A (p.Val142Ile) | 0:8:2695 | 0.0015 | 0.0000 | 0.0005 | 0.0000 | 0.0022 | 0.0000 |
| Vici syndrome | rs767638289 | CV | 18:45954507:G:A | NM_020964.3(EPG5):c.895C > T (p.Arg299Ter) | 0:2:2703 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Obesity, autosomal dominant | rs121913560 | CV | 18:60371842:T:C | NM_005912.3(MC4R):c.508 A > G (p.Ile170Val) | 0:14:2693 | 0.0026 | 0.0011 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Familial hypercholesterolemia 1 | rs148698650 | CV | 19:11107403:G:A | NM_000527.4(LDLR):c.829G > T (p.Glu277Ter) | 0:2:2704 | 0.0004 | 0.0011 | 0.0010 | 0.0031 | 0.0000 | 0.0000 |
| Mental retardation, autosomal recessive 3 | rs876657679 | CV | 19:13913639:G:T | NM_017721.5(CC2D1A):c.748 + 1 G > T | 0:3:2702 | 0.0006 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Autosomal recessive congenital ichthyosis 5 | rs118203937 | CV HMC CAGS | 19:15540506:G:A | NM_173483.4(CYP4F22):c.728G > A (p.Arg243His) | 2:1:2697 | 0.0009 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Autosomal recessive congenital ichthyosis 5 | rs118203935 | CV CAGS | 19:15549170:C:T | NM_173483.4(CYP4F22):c.1303C > T (p.His435Tyr) | 0:1:2706 | 0.0002 | 0.0000 | 0.0005 | 0.0000 | 0.0000 | 0.0000 |
| Nephrotic syndrome, type 9 | rs398122978 | CV CAGS | 19:40705140:G:A | NM_024876.4(COQ8B):c.532C > T (p.Arg178Trp) | 0:1:2697 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Graves disease | rs775644973 | CV | 20:1002085:C:T | NM_001029871.4(RSPO4):c.79 + 1 G > A | 0:3:2703 | 0.0006 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| Homocystinuria | rs121964972 | CV | 21:43060528:G:A | NM_000071.2(CBS):c.1058C > T (p.Thr353Met) | 0:1:2704 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Homocystinuria* | rs398123151 | CV HMC CAGS | 21:43062344:G:A | NM_000071.2(CBS):c.1006C > T (p.Arg336Cys) | 1:31:2674 | 0.0061 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| microcephalic osteodysplastic primordial dwarfism type 2 | rs777830265 | CV HMC | 21:46355533:C:T | NM_006031.6(PCNT):c.1843C > T (p.Gln615Ter) | 0:1:2706 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Deficiency of beta-ureidopropionase | rs747539101 | CV | 22:24520469:G:A | NM_016327.3(UPB1):c.873 + 1 G > A | 0:8:2695 | 0.0015 | 0.0000 | 0.0000 | 0.0013 | 0.0000 | 0.0000 |
| Acute infantile liver failure due to synthesis defect of mtDNA-encoded proteins | rs387907022 | CV | 22:46353829:G:A | NM_018006.5(TRMU):c.835G > A (p.Val279Met) | 0:1:2705 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
aAs examples of the use of QChip1, the 2,708 Qatari genomes were assessed for 140 pathogenic variants known to be present in the Qatari genome from the Hamad Medical Corporation genetic screening database. All of these variants were predicted by ClinVar to be pathogenic and by and SnpEff to have moderate or high impact on protein function. From left-to-right is the disease name, DbSNP rsID, list of databases where variant is found, genomic coordinates (GRCh38 reference), and the alternate allele frequency in Qatar, Kuwait, NYC, and Puerto Rico. From this analysis the Qatar, USA and Puerto Rico genetic subgroups are combined for each location. Table 5 is a subset of the n = 32,674 variants genotyped based on computationally predicted value for SGD research or screening; the complete dataset is in Supplementary Table 3. Variants at unusually elevated allele frequency in Qatar have an “*” next to the disease name.
bEach variant rsID was queried in three databases, including the ClinVar website (CV) (accessed May 2021), the Center for Arab Genetics Disorders (CAGS), and the Hamad Medical Corporation database of SGD disease case reports. For each record, the disease name and HGVS annotation were taken from ClinVar, and variants that were not described as “pathogenic” in ClinVar or did not have a phenotype described were excluded.
QChip1 pathogenic variants in genomics knowledgebasesa.
| Knowledgebase | Sample size for allele frequency | QChip1 pathogenic variants | |
|---|---|---|---|
| % | |||
| QChip1 | 2708 | 140 | 100 |
| ClinVar | 140 | 100 | |
| Hamad Medical Corporation (HMC) | 25 | 18 | |
| Center for Arab Genetic Studies (CAGS) | 27 | 19 | |
| Dasman Diabetes Institute (DDI, Kuwait) | 540 | 28 | 20 |
| GME Variome | 886 | 37 | 26 |
| Iranome (Iran) | 800 | 32 | 23 |
| New York City (NYC) | 226 | 16 | 11 |
| Puerto Ricans (PR) | 51 | 8 | 6 |
| Anywhere | 87 | 62 | |
| Middle East (CAGS, Kuwait, GME, Iran) | 70 | 50 | |
aIn order to quantify the utility of QChip1 for single gene (Mendelian) disorder screening outside of Qatar, the presence and (when available) allele frequency of each variant in Table 5 was checked in seven datasets, including three produced by this research team (HMC, NYC, PR) and four externally obtained [CAGS (http://cags.org.ae/), Dasman Diabetes Institute, GME Variome (http://igm.ucsd.edu/gme/data-browser.php), Iranome (http://www.iranome.ir/)]. Only the DDI, GME, and Iranome datasets had allele frequency data. Shown is the name of the knowledgebase, the sample size when available, and the QChip1 pathogenic variants found in the knowledgebase, including number and percentage of 140 total on QChip1 (Table 5).
2For datasets where allele frequency is available, the variant is counted as “present” if the frequency was great than zero. For datasets where allele frequency is not available, the variant is counted as “present” if a query of the dataset found the variant. The bottom two rows show aggregate data, where the “anywhere” row indicates variants present in any of the seven datasets (HMC, CAGS, Kuwait, GME, Iran, NYC, PR), and the “Middle East” row indicates variants present in the Middle Eastern datasets (CAGS, DDI, GME, Iran).
Fig. 1Population distribution of QChip1 variants observed in Qatar.
In order to demonstrate the population-specific value of QChip1, the risk alleles that were discovered by genome/exome sequencing, prioritized in the knowledgebase, included in the array design, successfully genotyped, and observed in array data for at least one of n = 2,708 Qataris are provided for download in Supplementary Table 1 and online at the Qatar Genome Browser (http://qchip.biohpc.cornell.edu). Shown is a summary of the population enrichment of these variants. A Enrichment of potentially pathogenic variants on QChip1 in Qatari subpopulations. In order to determine if Mendelian disease risk alleles were enriched in single Qatari subpopulations, a cross-population allele frequency comparison was conducted for five ancestries observed in Qatar (k1, QGP_PAR, Peninsular Arabs; k2, QGP_GAR, General Arabs; k4, QGP_WEP, Arabs of Western Eurasia and Persia; k5, QGP_SAS, South Asian Arabs, and k3, QGP_AFR, African Arabs). Not shown, QGP_ADM, Admixed Arabs. For each subpopulation, the risk allele frequency was compared to the maximum of the other four subpopulations. Shown is the proportion that was highest in the subpopulation for (left-to-right) QGP_PAR, QGP_GAR, QGP_WEP, QGP_SAS, and QGP_AFR. B Enrichment of potentially pathogenic variants on QChip1 in the Qatari genome relative to non-Qatari. The non-Qatari genomes were residents of New York City (total n = 226) and Puerto Rico (n = 51). The ancestry proportions of these 226 non-Qatari genomes in 5 clusters (k1 to k5) were calculated as described in Fig. 2 (combined analysis of non-Qataris and Qataris using ADMIXTURE[68]), the lowest cross-validation error was for k = 5, with the non-Qataris falling in 3 clusters (African-Americans from NYC, n = 60, k3; European-Americans from NYC, n = 153, k4; South Asian-Americans from NYC, n = 13, k5; Puerto Ricans of European Ancestry, k4; and Puerto Ricans of Afro-Caribbean Ancestry, k3). More details of the population structure were made available in Fig. 2 (Qataris) and Supplementary Fig. 1 (non-Qataris). Shown is the percentage of n = 32,674 potentially pathogenic variants in Mendelian (single gene) disorder genes that were observed in at least one Qatari and have a risk (minor) allele frequency in Qatar higher than in non-Qatari populations. The proportion of variants was calculated that were at elevated minor allele frequency (enriched) in the Qatari genome relative to the genomes of the 5 non-Qatari population clusters tested: USA African-American (k3), USA European-American (k4), USA South-Asian American (k5), PR Afro-Caribbean (k3), PR European (k4). Shown from left-to-right is the proportion that are enriched in Qatar relative to the maximum of all 5 populations, followed the proportion enriched relative to each individual population.
Fig. 2Strategy to design and assess QChip1.
Step 1. Qatari Genome Knowledgebase. Identification of the single gene (Mendelian) pathogenic variants and genes in protein coding regions of the Qatari genome was generated using whole-genome sequencing, exome sequencing and clinical reports (see Table 1). After cataloging all variants and respective genes, the pathogenic variants and genes were identified using ClinVar and SnpEff. Step 2. Using this list, Qchip0 (the precursor of QChip1) was designed on the Axiom platform which was then tested with 25 Qatari DNA samples for which whole-genome sequencing was available. Step 3. Elimination of poor performance probes and variants led to the final design of QChip1, which was tested for concordance with genome sequencing using DNA samples from Qataris. Step 4. Use of QChip1 to assess the prevalence of pathogenic variants and genes among Qataris, New York City residents and Puerto Ricans.