| Literature DB >> 31482689 |
Yury A Barbitoff1,2, Rostislav K Skitchenko1,3, Olga I Poleshchuk1, Anton E Shikov1,4, Elena A Serebryakova5, Yulia A Nasykhova5,6, Dmitrii E Polev7, Anna R Shuvalova7, Irina V Shcherbakova6, Mikhail A Fedyakov4, Oleg S Glotov4,5, Andrey S Glotov4,5,6,8, Alexander V Predeus1.
Abstract
BACKGROUND: Allele frequency data from large exome and genome aggregation projects such as the Genome Aggregation Database (gnomAD) are of ultimate importance to the interpretation of medical resequencing data. However, allele frequencies might significantly differ in poorly studied populations that are underrepresented in large-scale projects, such as the Russian population.Entities:
Keywords: Mendelian disease; Russia; allele frequency; whole-exome sequencing
Mesh:
Substances:
Year: 2019 PMID: 31482689 PMCID: PMC6825859 DOI: 10.1002/mgg3.964
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Figure 1The spectrum of genetic variation in exomes of Northwest Russia. (a) Distribution of the identified variant sites by functional impact on protein structure as given by SnpEff. (b) Alternative allele frequency distribution for known (dbSNP) and novel (non‐dbSNP) variants. (c) A scatterplot of alternative allele frequencies in the dataset versus gnomAD‐based frequencies; gnomAD‐only sites, multiallelic entries, and poorly covered regions are excluded. Red circle highlights sites not present in gnomAD. (d) Principal component analysis of allele frequencies for 121,171 exome variants present in Northwest Russia and all of the gnomAD populations. AFR, African; AMR, Ad Mixed American; EAS, East Asian; NFE, non‐Finnish European; NWR, Northwest Russia; SAS, South Asian. (e) Root mean square difference in allele frequencies between NWR and different gnomAD populations for the same variant sites as in (d). Error bars represent the 95% confidence interval margins for the mean
Prevalent disease alleles in the Northwest Russia dataset
| Location | rsID | Gene | gnomAD AF | gnomAD NFE AF | Allele count | Estimated AF (lower/upper CI) |
| Disease/condition |
|---|---|---|---|---|---|---|---|---|
| 12:103234271 | rs5030858 |
| 7.6 × 10−4 | 0.0015 | 6 (11) |
0.0081 | 7.9 × 10−4 (1.1 × 10−5) | Phenylketonuria |
| 13:113772982 | rs36209567 |
| 5.6 × 10−4 | 0.0010 | 5 (7) |
0.0067 |
0.0010 | Factor VII deficiency |
| 7:30058726 | rs542489955 |
| 5.5 × 10−4 | 0.0010 | 4 (8) |
0.0054 |
0.0061 | Ehlers‐Danlos syndrome, kyphoscoliotic type, 2 |
| 11:88911771 | rs61754365 |
| 2.9 × 10−4 | 3.2 × 10−4 | 4 (8) |
0.0054 | 1.1 × 10−4 (2.4 × 10−8) | Tyrosinase‐negative oculocutaneous albinism |
| 13:52518281 | rs76151636 |
| 9.2 × 10−4 | 0.0013 | 4 (6) |
0.0053 | 0.0159 (0.0095) | WIlson's disease |
| 22:50893287 | rs200488568 |
| 3.3 × 10−4 | 1.4 × 10−4 | 3 (4) |
0.0045 | 1.2 × 10−4 (5.0 × 10−5) | Charcot‐Marit‐Tooth disease, type 4B3 |
| 2:152357937 | rs549794342 |
| 2.7 × 10−4 | 4.7 × 10−4 | 3 (6) |
0.0040 |
0.0050 | Nemaline myopathy |
| 6:74191932 | rs201544686 |
| 1.7 × 10−4 | 2.0 × 10−4 | 3 (5) |
0.0040 | 4.9 × 10−4 (5.4 × 10−6) | Combined ox. phos. deficiency 10 |
| 6:80910740 | rs386834233 |
| 5.5 × 10−4 | 3.9 × 10−4 | 3 (4) |
0.0040 |
3.0 × 10−3
| Maple syrup urine disease |
| 5:54527618 | rs775051461 |
| 9.8 × 10−5 | 4.7 × 10−5 | 2 (2) |
0.0030 |
4.6 × 10−4
| Ciliary dyskinesia |
| 17:8015495 | rs121434233 |
| 1.5 × 10−4 | 2.8 × 10−4 | 2 (5) |
0.0027 |
0.018 | Autosomal recessive congenital ichthyosis 3 |
| 18:21119369 | rs543206298 |
| 7.6 × 10−5 | 1.1 × 10−4 | 2 (3) |
0.0027 |
0.0033 | Niemann‐Pick disease |
| 8:75276240 | rs104894080 |
| 3.2 × 10−5 | 7 × 10−5 | 2 (2) |
0.0027 |
0.0013 | Polyneuropathy, Charcot‐Marie‐Tooth intermediate A |
| 2:73677806 | rs1307458231 |
| 2.0 × 10−5 | 4.4 × 10−5 | 2 (2) |
0.0027 |
5.0 × 10−4
| Alstrom syndrome |
Values in brackets are derived from larger uncurated sample (see Materials and Methods). p‐values are calculated using allele frequencies in the gnomAD non‐Finnish European (NFE) population.
95% exact confidence interval margins for binomial proportions are given.
Disease prevalence estimated from known pathogenic variants’ frequencies
| Disease/condition | Gene(s) | Allele count | Carrier frequency (lower/upper CI) | Disease frequency (lower/upper CI) |
Known |
Comments/ |
|---|---|---|---|---|---|---|
| Retinal dystrophy, Stargardt disease |
| 13 (23) |
0.0350 |
3.1 × 10−4
| 1 in 10,000 |
Zol’nikova, |
| Cystic fibrosis |
| 11 (19) |
0.0296 |
2.2 × 10−4
| 1 in 10,000 | Reported carrier frequency of 0.032 (Abramov et al., |
| Phenylketonuria |
| 11 (18) |
0.0296 |
2.2 × 10−4
| 1 in 10,000 | Reported carrier frequency of 0.029 (Abramov et al., |
| Afibrinogenemia, congenital |
| 7 (10) |
0.0190 |
9.0 × 10−5
| n.a. | Reported lobal frequency of 1 in 1,000,000 (Mannucci et al., |
| Hepatic lipase deficiency |
| 6 (14) |
0.0162 |
6.6 × 10−5
| n.a. | — |
| Tyrosinase‐negative oculocutaneous albinism |
| 6 (12) |
0.0162 |
6.6 × 10−5
| 1 in 39,000 | — |
| Peeling skin syndrome |
| 5 (8) |
0.0135 |
4.5 × 10−5
| n.a. | — |
| Factor VII deficiency |
| 5 (7) |
0.0135 |
4.6 × 10−5
| n.a. |
Reported global frequency of |
| Wilson's disease |
| 4 (6) |
0.0108 |
2.9 × 10−5
| 1 in 30,000. | Similar global incidence reported (Ala, Walker, Ashkan, Dooley, & Schilsky, |
| Ehlers‐Danlos syndrome, kyphoscoliotic type, 2 |
| 4 (8) |
0.0108 |
2.9 × 10−5
| n.a. | — |
| Fructose intolerance, hereditary |
| 4 (7) |
0.0108 |
2.9 × 10−5
| n.a. | — |
| Galactosemia |
| 4 (5) |
0.0108 |
2.9 × 10−5
| 1 in 20,000 |
Reported carrier frequency of 0.006 |
Values in brackets are derived from larger uncurated sample (see Materials and Methods).
Frequency data for Russian population were obtained from literature and/or clinical genetics laboratories. n.a., no data available.