| Literature DB >> 35604078 |
Abstract
MOTIVATION: Measuring genetic diversity is an important problem because increasing genetic diversity is key to making new genetic discoveries, while also being a major source of confounding to be aware of in genetics studies.Entities:
Year: 2022 PMID: 35604078 PMCID: PMC9237724 DOI: 10.1093/bioinformatics/btac348
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Reference populations with their size (N), and corresponding ancestry proportions (in %) inferred from the proposed snp_ancestry_summary method, for several GWAS summary statistics
| Ancestry group |
| BBJ | FinnGen | Perú | Qatar | Africa | GERA | PAGE | BrCa | PrCa | CAD | Body fat | COVID | Eczema | Epilepsy | Urate |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Africa (West) | 735 | 30 | 1.9 | 27.7 | 0.3 | 0.3 | 2.2 | 0.7 | 4 | 0.2 | 0.7 | 2.2 | ||||
| Africa (South) | 449 | 70 | 0.9 | 5.9 | 0.2 | 1.2 | 0.3 | 1.2 | 0.5 | 0.3 | 3.5 | |||||
| Africa (East) | 276 | 13 | 0.1 | 0.3 | 1.9 | |||||||||||
| Africa (North) | 268 | 22 | 0.5 | 0.1 | ||||||||||||
|
| ||||||||||||||||
| Middle East | 523 | 64.6 | 0.2 | 1.3 | ||||||||||||
|
| ||||||||||||||||
| Ashkenazi | 1975 | 4.4 | 0.5 | 0.2 | 1.8 | 0.4 | 0.8 | 0.4 | 0.6 | 1.8 | ||||||
| Italy | 345 | 4.6 | 3.1 | 1.2 | 9.7 | 5.5 | 3.4 | 0.8 | ||||||||
| Europe (East) | 667 | 10.5 | 6.9 | 11.3 | 10.5 | 11.4 | 13.2 | 11.7 | 13.9 | 10.8 | ||||||
| Finland | 143 (+ 99) | 100 | 2.4 | 0.7 | 9.7 | 13 | 5.9 | 8.8 | 14.8 | 12.8 | 6.5 | 2 | ||||
| Europe (North West) | 4416 | 59.9 | 5.6 | 68.5 | 64.5 | 51.8 | 59.8 | 61.4 | 70.9 | 68 | 46 | |||||
| Europe (South West) | 603 | 3.5 | 15.8 | 4.7 | 4.5 | 2.1 | 2.1 | |||||||||
|
| ||||||||||||||||
| South America | 473 (+ 84) | 100 | 4.6 | 25.4 | 1.5 | 0.8 | 1.6 | 0.4 | 1.8 | 0.5 | ||||||
|
| ||||||||||||||||
| Sri Lanka | 372 | 0.4 | 0.4 | 3.4 | 1.7 | 4.7 | 4.2 | 1.8 | 1.9 | |||||||
| Pakistan | 400 | 1.1 | 7 | 4 | ||||||||||||
| Bangladesh | 223 (+ 86) | 1.6 | ||||||||||||||
|
| ||||||||||||||||
| Asia (East) | 961 | 3.5 | 1.2 | 1.2 | 0.7 | 2.5 | 1.2 | 0.1 | 0.1 | 3.1 | ||||||
| Japan | 240 (+ 104) | 100 | 2.2 | 9.4 | 0.4 | 2.4 | 2.8 | 0.7 | 0.7 | 0.3 | 29.8 | |||||
| Philippines | 295 | 1.5 | 4.6 | 0.2 | ||||||||||||
Note: Note that, because they are very close ancestry groups, we merge a posteriori the ancestry coefficients α from ‘Ireland’, ‘United Kingdom’ and ‘Scandinavia’ into a single ‘Europe (North West)’ group, and similarly for ‘Europe (North East)’ and ‘Europe (South East)’ into a single ‘Europe (East)’ group. Citations for the allele frequencies used: the BBJ (Sakaue ), FinnGen (Kurki ), GWAS in Peruvians (Asgari ), GWAS in Qataris (Thareja ), GWAS in Sub-Saharan Africans (Africa; Chen ), GERA (Hoffmann ), PAGE (Wojcik ), breast cancer (BrCa; Michailidou ), prostate cancer (PrCa; Schumacher ), coronary artery disease (CAD; Nikpay ), body fat percentage (Lu ), COVID-19 (The COVID-19 Host Genetics Initiative, 2021), eczema (Paternoster ), epilepsy (The International League Against Epilepsy Consortium on Complex Epilepsies, 2018) and serum urate (Tin ). Several of these GWAS summary statistics have been downloaded through the NHGRI-EBI GWAS Catalog (MacArthur ).