| Literature DB >> 28070634 |
Andrew J Pakstis1, Longli Kang2,3, Lijun Liu2,3, Zhiying Zhang2,3, Tianbo Jin2,3, Elena L Grigorenko4,5, Frank R Wendt6, Bruce Budowle7,8, Sibte Hadi9, Mariam Salam Al Qahtani9, Niels Morling10, Helle Smidt Mogensen10, Goncalo E Themudo10, Usha Soundararajan11, Haseena Rajeevan11, Judith R Kidd11, Kenneth K Kidd11.
Abstract
Ancestry inference for an individual can only be as good as the reference populations with allele frequency data on the SNPs being used. If the most relevant ancestral population(s) does not have data available for the SNPs studied, then analyses based on DNA evidence may indicate a quite distantly related population, albeit one among the more closely related of the existing reference populations. We have added reference population allele frequencies for 14 additional population samples (with >1100 individuals studied) to the 125 population samples previously published for the Kidd Lab 55 AISNP panel. Allele frequencies are now publicly available for all 55 SNPs in ALFRED and FROG-kb for a total of 139 population samples. This Kidd Lab panel of 55 ancestry informative SNPs has been incorporated in commercial kits by both ThermoFisher Scientific and Illumina for massively parallel sequencing. Researchers employing those kits will find the enhanced set of reference populations useful.Entities:
Keywords: Alfred; Ancestry; FROG-kb; Reference database; SNP
Mesh:
Year: 2017 PMID: 28070634 PMCID: PMC5491587 DOI: 10.1007/s00414-016-1524-z
Source DB: PubMed Journal: Int J Legal Med ISSN: 0937-9827 Impact factor: 2.686
The 14 new reference populations for the 55 AISNP panel
| Geographical region and population sample description | Sample size ( | Sample unique identifier: ALFRED database | Data source and typing method footnote |
|---|---|---|---|
| Southwest Asia | |||
| Saudi, Saudi Arabia | 91 | SA004393T | 1 |
| Arabs, Abu Dhabi, United Arab Emirates | 69 | SA004394U | 2 |
| East Asia | |||
| Uygur, Xinjiang, China | 100 | SA004301I | 3 |
| Mongols, Inner Mongolia, China | 100 | SA004303K | 3 |
| Hui, Ningxia, China | 100 | SA004304L | 3 |
| Han—Northwest, Shaanxi, China | 100 | SA004305M | 3 |
| Han—Southwest, Yunnan, China | 100 | SA004307O | 3 |
| Tibetans, Southwest Tibet, China | 100 | SA004302J | 3 |
| Miao = Hmong, Guizhou, China | 100 | SA004306N | 3 |
| Li = Hlai, Hainan, China | 100 | SA004308P | 3 |
| Greenland | |||
| Native Greenlanders, Greenland | 104 | SA004396W | 4 |
| North America | |||
| Yavapai, Arizona, USA | 62 | SA004395V | 5 |
| Plains AmerIndians | 56 | SA000023F | 1 |
| Southwest AmerIndians | 51 | SA000025H | 1 |
1. Genotypes generated at Kidd Lab used the standard TaqMan assays employed previously for the 55 AISNP panel [4, 5]; Saudi DNA supplied by Elena Grigorenko; samples are from healthy individuals who partially overlap with normal controls in Ercan-Sencicek et al., 2015 [6]
2. Allele frequencies for SNPs contributed by Sibte Hadi and colleagues, University of Central Lancashire. Abu Dhabi sample genotypes were identified using the HID Ancestry panel (ThermoFisher Scientific) on the Ion Torrent PGM
3. Genotypes supplied by Longli Kang and colleagues, Xizang Minzu University. SNP genotyping was done primarily with the Sequenom MassARRAY RS1000 following manufacturer’s standard protocol [7]. For four SNPs, genotyping was based on standard TaqMan assays as described for the 55 AISNP panel [4, 5]
4. Genotypes provided by Niels Morling and colleagues, University of Copenhagen. Typing method described in [8]
5. Genotypes contributed by Frank Wendt, Bruce Budowle, and colleagues, University of North Texas Health Science Center. See [9] for typing method details
Fig. 1STRUCTURE results for estimated cluster membership values at K = 9 in 139 reference populations. Population abbreviations are explained in Table S1