| Literature DB >> 31285530 |
Andrew J Pakstis1, Cemal Gurkan2,3, Mustafa Dogan4, Hasan Emin Balkaya4, Serkan Dogan4, Pavlos I Neophytou5, Lotfi Cherni6,7, Sami Boussetta6, Houssein Khodjet-El-Khil8, Amel Ben Ammar ElGaaied6, Nina Mjølsnes Salvo9, Kirstin Janssen9, Gunn-Hege Olsen9, Sibte Hadi10, Eida Khalaf Almohammed10,11, Vania Pereira12, Ditte Mikkelsen Truelsen12, Ozlem Bulbul13, Usha Soundararajan1, Haseena Rajeevan14, Judith R Kidd1, Kenneth K Kidd15.
Abstract
The set of 55 ancestry informative SNPs (AISNPs) originally developed by the Kidd Lab has been studied on a large number of populations and continues to be applied to new population samples. The existing reference database of population samples allows the relationships of new population samples to be inferred on a global level. Analyses show that these autosomal markers constitute one of the better panels of AISNPs. Continuing to build this reference database enhances its value. Because more than half of the 25 ethnic groups recently studied with these AISNPs are from Southwest Asia and the Mediterranean region, we present here various analyses focused on populations from these regions along with selected reference populations from nearby regions where genotype data are available. Many of these ethnic groups have not been previously studied for forensic markers. Data on populations from other world regions have also been added to the database but are not included in these focused analyses. The new population samples added to ALFRED and FROG-kb increase the total to 164 population samples that have been studied for all 55 AISNPs.Entities:
Mesh:
Year: 2019 PMID: 31285530 PMCID: PMC6871633 DOI: 10.1038/s41431-019-0466-6
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
The 25 new reference populations in FROG-kb for the 55 AISNP panel
| Geographical region and population sample | Sample size ( | Sample UID in ALFRED | Data sources and typing methods {footnote#} | Genotypes, frequencies available |
|---|---|---|---|---|
| Africa | ||||
| Southern Tunisians | 96 | SA004637U | {1}a | Both |
| Somalis | 98 | SA004636T | [6] {4}b | Both |
| Europe | ||||
| Norwegians | 200 | SA004650P | {2}a | Both |
| Danes | 142 | SA004635S | [6] {4}b | Both |
| Basques, Spain | 108 | SA004454R | [7]c | Both |
| Greek Cypriots | 96 | SA004645T | {1}a | Both |
| Southwest Asia | ||||
| Qatari | 158 | SA004651Q | {3}a | Both |
| Arabs, N. Iraq | 130 | SA004641P | {1}d | Both |
| Chaldeans, N. Iraq | 22 | SA004638V | {1}a | Both |
| Kurds, N. Iraq | 148 | SA004640O | {1}d | Both |
| Shabaks, N. Iraq | 9 | SA004643R | {1}a | Both |
| Syriacs, N. Iraq | 125 | SA004644S | {1}d | Both |
| Turkmen, N. Iraq | 129 | SA004642Q | {1}d | Both |
| Yazidis, N. Iraq | 148 | SA004639W | {1}d | Both |
| Turkish | 88 | SA004633Q | [8] {4}b | Both |
| Iranians | 93 | SA004634R | [8] {4}b | Both |
| East Asia | ||||
| Chengdu Tibetans | 63 | SA004624Q | [9]c | Both |
| Liangshan Tibetans | 33 | SA004616R | [9]c | Both |
| Qinghai Tibetans | 25 | SA004625R | [9]c | Both |
| Yi (Liangshan, Sichuan) | 48 | SA004626S | [9]c | Both |
| Japanese (Honshu) | 49 | SA004525Q | [10]c | Frequencies |
| Okinawa Japanese | 47 | SA004526R | [10]c | Frequencies |
| South America | ||||
| Afro-Ecuadorian | 29 | SA004509S | [11]c | Frequencies |
| Ecuadorian mestizo | 67 | SA004519T | [11]c | Frequencies |
| Kichwa (Ecuador) | 66 | SA004510K | [11]c | Frequencies |
1. Genotypes generated at Kidd Lab employing standard TaqMan assays used previously for 55 AISNP panel [2, 3]. The seven population samples from Northern Iraq were collected by Mustafa Dogan, International Burch University; DNA extracted from buccal swab samples by Hasan Emin Balkaya at the Turkish Cypriot DNA laboratory. The Greek Cypriot samples were collected and DNA was extracted by Pavlos I. Neophytou, Mendel Center for Biomedical Sciences Nicosia. Southern Tunisians were collected via buccal swabs by Lotfi Cherni and colleagues at the University of Tunis el Manar
2. Genotypes provided by and the individual samples were collected by Nina Mjølsnes Salvo and colleagues, UiT—The Arctic University of Norway. Typing method: Illumina/Verogen ForenSeq DNA Signature Prep Kit on the MiSeq FGx Forensic Genomics System [12]
3. Genotypes provided by and the individual samples were collected by Sibte Hadi and colleagues, University of Central Lancashire. Typing method: Illumina ForenSeq DNA signature panel [12]
4. Genotypes for Danes, Somalis, Turkish, and Iranians provided by and the individual samples were collected in Denmark by Vania Pereira, Ditte M. Truelsen, Helle S. Mogensen, Maryam S. Farzad, Torben Tvedebrink, Claus Børsting, and Niels Morling, University of Copenhagen. All four samples consist of unrelated individuals collected in Denmark; the individuals from Somalia, Turkey, and Iran are immigrants to Denmark. Typing method: ThermoFisher Precision ID Ancestry panel
aIndicates population samples and SNP data reported for first time in this study
bIndicates population samples reported initially in a previous publication as cited here in the data source column of this table. SNP data in this study was supplied by co-authors; see footnote #4
cIndicates SNP data and population samples employed in this study that derive from publications as cited here in data source column of this table
dIndicates population samples initially reported in a previous publication [13] for Y-chromosome data only. The autosomal SNP data in this study has not been reported previously
Fig. 1PCA results based on the 55 AISNP allele frequencies for 76 reference populations consisting of 43 groups from SW Asia–Europe and 33 selected populations in adjacent world regions. The 16 out of 25 new reference populations listed alphabetically in the inset box are from SW Asia, Europe, North Africa, East Africa
Fig. 2Estimated cluster membership bar plot via STRUCTURE for 43 reference populations in Southwest Asia—Europe and 33 selected populations in adjacent world regions—Africa, South Central Asia, Central Asia, Siberia. Displaying the highest likelihood run out of 20 runs at K = 6
Fig. 4Random match probability and most common genotype frequency for each of 76 populations based on the 55 AISNP panel
Fig. 3The 30 highest likelihoods calculated by FROG-kb for two Kurdish individuals based on the 55 AISNP panel and 164 current reference populations. The dot next to the value for the probability of genotype in each population identify results within one order of magnitude of the highest likelihood and therefore not significantly different