| Literature DB >> 21542921 |
Hiba Ma Babiker1, Carina M Schlebusch, Hisham Y Hassan, Mattias Jakobsson.
Abstract
BACKGROUND: There is substantial ethnic, cultural and linguistic diversity among the people living in east Africa, Sudan and the Nile Valley. The region around the Nile Valley has a long history of succession of different groups, coupled with demographic and migration events, potentially leading to genetic structure among humans in the region. RESULT: We report the genotypes of the 15 Identifiler microsatellite markers for 498 individuals from 18 Sudanese populations representing different ethnic and linguistic groups. The combined power of exclusion (PE) was 0.9999981, and the combined match probability was 1 in 7.4 × 1017. The genotype data from the Sudanese populations was combined with previously published genotype data from Egypt, Somalia and the Karamoja population from Uganda. The Somali population was found to be genetically distinct from the other northeast African populations. Individuals from northern Sudan clustered together with those from Egypt, and individuals from southern Sudan clustered with those from the Karamoja population. The similarity of the Nubian and Egyptian populations suggest that migration, potentially bidirectional, occurred along the Nile river Valley, which is consistent with the historical evidence for long-term interactions between Egypt and Nubia.Entities:
Year: 2011 PMID: 21542921 PMCID: PMC3118356 DOI: 10.1186/2041-2223-2-12
Source DB: PubMed Journal: Investig Genet ISSN: 2041-2223
Figure 1Map of the Sudan representing the geographic locations of the 18 populations in the present study. Geographic classification (used in the partitioning of variation analysis) of each population is also indicated on the map: northern Sudan (triangle), central Sudan (open square), eastern Sudan (filled circle), western Sudan (open circle) and southern Sudan (filled square). The dotted circle represents the suggested region occupied by the Messiria population.
Sample sizes (n) and linguistic affiliations of the populations in the study
| Ethnic group | Population | Linguistic family | Linguistic subgroup | Geographic group | |
|---|---|---|---|---|---|
| Arab | Bataheen | 29 | Afro-Asiatic | Semetic | Central |
| Arab | Gaalien | 57 | Afro-Asiatic | Semetic | Central |
| Arab | Shaigia | 17 | Afro-Asiatic | Semetic | Northern |
| Arab | Messiria | 8 | Afro-Asiatic | Semetic | Western |
| Copt | Copt | 31 | Afro-Asiatic | Ancient Egyptian | Central |
| Hausa | Hausa | 10 | Afro-Asiatic | Chadic | Eastern |
| Beja | Beni-Amer | 35 | Afro-Asiatic | Cushitic | Eastern |
| Beja | Hadendowa | 29 | Afro-Asiatic | Cushitic | Eastern |
| Nubian | Danagla | 40 | Nilo-Saharan | Eastern Sudanic | Northern |
| Nubian | Mahas | 31 | Nilo-Saharan | Eastern Sudanic | Northern |
| Nubian | Halfawieen | 24 | Nilo-Saharan | Eastern Sudanic | Northern |
| Nilotic | Dinka | 30 | Nilo-Saharan | Eastern Sudanic | Southern |
| Nilotic | Nuer | 19 | Nilo-Saharan | Eastern Sudanic | Southern |
| Nilotic | Shilluk | 19 | Nilo-Saharan | Eastern Sudanic | Southern |
| Nilotic | Baria | 6 | Nilo-Saharan | Eastern Sudanic | Southern |
| Nuba | Nuba | 34 | Nilo-Saharan and Niger-Congo | Eastern Sudanic and Kordofanian | Western |
| Zagawa | Zagawa | 29 | Nilo-Saharan | Saharan | Western |
| Gemar | Gemar | 6 | Nilo-Saharan | Eastern Sudanic | Western |
Allele frequencies for each of the 15 Identifiler STR loci found in the Sudanese sample.
| Allele | CSF1PO | D13S317 | D16S539 | D18S51 | D19S433 | D21S11 | D2S1338 | D3S1358 | D5S818 | D7S820 | D8S1179 | FGA | TH01 | TPOX | vWA |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4 | 0.001 | ||||||||||||||
| 6 | 0.228 | 0.006 | |||||||||||||
| 7 | 0.007 | 0.001 | 0.004 | 0.001 | 0.397 | 0.005 | |||||||||
| 8 | 0.048 | 0.069 | 0.039 | 0.002 | 0.084 | 0.228 | 0.001 | 0.091 | 0.390 | ||||||
| 9 | 0.037 | 0.048 | 0.189 | 0.001 | 0.002 | 0.037 | 0.106 | 0.003 | 0.167 | 0.292 | |||||
| 9.3 | 0.083 | ||||||||||||||
| 10 | 0.316 | 0.027 | 0.069 | 0.004 | 0.014 | 0.098 | 0.343 | 0.027 | 0.034 | 0.100 | |||||
| 10.2 | 0.001 | ||||||||||||||
| 11 | 0.242 | 0.311 | 0.320 | 0.010 | 0.019 | 0.202 | 0.212 | 0.053 | 0.189 | ||||||
| 11.2 | 0.001 | 0.004 | |||||||||||||
| 12 | 0.292 | 0.384 | 0.205 | 0.101 | 0.075 | 0.001 | 0.366 | 0.091 | 0.113 | 0.017 | 0.002 | ||||
| 12.2 | 0.010 | ||||||||||||||
| 13 | 0.048 | 0.128 | 0.147 | 0.087 | 0.225 | 0.003 | 0.203 | 0.015 | 0.241 | 0.001 | 0.003 | ||||
| 13.2 | 0.002 | 0.055 | |||||||||||||
| 14 | 0.008 | 0.031 | 0.031 | 0.096 | 0.261 | 0.065 | 0.008 | 0.298 | 0.077 | ||||||
| 14.2 | 0.001 | 0.078 | |||||||||||||
| 15 | 0.003 | 0.001 | 0.111 | 0.092 | 0.001 | 0.305 | 0.001 | 0.188 | 0.156 | ||||||
| 15.2 | 0.082 | 0.001 | |||||||||||||
| 16 | 0.136 | 0.033 | 0.051 | 0.263 | 0.060 | 0.259 | |||||||||
| 16.2 | 0.001 | 0.033 | 0.001 | ||||||||||||
| 17 | 0.140 | 0.008 | 0.139 | 0.271 | 0.010 | 0.008 | 0.245 | ||||||||
| 17.2 | 0.002 | 0.007 | |||||||||||||
| 18 | 0.114 | 0.001 | 0.091 | 0.082 | 0.003 | 0.009 | 0.155 | ||||||||
| 18.2 | 0.001 | 0.001 | |||||||||||||
| 19 | 0.078 | 0.209 | 0.005 | 0.001 | 0.047 | 0.075 | |||||||||
| 19.2 | 0.001 | 0.001 | |||||||||||||
| 20 | 0.059 | 0.128 | 0.002 | 0.065 | 0.024 | ||||||||||
| 21 | 0.034 | 0.060 | 0.108 | 0.003 | |||||||||||
| 21.2 | 0.005 | ||||||||||||||
| 22 | 0.013 | 0.128 | 0.183 | 0.001 | |||||||||||
| 22.2 | 0.002 | ||||||||||||||
| 23 | 0.003 | 0.091 | 0.166 | ||||||||||||
| 24 | 0.056 | 0.143 | |||||||||||||
| 24.2 | 0.003 | 0.002 | |||||||||||||
| 25 | 0.035 | 0.091 | |||||||||||||
| 25.2 | 0.001 | ||||||||||||||
| 26 | 0.002 | 0.009 | 0.039 | ||||||||||||
| 27 | 0.030 | 0.001 | 0.026 | ||||||||||||
| 28 | 0.123 | 0.001 | 0.063 | ||||||||||||
| 29 | 0.286 | 0.035 | |||||||||||||
| 30 | 0.226 | ||||||||||||||
| 30.2 | 0.005 | 0.006 | |||||||||||||
| 31 | 0.082 | ||||||||||||||
| 31.2 | 0.039 | ||||||||||||||
| 32 | 0.017 | ||||||||||||||
| 32.2 | 0.080 | ||||||||||||||
| 33 | 0.007 | ||||||||||||||
| 33.2 | 0.034 | ||||||||||||||
| 34 | 0.009 | ||||||||||||||
| 34.2 | 0.005 | ||||||||||||||
| 35 | 0.026 | ||||||||||||||
| 36 | 0.017 | ||||||||||||||
| 37 | 0.005 | ||||||||||||||
| 38 | 0.002 |
See table 3 for exact sample size (n) for each locus
Forensic summary statistics, observed and expected heterozygosity, deviation from Hardy-Weinberg-equilibrium (HWE), minimum allele frequency cut-off and sample size for each locus (n)
| Locus | CSF1PO | D13S317 | D16S539 | D18S51 | D19S433 | D21S11 | D2S1338 | D3S1358 | D5S818 | D7S820 | D8S1179 | FGA | TH01 | TPOX | vWA |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| n | 485 | 489 | 491 | 485 | 491 | 491 | 489 | 490 | 490 | 485 | 491 | 486 | 490 | 491 | 491 |
| MPa | 0.103 | 0.114 | 0.073 | 0.021 | 0.043 | 0.047 | 0.028 | 0.106 | 0.090 | 0.093 | 0.069 | 0.026 | 0.100 | 0.126 | 0.066 |
| PDb | 0.897 | 0.886 | 0.927 | 0.979 | 0.957 | 0.953 | 0.972 | 0.894 | 0.91 | 0.907 | 0.931 | 0.974 | 0.9 | 0.874 | 0.934 |
| PEc | 0.504 | 0.409 | 0.552 | 0.785 | 0.665 | 0.611 | 0.68 | 0.533 | 0.519 | 0.469 | 0.503 | 0.748 | 0.431 | 0.406 | 0.681 |
| PICd | 0.708 | 0.689 | 0.762 | 0.889 | 0.833 | 0.816 | 0.865 | 0.711 | 0.732 | 0.729 | 0.769 | 0.874 | 0.711 | 0.67 | 0.79 |
| Hobse | 0.746 | 0.687 | 0.774 | 0.895 | 0.833 | 0.804 | 0.843 | 0.763 | 0.755 | 0.726 | 0.745 | 0.877 | 0.702 | 0.684 | 0.841 |
| Hexpf | 0.751 | 0.731 | 0.792 | 0.899 | 0.850 | 0.834 | 0.878 | 0.754 | 0.767 | 0.766 | 0.798 | 0.886 | 0.747 | 0.718 | 0.813 |
| Pg | 0.955 | 0.021 | 0.335 | 0.357 | 0.195 | 0.211 | 0.022 | 0.335 | 0.258 | 0.015 | 0.005 | 0.749 | 0.311 | 0.400 | 0.018 |
| MAFh | 0.0052 | 0.0051 | 0.005 | 0.0052 | 0.005 | 0.005 | 0.0051 | 0.0051 | 0.0051 | 0.0052 | 0.005 | 0.0052 | 0.0051 | 0.005 | 0.005 |
aMatching probability.
bPower of discrimination.
cPower of exclusion.
dPolymorphism information content.
eObserved heterozygosity.
fExpected heterozygosity.
gHWE, Fisher's exact test P-value executed with 100,000 steps in the Markov chain and 1,000 dememorization steps.
hMinimum allele frequency cut-off (5/(2N)).
Figure 2The mean number of .
Figure 3Mean number of alleles private to a pair of Sudanese populations as a function of the sample size .
Figure 4Mean number of alleles private to a pair of populations as a function of the sample size . Comparisons between each Sudanese population and the Egyptian population is shown in blue; between each Sudanese population and the Somali population in orange; and between each Sudanese population and the Karamoja population from Uganda is shown in green.
Figure 5Clustering of 454 people from 18 Sudanese populations, 218 people from the Karamoja population (Uganda), 265 people from five Egyptian populations and 230 people from one Somali population assuming two or three clusters (.The mean (across replicate runs) log likelihood was -64611 for K = 2 and -64853 for K = 3. Each individual is represented by a column divided into K colors with each color representing a cluster. Different populations are separated by a black line and are labeled below the figure by self-reported ethnicity and above the figure by geographic region.
Figure 6Principal components based on genetic distance of the Sudanese populations and the Egyptian, the Somali and the Karamoja populations. The populations that have been genotyped in this study are indicated by filled circles. A) The first and the second principal components. B) The first and the third principal components.
Partition of genetic variation between the Sudanese populations (using analysis of molecular variance) based on linguistic or geographic classification.
| Grouping criteria ( | Groups | Variance (%) | ||
|---|---|---|---|---|
| Within populations | Between populations, within groups | Between groups | ||
| Linguistic groups,a (2) | Afro-Asiatic, Nilo-Saharan | 99.07 | 0.73 | 0.21 |
| Geographic groups, (5) | Central, Northern, Eastern, Western and Southernb | 98.99 | 0.49 | 0.52 |
For all F-statistics, p < 0.01
aSee Figure 1 and Table 1 for the geographic and linguistic classification of populations.
bThe Nuba population was excluded because the group included individuals speaking different languages belonging to the Niger-Congo and Nilo-Saharan linguistic families.