Literature DB >> 32490095

Population dataset for 21 simple tandem repeat loci in the Akan population of Ghana.

Abban Edward Kofi1,2, Hashom Mohd Hakim3, Hussein Omar Khan3, Siti Afifah Ismail3, Anita Ghansah4, Abd Rashid Nur Haslindawaty1, Shaharum Shamsuddin1, Mohd Yusmaidie Aziz5, Geoffrey Keith Chambers6, Hisham Atan Edinur1,7,8.   

Abstract

Short tandem repeat (STR) loci are widely used as genetic marker for ancestral and forensic analyses. The latter application includes for paternity testing and DNA profiling of samples collected from scenes of crime and suspects. This survey provides the first dataset for 21 STR loci across the Akan population in Ghana by genotyping of 109 unrelated healthy individuals using Investigator 24plex kit. None of the STR loci screened deviated from Hardy-Weinberg equilibrium after applying Bonferroni correction. Overall, 224 unique alleles were observed with allele frequencies ranging from 0.005 to 0.518. The combined match probability, combined power of exclusion and combined power discrimination were 1 in 4.07 × 10-25, 0.999999999 and 1, respectively. Principal coordinate analysis carried out using 21 STR allele frequency data mapped the Akans with Nigerian subpopulation groups (Hausa, Igbo and Yoruba), but separated from Thais of Thailand, Chechen of Jordan and Tijuana of Mexico.
© 2020 The Author(s).

Entities:  

Keywords:  Akan; Ghana; Investigator 24plex; STR

Year:  2020        PMID: 32490095      PMCID: PMC7262416          DOI: 10.1016/j.dib.2020.105746

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data In the Sub-Saharan region of Africa, population data for these 21 STR loci are only available for Nigerian subpopulations. The 21 STR dataset for the Akans of Ghana reported in this article is thus the first from a different country. Our 21 locus STR dataset provides an important source of information to estimate their statistical value as DNA evidence (match probability, power of exclusion power etc.) for this population group Data from this survey also supports the general value of STR loci for population studies across the region; allele frequencies of the 21 STR loci can be used to examine the past history of population events in Akans including gene flow, natural selection and migration patterns. The population genetic data and forensic statistics reported for the Akans can be used as a reference standard in future studies of other sub-population groups in Ghana; Ewe, Mole-Dagbon, Ga-Dangbe and Guang.

Data description

The allelic scores for the 21 STR loci examined across 109 unrelated Akan individuals are shown in Supplementary Table 1. Their allele frequency data and forensic parameters are shown in Table 1. A total of 224 unique alleles were observed with corresponding allele frequency ranging from 0.005 to 0.518. The observed heterozygosity (Ho) ranged from 0.725 (THO1, DS5818 and D7S820 loci) to 0.917 (SE33 locus) while the expected heterozygosity (He) ranged from 0.669 (TH01 locus) to 0.927 (SEE33 locus). After applying Bonferroni correction (p = 0.05/21, at 95% significance level), no deviation from Hardy-Weinberg equilibrium (HWE)were observed. The highest power of discrimination (PD) and polymorphic information content (PIC) were 0.982 and 0.917 (respectively), recorded for the SE33 locus. Locus TH01 showed the lowest PIC (0.628) and PD (0.824) values. The combined matching probability (CMP), combined probability of exclusion (CPE) and combined discriminating power (CPD) were 1 in 4.07 × 10−25, 0.999999999 and 1, respectively.
Table 1

Allele frequency data and forensic parameters for Akans of Ghana (n=109)

AlleleTH01D3S1358VWAD21S11TPOXD1S1656D12S391SE33D10S1248D22S1045D19S433D8S1179D2S1338D2S441D18S51FGAD16S539CSF1POD13S317D5S818D7S820
50.009
60.0870.087
70.5180.0230.0960.009
80.2110.2710.0050.0090.0140.0780.0230.0550.220
90.1010.2570.0050.0090.2200.0280.0090.0050.119
9.30.055
100.0180.1100.0140.0050.0320.0090.0090.0230.1240.2520.0090.0600.358
110.2200.0370.0500.0960.1100.0280.3620.0090.3580.2390.2520.2340.239
11.30.128
120.0050.0230.0690.0050.1380.0640.1010.1330.1240.0830.1380.2390.4720.3530.055
12.20.041
130.0280.0050.1330.0140.1740.3030.1470.1060.0090.1470.1740.257
13.20.050
140.0640.0500.0050.2390.0050.0460.3120.1740.1790.3390.2250.0410.0690.0500.032
14.20.064
150.2980.2800.1880.0870.0500.2200.1970.0370.2570.0320.2110.009
15.20.069
15.30.023
160.4270.2200.1150.0690.0500.0780.1790.0460.0640.1560.005
16.20.032
16.30.087
170.1610.2200.0280.1330.0920.0180.2060.0280.0730.1700.005
17.30.023
180.0410.1100.0050.2570.1240.0370.0050.0230.1150.028
18.20.009
18.30.0320.005
190.0050.0550.0090.1830.1380.0050.1510.1060.055
19.20.014
200.0320.1380.0960.0500.0600.018
20.20.014
210.0050.0500.0600.1740.0230.083
21.20.009
220.0500.0280.1330.0050.206
22.20.0090.009
230.0050.1010.0090.142
23.20.0180.014
240.0050.0180.1190.0050.165
24.20.0180.005
250.0870.087
25.20.0550.005
260.0050.0140.073
26.20.060
270.0180.0090.037
27.20.069
280.3030.018
28.20.032
290.2020.005
29.20.023
300.165
30.20.0140.014
310.083
31.20.028
320.028
32.20.073
330.014
33.20.014
340.009
350.037
360.005
N7791691412209101210127142167886
MP0.1760.1490.0660.0520.0890.0390.0500.0180.0910.0520.0550.0870.0290.0990.0500.0330.0930.0820.1490.1090.112
PD0.8240.8510.9340.9480.9110.9610.9500.9820.9090.9480.9450.9130.9710.9010.9500.9670.9070.9180.8510.8910.888
PIC0.6280.6480.7780.8040.7600.8460.8280.9170.7670.8200.8220.7440.8730.7430.8510.8730.7330.7730.6310.7060.709
PE0.4680.5140.5300.5300.6650.6650.7560.8310.6300.6300.7010.5300.7010.6650.8120.7560.5790.6830.3700.4680.468
TPI1.8172.0192.0962.0963.0283.0284.1926.0562.7252.7253.4062.0963.4063.0285.4504.1922.3703.2061.4731.8171.817
Ho0.7250.7520.7610.7610.8350.8350.8810.9170.8170.8170.8530.7610.8530.8350.9080.8810.7890.8440.6610.7250.725
He0.6690.7010.8090.8280.7950.8640.8500.9270.8000.8440.8420.7790.8880.7770.8690.8870.7710.8050.6830.7500.753
p-HWE0.0080.8760.2520.7560.6340.2960.5960.8180.1270.4700.4460.1830.7740.0060.0340.1620.4270.3480.5210.2490.036

N, number of loci; MP, matching probability; PD, power of discrimination; PIC, polymorphic information content; PE, power of exclusion; TPI, typical paternity index; Ho, observed heterozygosity; He, expected heterozygosity; p-HWE, p-value for Hardy-Weinberg equilibrium.

Allele frequency data and forensic parameters for Akans of Ghana (n=109) N, number of loci; MP, matching probability; PD, power of discrimination; PIC, polymorphic information content; PE, power of exclusion; TPI, typical paternity index; Ho, observed heterozygosity; He, expected heterozygosity; p-HWE, p-value for Hardy-Weinberg equilibrium. Principal coordinate (PCO) mapping performed using the allele frequency data across all 21 STR loci in the Akans and 7 other previously reported STR datasets (Supplementary Table 2) is shown in Fig. 1. The first and second axes accounted for 59.43% and 12.71% of genetic variability between datasets. The populations from Asia and North America are clustered in the upper left-hand quadrant. In contrast, the populations of Middle Eastern origin are plotted on the lower left quadrant. The Akans plotted closely with the Nigerian subpopulations in the lower right quadrant.
Fig. 1

PCO plot of 21 STR loci allele frequency data obtained from present survey of Akans and other reference populations.

PCO plot of 21 STR loci allele frequency data obtained from present survey of Akans and other reference populations.

Experimental design, materials, and methods

Ethical clearance

This research was reviewed and approved by the Institutional Review Board of Nugochi Memorial Institute of Medical Research (NMIMR), University of Ghana (permit no: NMIMR-IRB CPN 118/15-16 revd. 2019) and the Human Ethics Committee of University Sains Malaysia (USMKK), Health Campus, Kelantan, Malaysia (permit no: USM/JEPeM/16050188).

Sample collection

A total of 109 healthy unrelated individuals of Akan ethnicity aged between 18 to 45 years were recruited for this research. Each individual provided written informed consent and have at least three generations of un-admixed history. The sampling locations included Accra, Koforidua and Kumasi of Ghana.

DNA isolation and STR amplification

Cheek cells were collected using buccal swab sticks and genomic DNA was extracted from them using Invisorb® Spin Forensic kit (STRATEC Molecular GmbH, Germany). Total genomic DNA was quantified using Investigator Quantiplex Hyres kit according to manufacturer's recommendation (Qiagen, Hilden, Germany). The Investigator 24plex QS amplification kits (Qiagen, Germany) was utilized to amplify the sex-determining marker Amelogenin, 1 Y-STR locus, 2 quality sensors and 21 autosomal STR loci namely CSF1PO, D10S1248, D12S391, D13S317, D16S539, D18S51, TPOX, D19S433, D1S1656, D21S11, D22S1045, D2S1338, D2S441, D3S1358, D5S818, D7S820, FGA, D8S1179, SE33, TH01 and vWA on the GeneAmp PCR System 9700 thermal cycler (Applied Biosytems, USA) in total of 12.5 ul reaction volume. PCR products were separated by multi-capillary electrophoresis in 3500XL Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). The STR alleles were systematically called using the GeneMapper IDx v4.1 software (Applied Biosystems, USA). Allele designations were determined by comparison of the sample fragments with those of allelic ladders provided in the kit.

Statistical analysis

The HWE and the expected heterozygosity (He) values were calculated using Arlequin v3.5.2.2 [1,2]. The significance level for deviation from HWE (<0.05) was adjusted to p > 0.00238 after Bonferroni correction (p = 0.05/21 = 0.00238, where 21 is the number of loci and 0.05 is the standard HWE significance value). Allele frequencies for the 21 STR loci, PD, power of exclusion (PE), MP, typical paternity index (TPI), PIC and Ho values were computed using the Powerstats software version 1.2 [3]. PCO data mapping was used to compare and visualize genetic relatedness between Akans (Ghana), Thais [4], Tijuanans [5], Chechens living in Jordan [6], Saudis [7], and Nigerians [8]). The PCO analysis was performed using the MVSP software version 3.22 [9] and STR datasets for PCO analysis are provided as Supplementary table 2.

Declaration of Competing Interest

The authors declare that they have no known competing interests or personal relationships that could have appeared to influence the work reported in this paper.
SubjectGenetics
Specific subject areaDNA profiling
Type of dataTables and figure
How data were acquiredCapillary electrophoresis of STR polymerase chain reaction amplified products on 3500XL Genetic Analyser (Applied Biosystems, USA)
Data formatRaw and analyzed
Parameters for data collectionGenomic DNA samples extracted from cheek cells were used as templates for amplification of 21 STR loci using Investigator 24plex QS kits (Qiagen, Germany)
Description of data collectionComparison of separated STR fragments with standard allelic ladder included in the Investigator 24plex QS kit using the GeneMapper IDx v4.1 software (Applied Biosystems, USA).
Data source locationForensic Science Program, School of Health Sciences, Health Campus, 16150 Kubang Kerian, Kelantan, Malaysia
Data accessibilityData available in this article
  6 in total

1.  Performing the exact test of Hardy-Weinberg proportion for multiple alleles.

Authors:  S W Guo; E A Thompson
Journal:  Biometrics       Date:  1992-06       Impact factor: 2.571

2.  Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows.

Authors:  Laurent Excoffier; Heidi E L Lischer
Journal:  Mol Ecol Resour       Date:  2010-03-01       Impact factor: 7.090

3.  Population data for 21 autosomal STR loci (GlobalFiler kit) in two Mexican-Mestizo population from the northwest, Mexico.

Authors:  G Martínez-Cortés; F Zuñiga-Chiquette; A S Celorio-Sánchez; E Ruiz García; A B Antelo-Figueroa; V Dalpozzo-Valenzuela; A Valenzuela-Coronado; H Rangel-Villalobos
Journal:  Int J Legal Med       Date:  2018-10-17       Impact factor: 2.686

4.  Analysis of 21 autosomal STRs in Saudi Arabia reveals population structure and the influence of consanguinity.

Authors:  Yahya M Khubrani; Jon H Wetton; Mark A Jobling
Journal:  Forensic Sci Int Genet       Date:  2018-12-21       Impact factor: 4.882

5.  Population data of 21 autosomal STR loci in the Hausa, Igbo and Yoruba people of Nigeria.

Authors:  Victoria O Okolie; Selena Cisana; Moses S Schanfield; Khalid O Adekoya; Olufemi A Oyedeji; Daniele Podini
Journal:  Int J Legal Med       Date:  2017-11-04       Impact factor: 2.686

6.  Assessing the forensic efficiency of the GlobalFiler STR loci among the genetically isolated Chechen subpopulation in Jordan.

Authors:  Laith N Al-Eitan; Nizar N Darwish; Nancy M Hakooz; Rana B Dajani
Journal:  Gene       Date:  2019-08-29       Impact factor: 3.688

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.