Literature DB >> 32195283

Evaluation of ten SNP Markers for Human Identification and Paternity Analysis in Persian Population.

Sajad Habibi1, Amirhossein Ahmadi2, Mehrdad Behmanesh3, Ali Miri1, Mahmood Tavallaie1.   

Abstract

BACKGROUND: DNA markers are inevitable tools of human identification in forensic science. Single Nucleotide Polymorphisms (SNPs) are one category of these markers which is concerned to use especially in the case of degraded DNA because of their short amplicons.
OBJECTIVES: Detection of highly informative SNPs by the criteria is the essential step to develop a useful panel of SNP markers. The purpose of this work is to get high informative SNPs for human identification in Persian ethnic of the Iranian population.
MATERIAL AND METHODS: Genotype and allele frequencies of 10 SNPs from the SNPforID browser were determined by a PCR-RFLP method on 100 samples that was taken from 100 unrelated Persian people.
RESULTS: These ten SNPs were in Hardy-Weinberg equilibrium (P value > 0.1) except rs1355366 (P value = 0.02) and Heterozygosity of seven SNPs is greater than 0.45 but minor allele frequency of only four SNPs is more than 0.45. According to criteria only three SNPs rs1454361, rs2111980 and rs2107612 can pass all standards and are highly informative in population for forensic uses.
CONCLUSIONS: Our data showed that the CPI (Combined probability of Identity) and CPE (Combined Power of Exclusion) for ten SNPs are 1.13 E-04 and 0.809 respectively. It was also showed based on the criteria only three SNPs (rs2107612, rs1454361 and rs2111980) are highly informative in Persian population. If we can find 39 SNPs with PE and PI close to PE and PI of these three SNPs (rs2107612, rs1454361 and rs2111980), we will be able to use of these 39 SNPs in human identification with sufficient power of discrimination. Copyright:
© 2019 The Author(s); Published by National Institute of Genetic Engineering and Biotechnology.

Entities:  

Keywords:  Forensic Anthropology; Forensic Sciences; Polymorphism, Single Nucleotide

Year:  2019        PMID: 32195283      PMCID: PMC7080969          DOI: 10.29252/ijb.2148

Source DB:  PubMed          Journal:  Iran J Biotechnol        ISSN: 1728-3043            Impact factor:   1.671


1. Background

Currently, DNA markers are common tools in paternity tests and forensic genetics ( 1 ). Short tandem repeats (STRs) as a class of well-known markers have been widely applied in forensic laboratories due to their high variability and high power of discrimination ( 2 ). However, degraded samples which recovered from disaster victims or specimens from fires, explosions and airplane crashes make STRs unsuitable for human identification due to their long amplicons ( 3 ). Single nucleotide polymorphisms (SNPs) as the most frequent DNA sequence variations can be genotyped via short amplicons ( 4 ). However, bi-allelic SNPs are less informative than STRs, and then, more SNPs needs to achieve the same level of discrimination afforded with the STR loci used in forensic science laboratories. Evidence showed that to obtain the same power of discrimination as 13 STRs, a panel of at least 50-100 autosomal SNP loci would be required ( 5 ). Using of highly informative SNPs with maximum power of discrimination provide a possible way to decrease the number of SNPs and overcome these limitations ( 6 ). Until now, several SNP panels have been reported to be useful for forensic tests. SNPforID browser is a database which introduces 52 SNPs with low fixation index (Fst) in different population for forensic application ( 7 ). Since SNP informativeness may vary significantly between populations, it is necessary to determine allele frequencies and forensic characteristics of these SNPs in each population ( 8 ).

2. Objectives

The aim of this study is to find characteristics of 10 SNPs of SNPforID database for paternity tests and find out which of them are highly informative for forensic purposes in Persian population.

3. Materials and Methods

3.1. Sample Collection

100 unrelated Persian blood samples comprised of 50 males and 50 females were collected. Written informed consent was obtained from all subjects in accordance with the declaration of Helsinki.

3.2. Genomic DNA Extraction

5 mL of peripheral blood were collected from each volunteer in test tubes containing 0.5M EDTA and DNA was extracted using DNPTM Kit (Cinnagen, Iran). Briefly, lysis solution was used to lyse blood cells and then genomic DNA from white cells selectively precipitated with isopropanol. The precipitated DNA was washed and desalted with ethanol and dissolved in TE buffer and stored in -20°C. The quantity and quality of extracted DNA were examined spectrophotometrically or visually by electrophoresis on 1% agarose gel.

3.3. Selection of Highly Informative SNP Candidates

SNP data in the SNPforID browser http://spsmart.cesga.es/snpforid.php were employed in the present study. In order to select 10 highly informative SNP candidates, the allelic frequencies were used for screening. As markers with even allelic distributions have high observed heterozygosity and are more informative, three of the SNPs were selected with a common 45:55– 50:50 allelic distributions in the Middle East population where Iran is located there. All of the selected markers are located on autosomal chromosomes (Table 1).
Table 1

Primers and characteristics of amplified human genomic DNA segments containing selected SNP markers

NCBI SNP cluster IDPCR product (bp)Annealing temperaturePrimersChromosomal locationChromosome
rs210761237060 ◦C5’- TCAGGGAGGAATAAACATTACAGG-3’5’- GTCACAGCAACATAACCATATTAC-3’888,32012
rs145436141260 ◦C5’-ACTCTAGTGACATAGCCTCCAGTG-3’5’-TTGGAAGGACAATGAAGTTGCACG-3’25,850,832 14
rs211198030560 ◦C5’-CTCTTAACCTCCCTCCCTTGCCTG-3’5’-CTCTTCCTTCCGCCCCACTCCAAC-3’106,328,25412
rs135536640060 ◦C5’-CCGTCTCTTCAGGAACAGTAGAAC-3’5’-CAGGTGTGACAGCCTAGTTCTG-3’190,806,1083
rs25193423060 ◦C5’-ACTGACCCTTGCAGAGAACTGACC-3’5’-CCCAAGGTCCATCATTGGCTGT-3’174,778,6785
rs102852861458 ◦C5’-TCCTCAGCAAGAATCCCATTAGG-3’5’-TCAGTACAACCCTGCAAGATAGATG-3’48,362,29022
rs133587347660 ◦C5’-GGAATGGGTCAGGTCGAAGGTC-3’5’-AAGAACAGGAGCGTCAGAAACAG-3’20,901,72413
rs197925566660 ◦C5’-AAGAGAAGATACAGAGGCATTTCAG-3’5’-CTCCTTGACATCCCAAAAGCATACC-3’190,318,0804
rs91711871160 ◦C5’-CTGATTTGTTCTAGTGGCAGCGTTC-3’5’-GTGTCCAGCAAGAGAGAGATTTTCC-3’4,457,0037
rs141321265558 ◦C5’-AGTGTTAAGTGATTTGCCCTATGCC-3’5’-CACAACACCTAAGACTTGCTTTCAG-3’24,806,7971
Primers and characteristics of amplified human genomic DNA segments containing selected SNP markers

3.4. SNP Genotyping

Ten SNPs was determined by Polymerase Chain Reaction-restriction Fragment Length Polymorphism (PCR-RFLP). Briefly, the fragment which contains SNP were amplified to form PCR products as below: initial denaturation at 95℃ for 5min, 35 cycles of 30s at 95℃, 30 s at 58 or 60℃, 30 Sec at 72℃ and a final extension step of 5min at 72℃. PCR products were digested with appropriate enzymes at 37℃ for overnight, according to manufacturer’s instruction and then analyzed on 2% agarose gel (Table 2).
Table 2

Restriction enzymes which were used for genotyping by RFLP-PCR method

SNPEnzymeRestriction siteCut AlleleFragment size after digestion (bp)
rs2107612 (A/G)XmiI GTMKACA205 (165)
rs1454361 (A/T)XceI RCATGYA212 (200)
rs2111980 (A/G)SacI GAGCTCG200 (105)
rs1355366 (A/G)FaqI GGGAC(10/14N)G216(184)
rs251934 (C/T)XmiI GTMKACC144(90)
rs1028528 (A/G)DraIII CACNNNGTGA340(274)
rs1335873 (A/T)XmiI GTMKACT256(220)
rs1979255 (C/G)AlwI GGATCC406(260)
rs917118 (C/T)HgaI GACGCC431(280)
rs1413212 (T/C)BsmFI GTCCCT437(218)
Restriction enzymes which were used for genotyping by RFLP-PCR method

3.5. Statistical Analyses

The gene count method was used to calculate observed allele frequencies and the observed genotypes at each of the 10 SNP loci. probability of Identity (PI), Power of Exclusion (PE), Power of Discrimination (PD) and were calculated by PowerStatsV12.xls software, which is freely available on the net ( 9 ). Fisher’s exact tests using the OEGE program were performed to evaluate compliance with the Hardy– Weinberg equilibrium ( 10 ). The following SNP selection criteria were used to choose highly informative SNPs: ( 1 ) (MAF) > 0.45 in all ethnic groups, ( 2 ) heterozygosity > 0.45, ( 3 ) Hardy–Weinberg equilibrium (HWE) P value > 0.1, and ( 4 ) physical distance between SNP markers > 50 Mb ( 11 ).

4. Results

4.1 Genotype and Allele Frequencies of SNPs

Genotype and allele frequencies of SNP candidates in Persian population were calculated based on the gene count method (Table 3). All SNPs except for rs1355366 are in the Hardy-Weinberg Equilibrium (HWE) with the p-value greater than 0.1 so one marker (rs1355366) is not consistent with HWE in population under study. Heterozygosity of seven SNPs is greater than 0.45 but minor allele frequency of only four SNPs is more than 0.45 (Of course, rs1454361is very close to (MAF) > 0.45). Altogether according to four criteria only three SNPs of rs1454361, rs2111980 and rs2107612 can pass all standards and are highly informative in population for forensic uses.
Table 3

Paternity index, probability of exclusion, genotype and allele frequencies of ten SNPs in Persian population

Allele FrequencyGenotype Frequency
Middle EastPersian
NCBI SNP cluster IDPIPEPDAllele 1Allele 2Allele 1Allele 2Allele 1Allele 2Homo allele 1Het allele 1,2Homo allele 2HWE p. value
rs21076120.3910.2150.609GA0.330.670.4950.5050.230.530.240.54
rs14543610.3990.2160.601TA0.540.460.4430.5570.180.530.290.46
rs21119800.3620.1580.638AG0.480.510.5350.4650.30.460.240.44
rs13553660.3420.1020.658AG0.620.380.550.450.360.380.260.02
rs2519340.4390.1400.561TC0.580.4229710.070.440.490.49
rs10285280.4650.1130.535GA0.640.3625750.05400.550.5
rs13358730.4220.1470.578AT0.380.620.3150.6850.090.450.460.66
rs19792550.430.1540.57GC0.290.710.310.690.080.460.460.45
rs9171180.3650.140.635TC0.590.410.4250.5750.20.450.350.42
rs14132120.4330.1330.567GA0.310.690.6450.3550.380.530.090.11
Total PI1.13 E-04
Total PE0.809000
Paternity index, probability of exclusion, genotype and allele frequencies of ten SNPs in Persian population

4.2 Calculation of Combined PI and PE of Ten SNPs in Persian Population

PI and PE were calculated for each of candidate SNPs. With 10 autosomal SNPs combined PI (CPI) and combined PE (CPE) in Persian population was calculated as 1.13×E-04 and 0.80900 respectively (Table 3).

5. Discussion

Since geneticists discovered polymorphic structures in human genomes, several genetic tests have been developed to utilize this feature in paternity tests and very soon became profitable tools for forensic applications ( 12 ). Although STRs are widely employed in genetic tests as suitable tools; SNPs have some advantages in comparison to STRs. Short amplicon size, low mutation rates and genotyping via high-throughput technologies are some of their advantages ( 13 , 14 ). A panel of 52 SNPs which are presented in SNPforID browser is suggested to be enough informative and has the same power of discrimination as 15 STRs ( 15 , 16 ). However, the SNP informativeness mayvary significantly between populations, so, genotype and allele frequencies of 10 SNPs of this database were calculated in Persian population. As shown in (Table 3) some of these SNPs have different distributions in the Persian population in comparison with those reported for Middle East population in SNPforID browser ( 7 ). These differences could alter forensic parameters of SNPs for forensic application ( 8 ). Based on the allele frequencies of 10 SNPs, CPI was calculated as 1.13E-04. This is very more than 2.2E-17 which are supplied with 15 STR loci in the Persian population, so we should use of vey more SNPs for equal CPI with 15 STR but if selected SNPs are high informative by criteria we can use of fewer number of SNPs. This is same abut CPE that for equal power of exclusion with 15 STRs we should use very more SNP markers ( 17 ). These data mentioned that addition of more appropriate SNPs must be necessary to increase the power of discrimination. Gill reported that 50–80 loci are taken to reach the same discrimination level of 16 STRs loci ( 18 ). The addition of more SNPs, however, is associated with some restrictions. Production and commercialization of robust assays containing large numbers of oligonucleotide PCR primers will not be trivial ( 3 ). Furthermore, examining more loci will be more expensive and time-consuming; therefore, introducing highly informative SNPs is the unavoidable direction in order to decrease the number of SNPs for forensic uses ( 11 ). Since most SNPs are biallelic, the maximum power of discrimination is taken by a combination of SNPs with a nearly the same distribution between two alleles ( 19 ). Based on this procedure, Lee et al introduced 24 highly informative SNPs in Korean population with a PI and PE of 1.9E-10 and 0.989 respectively which is corresponding to nine STR loci ( 8 ). Kim et al suggested that a highly polymorphic set of about 40 SNP markers with MAF> 0.45 and the even allelic frequency will have nearly the same discrimination power for human identification as would a set of 16 STR markers in Korean population ( 11 ).

6. Conclusions

Here in Persian population, it is shown that three out of ten SNPs (rs1454361, rs2111980 and rs2107612) are highly informative based on criteria. With the average PI (0.387) and PE (0.197) of these three SNPs, it is estimated that 39 SNPs have CPI and CPE of 8.33E-17 and 0.9998 respectively, which is comparable to 15 STRs in Persian population with a CPI of 2.2E-17 and a CPE of 0.9999 ( 17 ). These numbers of SNPs is less than that 50-80 reported to be necessary for human identification. Altogether, although it is a preliminary study with a small number of SNPs, the outcomes of this study help forensic labs to define a list of common highly informative SNPs which are applicable in all populations.
  18 in total

1.  An assessment of the utility of single nucleotide polymorphisms (SNPs) for forensic purposes.

Authors:  P Gill
Journal:  Int J Legal Med       Date:  2001       Impact factor: 2.686

2.  Selection of twenty-four highly informative SNP markers for human identification and paternity analysis in Koreans.

Authors:  Hwan Young Lee; Myung Jin Park; Ji-Eun Yoo; Ukhee Chung; Gil-Ro Han; Kyoung-Jin Shin
Journal:  Forensic Sci Int       Date:  2005-03-10       Impact factor: 2.395

3.  Developing a SNP panel for forensic identification of individuals.

Authors:  Kenneth K Kidd; Andrew J Pakstis; William C Speed; Elena L Grigorenko; Sylvester L B Kajuna; Nganyirwa J Karoma; Selemani Kungulilo; Jong-Jin Kim; Ru-Band Lu; Adekunle Odunsi; Friday Okonofua; Josef Parnas; Leslie O Schulz; Olga V Zhukova; Judith R Kidd
Journal:  Forensic Sci Int       Date:  2005-12-19       Impact factor: 2.395

4.  Resolving relationship tests that show ambiguous STR results using autosomal SNPs as supplementary markers.

Authors:  C Phillips; M Fondevila; M García-Magariños; A Rodriguez; A Salas; A Carracedo; M V Lareu
Journal:  Forensic Sci Int Genet       Date:  2008-04-18       Impact factor: 4.882

5.  The SNPforID browser: an online tool for query and display of frequency data from the SNPforID project.

Authors:  Jorge Amigo; Christopher Phillips; Maviky Lareu; Angel Carracedo
Journal:  Int J Legal Med       Date:  2008-05-20       Impact factor: 2.686

6.  Development of SNP-based human identification system.

Authors:  Jae-Jung Kim; Bok-Ghee Han; Hae-In Lee; Han-Wook Yoo; Jong-Keuk Lee
Journal:  Int J Legal Med       Date:  2010-03       Impact factor: 2.686

7.  SNP variation with latitude: Analysis of the SNPforID 52-plex markers in north, mid-region and south Chilean populations.

Authors:  F Moreno; A Freire-Aradas; C Phillips; M Fondevila; Á Carracedo; M V Lareu
Journal:  Forensic Sci Int Genet       Date:  2014-01-07       Impact factor: 4.882

8.  Development of a SNP set for human identification: A set with high powers of discrimination which yields high genetic information from naturally degraded DNA samples in the Thai population.

Authors:  Hathaichanoke Boonyarit; Surakameth Mahasirimongkol; Nuttama Chavalvechakul; Masayuki Aoki; Hanae Amitani; Naoya Hosono; Naoyuki Kamatani; Michiaki Kubo; Patcharee Lertrit
Journal:  Forensic Sci Int Genet       Date:  2014-03-27       Impact factor: 4.882

9.  STRs vs. SNPs: thoughts on the future of forensic DNA testing.

Authors:  John M Butler; Michael D Coble; Peter M Vallone
Journal:  Forensic Sci Med Pathol       Date:  2007-09-12       Impact factor: 2.007

10.  Hardy-Weinberg equilibrium testing of biological ascertainment for Mendelian randomization studies.

Authors:  Santiago Rodriguez; Tom R Gaunt; Ian N M Day
Journal:  Am J Epidemiol       Date:  2009-01-06       Impact factor: 4.897

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.