| Literature DB >> 35176104 |
Sobiah Rauf1, Jeremy J Austin2, Denice Higgins2,3, Muhammad Ramzan Khan1.
Abstract
Massively parallel sequencing following hybridisation enrichment provides new opportunities to obtain genetic data for various types of forensic testing and has proven successful on modern as well as degraded and ancient DNA. A customisable forensic intelligence panel that targeted 124 SNP markers (67 ancestry informative markers, 23 phenotype markers from the HIrisplex panel, and 35 Y-chromosome SNPs) was used to examine biogeographic ancestry, phenotype and sex and Y-lineage in samples from different ethnic populations of Pakistan including Pothwari, Gilgit, Baloach, Pathan, Kashmiri and Siraiki. Targeted sequencing and computational data analysis pipeline allowed filtering of variants across the targeted loci. Study samples showed an admixture between East Asian and European ancestry. Eye colour was predicted accurately based on the highest p-value giving overall prediction accuracy of 92.8%. Predictions were consistent with reported hair colour for all samples, using the combined highest p-value approach and step-wise model incorporating probability thresholds for light or dark shade. Y-SNPs were successfully recovered only from male samples which indicates the ability of this method to identify biological sex and allow inference of Y-haplogroup. Our results demonstrate practicality of using hybridisation enrichment and MPS to aid in human intelligence gathering and will open many insights into forensic research in South Asia.Entities:
Mesh:
Year: 2022 PMID: 35176104 PMCID: PMC8853543 DOI: 10.1371/journal.pone.0264125
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Summary of steps in experimental lab workflow.
Detailed information of the 67 biogeographic ancestry SNPs included in the hybridisation enrichment panel.
| rs Number | Chr. No. | Position (GRCh37/hg19) | Ancestry Group | Reference |
|---|---|---|---|---|
| rs2139931 | 1 | 84590527 | OCE | [ |
| rs2814778 | 1 | 159174683 | AFR | [ |
| rs4657449 | 1 | 165465281 | EAS | [ |
| rs12142199 | 1 | 1249187 | EUR | [ |
| rs12402499 | 1 | 101528954 | AMR | [ |
| rs647325 | 1 | 18170886 | AMR | [ |
| rs2184030 | 1 | 206667441 | Tri-allelic | [ |
| rs16830500 | 2 | 152814129 | OCE | [ |
| rs3827760 | 2 | 109513601 | EAS | [ |
| rs10183022 | 2 | 237481969 | OCE | [ |
| rs820371 | 3 | 123404711 | EUR | [ |
| rs6437783 | 3 | 108172817 | EAS | [ |
| rs9809818 | 3 | 71480566 | OCE | [ |
| rs12498138 | 3 | 121459589 | AMR | [ |
| rs4683510 | 3 | 140285115 | EAS | [ |
| rs7623065 | 3 | 22385375 | OCE | [ |
| rs10012227 | 4 | 18637315 | AMR | [ |
| rs1229984 | 4 | 100239319 | EAS | [ |
| rs4540055 | 4 | 38803255 | Tri-allelic | [ |
| rs1509524 | 4 | 125455038 | OCE | [ |
| rs6875659 | 5 | 175158653 | AFR | [ |
| rs16891982 | 5 | 33951693 | EUR | [ |
| rs4704322 | 5 | 75822474 | EAS | [ |
| rs6886019 | 5 | 170245846 | OCE | [ |
| rs10455681 | 6 | 69802502 | OCE | [ |
| rs2080161 | 7 | 13331150 | AMR | [ |
| rs798949 | 7 | 120765954 | OCE | [ |
| rs1871534 | 8 | 145639681 | AFR | [ |
| rs2409722 | 8 | 11039816 | OCE | [ |
| rs7832008 | 8 | 98358246 | OCE | [ |
| rs2789823 | 9 | 136769888 | AFR | [ |
| rs10811102 | 9 | 1911291 | OCE | [ |
| rs10970986 | 9 | 32453278 | OCE | [ |
| rs16913918 | 9 | 3074359 | EUR | [ |
| rs7084970 | 10 | 119750413 | EUR | [ |
| rs4749305 | 10 | 28391596 | EUR | [ |
| rs2274636 | 10 | 27443012 | OCE | [ |
| rs174570 | 11 | 61597212 | AMR | [ |
| rs3751050 | 11 | 9091244 | OCE | [ |
| rs5030240 | 11 | 32424389 | Tri-allelic | [ |
| rs1924381 | 13 | 72321856 | EUR | [ |
| rs9522149 | 13 | 111827167 | EUR | [ |
| rs721367 | 13 | 95546650 | EAS | [ |
| rs730570 | 14 | 101142890 | EUR | [ |
| rs7151991 | 14 | 32635572 | AMR | [ |
| rs10483251 | 14 | 21671277 | AMR | [ |
| rs12434466 | 14 | 97324289 | EAS | [ |
| rs1834640 | 15 | 48392165 | EUR | [ |
| rs12594144 | 15 | 64161351 | EAS | [ |
| rs1426654 | 15 | 48426484 | EUR | [ |
| rs3784651 | 15 | 94925273 | OCE | [ |
| rs6494411 | 15 | 63835861 | EAS | [ |
| rs881929 | 16 | 31079371 | EAS | [ |
| rs17822931 | 16 | 48258198 | EAS | [ |
| rs16946159 | 16 | 48459558 | OCE | [ |
| rs4792928 | 17 | 42105174 | AMR | [ |
| rs8072587 | 17 | 19211073 | EUR | [ |
| rs9908046 | 17 | 53563782 | OCE | [ |
| rs1369290 | 18 | 67691520 | AFR | [ |
| rs310644 | 20 | 62159504 | AFR | [ |
| rs2069945 | 20 | 33761837 | Tri-allelic | [ |
| rs6054465 | 20 | 6673018 | OCE | [ |
| rs715605 | 22 | 30640308 | OCE | [ |
| rs1557553 | 22 | 44760984 | AMR | [ |
| rs8137373 | 22 | 41729216 | AMR | [ |
| rs4892491 | X | 73422412 | EAS | [ |
| rs11156577 | X | 153660041 | OCE | [ |
EUR informative SNP at rs16891982 is also included in the phenotype SNPs. Ancestry groups are: East Asian (EAS), African (AFR), EUR-European, Native American (AMR) and Oceanian (OCE). Tri-allelic SNPs are ancestry informative but also serve to monitor for contamination from more than 1 DNA donor.
Details of the 23 phenotype (hair and eye colour) SNPs included in the hybridisation enrichment panel.
SNP rs16891982 is also included in the ancestry SNPs.
| rs Number | Chr. No. | Position (GRCh37/hg19) | Reference |
|---|---|---|---|
| rs16891982 | 5 | 33951693 | [ |
| rs28777 | 5 | 33958959 | [ |
| rs4959270 | 6 | 457748 | [ |
| rs12203592 | 6 | 396321 | [ |
| rs683 | 9 | 12709305 | [ |
| rs1042602 | 11 | 88911696 | [ |
| rs1393350 | 11 | 89011046 | [ |
| rs12821256 | 12 | 89328335 | [ |
| rs2402130 | 14 | 92801203 | [ |
| rs12896399 | 14 | 92773663 | [ |
| rs12913832 | 15 | 28365618 | [ |
| rs1800407 | 15 | 28230318 | [ |
| rs1805005 | 16 | 89985844 | [ |
| rs1805006 | 16 | 89985918 | [ |
| rs2228479 | 16 | 89985940 | [ |
| rs11547464 | 16 | 89986091 | [ |
| rs1805007 | 16 | 89986117 | [ |
| rs201326893 | 16 | 89986122 | [ |
| rs1110400 | 16 | 89986130 | [ |
| rs1805008 | 16 | 89986144 | [ |
| rs885479 | 16 | 89986154 | [ |
| rs1805009 | 16 | 89986546 | [ |
| rs2378249 | 20 | 33218090 | [ |
Details of the 35 Y-chromosome SNPs included in the hybridisation enrichment panel.
| rs Number (mutation name) | Position (GRCh37/hg19) | Y-chr haplogroup | Reference |
|---|---|---|---|
| rs2032595 (M168) | 14813991 | CDEF | [ |
| rs3848982 (M145) | 21717208 | DE | [ |
| rs2032602 (M174) | 14954280 | D | [ |
| rs371443469 (V36) | 6814246 | E1b1b1a1b1a | [ |
| rs9306841 (M96) | 21778998 | E | [ |
| rs9786025 (P170) | 15021522 | E | [ |
| rs2032666 (M216) | 15437564 | C | [ |
| rs35284970 (M130) | 2734854 | C | [ |
| rs2032668 (M217) | 15437333 | C2 | [ |
| rs868363758 (M347) | 2877479 | C1b3b | [ |
| rs9786706 (U13) | 14698928 | G2a2b2a1a1a1 | [ |
| rs2032636 (M201) | 15027529 | G | [ |
| rs13447371 (M282) | 21764431 | H2 | [ |
| rs2032673 (M69) | 21894058 | H1a | [ |
| rs17250163 (P126) | 21225770 | IJ | [ |
| rs9341301 (M258) | 15023364 | I | [ |
| rs13447352 (M304) | 22749853 | J | [ |
| rs9341313 (M267) | 22741818 | J1 | [ |
| rs3900 (M9) | 21730257 | KLT | [ |
| rs3902 (M11) | 21730647 | L | [ |
| rs9341308 (M272) | 22738775 | T | [ |
| rs2033003 (M526) | 23550924 | K | [ |
| n/a (P308) | 15409573 | S | [ |
| n/a (P256) | 8685230 | M | [ |
| rs2032631 (M45) | 21867787 | QR | [ |
| rs8179021 (M242) | 15018582 | Q | [ |
| rs2032658 (M207) | 15581983 | R | [ |
| rs17250535 (M420) | 23473201 | R1a | [ |
| rs9786184 (M343) | 2887824 | R1b | [ |
| rs9786153 (M269) | 22739367 | R1b1a1b | [ |
| rs9786140 (M412) | 8502236 | R1b1a1b1a | [ |
| rs9341278 (M231) | 15469724 | N | [ |
| rs13447361 (M324) | 2821786 | O2a | [ |
| rs11575897 (M176) | 2655180 | O1b2 | [ |
| rs13447354 (M307) | 22750951 | O1a1a | [ |
*sub-haplogroup nomenclature taken from ISOGG 2018 version 13.256.
Fig 2Workflow highlighting main steps of sequencing data analysis pipeline.
Fig 3Y-chromosome haplogroup assignment on PhyloTree for sample R1.
Haplogroup assigned is R-M420. Derived SNPs: M168(C>T) →M9(C>G) →M526(A>C) →M45(G>A) →M207(A>G) → M420(T>A). Purple and green colour circles show ancestral and derived SNPs, respectively. Names on branches and leaves of tree represents SNP identifiers and haplogroup names, respectively.
Summary of Y-chromosome haplogroup results from twenty-one male samples.
| Sample | Sex | Self-declared ancestry | Region | Inferred Y-chr haplogroup | Continental Affiliation |
|---|---|---|---|---|---|
|
| Male | Asian | Asia | J-M267 | West Asia, North Africa |
|
| Male | Asian | Asia | H-M69 | South Asia |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | J-M267 | West Asia, North Africa |
|
| Male | Asian | Asia | H-M69 | South Asia |
|
| Male | Asian | Asia | Q | N Asia, America |
|
| Male | Asian | Asia | H-M69 | South Asia |
|
| Male | Asian | Asia | J and J-M267 | Asia, Europe and West Asia |
|
| Male | Asian | Asia | L | Asia, Middle East, Europe |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | R | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | L | Asia, Middle East, Europe |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | J | Asia |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | R | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | R-M420 | C & E Europe, S & C-Asia |
|
| Male | Asian | Asia | Q | N-Asia, America |
Inferred eye colour and most probable hair colour associated probabilities in terms of P-value for twenty-eight samples with known hair and eye colour using the HIrisplex SNPs in the custom enrichment panel.
| Sample | SELF-DECLARED | INFERRED PREDICTIONS | ||||
|---|---|---|---|---|---|---|
| Eye Colour | Hair Colour | Eye Colour (P-Value) | Hair Colour (P-Value) | Hair Shade (P-Value) | Most Probable Hair Colour | |
| B2 | Brown / Hazel | Brown | Brown (0.547) | Brown (0.629) | Dark (0.826) | Dark Brown |
| B4 | Blue | Brown | Blue (0.510) | Brown (0.720) | Dark (0.720) | Dark Brown |
| B5 | Blue/ Grey | Brown | Blue (0.783) | Brown (0.545) | Light (0.693) | Light Brown |
| B6 | Blue | Brown | Blue (0.649) | Brown (0.560) | Light (0.625) | Light Brown |
| G9 | Blue | Brown | Blue (0.510) | Brown (0.639) | Dark (0.77) | Dark Brown |
| Gil7 | Brown/ Hazel | Brown | Brown (0.547) | Brown (0.741) | Light (0.630) | Light Brown |
| Gil8 | Blue/ Grey | Brown | Blue (0.783) | Brown (0.545) | Light (0.693) | Light Brown |
| Gil9 | Blue | Brown | Blue (0.510) | Brown (0.639) | Dark (0.778) | Dark Brown |
| Gil11 | Blue | Brown | Blue (0.649) | Brown (0.545) | Light (0.693) | Light Brown |
| K1 | Blue | Brown | Blue (0.458) | Brown (0.566) | Dark (0.908) | Dark Brown |
| K3 | Brown | Brown | Brown (0.547) | Brown (0.639) | Dark (0.827) | Dark Brown |
| K4 | Brown | Brown | Brown (0.547) | Brown (0.741) | Light (0.630) | Light Brown |
| K7 | Blue/Grey | Brown | Blue (0.510) | Brown (0.629) | Dark (0.826) | Dark Brown |
| K8 | Blue | Brown | Blue (0.510) | Brown (0.629) | Dark (0.826) | Dark Brown |
| P9 | Blue | Brown | Blue (0.783) | Brown (0.545) | Light (0.693) | Light Brown |
| P11 | Blue/Grey | Brown | Blue (0.783) | Brown (0.483) | Light (0.665) | Light Brown |
| P12 | Blue | Brown | Blue (0.510) | Brown (0.639) | Dark (0.778) | Dark Brown |
| P14 | Blue | Brown | Blue (0.649) | Brown (0.545) | Light (0.693) | Light Brown |
| PT32 | Brown/ Black | Brown | Blue (0.510) | Brown (0.569) | Dark (0.683) | Dark Brown |
| PT45 | Brown | Brown | Brown (0.547) | Brown (0.639) | Dark (0.778) | Dark Brown |
| PT34 | Brown | Brown | Brown (0.547) | Brown (0.569) | Dark (0.683) | Dark Brown |
| PT39 | Brown/ Black | Brown | Brown (0.547) | Brown (0.547) | Dark (0.635) | Dark Brown |
| PT50 | Brown | Brown | Brown (0.547) | Brown (0.629) | Dark (0.826) | Dark Brown |
| R1 | Brown | Brown | Brown (0.547) | Brown (0.629) | Dark (0.826) | Dark Brown |
| R2 | Brown | Brown | Brown (0.547) | Brown (0.629) | Dark (0.826) | Dark Brown |
| R3 | Brown/ Hazel | Brown | Brown (0.547) | Brown (0.629) | Dark (0.826) | Dark Brown |
| R5 | Brown | Brown | Brown (0.547) | Brown (0.629) | Dark (0.826) | Dark Brown |
| R7 | Brown | Brown | Blue (0.510) | Brown (0.660) | Dark (0.613) | Dark Brown |
Inferred biogeographic ancestry using snipper for male and female samples under study.
| Sample | Self-declared ancestry | Region | Lowest and Highest Likelihood from Snipper |
|---|---|---|---|
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AFR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AFR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AFR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AFR |
|
| Asian | Asia | 2,236 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 831 times more likely to be EUR than AMR, and 70,915,529 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AFR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than AMR and 10E+9 times more likely to be EUR than EAS |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
|
| Asian | Asia | 10E+9 times more likely to be EUR than EAS and 10E+9 times more likely to be EUR than AMR |
Fig 4PCA plot for 67 biogeographic ancestry SNPs from 28 Pakistani samples and four global reference populations.
Black points represent study samples, also indicated using sample names. Continental reference population samples are shown in yellow (AFR), blue (EAS), green (EUR), pink (AMR), red (OCE) and green (EUR).