| Literature DB >> 31113963 |
Fanny Chu1,2, Katelyn E Mason1, Deon S Anex3, A Daniel Jones2,4, Bradley R Hart1.
Abstract
Human hair contains minimal intact nuclear DNA for human identification in forensic and archaeological applications. In contrast, proteins offer a pathway to exploit hair evidence for human identification owing to their persistence, abundance, and derivation from DNA. Individualizing single nucleotide polymorphisms (SNPs) are often conserved as single amino acid polymorphisms in genetically variant peptides (GVPs). Detection of GVP markers in the hair proteome via high-resolution tandem mass spectrometry permits inference of SNPs with known statistical probabilities. To adopt this approach for forensic investigations, hair proteomic variation and its effects on GVP identification must first be characterized. This research aimed to assess variation in single-inch head, arm, and pubic hair, and discover body location-invariant GVP markers to distinguish individuals. Comparison of protein profiles revealed greater body location-specific variation in keratin-associated proteins and intracellular proteins, allowing body location differentiation. However, robust GVP markers derive primarily from keratins that do not exhibit body location-specific differential expression, supporting GVP identification independence from hair proteomic variation at the various body locations. Further, pairwise comparisons of GVP profiles with 8 SNPs demonstrated greatest interindividual variation and high intraindividual consistency, enabling similar differentiative potential of individuals using single hairs irrespective of body location origin.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31113963 PMCID: PMC6529471 DOI: 10.1038/s41598-019-44007-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Comparison of numbers of identified (a) proteins, (b) unique peptides, (c) amino acids, and missense SNPs inferred from (d) major and (e) minor GVPs at different body locations. Black lines represent statistically significant comparisons and significance levels are represented as: p ≤ 0.05 (*), p ≤ 0.01 (**), and p ≤ 0.001 (***). Pubic hair samples yield statistically greater numbers of proteins, peptides, amino acids, and inferred SNPs (two-way ANOVA and Tukey HSD; n = 36).
Figure 2Average abundances for a subset of differentially expressed hair proteins at different body locations (two-way ANOVA and Tukey HSD; n = 36). Error bars represent standard deviation from 4 replicate measurements of each of three individuals. Black lines represent statistically significant comparisons and significance levels are represented as: p ≤ 0.05 (*), p ≤ 0.01 (**), and p ≤ 0.001 (***).
Figure 3Comparison and distribution of exome-proteome consistent SNPs across different body locations. (a) Distribution of inferred consistent SNPs across the three body locations for major and minor GVPs, respectively. (b) Summary of the number of consistent SNPs at each body location. (c) Comparison of differentially expressed proteins to proteins of 11 SNPs with unreliable identifications at one or two body locations (i.e., not identified at all body locations). The majority of exome-proteome consistent SNPs identified at each body location are identified in all samples. Unreliably identified SNPs at either one or two body locations originate from a set of proteins that are not differentially expressed; there is no overlap between these sets of proteins. Therefore, SNPs are not body location-specific.
SNP and GVP candidates for GVP panel.
| Gene | SNP Identifier | Amino Acid Polymorphism | GVP Type | Peptide† | PTM | Average Abundance | Observation Frequency |
|---|---|---|---|---|---|---|---|
| FAM83H | rs9969600 | Q/H | Minor | R.VNL | 1.10 × 106 | 2 | |
| KRT32 | rs2071563 | T/M | Major | R.ARLEGEIN | A1:Formylation | 5.72 × 107 | 27 |
| KRT32 | rs2071563 | T/M | Major | R.LEGEIN | 1.71 × 108 | 28 | |
| KRT32 | rs2071563 | T/M | Minor | R.ARLEGEIN | M9:Oxidation (M) | 4.79 × 107 | 10 |
| KRT32 | rs2071563 | T/M | Minor | R.LEGEIN | M7:Oxidation (M) | 1.69 × 107 | 7 |
| KRT33A | rs148752041 | D/H | Minor | R. | N2:Deamidation (NQ) | 4.38 × 107 | 8 |
| KRT33A | rs148752041 | D/H | Minor | R. | 1.11 × 108 | 7 | |
| KRT33B | 17:g.41366553 G > T * | L/M | Major | R.ILDE | C8:Carbamidomethylation;N29:Deamidation (NQ) | 9.11 × 107 | 12 |
| KRT33B | 17:g.41366553 G > T * | L/M | Major | R.ILDE | C8:Carbamidomethylation | 1.49 × 107 | 11 |
| KRT33B | 17:g.41366553 G > T * | L/M | Minor | R.ILDE | C8:Carbamidomethylation | 7.10 × 107 | 10 |
| KRT33B | 17:g.41366553 G > T * | L/M | Minor | R.RILDE | C9:Carbamidomethylation | 2.20 × 107 | 7 |
| KRT81 | rs2071588 | G/R | Minor | R.GLTGGFGSHSVC | C12:Carbamidomethylation | 3.35 × 108 | 19 |
| KRT81 | rs2071588 | G/R | Minor | L.TGGFGSHSVC | C10:Carbamidomethylation | 1.18 × 107 | 13 |
| KRT83 | rs2852464 | I/M | Major | R.DLNMDC | C6:Carbamidomethylation | 1.08 × 108 | 32 |
| KRT83 | rs2852464 | I/M | Major | R.DLNMDC | K12:Carbamylation | 1.83 × 108 | 23 |
| KRT83 | rs2852464 | I/M | Minor | R.DLNMDC | C6:Carbamidomethylation | 2.97 × 107 | 19 |
| KRT83 | rs2852464 | I/M | Minor | R.DLNMDC | C6:Carbamidomethylation;M7:Oxidation (M) | 4.29 × 106 | 16 |
| KRTAP10-3 | rs233252 | C/Y | Minor | R.ST | C4:Carbamidomethylation;C10:Carbamidomethylation | 2.97 × 106 | 4 |
| KRTAP10-3 | rs233252 | C/Y | Minor | R.ST | C4:Carbamidomethylation | 1.65 × 106 | 2 |
| KRTAP10-9 | rs9980129 | R/C | Minor | C.CAPTSS | C1:Carbamidomethylation;C7:Carbamidomethylation; C12:Carbamidomethylation | 6.99 × 106 | 10 |
| KRTAP4-11 | rs760092771 | S/C | Major | R.TTYCRPSCCVS | C4:Carbamidomethylation;C8:Carbamidomethylation; C9:Carbamidomethylation | 1.75 × 108 | 5 |
| KRTAP4-11 | rs760092771 | S/C | Minor | R.TTYCRPSYSVS | C4:Carbamidomethylation;C12:Carbamidomethylation; C13:Carbamidomethylation | 1.32 × 108 | 12 |
| KRTAP4-11 | rs760092771 | S/C | Minor | R.TTYCRPSYSVS | C4:Carbamidomethylation;C12:Carbamidomethylation | 8.10 × 107 | 11 |
| KRTAP4-11 | rs763737606 | C/S | Major | R.TTYCRPSC | C4:Carbamidomethylation;C8:Carbamidomethylation; C9:Carbamidomethylation | 1.75 × 108 | 5 |
| KRTAP4-11 | rs763737606 | C/S | Minor | R.TTYCRPSY | C4:Carbamidomethylation;C12:Carbamidomethylation; C13:Carbamidomethylation | 1.32 × 108 | 12 |
| KRTAP4-11 | rs763737606 | C/S | Minor | R.TTYCRPSY | C4:Carbamidomethylation;C12:Carbamidomethylation | 8.10 × 107 | 11 |
| KRTAP4-11 | rs774046661 | C/Y | Major | R.TTYCRPS | C4:Carbamidomethylation;C8:Carbamidomethylation; C9:Carbamidomethylation | 1.75 × 108 | 5 |
| KRTAP4-11 | rs774046661 | C/Y | Minor | R.TTYCRPS | C4:Carbamidomethylation;C12:Carbamidomethylation; C13:Carbamidomethylation | 1.32 × 108 | 12 |
| KRTAP4-11 | rs774046661 | C/Y | Minor | R.TTYCRPS | C4:Carbamidomethylation;C12:Carbamidomethylation | 8.10 × 107 | 11 |
| VSIG8 | rs62624468 | V/I | Major | R.LGCPY | C3:Carbamidomethylation;N15:Deamidation (NQ) | 1.36 × 107 | 18 |
| VSIG8 | rs62624468 | V/I | Major | R.LGCPY | C3:Carbamidomethylation;N15:Deamidation (NQ);M22:Oxidation (M) | 6.88 × 106 | 14 |
| VSIG8 | rs62624468 | V/I | Minor | R.LGCPY | C3:Carbamidomethylation;N15:Deamidation (NQ);M22:Oxidation (M) | 4.92 × 106 | 5 |
| VSIG8 | rs62624468 | V/I | Minor | R.LGCPY | C3:Carbamidomethylation;N15:Deamidation (NQ);M22:Oxidation (M) | 2.90 × 106 | 3 |
*No SNP identifier associated with SNP (HGVS notation used); Bold text denotes location of amino acid variant in genetically variant peptide; †Preceding amino acid in peptide sequence denoted by “X.”
Figure 4GVP profiles of 36 samples using observed phenotype frequency to represent the presence or absence of major and minor GVPs at 8 SNP loci. Profiles within an individual are similar, indicating consistent identification of SNPs with robust GVPs.
Figure 5(a) Average number of GVP profile differences from different pairwise comparison categories compared to (b) expected number of GVP profile differences. Error bars represent the standard deviation. All but two comparisons, denoted by dotted line, are statistically significant (Kruskal-Wallis and Dunn tests; n = 630; p ≤ 3.80 × 10−6). The numbers of observed profile differences approximate expected GVP profile differences. Between Individual profile differences are statistically greater than Replicate and Within Individual profile differences.
Figure 6Experimentally observed random match probabilities (m ± 95% CI) compared to expected RMP values for each individual. Expected RMPs are theoretically-derived values based on the detection of all GVPs consistent with an individual’s genotype for the same 8 SNPs. RMP values of different body location samples from the same individual are not different; the extent to which individuals are distinguished from one another is not affected by hair origin. Observed RMP values from a robust set of SNPs approximate expected values within an order of magnitude.