| Literature DB >> 19668368 |
Christopher Phillips1, Lourdes Prieto, Manuel Fondevila, Antonio Salas, Antonio Gómez-Tato, José Alvarez-Dios, Antonio Alonso, Alejandro Blanco-Verea, María Brión, Marta Montesino, Angel Carracedo, María Victoria Lareu.
Abstract
The 11-M Madrid commuter train bombings of 2004 constituted the second biggest terrorist attack to occur in Europe after Lockerbie, while the subsequent investigation became the most complex and wide-ranging forensic case in Spain. Standard short tandem repeat (STR) profiling of 600 exhibits left certain key incriminatory samples unmatched to any of the apprehended suspects. A judicial order to perform analyses of unmatched samples to differentiate European and North African ancestry became a critical part of the investigation and was instigated to help refine the search for further suspects. Although mitochondrial DNA (mtDNA) and Y-chromosome markers routinely demonstrate informative geographic differentiation, the populations compared in this analysis were known to show a proportion of shared mtDNA and Y haplotypes as a result of recent gene-flow across the western Mediterranean, while any two loci can be unrepresentative of the ancestry of an individual as a whole. We based our principal analysis on a validated 34plex autosomal ancestry-informative-marker single nucleotide polymorphism (AIM-SNP) assay to make an assignment of ancestry for DNA from seven unmatched case samples including a handprint from a bag containing undetonated explosives together with personal items recovered from various locations in Madrid associated with the suspects. To assess marker informativeness before genotyping, we predicted the probable classification success for the 34plex assay with standard error estimators for a naïve Bayesian classifier using Moroccan and Spanish training sets (each n = 48). Once misclassification error was found to be sufficiently low, genotyping yielded seven near-complete profiles (33 of 34 AIM-SNPs) that in four cases gave probabilities providing a clear assignment of ancestry. One of the suspects predicted to be North African by AIM-SNP analysis of DNA from a toothbrush was identified late in the investigation as Algerian in origin. The results achieved illustrate the benefit of adding specialized marker sets to provide enhanced scope and power to an already highly effective system of DNA analysis for forensic identification.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19668368 PMCID: PMC2719087 DOI: 10.1371/journal.pone.0006583
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Plot of ranked assignment probabilities obtained from cross validation of training sets.
Probabilities of case samples are shown on the right as black diamonds with reported ancestry assignments shown next to the values. (case sample numbers as listed in Materials and Methods). LR is a log scale of the likelihood ratio of Moroccan classification probability/Spanish classification probability, so values below the mid-line of 1 (i.e. balanced odds) are reciprocal values and the distance from the midline denotes higher probability of ancestry from one of the two populations. Red diamonds show four Moroccans misclassified as Spanish and two with balanced odds of 1, green diamonds six Moroccans exhibiting low probabilities compared to the population sample as a whole. The STRUCTURE analysis plot (K = 2) is aligned matching each component sample below.
Reclassification and cross validation analysis of training sets.
|
| European | North African |
| Spanish training set classified as: | 100% | 0% |
| Moroccan training set classified as: | 0% | 100% |
|
| European | North African |
| Spanish training set classified as; | 100% | 0% |
| Moroccan training set classified as: | 12% | 88% |
Cumulative divergence values for pairwise comparisons between nine European populations and two North African populations.
|
|
| |
| Orkney Islands | 1.380 | 1.532 |
| Spain | 1.078 | 1.428 |
| France | 1.056 | 1.421 |
| French Basque | 0.987 | 1.318 |
| Bergamo, Italy | 0.911 | 1.145 |
| Russia | 0.800 | 1.159 |
| Tuscany, Italy | 0.759 | 0.925 |
| Adygei | 0.614 | 0.773 |
| Sardinia | 0.538 | 0.863 |
|
| 0.056 | |
|
| 0.317 | |
Values below show within-population group divergence estimates.
Sardinian and Adygei CEPH-HGDP populations excluded.
Ancestry assignment probabilities from AIM-SNP analysis using a Bayes classifier (output: -log likelihoods), derived likelihood ratios (LR) and ancestries reported to the investigation.
| case sample | -log likelihood: North African | -log likelihood: European | LR North African | LR European | Ancestry assignment AIM-SNPs | Assessment of ancestry Y/mt loci | Ancestry assignment reported |
| Razor | 9.6948E-15 | 2.6870E-16 | 36.08 | - | - | - | |
| Handprint on bag | 2.4805E-18 | 4.259E-15 | 1,717 | European | European | European | |
| Toothbrush | 1.6811E-14 | 2.9613E-23 | 567,680,319 | N African | European | N African | |
| Blanket | 9.0947E-17 | 1.509E-18 | 60.27 | - | N African | - | |
| Hat | 2.4254E-15 | 5.2772E-23 | 45,959,585 | N African | N African | N African | |
| Scarf profile 1 | 2.2546E-20 | 2.4632E-21 | 9.15 | - | NT | - | |
| Scarf profile 2 | 1.3096E-19 | 1.0111E-27 | 129,513,282 | N African | NT | N African |
No assignments were made for the three samples with AIM-SNP assignment probabilities lower than 100× more likely to be North African
NT: not tested.
Figure 2Principal component analysis of case samples and the extended Spanish (black points, n = 127) and North African (red points, n = 240) training sets.
PC1 and PC2 % contributions shown in brackets.