| Literature DB >> 24453568 |
Sophie Verscheure1, Thierry Backeljau2, Stijn Desmyter3.
Abstract
The identification of dog hair through mtDNA analysis has become increasingly important in the last 15 years, as it can provide associative evidence connecting victims and suspects. The evidential value of an mtDNA match between dog hair and its potential donor is determined by the random match probability of the haplotype. This probability is based on the haplotype's population frequency estimate. Consequently, implementing a population study representative of the population relevant to the forensic case is vital to the correct evaluation of the evidence. This paper reviews numerous published dog mtDNA studies and shows that many of these studies vary widely in sampling strategies and data quality. Therefore, several features influencing the representativeness of a population sample are discussed. Moreover, recommendations are provided on how to set up a dog mtDNA population study and how to decide whether or not to include published data. This review emphasizes the need for improved dog mtDNA population data for forensic purposes, including targeting the entire mitochondrial genome. In particular, the creation of a publicly available database of qualitative dog mtDNA population studies would improve the genetic analysis of dog traces in forensic casework.Entities:
Keywords: Dog; Forensics; Mitochondrial DNA; Population study; Random match probability; Sampling strategy
Year: 2013 PMID: 24453568 PMCID: PMC3890688 DOI: 10.3897/zookeys.365.5859
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Figure 1.Position of the control region and its subregions within the Kim et al. (1998) reference dog mitochondrial genome.
Overview of the characteristics of sampling and sequence analysis in 58 canine mtDNA studies. Number of dogs sampled and, when specified in the publication, the number of dog breeds and mixed-breed, feral or village dogs in the sample; Origin of sample: new or extracted from previous studies as a comparison or to supplement the population sample (see reference numbers, except unpublished data by van Ash et al. (59); Koop et al. (60) and Shahid et al. (61)); Sampling region (or the geographic region of origin of included dog breeds if unclear from publication); Intention to avoid the inclusion of maternal relatives; GenBank accession numbers of new data are stated when applicable; un, unknown; s, skeletal remains of various age; ±, when variable, all sequences extracted from GenBank or the publication have this region in common; <, selected from this number of dogs from the same publication; * Larger region is mentioned in the publication, but only this part is available; Characteristics can differ from what is stated in publication if potential clerical errors were adapted, e.g. **publication states 246 instead of 233 as the sequences from reference 22 were included twice.
| Publication | Sample | Sequence analysis | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Reference | Aim of study | # Dogs | # Breeds | # Mixed- breed | Origin of sample | Sampling region | Avoid relatives | mtDNA region | Availability of new sequence data | |
| 1 | mtDNA variability study | 11 | 11 | 0 | new | the Netherlands | un | Repeat region* | Publication | |
| 2 | Phylogenetic breed study | 94 | 24 | 0 | new | Japan | un | 15458-16129, 16420-16727 | ||
| 3 | Forensic, Phylogenetic | 102 | 52 | 0 | new | Sweden | YES | 15431-15687 | Publication | |
| Population study | ||||||||||
| 4 | Phylogenetic population study | 34 | 24 | 0 | new | Japan, Korea, Mongolia, Indonesia | un | 15458-16130* | ||
| 5 | Phylogenetic population study | 140 | 67 | 5 | new | un | un | 15431-15687 | ||
| 15 | un | un | < 140 new | ± 15393-16076, ± 16508-49 | ||||||
| 6 | First complete dog mtGenome | 1 | 1 | 0 | new | Korea | YES | 1-16727 | ||
| 7 | Phylogenetic breed study | 74s | un | un | new | Japan | un | 15483-15679 | ||
| 84 | un | un | new | |||||||
| 94 | 24 | 0 | 2 | 15458-16129, 16420-16727 | ||||||
| 8 | Phylogenetic breed study | 19 | 1 | 0 | new | US, Mexico | YES | 15431-15687* | Publication | |
| 140 | 67 | 5 | 5 | un | 15431-15687 | |||||
| 9 | Phylogenetic population study | 41 | 30 | 0 | new | Switzerland | un | 15458-16000 | ||
| 9 | 0 | 9 | new | Italy | ||||||
| 10 | Phylogenetic breed study | 25 | 11 | 0 | new | Korea | NO | 15622-16030 | ||
| 11 | Forensic population study | 12 | 11 | 1 | new | Poland | un | 15431-15687 | ||
| 12 | Phylogenetic population study | 526 | un | un | new | Europe, Asia, Africa, Arctic America | un | ± 15458-16039 | ||
| 128 | 2, 4 | |||||||||
| 13 | Inheritable disorder study | 365 | 49 | un | new | Japan | un | 15458-16055 | ||
| 14 | Phylogenetic population study | 50 | un | un | new | France, Switzerland | un | ± 15519-15746 | ||
| 15 | un | un | new | France, Portugal | ||||||
| 15 | Forensic population study | 105 | un | un | new | UK | un | 15431-16030 | ||
| 246 | 2, 3, 9 | Japan, Switzerland, Italy, Sweden | ± 15458-15687 | |||||||
| 16 | Pereira et al. 2004 | Catalogue of published datasets | 58 | 1 | 0 | 59 | Portugal | un | 15458-16039 | Publication |
| 1089 | un | un | 2, 4, 10, 12, 13, 15 | Europe, Asia, Africa, Arctic America | ± 15622-16030 | |||||
| 17 | Phylogenetic population study | 22 | un | un | new | SE-Asia, India | un | 15458-16039 | ||
| 19s | un | un | new | Polynesia | 15458-15720 | Publication | ||||
| 654 | un | un | 2, 4, 12 | Europe, Asia, Africa, Arctic America | ± 15458-16039 | |||||
| 18 | Phylogenetic population study | 24 | 0 | 24 | new | India | un | 15443-15783 | ||
| 19 | Forensic, Phylogenetic | 35 | 19 | 9 | new | Germany | YES | 15458-16039 | ||
| 74 | 52 | 2 | new | Europe | ||||||
| Population study | 758 | un | un | 2, 4, 12, 15 | Europe, Asia, Africa, Arctic America | ± 15458-16030 | ||||
| 20 | Forensic population study | 348 | 88 | 45 | new | US | un | 15431-16085 | Not published | |
| 21 | Phylogenetic breed study | 143 | 4 | 0 | new | Portugal | YES | 15372-16083 | Publication | |
| 144 | 9 | 0 | 2, 4, 12, 13 | Europe, Asia, Africa, Arctic America | ± 15458-16030 | |||||
| 22 | Phylogenetic population study | 88 | 53 | 0 | new | Sweden | un | part of HV-I | Not published | |
| 14 | 13 | 0 | < 88 new | 1-16727 | ||||||
| 23 | Phylogenetic breed study | 143 | 11 | 0 | new | Portugal, Spain, Morocco | YES | 15211-16096 | ||
| 21 | 0 | 21 | new | Portugal, Azores, Tunisia | ||||||
| 24 | Phylogenetic breed study | 84 | 3 | 0 | new | Russia | un | 15458-15778 | ||
| 20 | 2 | 0 | 12 | Turkey | ± 15458-16039 | |||||
| 25 | Phylogenetic breed study | 100 | 20 | 0 | new | Sweden | un | 15431-15687 | Publication | |
| 26 | Forensic population study | 133 | 46 | 38 | new | Austria | un | 15458-16727 | Publication | |
| 27 | Forensic population study | 61 | 41 | 0 | new | US | un | 15455-16727 | ||
| Forensic breed study | 64 | 2 | 0 | new | (excluding | |||||
| 28 | Forensic population study | 83 | 30 | 0 | new | US | un | 15595-15654 | Publication | |
| 159 | 27, 30 | |||||||||
| 29 | Forensic population study | 96 | 79 | 0 | new | UK | un | 15458-16039 | Not published | |
| Forensic breed study | 15 | 1 | 0 | new | 15458-16131, 16428-16727 | |||||
| 30 | Forensic population study | 36 | 11 | 20 | new | US (California) | un | 15456-16063 | ||
| 22 | un | un | 60 | un | 15433-16139 | |||||
| 179 | 2, 4, 5, 6, 10, 27 | Europe, Asia, North-America | ± 15622-16030 | |||||||
| 31 | Phylogenetic breed study | 52 | 5 | 0 | new | Spain | un | 15458-16105 | ||
| 32 | Inheritable disorder study | 7 | 1 | 0 | new | Sweden | NO | 1-16727 | ||
| 33 | Phylogenetic population study | 309 | 0 | 309 | new | Egypt, Uganda, Namibia | YES | ± 15454-16075 | ||
| 17 | 0 | 17 | new | US (mostly Puerto Rico) | ||||||
| un | un | un | 12, 23 | East-Asia, Africa | ± 15458-16039 | |||||
| 34 | Forensic population study | 117 | 60 | 24 | new | Belgium | YES | 15458-16130, 16431-16727 | Not published | |
| 35 | Phylogenetic breed study | 114 | 2 | 0 | new | Turkey | YES | 15458-16039 | ||
| un | un | un | 12 | Europe, Asia, Africa | ||||||
| 36 | Phylogenetic population study | 907 | un | un | new, 61 | Old World, Arctic America | un | ± 15458-16039 | ||
| 669 | un | un | 2, 4, 6, 12, 22 | |||||||
| 135 | un | un | < 907 + 669 | 1-15511, 15535-16039, 16551-16727 | ||||||
| 34 | un | un | 6, 22, 61 | 1-16727 | ||||||
| 37 | Forensic population study | 427 | 139 | 118 | new | US | YES | ± 15458-16114, ± 16484-16727 | ||
| 125 | 27 | 15455-16727 | ||||||||
| 38 | Forensic population study | 64 | 43 | 11 | 37 | US | YES | ± 1-16129, ± 16434-16727 | ||
| 15 | 14 | 0 | 6, 22 | Korea, Sweden | 1-16727 | |||||
| 39 | Phylogenetic population study | 29 | un | un | new | Canada | un | 15361-15785 | ||
| 40 | Forensic population study | 220 | 0 | 220 | new | US | YES | 15456-16063 | ||
| 429 | 30, 37 | ± 15458-16063 | ||||||||
| 41 | Phylogenetic population study | 325 | un | un | new | Europe, SW-Asia | un | 15458-16039 | ||
| 1576 | un | un | 2, 4, 6, 12, 22, 36, 61 | Old World, Arctic America | ± 15458-16039 | |||||
| 42 | Phylogenetic population study | 200 | 0 | 200 | new | Middle East/SW-Asia | un | 15482-15867 | ||
| 231 | 0 | 231 | new | SE-Asia | ||||||
| 1576 | un | un | 2, 4, 6, 12, 22, 36, 61 | Old World, Arctic America | ± 15458-16039 | |||||
| 43 | Phylogenetic population study | 371 | 0 | 371 | new | the Americas | un | ± 15491-15755 | ||
| 29 | un | un | 39 | |||||||
| 44 | Phylogenetic population study | 280 | 33 | 0 | new | Europe, Arctic America, East-Asia | YES | 15458-16039 | ||
| 234 | 36 | ± 15458-16039 | ||||||||
| 45 | Point heteroplasmy pedigree study | 180 | 18 | 0 | new | Europe, Arctic America, East-Asia | NO | 15458-16039 | Publication | |
| 131 | 2 | 0 | new | |||||||
| 46 | Phylogenetic breed study | 77 | 26 | 0 | new | Germany | NO | 15458-16124 | Publication | |
| 34 | 1 | 0 | new | |||||||
| 47 | Phylogenetic breed study | 1 | 1 | 0 | new | China | YES | 1-16727 | ||
| 33 | un | un | 22, 32, 38, 61 | Sweden, US | ||||||
| 48 | Forensic species ID, Phylogenetic population study | 20 | 0 | 20 | new | Croatia | un | 15465-15744 | ||
| 49 | Validation of forensic analysis method | 41 | 29 | 3 | new | Belgium | un | ± 15458-16092, ± 16474-16703 | ||
| 550 | 27, 37 | US | ± 15458-16114, ± 16484-16727 | |||||||
| 50 | Phylogenetic breed study | 78 | 3 | 0 | new | Romania | YES | ± 15251-16068 | ||
| 51 | Forensic population study | 208 | 60 | 68 | new, 34 | Belgium | YES | 15458-16129, 16430-16727 | ||
| 778 | 15, 26, 27, 37 | UK, Austria, US | ± 15458-16030 | |||||||
| Forensic breed study | 107 | 6 | 0 | new | Belgium | 15458-16129, 16430-16727 | ||||
| 337 | 6 | 0 | < 208 new, 13, 19, 26, 27, 37 | Worldwide | ± 15458-16039 | |||||
| 52 | Phylogenetic breed study | 34 | 2 | 0 | new | Poland | NO | 15426-16085 | ||
| 53 | , | un | un | un | GenBank | Worldwide | ||||
| 54 | Forensic population study | 100 | 98 | 0 | new | US, Australia, Canada, Columbia, Uruguay | YES | ± 1-16129, ± 16430-16727 | ||
| 233** | un | un | 6, 22, 36, 38, 61 | Worldwide | ± 1-15511; ± 15535-16039; ± 16551-16727 | |||||
| 55 | Phylogenetic breed study | 47 | 1 | 0 | new | Tibet, surrounding areas | YES | ± 582 bp of control region | Not published | |
| 439 | un | un | GenBank | Worldwide | ||||||
| 56 | Phylogenetic population study | 305 | un | un | new | SE-Asia, E-Asia | YES | 15458-16039 | ||
| 350 | un | un | 4, 12, 36 | ± 15458-16039 | ||||||
| 1224 | un | un | 2, 4, 6, 12, 22, 36, 61 | Old World, Arctic America | ||||||
| 19s | un | un | 17 | Polynesia | 15458-15720 | |||||
| 57 | Phylogenetic population study | 20s | un | un | new | Alaska, Greenland | un | ± 367 bp of HV-I | ||
| 51 | 1 | 0 | new | Arctic America | ± 15580-16016 | |||||
| 78 | 2 | 0 | 2, 12, 36, 44 | ± 15458-16039 | ||||||
| 58 | Phylogenetic breed study | 324 | 5 | 0 | new | Canary Islands | YES | 15361-16086 | Publication | |
| 986 | un | un | 15, 26, 27, 34, 37, 51 | UK, Austria, Belgium, US | ± 15458-16030 | |||||
Comparison of haplotype number, PE and haplotypes with the 10 highest frequencies in selected dog mtDNA studies. Exclusion probability (PE) is based on the part of the control region studied in the publication (further details on exact region in Table 1) excluding the repeat region; characteristics can differ from publication if potential clerical errors were adapted; the 3 universally most frequent haplotypes are in bold (A11, B1 and A17); (×) US and a minority from Australia, Canada, Uruguay and Columbia; (××) Haplotype names are analogous to Savolainen et al. 2002, Savolainen et al. 2004, Angleby and Savolainen 2005, Pang et al. 2009, Klütsch et al. 2011a, Ardalan et al. 2011 and Oskarsson et al. 2012, or are in italic when unavailable and publication name was used; (×××) Haplotypes are based on 15458-16039 except for Wetton et al. 2003 (15458-16030). Only the Kim et al. (1998) reference nucleotide was considered in case of a heteroplasmic site.
| Population studies for forensic purposes | Breed studies for forensic or phylogenetic purposes | |||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Europe | US | |||||||||||||||||||||||
| Sampling region | UK | Germany | Austria | Belgium | US | US | US | US | US (×) | Japan | US | Canary Islands | ||||||||||||
| Studied part of control region (CR) | HV-I | HV-I | entire CR | HV-I+II | entire CR | HV-I | HV-I+II | HV-I | HV-I | HV-I+II | entire CR | HV-I | ||||||||||||
| # Dogs (# breeds/# mixed-breed) | 105(un/un) | 35(19/9) | 133 (46/38) | 208 (60/68) | 61(41/0) | 36(11/20) | 552 (139/118) | 649 | 100(98/0) | 94(24/0) | 64(2/0) | 324(5/0) | ||||||||||||
| # Haplotypes | 31 | 13 | 40 | 58 | 32 | 16 | 104 | 71 | 34 | 38 | 13 | 16 | ||||||||||||
| Exclusion probability | 0.93 | 0.86 | 0.93 | 0.92 | 0.93 | 0.89 | 0.96 | 0.92 | 0.91 | 0.93 | 0.8 | 0.86 | ||||||||||||
| Haplotypes with 10 highest frequency estimates (%) (××) (×××) | 14.2 | 28.6 | 18.0 | 16.8 | 14.8 | 16.7 | 15.6 | 18.0 | 17.0 | A18 | 13.8 | A33 | 29.7 | 25.9 | ||||||||||
| A18 | 11.4 | 14.3 | 12.0 | 15.4 | 13.1 | 13.9 | 10.9 | 12.6 | 15.0 | A68 | 10.6 | A16 | 28.1 | 14.5 | ||||||||||
| 10.5 | 14.3 | 12.0 | 15.4 | A18 | 9.8 | A16 | 13.9 | 10.3 | 11.7 | A18 | 14.0 | C3 | 10.6 | 26.6 | A20 | 14.2 | ||||||||
| A2 | 8.6 | C3 | 8.6 | A19 | 8.3 | A19 | 6.7 | 6.6 | 13.9 | A18 | 9.2 | A18 | 11.1 | 8.0 | 9.6 | A5 | 6.3 | B6 | 14.2 | |||||
| 8.6 | A2 | 5.7 | A2 | 6.8 | A18 | 5.8 | C3 | 4.9 | A18 | 11.1 | A16 | 6.7 | A16 | 6.2 | A2 | 7.0 | B14 | 8.5 | 6.3 | A19 | 8.3 | |||
| Haplotypes with 10 highest frequency estimates (%) (××) (×××) | A19 | 5.7 | A19 | 5.7 | A18 | 6.8 | A16 | 4.8 | C5 | 4.9 | A1 | 2.8 | A33 | 3.4 | A19 | 3.1 | A22 | 4.0 | A19 | 4.3 | 1.6 | 6.5 | ||
| A16 | 4.8 | B6 | 5.7 | A16 | 3.8 | A22 | 3.8 | A1 | 3.3 | A15 | 2.8 | C3 | 3.1 | C3 | 3.1 | A5 | 3.0 | A2 | 3.2 | 1.6 | A22 | 4.9 | ||
| A20 | 3.8 | A5 | 2.9 | A1 | 3.0 | A2 | 2.4 | A2 | 3.3 | A22 | 2.8 | A2 | 2.5 | A2 | 2.5 | A16 | 3.0 | 3.2 | 2.8 | |||||
| A26 | 2.9 | A20 | 2.9 | A153 | 3.0 | C1 | 2.4 | A64 | 3.3 | A26 | 2.8 | A19 | 2.4 | A5 | 1.7 | 3.0 | 3.2 | A18 | 2.8 | |||||
| A1 | 2.9 | A33 | 2.9 | A22 | 2.3 | C2 | 2.4 | C2 | 3.3 | A28 | 2.8 | A5 | 2.0 | A22 | 1.7 | A19 | 2.0 | A29 | 2.1 | A33 | 2.5 | |||
| C1 | 2.9 | A44 | 2.9 | A26 | 2.3 | 3.3 | A29 | 2.8 | C2 | 1.7 | A24 | 2.0 | A70 | 2.1 | ||||||||||
| A70 | 2.9 | A33 | 2.3 | A64 | 2.8 | A72 | 2.1 | |||||||||||||||||
| A82 | 2.9 | C1 | 2.3 | A140 | 2.8 | B12 | 2.1 | |||||||||||||||||
| A156 | 2.8 | C1 | 2.1 | |||||||||||||||||||||
| B3 | 2.8 | |||||||||||||||||||||||
| C3 | 2.8 | |||||||||||||||||||||||
Figure 2.Impact of including maternally related dogs in population samples of 200 versus 1000 dogs on the estimation of the frequencies of rare haplotypes.
Illustration of different annotations for the HV-II polyC-polyT-polyC haplotype with 6 C’s, 8 T’s and 2 C’s. Annotation (1) was used by Gundry et al. (2007), while Eichmann and Parson (2007) and Desmyter and Gijsbers (2012) applied annotation (2) because of different alignments to the Kim et al. (1998) reference sequence of 3C8T3C.
| - | - | - | |||||||||||||||
| C | C | C | C | C | - | C | T | T | T | T | T | T | T | T | C | C | |
| C | C | C | C | C | C | T | T | T | T | T | T | T | T | C | C | - |