| Literature DB >> 29494531 |
Odile Loreille1, Shashikala Ratnayake2, Adam L Bazinet3, Timothy B Stockwell4, Daniel D Sommer5, Nadin Rohland6, Swapan Mallick7, Philip L F Johnson8, Pontus Skoglund9, Anthony J Onorato10, Nicholas H Bergman11, David Reich12,13, Jodi A Irwin14.
Abstract
High throughput sequencing (HTS) has been used for a number of years in the field of paleogenomics to facilitate the recovery of small DNA fragments from ancient specimens. Recently, these techniques have also been applied in forensics, where they have been used for the recovery of mitochondrial DNA sequences from samples where traditional PCR-based assays fail because of the very short length of endogenous DNA molecules. Here, we describe the biological sexing of a ~4000-year-old Egyptian mummy using shotgun sequencing and two established methods of biological sex determination (RX and RY), by way of mitochondrial genome analysis as a means of sequence data authentication. This particular case of historical interest increases the potential utility of HTS techniques for forensic purposes by demonstrating that data from the more discriminatory nuclear genome can be recovered from the most damaged specimens, even in cases where mitochondrial DNA cannot be recovered with current PCR-based forensic technologies. Although additional work remains to be done before nuclear DNA recovered via these methods can be used routinely in operational casework for individual identification purposes, these results indicate substantial promise for the retrieval of probative individually identifying DNA data from the most limited and degraded forensic specimens.Entities:
Keywords: Egypt; ancient DNA; high throughput sequencing; hybridization capture; mitochondrial genome; mtGenome; mummy; sexing
Year: 2018 PMID: 29494531 PMCID: PMC5867856 DOI: 10.3390/genes9030135
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Head of the Djehutynakht mummy (2010–1961 BC). Photograph © 2018 Museum of Fine Arts, Boston, USA.
Figure 2Sequencing strategy in this study. Molecular work performed at the Federal Bureau of Investigation (FBI) Laboratory is in blue, while work performed at Harvard Medical School (HMS) is in pink. Cap1-Lib2 has both colors because the extraction and library preparation of Lib2 was done at the HMS, while the hybridization capture took place at the FBI. CAP-LIB1 is the result of merging data from runs I and II, while CAP-LIB2 is the result of merging runs III and IV. Lib1 and Lib2 were shotgun-sequenced together on the HiSeq at National Bioforensic Analysis Center (NBFAC, run V).
Figure 3(a) Size distribution of the reads that aligned to the mtGenome in CAP-LIB1. Reads ≤35 bp or >70 bp (shown in orange) were removed from the final data to avoid any impact from nonspecific [34] or contaminating reads. The length with the greatest number of reads was 47 bp. (b) Size distribution of the reads that aligned to the mtGenome in CAP-LIB2. Only reads >35bp and ≤70 bp (shown in blue) were retained in the final data. The length with the greatest number of reads was 38 bp.
Figure 4Distribution of 40,910 reads over the entire mtGenome. Sequence coverage at each position ranged from 5× to 247× (average 108×).
Sequencing statistics for the extraction reagent control (RB) and high throughput sequencing (HTS) reagent negative controls (NC).
| Total # Reads | # Unique Reads Mapped to hg19 | # Unique Reads Mapped to the mtGenome | ||
|---|---|---|---|---|
| Cap-Lib1 | RB | 428,192 | 560 | 0 |
| NC | 47,414 | 121 | 0 | |
| Cap2-Lib2 | RB | 744,228 | 570 | 10 |
| NC | 250,282 | 128 | 7 |
Shotgun sequencing mapping statistics.
| Sequence Statistics | FBI Shotgun-Lib1 | HMS Shotgun-Lib2 | |
|---|---|---|---|
| a | Number of raw paired reads | 164,451,485 | 266,162,607 |
| b | Number of reads mapped to the human genome hg19 and rCRS | 3,692,504 | 19,485,309 |
| c | Percentage of endogenous human DNA | 2.24% | 6.57% |
| d | Number of unique human reads with Q >30 | 1,595,239 | 7,691,326 |
| e | Average coverage hg19 | 0.02× | 0.09× |
| f | Number of unique mapped human reads with signs of damage | 518,381 | 344,995 |
RX and RY determination using unique mapped reads with Q > 30.
| Samples | Lib1 | Sex | Lib2 | Sex |
|---|---|---|---|---|
| Mapped reads | 1,595,239 | 7,691,326 | ||
| Mapped to X | 37,605 | 176,181 | ||
| Mapped to Y | 3732 | 16,469 | ||
| RY | 0.090 | 0.0855 | ||
| 95% CI | ♂ | ♂ | ||
| RX | 0.45 | 0.433 | ||
| 95% CI | 0.429– | ♂ | 0.4– | ♂ |
Bold symbolizes the CI used to determine sex.
RX and RY determination using unique mapped reads that showed signs of DNA damage.
| Samples | Lib1 | Sex | Lib2 | Sex |
|---|---|---|---|---|
| Mapped reads | 518,381 | 344,995 | ||
| Mapped to X | 11,512 | 6688 | ||
| Mapped to Y | 1133 | 637 | ||
| RY | 0.0896 | 0.087 | ||
| 95% CI | ♂ | ♂ | ||
| RX | 0.4216 | 0.3638 | ||
| 95% CI | 0.3987– | ♂ | 0.3261– | ♂ |
Bold symbolizes the CI used to determine sex.