| Literature DB >> 34424926 |
Claus M Zacho1, Martina A Bager2, Ashot Margaryan2,3, Peter Gravlund4, Anders Galatius5, Arne R Rasmussen6, Morten E Allentoft1,7.
Abstract
Natural history museum collections worldwide represent a tremendous resource of information on past and present biodiversity. Fish, reptiles, amphibians and many invertebrate collections have often been preserved in ethanol for decades or centuries and our knowledge on the genomic and metagenomic research potential of such material is limited. Here, we use ancient DNA protocols, combined with shotgun sequencing to test the molecular preservation in liver, skin and bone tissue from five old (1842 to 1964) museum specimens of the common garter snake (Thamnophis sirtalis). When mapping reads to a T. sirtalis reference genome, we find that the DNA molecules are highly damaged with short average sequence lengths (38-64 bp) and high C-T deamination, ranging from 9% to 21% at the first position. Despite this, the samples displayed relatively high endogenous DNA content, ranging from 26% to 56%, revealing that genome-scale analyses are indeed possible from all specimens and tissues included here. Of the three tested types of tissue, bone shows marginally but significantly higher DNA quality in these metrics. Though at least one of the snakes had been exposed to formalin, neither the concentration nor the quality of the obtained DNA was affected. Lastly, we demonstrate that these specimens display a diverse and tissue-specific microbial genetic profile, thus offering authentic metagenomic data despite being submerged in ethanol for many years. Our results emphasize that historical museum collections continue to offer an invaluable source of information in the era of genomics.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34424926 PMCID: PMC8382189 DOI: 10.1371/journal.pone.0256353
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Five specimens of Thamnophis sirtalis included in this study.
| Museum # | Collection year | Location | Country |
|---|---|---|---|
| ZMUC R602799 | 1842 | Pennsylvania, West | USA |
| ZMUC R602815 | 1880 | Shoalwater Bay | USA |
| ZMUC R602826 | 1920 | Zoo, Copenhagen | Denmark |
| ZMUC R602827 | 1923 | Zoo, Copenhagen | Denmark |
| ZMUC R603729 | 1964 | Bornholm | Denmark |
The Danish island Bornholm is not part of the distribution range for this species, so similar to the zoo animals from Copenhagen, we assume that this must be an individual that died in captivity.
DNA preservation.
| Museum # | Collection year | Formalin | Tissue | Conc. (ng/μL) | Endo % | Length, bp | C-T % | Human % |
|---|---|---|---|---|---|---|---|---|
| ZMUC R602799 | 1842 | ? | Liver | 5.7 | 35.7 | 46.8 | 19.8 | 1.1 |
| Bone | 2.6 | 42.2 | 51.5 | 14.5 | 0.5 | |||
| Skin | 0.3 | 37.1 | 47.1 | 14.1 | 1.2 | |||
| ZMUC R602815 | 1880 | ? | Liver | 38.1 | 33.7 | 49.2 | 20.5 | 1.7 |
| Bone | 10.5 | 47.2 | 51.5 | 16.0 | 0.6 | |||
| Skin | 0.6 | 33.7 | 40.4 | 18.8 | 1.3 | |||
| ZMUC R602826 | 1920 | No | Liver | 92.4 | 47.4 | 59.4 | 11.1 | 0.3 |
| Bone | 18.6 | 50.5 | 63.4 | 10.9 | 0.2 | |||
| Skin | 25.9 | 46.8 | 59.3 | 9.1 | 0.2 | |||
| ZMUC R602827 | 1923 | Yes | Liver | 54.7 | 54.7 | 46.8 | 13.7 | 0.3 |
| Bone | 1.8 | 55.6 | 49.7 | 16.8 | 0.3 | |||
| Skin | 2.1 | 55.1 | 45.8 | 16.4 | 0.3 | |||
| ZMUC R603729 | 1964 | ? | Liver | 22.7 | 29.0 | 48.8 | 15.7 | 0.9 |
| Bone | 83.3 | 48.5 | 62.1 | 8.8 | 0.2 | |||
| Skin | 12.1 | 45.4 | 45.2 | 12.6 | 0.4 |
Results of formalin test, DNA concentration measurement and bioinformatic analyses. Endo %, endogenous DNA content; Length, average length (in base pairs) of mapped reads; C-T %, C-to-T deamination damage fraction recorded at first position in the 5’end; Human %, fraction of sequences that could be mapped to the human reference genome. Each value represents the average of two replicate DNA extracts (see S2 Table for all observations).
Fig 1DNA preservation.
DNA concentration measurement and DNA preservation metrics listed according to specimen age. Endo %, endogenous DNA content; Length, average length (in base pairs) of mapped reads; C-T %, C-to-T deamination damage fraction recorded at first position in the 5’end. Bars represent average values of two replicate DNA extracts with the range shown.
Linear mixed effects model.
| Endogenous DNA, % | |||||
|---|---|---|---|---|---|
| Source | Effect | SE | df | t | p |
| Intercept | 48.79 | 3.54 | 23 | 13.79 | <0.0001 |
| Liver | -8.69 | 2.24 | 23 | -3.88 | 0.0008 |
| Skin | -5.19 | 2.24 | 23 | -2.34 | 0.0298 |
|
| |||||
| Source | Effect | SE | df | t | p |
| Intercept | 55.64 | 2.73 | 23 | 20.42 | <0.0001 |
| Liver | -5.46 | 1.66 | 23 | -3.28 | 0.0033 |
| Skin | -8.11 | 1.66 | 23 | -4.87 | 0.0001 |
|
| |||||
| Source | Effect | SE | df | t | p |
| Intercept | 13.37 | 1.52 | 23 | 8.78 | <0.0001 |
| Liver | 2.76 | 0.91 | 23 | 3.05 | 0.0057 |
| Skin | 0.81 | 0.91 | 23 | 0.89 | 0.3804 |
|
| |||||
| Source | Effect | SE | df | t | p |
| Intercept | -67% | 139 | 23 | -3.32 | <0.0001 |
| Liver | 89% | 113 | 23 | 5.15 | <0.0001 |
| Skin | 50% | 113 | 23 | 3.28 | 0.0033 |
|
| |||||
| Source | Effect | SE | df | t | p |
| Intercept | 837% | 185 | 23 | 3.63 | <0.0001 |
| Liver | 221% | 152 | 23 | 2,79 | 0.0103 |
| Skin | -73% | 152 | 23 | -3.16 | 0.0043 |
Results of the linear mixed effects models conducted on endogenous DNA content, mapped length, CT damage, human DNA content and DNA concentration. Bone was used as the reference matrix against which values for liver and skin were compared. The estimated effects derived from the log-transformed variables (human DNA content and DNA concentration) are given in percentage deviations from bone, while other effects are given in the relevant units (percentage points or base pair length). SE: standard error, df: degrees of freedom, t: test value, p: p value.
Fig 2Human DNA contamination.
Fraction of sequences that could be mapped to the human reference genome listed according to specimen age. Bars represent average values of two replicate DNA extracts and the range is shown.
Fig 3Metagenomic profiling.
Bacterial metagenomic diversity visualized in Krona and Venn (lower right) plots for the three tissue types. The diversity is presented at species-level and includes only NCBI refseq taxa with >5 sequence matches. Colors denote taxonomic groups found within the various bacterial phyla as according to Krona visualization. Blanks denote negative extraction control.