| Literature DB >> 22163018 |
Martijn Staats1, Argelia Cuenca, James E Richardson, Ria Vrielink-van Ginkel, Gitte Petersen, Ole Seberg, Freek T Bakker.
Abstract
Dried plant herbarium specimens are potentially a valuable source of DNA. Efforts to obtain genetic information from this source are often hindered by an inability to obtain amplifiable DNA as herbarium DNA is typically highly degraded. DNA post-mortem damage may not only reduce the number of amplifiable template molecules, but may also lead to the generation of erroneous sequence information. A qualitative and quantitative assessment of DNA post-mortem damage is essential to determine the accuracy of molecular data from herbarium specimens. In this study we present an assessment of DNA damage as miscoding lesions in herbarium specimens using 454-sequencing of amplicons derived from plastid, mitochondrial, and nuclear DNA. In addition, we assess DNA degradation as a result of strand breaks and other types of polymerase non-bypassable damage by quantitative real-time PCR. Comparing four pairs of fresh and herbarium specimens of the same individuals we quantitatively assess post-mortem DNA damage, directly after specimen preparation, as well as after long-term herbarium storage. After specimen preparation we estimate the proportion of gene copy numbers of plastid, mitochondrial, and nuclear DNA to be 2.4-3.8% of fresh control DNA and 1.0-1.3% after long-term herbarium storage, indicating that nearly all DNA damage occurs on specimen preparation. In addition, there is no evidence of preferential degradation of organelle versus nuclear genomes. Increased levels of C→T/G→A transitions were observed in old herbarium plastid DNA, representing 21.8% of observed miscoding lesions. We interpret this type of post-mortem DNA damage-derived modification to have arisen from the hydrolytic deamination of cytosine during long-term herbarium storage. Our results suggest that reliable sequence data can be obtained from herbarium specimens.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22163018 PMCID: PMC3230621 DOI: 10.1371/journal.pone.0028448
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Copy numbers for herbarium specimens and fresh tissues for:(A) plastid genes (B) mitochondrial genes (C) nuclear genes.
Values statistically different at 5% significance level in post-hoc tests are indicated by different letters (A or B).
Figure 2Copy number fold-reduction of plastid, mitochondrial and nuclear genes in herbarium specimens (A) after specimen preparation (fresh tissue/young herbarium ratio), and (B) after long-term herbarium storage (young/old herbarium ratio).
Values were not statistically different at 5% significance level.
Mean DNA yield, mean gene copy numbers, and DNA yield loss-corrected copy numbers for plastid, mitochondrial and nuclear DNA relative to fresh control.
| % mean gene copy numbers | ||||
| Sample type | % mean DNA yield | Plastid | Mitochondrial | Nuclear |
|
| 100 | 100 | 100 | 100 |
|
| 22.6 | 10.5 (2.4) | 12.2 (2.8) | 16.8 (3.8) |
|
| 18.7 | 7.2 (1.3) | 5.6 (1.0) | 5.1 (1.0) |
Percentages were calculated relative to fresh controls, which were set at 100%. Mean DNA yields and mean gene copy numbers were taken from Table S6.
Percentages of mean gene copy numbers corrected for DNA yield losses relative to fresh control (in brackets) are calculated as: (% mean DNA yield) × (% mean gene copy numbers).
Chi-square (χ2) tests of independence (at p≥0.05 level) on number of nucleotide substitutions observed in fresh and herbarium DNA.
| Fresh tissue | Young herbarium | Old herbarium | ||||||
| DNA type | Substitution type | Observed (%) | Observed (%) | Observed, corrected | ÷2 contribution ( | Observed (%) | Observed, corrected | ÷2 contribution ( |
|
|
| 10746437 | 14193327 | 10746437 | 14101498 | 10746437 | ||
|
| 11298 | 16047 | 12149.94 | 22527 | 17167.32 | |||
|
| 8504 (0.0791) | 11634 (0.0817) | 8808.65 | 10.91 | 15040 (0.1066) | 11461.65 | 1028.66 | |
|
| 264 (0.0024) | 427 (0.0030) | 323.30 | 13.32 | 619 (0.0044) | 471.72 | 163.45 | |
|
| 521 (0.0048) | 585 (0.0041) | 442.93 | 11.69 | 937 (0.0066) | 714.07 | 71.54 | |
|
| 272 (0.0025) | 386 (0.0027) | 292.26 | 1.51 | 1021 (0.0072) | 778.08 | 941.61 | |
|
| 49 (0.0046) | 79 (0.0006) | 59.82 | 2.39 | 98 (0.0007) | 74.68 | 13.46 | |
|
| 1688 (0.0157) | 2936 (0.0207) | 2222.98 | 169.55 | 4812 (0.0341) | 3667.12 | 2320.44 | |
| 209.37 (<0.001) | 4539.16 (<0.001) | |||||||
|
|
| 8566629 | 9769969 | 8566629 | 8738228 | 8566629 | ||
|
| 14050 | 16174 | 14181.89 | 17277 | 16937.72 | |||
|
| 7580 (0.0885) | 8828 (0.090) | 7740.68 | 3.41 | 8113 (0.0928) | 7953.68 | 18.42 | |
|
| 380 (0.0044) | 408 (0.0042) | 357.75 | 1.30 | 525 (0.0060) | 514.69 | 47.74 | |
|
| 317 (0.0037) | 403 (0.0041) | 353.36 | 4.17 | 328 (0.0038) | 321.56 | 0.07 | |
|
| 569 (0.0066) | 634 (0.0065) | 555.91 | 0.30 | 834 (0.0095) | 817.62 | 108.63 | |
|
| 198 (0.0023) | 247 (0.0025) | 216.58 | 1.74 | 281 (0.0032) | 275.48 | 30.32 | |
|
| 5006 (0.0584) | 5654 (0.0579) | 4957.61 | 0.47 | 7196 (0.0823) | 7054.69 | 838.42 | |
| 11.39 (0.044) | 1043.60 (<0.001) | |||||||
|
|
| 12620519 | 13953891 | 12620519 | 8608206 | 12620519 | ||
|
| 18779 | 20334 | 18390.97 | 21269 | 31182.55 | |||
|
| 13084 (0.1036) | 13923 (0.0997) | 12592.58 | 18.46 | 11726 (0.1362) | 17191.53 | 1289.49 | |
|
| 547 (0.0043) | 595 (0.0043) | 538.14 | 0.14 | 826 (0.0096) | 1211.00 | 806.03 | |
|
| 682 (0.0054) | 702 (0.0050) | 634.92 | 3.25 | 1380 (0.0160) | 2023.22 | 2637.65 | |
|
| 473 (0.0037) | 498 (0.0036) | 450.41 | 1.08 | 681 (0.0079) | 998.42 | 583.64 | |
|
| 243 (0.0019) | 348 (0.0025) | 314.75 | 21.18 | 417 (0.0048) | 611.37 | 558.41 | |
|
| 3750 (0.0297) | 4268 (0.0306) | 3860.17 | 3.24 | 6239 (0.0725) | 9147.02 | 7767.41 | |
| 47.35 (<0.001) | 31642.64 (<0.001) | |||||||
Total numbers of adenine, thymine, cytosine and guanine in the dataset.
Numbers of observed substitutions and their percentages of total nucleotides observed (in brackets).
The numbers of observed substitutions were corrected for differences in total sequence data (nucleotides) between fresh and herbarium data.
Separate analyses were performed for plastid, mitochondrial, and nuclear DNA. Observed numbers in the fresh tissue data were used as expected numbers in χ2-tests.
Estimated (C→T/G→A) rate in herbarium plastid DNA during herbarium storage.
| Number of substitutions per 106 nucleotides | ||||
| Species | Young herbarium | Old herbarium | Old herbarium age (yrs.) | (C→T/G→A) per nucleotide per year ± SD |
|
| 197.55 | 330.02 | 65 | 2.04*10−6 |
|
| 229.09 | 384.02 | 107 | 1.45*10−6 |
|
| 186.14 | 313.52 | 114 | 1.12*10−6 |
| Average | 1.53*10−6±4.66*10−7 | |||