| Literature DB >> 20441577 |
Kay Prüfer1, Udo Stenzel, Michael Hofreiter, Svante Pääbo, Janet Kelso, Richard E Green.
Abstract
High-throughput sequencing technologies have opened up a new avenue for studying extinct organisms. Here we identify and quantify biases introduced by particular characteristics of ancient DNA samples. These analyses demonstrate the importance of closely related genomic sequence for correctly identifying and classifying bona fide endogenous DNA fragments. We show that more accurate genome divergence estimates from ancient DNA sequence can be attained using at least two outgroup genomes and appropriate filtering.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20441577 PMCID: PMC2898072 DOI: 10.1186/gb-2010-11-5-r47
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Number of aligned ancient DNA fragments and average sequence length. Properties of Mega BLAST alignments of ancient DNA sequences from a Neandertal fossil to genome sequences of increasing divergence. Left panel: number of reads with a best hit to the genome sequence and not to the GenBank nonredundant and environmental databases (yellow). Subset of reads with one unique best hit to the reference genome (light green). Subset of reads with one unique best hit to the reference genome that can be fully aligned with a positive alignment score (dark green). Right panel: Average length of best local alignments (yellow), average length of fragments with a unique best local alignment (red), average length of fragments with a positive score when fully aligned to reference genome (brown).
Figure 2Differences per site in alignments of ancient DNA fragments. All nucleotide differences (top) and transversion differences (bottom) in different alignments to reference genomes of increasing divergence. Each read is required to have one uniquely best Mega BLAST alignment to the reference genome (estimate shown as the black line). The semiglobal alignment forces the full sequence to align to the genomic region identified by the local alignment (estimate shown as red line). These full alignments are further filtered for having a positive alignment score (blue line). The green crosses show the differences between human and the reference species in the ENCODE multiple sequence alignments. The divergence times on the x-axis are from [52] and [35], except for human for which we choose an arbitrary divergence time of 1 million years to Neandertal.
Figure 3Schematic description of divergence triangulation. (a) A phylogenetic tree depicting the necessary topology for the application of the divergence triangulation method. (b) The ancient DNA sequences are used like an outgroup to the two genomic sequences in an unrooted tree. (c) Alignments between genomic sequences and ancient DNA fragments are used to assign changes to the lineages (numbers on the right-hand side). In this process, coinciding changes often caused by ancient DNA damage (shown in red in the alignments) can lead to misassignments of differences (in red in the summary of tables) (d) The assigned differences can be used to calculate a divergence relative to the divergence between the two genome sequences.
Figure 4Divergence estimates by triangulation on simulated datasets. (a) 3DP divergence estimates in comparison to the expected values. Four bars are drawn for different filters: raw estimate without filtering on all unique alignments (brown); filtered alignments with verified human and chimpanzee genomic location using a whole genome alignment and a distance of at least 6 points between best and second best local alignments' bitscores (red); alignments of fragments with a size >35 bp (orange); and all filters applied (yellow). (b) Estimates are derived solely from transversion differences, otherwise identical to (a).
Evaluation of potential ancient DNA shotgun sequencing targets
| Species | DNA preservation | Biological relevance | Closely related genome available |
|---|---|---|---|
| Neandertal | Yes, reasonable | Recent human evolution | Human, chimpanzee |
| Mammoth | Yes, very good. Draft genome published in 2008 [ | Limited; possibly adaptation to arctic environments | Elephant |
| Mastodon | Yes, good | Limited; in combination with mammoth parallel adaptation to arctic environments | No close living relatives |
| Dwarf elephant | Maybe possible; young enough, but poor preservation conditions | Rapid decrease in body size due to island adaptation | Elephant |
| Cave bear | Yes, good | Limited; probably interesting in combination with genomes from modern bear species; long hibernation without muscle atrophy may be medically interesting | Bear (not sequenced) |
| Ground sloth | Yes, reasonable | Size difference to modern species; parallel evolution in different lineages | Tree sloth (sequencing in progress) |
| Saber tooth cat | Probably possible | Limited; unique morphological adaptations | No close living relatives |
| Aurochs ( | Marginal; young enough, but poor preservation conditions in region of domestication | Understanding of domestication process | Cattle [ |
| No, young enough, but too poor preservation conditions | Relationship to modern humans; recent human evolution; island adaptation in a hominid | Human, chimpanzee | |
| Australopithecus | No, too old | Human evolution: potentially medical insights | Human, chimpanzee |
| Dinosaurs | No, far too old | Unique evolutionary lineage | No close living relatives |