| Literature DB >> 22649615 |
A P Grigorenko1, S A Borinskaya, N K Yankovsky, E I Rogaev.
Abstract
Studies of ancient DNA specimens started 25 years ago. At that time short mitochondrial DNA (mtDNA) fragments were the main targets in ancient DNA studies. The last three years were especially productive in the development of new methods of DNA purification and analysis. Complete mtDNA molecules and relatively large fragments of nuclear DNA are the targets of ancient DNA studies today. Ancient DNA studies allowed us to study organisms that went extinct more than ten thousand years ago, to reconstruct their phenotypic traits and evolution. Ancient DNA analyses can help understand the development of ancient human populations and how they migrated. A new evolutionary hypothesis and reconstruction of the biota history have been re-created from recent ancient DNA data. Some peculiarities and problems specific to the study of ancient DNA were revealed, such as very limited amounts of DNA available for study, the short length of the DNA fragments, breaks and chemical modifications in DNA molecules that result in "postmortem" mutations or complete blockage of DNA replication in vitro. The same specific features of DNA analysis were revealed for specimens from complicated forensic cases that result in the lack of experimental data or interpretation problems.. Here, we list the specific features of ancient DNA methodology and describe some achievements in fundamental and applied research of ancient DNA, including our own work in the field.Entities:
Year: 2009 PMID: 22649615 PMCID: PMC3347533
Source DB: PubMed Journal: Acta Naturae ISSN: 2075-8251 Impact factor: 1.845
Fig. 1.Unusually well-preserved mammoth M. primigenius DNA specimen obtained from permafrost-preserved remains found in 1986 in the valley of the Enmynveem River (Chukotka):
a - Fluorescence of nuclei in muscle cells of M. primigenius (~33, 000 years old) after DAPI staining indicates a relatively high degree of DNA preservation; b - electrophoresis of total genomic DNA extracted from mammoth muscle tissue (lane 1 is 1/10 dilution of the DNA on lane 2), and control DNA isolated from fresh human blood samples (lanes 3 and 4). The right lane is a DNA molecular weight marker (fragments sizes are indicated in kb) [13]
Authenticity criteria for ancient DNA
| Criteria | Importance for authenticity |
| Reagents and plastic to be used for work with ancient DNA must be checked for the possible presence of amplifiable templates. Since the templates can be present in trace amounts, and thus be amplified in only one out of several samples, multiple checks have to be performed. | Prevents contamination through reagents and disposable materials. |
| All the manipulations that are used to extract DNA are performed on solutions with no templates using the same solutions. PCR is performed with a double negative control; the normal one (reaction mix with no template) and the reaction mix with the "empty" extract. | Helps detect contamination which could have happened during extraction or during the PCR mix preparation. |
| Positive controls are usually not used, since they carry the risk of potential contamination. | Prevents contamination. |
| When possible, several independent extractions of DNA are performed from different areas of the sample. | Helps identify local contamination of the sample itself. |
| Repeated amplifications of material obtained from the same and from different extractions. | Helps identify sporadic contamination and facilitates the identification of erroneous nucleotides which were included into products amplified from degraded DNA extracts with a small amount of template molecules. |
| Cloning of amplification products and/or sequencing of multiple clones. | Identifies heterogeneity in the amplified products, which stems from contamination or amplification of degraded DNA with modified nucleotides. |
| Determination of the number of amplified DNA template molecules (must be determined for each pair of primers, since the number of amplified molecules can vary noticeably depending on the length and the nucleotide content of the amplified fragment, and also on the sensitivity of the specific pair of primers). | Determines the possibility of insertions of nucleotides not present in the original sequence. Extracts which contain only a few or even a single molecule are very much prone to yield erroneous insertions, so it is necessary to perform several amplifications. Extracts which contain at least 1,000 molecules require only one amplification reaction. |
| Peculiar "molecular behavior," a reverse correlation between the efficiency of amplification and the length of the amplified fragment . | If the sample does not exhibit more intensive amplification of shorter fragments than that of longer fragments, as compared to modern DNA, this indicates that the source of the amplified DNA is contaminated by modern templates. |
| Biochemical analysis of the level of preservation of macromolecules. | A high level of biochemical preservation of macromolecules indicates a high probability of preserved DNA molecules being found in the sample. This DNA can be analyzed. Thus, the test would support the authenticity of the sequencing results. |
| Avoid introducing nuclear sequences into mtDNA. | Nuclear DNA has regions homologous to mtDNA, so this fact must be taken into account if mtDNA amplification is used. |
| Independent confirmation of results in a different laboratory. | This helps identify laboratory contamination of samples or reagents, but it does not rule out contamination that was present in the sample itself (contaminants which were a part of the sample before it arrived at the laboratory, for instance during archeological excavation). This requirement used to be mandatory. Now it has been dropped. |
Various types of ancient DNA damage (from [4, 17] with modifications).
| Type of damage | Cause of damage | Effect on DNA | Possible solution |
| Nucleobases and deoxyribose degradation | Postmortem destruction by intracellular nucleases, degradation by microorganisms and other chemical processes | Apurinization of DNA, strand breaks, decrease of DNA fragment size, decrease of the overall amount of DNA | Amplification of short (<100-200 bp) overlapping fragments |
| Cross-links which block PCR | Alkylation, Maillard reaction (chemical reaction between a sugar molecule and an amino group of a nucleobase or an amino acid) | Cross-links between DNA strands in a single molecule; cross-links between DNA strands of different molecules; or cross-links between DNA and proteins | Treating the sample with reagents that destroy cross-links |
| Deamination and other types of oxidative or hydrolytic DNA base modifications |
Adenine → hypoxanthine | Insertion during amplification of nucleotides that were not present in the original nonmodified template | Treatment by DNA uracil-N-glycosylase, which removes cytosine deamination products. Determination of a consensus sequence based on multiple sequencing of the analyzed regions: Multiple independent PCR, cloning of the original template or PCR products, and sequencing of several clones |
Fig. 2.A typical result of PCR amplification of mammoth mitochondrial genome fragments:
a - PCR products of relatively short amplification fragments (300-600 bp); b - Successful amplification of long PCR fragments harboring entire mitochondrial gene sequences (1317 bp for the CytB gene and 1613 bp for the ATP6 gene) and no amplification products are obtained for larger size PCR fragments (3054 bp for the ND5 gene). M - marker fragment sizes are indicated in kb, K - negative controls [13]
Fig. 3.The size distribution of genomic DNA fragments extracted from the ancient remains sequenced on the 454 platform. Unpublished data obtained in collaboration of E.I. Rogaev et al. with M. Blow and E. Rubin
Fig. 4.Mitochondrial genome of the woolly mammoth M. primigenius. Determination of the nucleotide sequence of the complete mitochondrial genome was performed in two laboratories. The overlapping PCR-amplification products used for sequencing are shown in the inner circle [13]
Methodical approaches specific to analyses of degraded DNA from historical specimens [45]
| Stage of analysis | Special procedures | Reagents and methods |
| Preparation of historic samples | Independent analysis in specialized laboratories in IOGene (Moscow) and University of Massachusetts Medical School (Worchester, USA). | Physical and chemical cleaning of small bone fragments, crushing or drilling to obtain bone powder. |
| Extraction of DNA from bone remains | All the experimental procedures were performed in sterile PCR-hoods, in accordance with standards for ancient DNA research, keeping to all the safety precautions so as not to contaminate the samples by modern DNA. | DNA was extracted from ~170≤700 mg of cleaned bone material treated by 0.5 M EDTA and proteinase K and was then purified by a QIAquick PCR purification kit (Qiagen) in accordance with the manufacturer's protocol with slight modifications. |
| Extraction of DNA from archive spots of blood | The biological material was obtained from 4 different blood stains. At least 3 swabs were taken from each spot. In order to minimize contamination, DNA was extracted only from the 2nd and 3rd swabs of each spot. | DNA was extracted with the QIAamp DNA Mini Kit (Qiagen) in accordance with the manufacturer's protocol ("DNA Purification from Dried Blood Spots") with several modifications. |
| Quantative DNA analysis | The total DNA was quantified by the Quant-iT™ PicoGreen® Assay kit (Invitrogen), human specific DNA was quantified by the Plexor® HY assay kit (Promega) and the 7500 Real-Time PCR System (Applied Biosystems). | |
| Sequencing HVR1 and HVR2 of mtDNA from historic samples | Possible contamination by foreign DNA was monitored by using negative controls (amplification of "empty" extracts and PCR without addition of the template). | mtDNA fragments were amplified as short overlapping fragments. The PCR products were then extracted from the agarose gel using a QIAquick Gel Extraction kit or a MinElute Gel Extraction kit. For additional studies, the PCR products of samples from the second burial site were cloned. |
| Sequence analysis of the complete mitochondrial genome, extracted from bone remains. | Since the DNA was highly degraded, short overlapping fragments sized 164-383 b.p. were obtained, covering the whole mitochondrial genome. | Because the amount of DNA was initially so small, multiplex amplification was performed using 88 pairs of specially developed primers grouped into 3 kits, and then the products of this PCR were amplified with individual primer pairs. The PCR products were then sequenced using three different strategies |
| Analysis of the mtDNA extracted from the blood stains on Nicholas the Second's shirt. | Up to 5 or 7 repeated PCR reactions were conducted for some of the SNPs. | Since the quality of preservation in the blood stains was unknown, a kit of primers was developed for the amplification of short (64≤109 b.p.) DNA fragments, which would include very rare SNPs identified in the previous analysis of Skeleton №4 (the putative skeleton of Nicholas the Second). |
| Extraction and analysis of DNA from modern samples. | All the procedures involved in analyzing modern DNA were performed in other buildings, which were located some distance away from the ancient DNA laboratories. All the living relatives who took part in the study gave their written consent. | DNA obtained from buccal swabs or drops of blood was extracted using standard protocols. PCR was performed using a kit of primers for amplifying longer fragments. |
| Assembly of fragments into a continuous sequence of nucleotides (contigs). | The sequences were assembled using Seqman software, DNASTAR, and the revised Cambridge reference sequence (rCRS, accession number AC_000021) as a standard. | |
| Sex identification. | Sex was identified using the standard method, amplification of a fragment of the amelogenin gene using several commercial kits: AmpF≤STR® MiniFiler™ (Applied Biosystems) and PowerPlex S5 (Promega). Specially developed primers for the amplification of short fragments specific to the X- and Y-chromosomes were also used. | |
| Analysis of nuclear STR markers. | During the initial study, mtDNA or nuclear DNA extracts that consisted of a mix of individual profiles were discarded from further analysis. Each sample from various extracts was serially amplified. Homozygous loci were considered authentic if multiple independent amplifications confirmed a certain allele for the autosome STR-marker. | The following kits were used in order to obtain autosomal STR profiles from bone samples of the first and second burial sites: AmpF≤STR® MiniFiler™ PCR Amplification Kit (Applied Biosystems) and PowerPlex S5 System (Promega), developed especially for analyzing degraded DNA. |
| STR-profiles of the Y chromosome. | The AmpF≤STR® Yfiler™ (Applied Biosystems) kit was used, according to the manufacturer's protocol with slight modifications for work with degraded DNA. | |
| Electrophoresis analysis | In order to increase the signal intensity and lower "noise" in the STR-profiles, the products of multiplex amplification were sometimes purified according to the method for genotyping low-copy DNA templates using Qiagen MiniElute PCR purification kit. | Electrophoretic analysis was performed with a 96-capillary sequencer 3730xl DNA Analyzer (Applied Biosystems). The results were analyzed using GeneMapper ® ID software v3.2 (Applied Biosystems). |
Fig. 5.Maternal lineages of Empress Alexandra Fedorovna. The family members whose mitochondrial DNA was determined are indicated in grey
Databases used for analyzing the population frequencies of the determined genetic profiles [45-49]
| Query sequence | Number of samples in the database | Name of the database |
| Complete mitochondrial genome |
71,664 |
Mitosearch (www.mitosearch.org) contains sequences of the hypervariable region; |
| STR-haplotype of the Y-chromosome |
4,163 |
U.S.Consolidated Y-STR Database, which has genotypes for 15-17 STR loci (http://www.usystrdatabase.org/); |
| Autosome STRs | > 50,000 | For this study we combined the nonoverlapping data for European populations from two large databases: ALFRED (http://alfred.med.yale.edu/alfred/) and The Distribution of the Human DNA-PCR Polymorphisms database (http://www.uniduesseldorf.de/WWW/MedFak/Serology/database.html) and also added the recently published data on the Russian population. |
| SNP in the IVS3-3 position of the F9 gene |
Population SNP database (NCBI, HapMap Project); |
Fig. 6.Heteroplasmy at the 16169C/T locus in the mitochondrial (maternal) lineages of Emperor Nicholas II
Fig. 7.The paternal lineages of the Romanov family. The family members whose DNA was studied are indicated in grey
STR-haplotype analysis of the Y-chromosome [45]
| Markers | № 4 | № 146 | Members of the Romanov family | Archive blood stain from a shirt | Control DNA ABI, 007 |
| DYS456 | 16 | 16 | 16 | 16 | 15 |
| DYS389I | 13 | 13 | 13 | 13 | 13 |
| DYS390 | 24 | 24 | 24 | 24 | 24 |
| DYS389II | 29 | 29 | 29 | 29 | 29 |
| DYS458 | 17 | 17 | 17 | 17 | 17 |
| DYS19 | 14 | 14 | 14 | 14 | 15 |
| DYS385 | 11, 14 | 11, ND | 11, 14 | 11, 14 | 11, 14 |
| DYS393 | 13 | 13 | 13 | 13 | 13 |
| DYS391 | 10 | 10 | 10 | 10 | 11 |
| DYS439 | 11 | 11 | 11 | 11 | 12 |
| DYS635 | 24 | 24 | 24 | 24 | 24 |
| DYS392 | 13 | 13 | 13 | 13 | 13 |
| Y-GATA-H4 | 12 | 12 | 12 | 12 | 13 |
| DYS437 | 15 | 15 | 15 | 15 | 15 |
| DYS438 | 12 | 12 | 12 | 12 | 12 |
| DYS448 | 19 | 19 | 19 | 19 | 19 |
Fig. 8.Analysis of the sex chromosome and autosomal STR-markers in DNA extracted from bone specimens [45]
Fig. 9.Hemophilia in the royal families of Europe: a - the point mutation in the F9 gene of the bloodclotting factor, that creates an alternative splicing site [49]; b - the pedigree demonstrating the inheritance of hemophilia from Queen Victoria to her descendants; c - sequencing chromatograms of the F9 gene sequence obtained during the analysis of DNA extracted from bone remains
Methods used in screening mutations of the F8 and F9 genes [49]
| Stage of analysis | Special procedures | Reagents and methods |
| Multiplex PCR-amplification of the Factor VIII gene, F8 (26 exons) and Factor IX, F9 (8 exons), both of which are located on the X-chromosome | Extracts which were shown to be free of contamination by foreign samples based on the analyses of mtDNA and STR-markers were used for analyzing the nuclear genes. Just as in the previous analyses, negative controls were used: PCR-amplification of "empty" extracts (obtained by performing all of the extraction procedures, but without adding the bone sample) and PCR amplification without the addition of any DNA. | ~ 210 pairs of primers were designed for multiplex PCR amplification of short overlapping sequences (< 200 bp), that would cover all the exons and the intron-exons boundaries of the F8 and F9 genes. The primers were grouped into 14 sets for the F8 gene and 3 sets for the F9 gene, with each set consisting of 7 to 30 pairs of primers. About 100 pg of human DNA (~ 16-17 diploid genomes) were used for the initial multiplex PCR. |
| Sequencing | Sequencing of the blood clotting factor genes F8 and F9 was done in parallel with the mitochondrial genome to have a control for contamination and unequivocal identification of the sample. | Individual PCR-fragments were excised and purified from a 2.5 % agarose gel and then sequenced using two strategies. One involved mixing the PCR products in equimolar amounts and using them for sequencing (Illumina GA). The other approach involved the direct sequencing of individual PCR-products on a 96-capillary sequencer 3730xl DNA analyzer (Applied Biosystems). |
| Genotyping of the identified F9 gene mutation | 8 independent amplifications of the N7 sample (Empress Alexandra) were performed. For other bone samples, 2 to 7 independent extracts were analyzed from each sample. | The mutation which was initially found during the DNA analysis of skeleton N7 was verified by sequencing ultra-short amplicons (63 b.p. and 83 b.p.), which were obtained with specially developed primers. The same primers were used to amplify the DNA from bone fragments N146 (Prince Alexei) and N3, N5, N6, N147 (Nicolas the Second's and Alexandra's daughters). |
| Analysis of splicing products | The amplified fragment which bore the mutated region of the F9 gene was cloned into the pET01 Exontrap vector (MoBiTec). After verifying the structure of the recombinant molecule by sequencing, it was used to transfect a cell culture. cDNA was obtained from the spliced DNA by using RT-PCR. The cDNA was then amplified, and the PCR product library was sequenced (Illumina GA).The spliced sequences were identified in 812,114 reads. 99.982 % of the transcripts were spliced at the mutant site, and only 0.018 % were spliced at the wild-type site. |