| Literature DB >> 27604881 |
Simon Y W Ho1, Amanda X Y Chen2, Luana S F Lins2, David A Duchêne2, Nathan Lo2.
Abstract
The molecular clock is a valuable and widely used tool for estimating evolutionary rates and timescales in biological research. There has been considerable progress in the theory and practice of molecular clocks over the past five decades. Although the idea of a molecular clock was originally put forward in the context of protein evolution and advanced using various biochemical techniques, it is now primarily applied to analyses of DNA sequences. An interesting but very underappreciated aspect of molecular clocks is that they can be based on genetic data other than DNA or protein sequences. For example, evolutionary timescales can be estimated using microsatellites, protein folds, and even the extent of recombination. These genome features hold great potential for molecular dating, particularly in cases where nucleotide sequences might be uninformative or unreliable. Here we present an outline of the different genetic data types that have been used for molecular dating, and we describe the features that good molecular clocks should possess. We hope that our article inspires further work on the genome as an evolutionary timepiece.Entities:
Keywords: evolutionary rate; genomic data; molecular clock; molecular dating; phylogenetic analysis
Mesh:
Year: 2016 PMID: 27604881 PMCID: PMC5630892 DOI: 10.1093/gbe/evw220
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Methods used to estimate evolutionary timescales from genomic data. (a) Linear regression of pairwise genetic distances against time since divergence. Each data point in this plot represents a pair of taxa, with their divergence time inferred from the fossil record or from the age of a geological event that is presumed to be associated with the evolutionary divergence. Fitting a line through these points involves the assumption that genetic change accumulates at a constant rate through time, with the slope presenting an estimate of this rate. The line of best fit can be used to infer the timing of evolutionary divergence events, provided that a measure of genetic distance is available for the taxa in question. Molecular clocks based on linear regression have a number of weaknesses, including nonindependence of the data points and sensitivity to rate variation across lineages. (b) Phylogenetic analysis using a clock model. The tree is a chronogram with branch lengths measured in units of time. These methods usually involve models that explicitly describe the evolution of characters along the branches of the tree. Phylogenetic molecular clocks are calibrated by constraining the age of one or more nodes in the tree, such as the node indicated with a green circle, allowing the remaining node times to be inferred from the genetic data. (c) Root-to-tip distances computed from a phylogram, plotted against the ages of the sequences. A regression line is fitted through these data points, with the slope of the line giving an estimate of the evolutionary rate. This method is often used in analyses of time-structured sequence data, such as those from rapidly evolving viruses.
. 2.—Timeline showing the use of different genetic data types for molecular-clock analyses. The development and analysis of different data types is illustrated by a range of studies over the past five decades: amino acid sequences (Zuckerkandl and Pauling 1962), microcomplement fixation (Sarich and Wilson 1967b), protein electrophoresis (Nei 1971), nucleotide sequences (Miyata and Yasunaga 1980; Cohn et al. 1984), DNA–DNA hybridization (Wayne et al. 1991), microsatellites (Goldstein et al. 1995), randomly amplified polymorphic DNA (Espinasa and Borowsky 1998), genome complexity (Sharov 2006), amplified fragment length polymorphisms (Kropf et al. 2009), protein folds (Wang et al. 2011), and recombination events (Moorjani et al. 2016).