Literature DB >> 22016336

Time dependency of molecular evolutionary rates? Yes and no.

Sankar Subramanian, David M Lambert.   

Abstract

Some previous studies have suggested that rates of evolution inferred using molecular sequences vary substantially depending on the time frame over which they are measured, whereas a number of other studies have argued against this proposition. We examined this issue by separating positions of primate mitochondrial genomes that are under different levels of selection constraints. Our results revealed an order of magnitude variation in the evolutionary rates at constrained sites (including nonsynonymous sites, D-loop, and RNA) and virtually an identical rate of evolution at synonymous sites, independent of the timescales over which they were estimated. Although the evolutionary rate at nonsynonymous sites obtained using the European (H1 haplogroup) mitogenomes is 9-15 times higher than that estimated using the human-chimpanzee pair, in contrast, the rates at synonymous sites are similar between these comparisons. We also show that the ratio of divergence at nonsynonymous to synonymous sites estimated using intra- and interspecific comparisons vary up to nine times, which corroborates our results independent of calibration times.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22016336      PMCID: PMC3236606          DOI: 10.1093/gbe/evr108

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Rates of molecular evolution are central to understanding the genetics and molecular biology of species. In the past, evolutionary rates have typically been estimated by calibrating the levels of evolutionary divergence (usually from extant species) with likely divergence times between species/populations. Although the evolutionary rates obtained using different calibration times should in principle be similar, an earlier study first noted much higher rates based on recent calibration times, compared with the rates estimated using older calibrations (Garcia-Moreno 2004). A later study using mitochondrial sequence data from primates and birds suggested that this pattern is more universal, and the authors proposed that rates of evolution are generally time dependent (Ho et al. 2005). Several studies of populations and closely related species reported higher evolutionary rates compared with those obtained from distantly related species and thus provided support for this concept (Burridge et al. 2008; Gratton et al. 2008; Howell et al. 2008; Henn et al. 2009; Soares et al. 2009). On the other hand, a number of studies refuted the idea of time dependency based on the methodological artifacts associated with the Bayesian Markov Chain Monte Carlo (MCMC) approach used to estimate evolutionary rates (Emerson 2007; Bandelt 2008; Debruyne and Poinar 2009; Navascues and Emerson 2009). These studies showed an upward bias in the rates estimated by the complex Bayesian MCMC methods using the data from populations or closely related species (Navascues and Emerson 2009), which was attributed to the poor or low signal in the data (Debruyne and Poinar 2009). Note that the rate of evolution mentioned above (and throughout this article) refers only to the inferred number of substitutions (or mutations) per site divided by the coalescence/divergence times. Hence, this does not suggest that the rate of mutation itself varies with time. Population genetic theories predict a time-dependent rate at constrained sites (but not at neutral sites) due to the removal of slightly deleterious mutations over time. Therefore, the concept of time dependency needs to be reexamined by separating neutral and other constrained sites that are under different magnitudes of selective constraints. Furthermore, a simple method of rate estimation that uses a minimal number of parameters and assumptions is needed to avoid methodological biases reported previously. Hence, to examine this, we assembled data sets consisting of mitochondrial genome sequences belonging to human populations, Neanderthal, and chimpanzee.

Materials and Methods

Mitochondrial genome sequences of chimpanzee, Neanderthal, and human populations were obtained from GenBank. We obtained 83 sequences for which the names of the haplogroup (H1) were explicitly mentioned. Similarly, there were 33 sequences with explicit references to native Australian (Aborigine) in the source fields. To estimate the rate of evolution within humans, 100 mitogenomic sequences were used by collecting five representative sequences from 20 major haplogroups A, B, C, D, G, H, I, J, K, L0, L1, L2, L3, T, U, V, W, X, Y, and Z. Hence, 216 human genomes plus the genomes of Neanderthal and chimpanzee were used in this study. Individual orthologous protein sequences from 218 genomes were aligned, and this was used to align cDNA. Similarly, individual tRNA, rRNA genes, and D-loop regions were aligned. Note that the D-loop regions were available only for 20 and 31 genomes belonging to Australians and Europeans (H1), respectively. All positions with alignment gaps were excluded. To estimate rates at synonymous positions, 4-fold and 2-fold sites from all protein-coding genes were concatenated. Likewise, the 0-fold sites were used to estimate rates at nonsynonymous sites. All tRNA and rRNAs were concatenated into a single alignment. To estimate the rate of evolution for the population data from European (H1), Australian, and within humans, the program “MCMCcoal” was employed using the option of “data analysis from one species” (Rannala and Yang 2003). All the default prior settings in the control file MCMCcoalYu2001.ctl (supplied with the program) were used. The summary statistics were extracted from the output file (mcmc.out) using the program “ds,” which is provided along with the MCMCcoal software. This program estimates the coalescence distances or root heights (μtMRCA) of the most recent common ancestor for a given set of sequences from a species (supplementary table S1, Supplementary Material online). The rates of evolution were determined by dividing μtMRCA by the respective population coalescence times given in table 1.
Table 1

Estimates of Rates of Molecular Evolution

Rate of Evolution (×10−8 s/s/year)
Group (Number of Genomes)Calibration Times (Intervals), kyrSynonymous SitesNonsynonymous SitesRNAD-LoopdN/dS(SE)
European—H1 (83)18 (11–25)5.1 (8.4–3.7)2.2 (3.5–1.6)2.8 (4.6–2.0)12.0 (19.0–8.6)0.423 (0.173)
Australian (33)45 (40–65)6.9 (7.8–4.8)1.8 (2.1–1.3)2.1 (2.4–1.5)18.0 (20.0–12.0)0.264 (0.067)
Humans (100)a150 (100–200)3.7 (5.5–2.8)0.8 (1.2–0.6)1.2 (1.9–0.9)7.8 (12.0–5.9)0.210 (0.041)
Human–Neanderthal500 (400–600)3.4 (5.7–2.5)0.5 (0.8–0.3)0.6 (1.0–0.4)4.4 (7.3–3.1)0.136 (0.006)
Human–chimpanzee6,000 (5,000–7,000)4.3 (5.2–3.7)0.2 (0.2–0.2)0.7 (0.9–0.6)4.3 (5.2–3.7)0.047 (0.0005)
Ratio H1/human–chimpanzee1.2 (1.0–1.6)10.8 (9.1–14.8)3.8 (3.2–5.2)2.8 (2.3–3.8)9.1 (5.4–12.5)

Note.—SE, standard error.

Five mitogenomes each were taken from 20 major haplogroups.

Estimates of Rates of Molecular Evolution Note.—SE, standard error. Five mitogenomes each were taken from 20 major haplogroups. To estimate rates of evolution between species, pairwise distances (and standard errors) between humanNeanderthal and humanchimpanzee were estimated (supplementary table S1, Supplementary Material online). In order to reduce estimation errors and to avoid any bias, the mean distance estimates between the 100 human genomes and chimpanzee or Neanderthal were obtained. For protein-coding genes, the “codeml” program of “PAML” (Yang 2007) was used to estimate dN and dS between species. For RNA and D-loop, the software “PAUP” (Swofford 2003) was used to estimate pairwise distances using the Hasegawa-Kishino-Yano plus Gamma plus invariant sites model. The above model was determined using the software “Modeltest” (Posada and Crandall 1998) using Bayesian Information criterion. Finally, the mean pairwise distances were divided by the species divergence times to obtain the rates of evolution. The program MCMCcoal was not used to obtain divergence between species. This is because this program assumes equal rate of evolution among sites and uses the simple Jukes–Cantor model to correct multiple substitution. These assumptions are sufficient only when the divergence is small (<0.2). Because the divergence between human and chimpanzee is >0.5 (at synonymous sites and D-loop), simple models will underestimate evolutionary distances (Nei and Kumar 2000). Evolutionary distance estimates and number of sites used are given in supplementary table S1, Supplementary Material online. Methods related to the rate estimations based on Bayesian MCMC analysis are given in the supplementary methods (supplementary tables S2 and S3, Supplementary Material online).

Results and Discussion

The data set was partitioned into synonymous sites, nonsynonymous sites, RNAs, and D-loop. The evolutionary distances between species pairs were estimated by PAUP (Swofford 2003) using the parameter estimates obtained from Modeltest (Posada and Crandall 1998). For the population data, the coalescence distance of the most recent common ancestor (μtMRCA) or the root height was estimated using the software MCMCcoal (Rannala and Yang 2003), which uses the Jukes–Cantor model for multiple-hit corrections. The pairwise divergences and coalescence distances were obtained for all the four types of sites (supplementary table S1, Supplementary Material online), and rates of evolution were estimated by simply dividing these distances by the respective divergence/coalescence times. We used five calibration time points spanning 18, 45, 150, 500, and 6,000 kyr, which adequately captures the rate of evolution in a wide range of timescales (table 1). The coalescence time of Europeans (H1 haplogroup) was based on the last glacial maximum (Achilli et al. 2004; Endicott and Ho 2008) and that for Australians was based on the oldest fossils of humans in Australia (Bowler et al. 2003). Other time points are based on the divergence time estimated using the nuclear data from human populations (Green et al. 2006; Gutenkunst et al. 2009) and from human/Neanderthal (Green et al. 2006) comparisons, respectively. We used the widely accepted 6 (5–7) million years for human/chimpanzee divergence. The rate estimates using nonsynonymous sites, RNA, and D-loop showed a negative relationship with the calibration times (fig. 1). In contrast, the neutral evolutionary rates using synonymous sites were similar across all the timescales. The magnitude of the rate variation is highest for the nonsynonymous sites and lowest for the D-loop region. For instance, the evolutionary rate at the amino acid replacement sites estimated for the H1 European haplogroup with a coalescence age of 18 kyr was found to be 10 times (8–14 times) higher than that obtained using the humanchimpanzee pair (fig. 1; table 1). The magnitude of the difference in the evolutionary rates between these time points are 7.5 (6–10) and 2.5 (2–3) times for the RNA and D-loop, respectively (fig. 1). The extent of rate differences suggests the intensity of selective constraints on these regions. Although nonsynonymous sites and RNAs are well known to be under selective constraints, the present study reveals selection on the D-loop region of mitochondrial genomes. In order to examine the robustness of these results, we reanalyzed the data using Bayesian MCMC methods (supplementary table S2, Supplementary Material online) and obtained similar results (supplementary fig. S1 and table S3, Supplementary Material online).
F

Relationships between rates of evolution and calibration times. (A) Rates of evolution were estimated for the nonsynonymous (red) and synonymous positions (blue) of mitochondrial protein-coding genes and for (B) RNAs (tRNA + rRNA, brown) and D-loop (green). Error bars are based on the lower and upper limits of the divergence/coalescence times. Both x and y axes are on a logarithmic scale. The best fitting regression lines are shown.

Relationships between rates of evolution and calibration times. (A) Rates of evolution were estimated for the nonsynonymous (red) and synonymous positions (blue) of mitochondrial protein-coding genes and for (B) RNAs (tRNA + rRNA, brown) and D-loop (green). Error bars are based on the lower and upper limits of the divergence/coalescence times. Both x and y axes are on a logarithmic scale. The best fitting regression lines are shown. Population genetic theories predict that the slightly deleterious mutations contribute diversity to the population for a short while, but that they are selected against over long timescales and are prevented from becoming fixed (Kimura 1983). Evidence for this prediction is very clear from figure 1. Over short timescales, evolutionary rates for nonsynonymous sequences are high probably due to the presence of such slightly deleterious mutations. Because these mutations are gradually eliminated over time, this is reflected in a steady decline in rates. In contrast, the mutations at synonymous sites are largely neutral and thus the accumulation of such mutations is relatively constant over time. This results in the similarity of neutral evolutionary rates across various timescales. Although the temporally declining rate pattern observed for RNAs is similar to that for amino acid replacement sites, the magnitude of the decline is comparatively small. This suggests relatively weak selective constraints on the former, in contrast to the latter. Interestingly, this study also reveals selection constraints in the D-loop region. Because the region is responsible for DNA replication, it is likely to harbor replication origins and other regulatory motifs associated with replication. However, the selection pressure on D-loop appears to be much weaker than that observed for the other constrained sites as the rate obtained for Europeans is only 2–3 times higher than that estimated for the humanchimpanzee pair (fig. 1; table 1). This could be due to the fact that only a small fraction of sites in this region are likely to be under selection. The results of this investigation are supported by earlier studies on birds and primates. A previous study revealed higher rate of constrained site evolution within a penguin lineage compared with that estimated between the lineages (Subramanian et al. 2009). However, the neutral rates of within and between lineage comparisons were found to be similar. Furthermore, a study on hominids also found significantly higher rates of nonsynonymous site evolution using intraspecific comparisons than that obtained for interspecific comparison (Endicott and Ho 2008). In contrast, evolutionary rates at synonymous sites were not significantly different between the two comparisons. Based on the similarity of neutral evolutionary rates, Soares et al. (2009) devised a method to correct the time-dependent effect in estimating evolutionary rates using human mitochondrial genomes. Despite using a very wide range of divergence/coalescence times, the rates obtained in this study are still influenced by the accuracy of the time estimates used. Therefore, we reexamined the temporal rate patterns without using any divergence/coalescence times. Our results showed a huge difference in the nonsynonymous rates (rN) and a broad similarity in the synonymous rates (rS) across all timescales. Therefore, the ratio of the former to the later will reveal the temporal pattern of nonsynonymous evolution. This ratio can be simplified as rN/rS = (dN/T)/(dS/T) = dN/dS, where T is divergence/coalescence time, dN and dS are divergences at nonsynonymous and synonymous sites. Now, after eliminating the time component (T), we estimated the dN/dS ratio for the five data sets. Figure 2 reveals a negative correlation between the dN/dS ratios and calibration times. The pattern observed in this result is possible only if the rate of nonsynonymous evolution declines with calibration times and the synonymous rate is constant across different timescales. Therefore, independent of the use of calibration times, this result provides substantial support for the results shown in figure 1.
F

(A) Ratio of nonsynonymous to synonymous divergence (dN/dS) estimated for intra- and interspecific comparisons using the mitochondrial genes. Error bars are standard error of the mean. (B) Comparison of NI estimates obtained through two different methods. Red columns are the NI computed using equation (1) (without calibration times), in which the intraspecific dN/dS of different populations were divided by that obtained for the human–chimpanzee comparison. Blue columns are the NI estimated using equation (3), where the intraspecific nonsynonymous rates of evolution obtained for different populations were divided by the interspecific (human–chimpanzee) rate.

(A) Ratio of nonsynonymous to synonymous divergence (dN/dS) estimated for intra- and interspecific comparisons using the mitochondrial genes. Error bars are standard error of the mean. (B) Comparison of NI estimates obtained through two different methods. Red columns are the NI computed using equation (1) (without calibration times), in which the intraspecific dN/dS of different populations were divided by that obtained for the humanchimpanzee comparison. Blue columns are the NI estimated using equation (3), where the intraspecific nonsynonymous rates of evolution obtained for different populations were divided by the interspecific (humanchimpanzee) rate. McDonald and Kreitman (1991) introduced a neutrality test, which compares the ratio of nonsynonymous to synonymous diversity (pN/pS) within a population and the ratio of nonsynonymous to synonymous divergence (dN/dS) obtained from interspecies comparison. Higher pN/pS compared with dN/dS is suggestive of the presence of deleterious nonsynonymous polymorphisms and the reverse mean adaptive amino acid substitutions. Later, Rand and Kann (1996) proposed a measure called neutrality index (NI), which is the ratio of these two ratios, This equation can be written aswhere rN1 and rS1 are intraspecific rates of evolution at synonymous and nonsynonymous sites, respectively, and rN2 and rS2 are interspecific rates of evolution at these corresponding sites. T1 and T2 are the population coalescence/divergence times within population and between species, respectively. Assuming that the rate of synonymous site evolution is similar between different timescales (rS1 = rS2), equation (2) can be simplified to This suggests that NI is simply the ratio of the rates of nonsynonymous site evolution estimated within a species and between species. Therefore, the time-dependent variation in the rate of evolution at nonsynonymous sites could be determined using the divergences at synonymous and amino acid replacement sites alone, without using any calibration times. We computed NI by comparing the dN/dS ratios obtained for Europeans (H1), Australians, humans, and human/Neanderthal with the ratio estimated for the humanchimpanzee comparison using equation (1). We also estimated NIs by comparing the nonsynonymous rates (table 1 and fig. 1) obtained for the human populations with that of humanchimpanzee using equation (3). Figure 2 shows that the NIs estimated using calibration times (2.3–10.8) are largely similar to those (2.9–9.1) estimated without using these times, and both clearly show time-dependent patterns of nonsynonymous rates of evolution. These estimates are very similar to the NIs obtained using the intra- and interspecies data from primates (Hasegawa et al. 1998), rodents (Rand and Kann 1996), and fruit flies (Rand et al. 1994). Therefore, the present and the previous studies provide solid evidence for the time dependency of molecular rates at amino acid replacement positions independent of calibration times.

Supplementary Material

Supplementary figure S1 and tables S1–S3 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/). Alignments and input files for the programs MCMCcoal and BEAST can be obtained from http://www.mediafire.com/?udqdd25a9734jyi.
  23 in total

1.  Clock debate: when times are a-changin': time dependency of molecular rate estimates: tempest in a teacup.

Authors:  H-J Bandelt
Journal:  Heredity (Edinb)       Date:  2007-10-17       Impact factor: 3.821

2.  Geological dates and molecular rates: fish DNA sheds light on time dependency.

Authors:  Christopher P Burridge; Dave Craw; David Fletcher; Jonathan M Waters
Journal:  Mol Biol Evol       Date:  2008-02-14       Impact factor: 16.240

3.  Alarm bells for the molecular clock? No support for Ho et al.'s model of time-dependent molecular rate estimates.

Authors:  Brent C Emerson
Journal:  Syst Biol       Date:  2007-04       Impact factor: 15.683

4.  PAML 4: phylogenetic analysis by maximum likelihood.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-04       Impact factor: 16.240

5.  Time dependency of molecular rate estimates for mtDNA: this is not the time for wishful thinking.

Authors:  N Howell; C Howell; J L Elson
Journal:  Heredity (Edinb)       Date:  2008-06-04       Impact factor: 3.821

6.  Time dependency of molecular rate estimates and systematic overestimation of recent divergence times.

Authors:  Simon Y W Ho; Matthew J Phillips; Alan Cooper; Alexei J Drummond
Journal:  Mol Biol Evol       Date:  2005-04-06       Impact factor: 16.240

7.  Excess amino acid polymorphism in mitochondrial DNA: contrasts among genes from Drosophila, mice, and humans.

Authors:  D M Rand; L M Kann
Journal:  Mol Biol Evol       Date:  1996-07       Impact factor: 16.240

8.  Analysis of one million base pairs of Neanderthal DNA.

Authors:  Richard E Green; Johannes Krause; Susan E Ptak; Adrian W Briggs; Michael T Ronan; Jan F Simons; Lei Du; Michael Egholm; Jonathan M Rothberg; Maja Paunovic; Svante Pääbo
Journal:  Nature       Date:  2006-11-16       Impact factor: 49.962

9.  Neutral and non-neutral evolution of Drosophila mitochondrial DNA.

Authors:  D M Rand; M Dorfsman; L M Kann
Journal:  Genetics       Date:  1994-11       Impact factor: 4.562

10.  A Bayesian evaluation of human mitochondrial substitution rates.

Authors:  Phillip Endicott; Simon Y W Ho
Journal:  Am J Hum Genet       Date:  2008-03-27       Impact factor: 11.025

View more
  9 in total

1.  Selective constraints determine the time dependency of molecular rates for human nuclear genomes.

Authors:  Sankar Subramanian; David M Lambert
Journal:  Genome Biol Evol       Date:  2012       Impact factor: 3.416

2.  Complete mitochondrial DNA sequences of the threadfin cichlid (Petrochromis trewavasae) and the blunthead cichlid (Tropheus moorii) and patterns of mitochondrial genome evolution in cichlid fishes.

Authors:  Christoph Fischer; Stephan Koblmüller; Christian Gülly; Christian Schlötterer; Christian Sturmbauer; Gerhard G Thallinger
Journal:  PLoS One       Date:  2013-06-24       Impact factor: 3.240

3.  Time dependency of foamy virus evolutionary rate estimates.

Authors:  Pakorn Aiewsakun; Aris Katzourakis
Journal:  BMC Evol Biol       Date:  2015-06-26       Impact factor: 3.260

4.  Effects of mitochondrial DNA rate variation on reconstruction of Pleistocene demographic history in a social avian species, Pomatostomus superciliosus.

Authors:  Janette A Norman; Caroline J Blackmore; Meaghan Rourke; Les Christidis
Journal:  PLoS One       Date:  2014-09-02       Impact factor: 3.240

5.  Mitochondrial DNA hyperdiversity and its potential causes in the marine periwinkle Melarhaphe neritoides (Mollusca: Gastropoda).

Authors:  Séverine Fourdrilis; Patrick Mardulyn; Olivier J Hardy; Kurt Jordaens; António Manuel de Frias Martins; Thierry Backeljau
Journal:  PeerJ       Date:  2016-10-05       Impact factor: 2.984

6.  Highly regional population structure of Spondyliosoma cantharus depicted by nuclear and mitochondrial DNA data.

Authors:  Ana Neves; Ana Rita Vieira; Vera Sequeira; Rafaela Barros Paiva; Leonel Serrano Gordo; Octávio S Paulo
Journal:  Sci Rep       Date:  2020-03-04       Impact factor: 4.379

7.  Phylogeography using mitogenomes: A rare Dipodidae, Sicista betulina, in North-western Europe.

Authors:  Liselotte Wesley Andersen; Magnus W Jacobsen; Jane Frydenberg; Julie Dahl Møller; Thomas Secher Jensen
Journal:  Ecol Evol       Date:  2022-04-21       Impact factor: 3.167

8.  Evaluating purifying selection in the mitochondrial DNA of various mammalian species.

Authors:  Pedro Soares; Diogo Abrantes; Teresa Rito; Noel Thomson; Predrag Radivojac; Biao Li; Vincent Macaulay; David C Samuels; Luísa Pereira
Journal:  PLoS One       Date:  2013-03-22       Impact factor: 3.240

9.  A universal trend among proteomes indicates an oily last common ancestor.

Authors:  Ranjan V Mannige; Charles L Brooks; Eugene I Shakhnovich
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.