| Literature DB >> 24273314 |
Abstract
Genomic DNA is used as the template for both replication and transcription, whose machineries may collide and result in mutagenesis, among other damages. Because head-on collisions are more deleterious than codirectional collisions, genes should be preferentially encoded on the leading strand to avoid head-on collisions, as is observed in most bacterial genomes examined. However, why are there still lagging strand encoded genes? Paul et al. recently proposed that these genes take advantage of the increased mutagenesis resulting from head-on collisions and are thus adaptively encoded on the lagging strand. We show that the evidence they provided is invalid and that the existence of lagging strand encoded genes is explainable by a balance between deleterious mutations that bring genes from the leading to the lagging strand and purifying selection purging such mutants. Therefore, the adaptive hypothesis is neither theoretically needed nor empirically supported.Entities:
Keywords: convergence; evolution; mutation-selection balance
Mesh:
Substances:
Year: 2013 PMID: 24273314 PMCID: PMC3879979 DOI: 10.1093/gbe/evt193
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FSchematic drawing of a collision between a working DNA polymerase and a working RNA polymerase in bacteria. (a) Codirectional collision when a gene is encoded on the leading strand. (b) Head-on collision when a gene is encoded on the lagging strand. Arrows show the directions of movement.
FLagging strand encoded genes do not contain more multihit sites than expected by chance. (a) The observed ratio (arrow) between the number of multihit sites in lagging strand encoded genes and that in leading strand encoded genes is not significantly different from the random expectation (bars). The random expectation is determined by 10,000 simulations using variable sites determined from five B. subtilis strains and ten additional Bacillus species. P value is the two-tail probability that a randomly generated ratio is more extreme than the observed one. (b) The observed ratio (arrow) between the number of multihit-site-containing genes on the lagging strand and that on the leading strand is not significantly different from the random expectation (bars). (c) Same as (a) except that all sites are assumed variable. (d) Same as (b) except that all sites are assumed variable.