Literature DB >> 18478116

Highly conserved regimes of neighbor-base-dependent mutation generated the background primary-structural heterogeneities along vertebrate chromosomes.

Marcos A Antezana1, I King Jordan.   

Abstract

The content of guanine+cytosine varies markedly along the chromosomes of homeotherms and great effort has been devoted to studying this heterogeneity and its biological implications. Already before the DNA-sequencing era, however, it was established that the dinucleotides in the DNA of mammals in particular, and of most organisms in general, show striking over- and under-representations that cannot be explained by the base composition. Here we show that in the coding regions of vertebrates both GC content and codon occurrences are strongly correlated with such "motif preferences" even though we quantify the latter using an index that is not affected by the base composition, codon usage, and protein-sequence encoding. These correlations are likely to be the result of the long-term shaping of the primary structure of genic and non-genic DNA by a regime of mutation of which central features have been maintained by natural selection. We find indeed that these preferences are conserved in vertebrates even more rigidly than codon occurrences and we show that the occurrence-preference correlations are stronger in intronic and non-genic DNA, with the R(2)s reaching 99% when GC content is approximately 0.5. The mutation regime appears to be characterized by rates that depend markedly on the bases present at the site preceding and at that following each mutating site, because when we estimate such rates of neighbor-base-dependent mutation (NBDM) from substitutions retrieved from alignments of coding, intronic, and non-genic mammalian DNA sorted and grouped by GC content, they suffice to simulate DNA sequences in which motif occurrences and preferences as well as the correlations of motif preferences with GC content and with motif occurrences, are very similar to the mammalian ones. The best fit, however, is obtained with NBDM regimes lacking strand effects, which indicates that over the long term NBDM switches strands in the germline as one would expect for effects due to loosely contained background transcription. Finally, we show that human coding regions are less mutable under the estimated NBDM regimes than under matched context-independent mutation and that this entails marked differences between the spectra of amino-acid mutations that either mutation regime should generate. In the Discussion we examine the mechanisms likely to underlie NBDM heterogeneity along chromosomes and propose that it reflects how the diversity and activity of lesion-bypass polymerases (LBPs) track the landscapes of scheduled and non-scheduled genome repair, replication, and transcription during the cell cycle. We conclude that the primary structure of vertebrate genic DNA at and below the trinucleotide level has been governed over the long term by highly conserved regimes of NBDM which should be under direct natural selection because they alter drastically missense-mutation rates and hence the somatic and the germline mutational loads. Therefore, the non-coding DNA of vertebrates may have been shaped by NBDM only epiphenomenally, with non-genic DNA being affected mainly when found in the proximity of genes.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18478116      PMCID: PMC2366069          DOI: 10.1371/journal.pone.0002145

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  91 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  A new method for estimating nonsynonymous substitutions and its applications to detecting positive selection.

Authors:  Hua Tang; Chung-I Wu
Journal:  Mol Biol Evol       Date:  2005-10-19       Impact factor: 16.240

3.  Chaos and order in spontaneous mutation.

Authors:  John W Drake
Journal:  Genetics       Date:  2006-05       Impact factor: 4.562

4.  What drives codon choices in human genes?

Authors:  S Karlin; J Mrázek
Journal:  J Mol Biol       Date:  1996-10-04       Impact factor: 5.469

5.  Non-Darwinian evolution.

Authors:  J L King; T H Jukes
Journal:  Science       Date:  1969-05-16       Impact factor: 47.728

6.  A novel recombination pathway initiated by the Mre11/Rad50/Nbs1 complex eliminates palindromes during meiosis in Schizosaccharomyces pombe.

Authors:  Joseph A Farah; Gareth Cromie; Walter W Steiner; Gerald R Smith
Journal:  Genetics       Date:  2005-01-16       Impact factor: 4.562

7.  Doublet frequencies in evolutionary distinct groups.

Authors:  R Nussinov
Journal:  Nucleic Acids Res       Date:  1984-02-10       Impact factor: 16.971

8.  Evidence for preferential mismatch repair of lagging strand DNA replication errors in yeast.

Authors:  Youri I Pavlov; Ibrahim M Mian; Thomas A Kunkel
Journal:  Curr Biol       Date:  2003-04-29       Impact factor: 10.834

9.  Gaps and forks in DNA replication: Rediscovering old models.

Authors:  Alan R Lehmann; Robert P Fuchs
Journal:  DNA Repair (Amst)       Date:  2006-09-07

10.  The role of DNA double-strand breaks in spontaneous homologous recombination in S. cerevisiae.

Authors:  Gaëlle Lettier; Qi Feng; Adriana Antúnez de Mayolo; Naz Erdeniz; Robert J D Reid; Michael Lisby; Uffe H Mortensen; Rodney Rothstein
Journal:  PLoS Genet       Date:  2006-10-05       Impact factor: 5.917

View more
  6 in total

Review 1.  You're one in a googol: optimizing genes for protein expression.

Authors:  Mark Welch; Alan Villalobos; Claes Gustafsson; Jeremy Minshull
Journal:  J R Soc Interface       Date:  2009-03-11       Impact factor: 4.118

2.  Introns form compositional clusters in parallel with the compositional clusters of the coding sequences to which they pertain.

Authors:  Miguel A Fuertes; José M Pérez; Emile Zuckerkandl; Carlos Alonso
Journal:  J Mol Evol       Date:  2010-12-04       Impact factor: 2.395

Review 3.  Was Wright right? The canonical genetic code is an empirical example of an adaptive peak in nature; deviant genetic codes evolved using adaptive bridges.

Authors:  David M Seaborg
Journal:  J Mol Evol       Date:  2010-08-15       Impact factor: 2.395

4.  Evolution of hsp70 gene expression: a role for changes in AT-richness within promoters.

Authors:  Bing Chen; Tieliu Jia; Ronghui Ma; Bo Zhang; Le Kang
Journal:  PLoS One       Date:  2011-05-31       Impact factor: 3.240

5.  Purifying selection in deeply conserved human enhancers is more consistent than in coding sequences.

Authors:  Dilrini R De Silva; Richard Nichols; Greg Elgar
Journal:  PLoS One       Date:  2014-07-25       Impact factor: 3.240

Review 6.  Regulating highly dynamic unstructured proteins and their coding mRNAs.

Authors:  Buyong Ma; Ruth Nussinov
Journal:  Genome Biol       Date:  2009-01-28       Impact factor: 13.583

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.