Literature DB >> 12954770

Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes.

Zhaolei Zhang1, Mark Gerstein.   

Abstract

Nucleotide substitution, insertion and deletion (indel) events are the major driving forces that have shaped genomes. Using the recently identified human ribosomal protein (RP) pseudogene sequences, we have thoroughly studied DNA mutation patterns in the human genome. We analyzed a total of 1726 processed RP pseudogene sequences, comprising more than 700 000 bases. To be sure to differentiate the sequence changes occurring in the functional genes during evolution from those occurring in pseudogenes after they were fixed in the genome, we used only pseudogene sequences originating from parts of RP genes that are identical in human and mouse. Overall, we found that nucleotide transitions are more common than transversions, by roughly a factor of two. Moreover, the substitution rates amongst the 12 possible nucleotide pairs are not homogeneous as they are affected by the type of immediately neighboring nucleotides and the overall local G+C content. Finally, our dataset is large enough that it has many indels, thus allowing for the first time statistically robust analysis of these events. Overall, we found that deletions are about three times more common than insertions (3740 versus 1291). The frequencies of both these events follow characteristic power-law behavior associated with the size of the indel. However, unexpectedly, the frequency of 3 bp deletions (in contrast to 3 bp insertions) violates this trend, being considerably higher than that of 2 bp deletions. The possible biological implications of such a 3 bp bias are discussed.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12954770      PMCID: PMC203328          DOI: 10.1093/nar/gkg745

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  27 in total

1.  Comparison of DNA sequences with protein sequences.

Authors:  W R Pearson; T Wood; Z Zhang; W Miller
Journal:  Genomics       Date:  1997-11-15       Impact factor: 5.736

2.  Compositional differences within and between eukaryotic genomes.

Authors:  S Karlin; J Mrázek
Journal:  Proc Natl Acad Sci U S A       Date:  1997-09-16       Impact factor: 11.205

Review 3.  Structure and evolution of mammalian ribosomal proteins.

Authors:  I G Wool; Y L Chan; A Glück
Journal:  Biochem Cell Biol       Date:  1995 Nov-Dec       Impact factor: 3.626

4.  Patterns and rates of indel evolution in processed pseudogenes from humans and murids.

Authors:  R Ophir; D Graur
Journal:  Gene       Date:  1997-12-31       Impact factor: 3.688

5.  Neighboring base effects on substitution rates in pseudogenes.

Authors:  M Bulmer
Journal:  Mol Biol Evol       Date:  1986-07       Impact factor: 16.240

6.  The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment.

Authors:  X Gu; W H Li
Journal:  J Mol Evol       Date:  1995-04       Impact factor: 2.395

7.  Evolutionary rates of insertion and deletion in noncoding nucleotide sequences of primates.

Authors:  N Saitou; S Ueda
Journal:  Mol Biol Evol       Date:  1994-05       Impact factor: 16.240

8.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

Review 9.  Slipped-strand mispairing: a major mechanism for DNA sequence evolution.

Authors:  G Levinson; G A Gutman
Journal:  Mol Biol Evol       Date:  1987-05       Impact factor: 16.240

Review 10.  Trinucleotide repeat DNA structures: dynamic mutations from dynamic DNA.

Authors:  C E Pearson; R R Sinden
Journal:  Curr Opin Struct Biol       Date:  1998-06       Impact factor: 6.809

View more
  101 in total

1.  Discovery of mutations in Saccharomyces cerevisiae by pooled linkage analysis and whole-genome sequencing.

Authors:  Shanda R Birkeland; Natsuko Jin; Alev Cagla Ozdemir; Robert H Lyons; Lois S Weisman; Thomas E Wilson
Journal:  Genetics       Date:  2010-10-05       Impact factor: 4.562

2.  Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome.

Authors:  Zhaolei Zhang; Paul M Harrison; Yin Liu; Mark Gerstein
Journal:  Genome Res       Date:  2003-12       Impact factor: 9.043

3.  The functional genomic distribution of protein divergence in two animal phyla: coevolution, genomic conflict, and constraint.

Authors:  Cristian I Castillo-Davis; Fyodor A Kondrashov; Daniel L Hartl; Rob J Kulathinal
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

4.  Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function.

Authors:  Nalin C W Goonesekere; Byungkook Lee
Journal:  Nucleic Acids Res       Date:  2004-05-20       Impact factor: 16.971

5.  Transcription-coupled and splicing-coupled strand asymmetries in eukaryotic genomes.

Authors:  Marie Touchon; Alain Arneodo; Yves d'Aubenton-Carafa; Claude Thermes
Journal:  Nucleic Acids Res       Date:  2004-09-23       Impact factor: 16.971

6.  Mutation biases and mutation rate variation around very short human microsatellites revealed by human-chimpanzee-orangutan genomic sequence alignments.

Authors:  William Amos
Journal:  J Mol Evol       Date:  2010-08-11       Impact factor: 2.395

7.  GC content evolution of the human and mouse genomes: insights from the study of processed pseudogenes in regions of different recombination rates.

Authors:  Adel Khelifi; Julien Meunier; Laurent Duret; Dominique Mouchiroud
Journal:  J Mol Evol       Date:  2006-04-28       Impact factor: 2.395

8.  Why are young and old repetitive elements distributed differently in the human genome?

Authors:  Elise M S Belle; Matthew T Webster; Adam Eyre-Walker
Journal:  J Mol Evol       Date:  2005-03       Impact factor: 2.395

9.  An unusual pattern of spontaneous mutations recovered in the halophilic archaeon Haloferax volcanii.

Authors:  Reena R Mackwan; Geraldine T Carver; John W Drake; Dennis W Grogan
Journal:  Genetics       Date:  2006-12-28       Impact factor: 4.562

10.  Highly heterogeneous mutation rates in the hepatitis C virus genome.

Authors:  Ron Geller; Úrsula Estada; Joan B Peris; Iván Andreu; Juan-Vicente Bou; Raquel Garijo; José M Cuevas; Rosario Sabariegos; Antonio Mas; Rafael Sanjuán
Journal:  Nat Microbiol       Date:  2016-04-18       Impact factor: 17.745

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.