Literature DB >> 12034841

Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes.

Nathaniel Echols1, Paul Harrison, Suganthi Balasubramanian, Nicholas M Luscombe, Paul Bertone, Zhaolei Zhang, Mark Gerstein.   

Abstract

Based on searches for disabled homologs to known proteins, we have identified a large population of pseudogenes in four sequenced eukaryotic genomes-the worm, yeast, fly and human (chromosomes 21 and 22 only). Each of our nearly 2500 pseudogenes is characterized by one or more disablements mid-domain, such as premature stops and frameshifts. Here, we perform a comprehensive survey of the amino acid and nucleotide composition of these pseudogenes in comparison to that of functional genes and intergenic DNA. We show that pseudogenes invariably have an amino acid composition intermediate between genes and translated intergenic DNA. Although the degree of intermediacy varies among the four organisms, in all cases, it is most evident for amino acid types that differ most in occurrence between genes and intergenic regions. The same intermediacy also applies to codon frequencies, especially in the worm and human. Moreover, the intermediate composition of pseudogenes applies even though the composition of the genes in the four organisms is markedly different, showing a strong correlation with the overall A/T content of the genomic sequence. Pseudogenes can be divided into 'ancient' and 'modern' subsets, based on the level of sequence identity with their closest matching homolog (within the same genome). Modern pseudogenes usually have a much closer sequence composition to genes than ancient pseudogenes. Collectively, our results indicate that the composition of pseudogenes that are under no selective constraints progressively drifts from that of coding DNA towards non-coding DNA. Therefore, we propose that the degree to which pseudogenes approach a random sequence composition may be useful in dating different sets of pseudogenes, as well as to assess the rate at which intergenic DNA accumulates mutations. Our compositional analyses with the interactive viewer are available over the web at http://genecensus.org/pseudogene.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12034841      PMCID: PMC117176          DOI: 10.1093/nar/30.11.2515

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  34 in total

1.  The DNA sequence of human chromosome 22.

Authors:  I Dunham; N Shimizu; B A Roe; S Chissoe; A R Hunt; J E Collins; R Bruskiewich; D M Beare; M Clamp; L J Smink; R Ainscough; J P Almeida; A Babbage; C Bagguley; J Bailey; K Barlow; K N Bates; O Beasley; C P Bird; S Blakey; A M Bridgeman; D Buck; J Burgess; W D Burrill; K P O'Brien
Journal:  Nature       Date:  1999-12-02       Impact factor: 49.962

2.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

3.  Genome-scale compositional comparisons in eukaryotes.

Authors:  A J Gentles; S Karlin
Journal:  Genome Res       Date:  2001-04       Impact factor: 9.043

4.  A question of size: the eukaryotic proteome and the problems in defining it.

Authors:  Paul M Harrison; Anuj Kumar; Ning Lang; Michael Snyder; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

5.  Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22.

Authors:  Paul M Harrison; Hedi Hegyi; Suganthi Balasubramanian; Nicholas M Luscombe; Paul Bertone; Nathaniel Echols; Ted Johnson; Mark Gerstein
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

6.  Computational inference of homologous gene structures in the human genome.

Authors:  R F Yeh; L P Lim; C B Burge
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

Review 7.  Studying genomes through the aeons: protein families, pseudogenes and proteome evolution.

Authors:  Paul M Harrison; Mark Gerstein
Journal:  J Mol Biol       Date:  2002-05-17       Impact factor: 5.469

8.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

9.  A small reservoir of disabled ORFs in the yeast genome and its implications for the dynamics of proteome evolution.

Authors:  Paul Harrison; Anuj Kumar; Ning Lan; Nathaniel Echols; Michael Snyder; Mark Gerstein
Journal:  J Mol Biol       Date:  2002-02-22       Impact factor: 5.469

Review 10.  Interspersed repeats and other mementos of transposable elements in mammalian genomes.

Authors:  A F Smit
Journal:  Curr Opin Genet Dev       Date:  1999-12       Impact factor: 5.578

View more
  51 in total

1.  Evolution of prokaryotic genes by shift of stop codons.

Authors:  Anna A Vakhrusheva; Marat D Kazanov; Andrey A Mironov; Georgii A Bazykin
Journal:  J Mol Evol       Date:  2010-11-17       Impact factor: 2.395

2.  On slippage-like mutation dynamics within genes: a study of pseudogenes and 3'UTRs.

Authors:  Manami Nishizawa; Kazuhisa Nishizawa
Journal:  J Mol Evol       Date:  2005-02       Impact factor: 2.395

3.  Forbidden penta-peptides.

Authors:  Tamir Tuller; Benny Chor; Nathan Nelson
Journal:  Protein Sci       Date:  2007-10       Impact factor: 6.725

4.  Comparative analysis of pseudogenes across three phyla.

Authors:  Cristina Sisu; Baikang Pei; Jing Leng; Adam Frankish; Yan Zhang; Suganthi Balasubramanian; Rachel Harte; Daifeng Wang; Michael Rutenberg-Schoenberg; Wyatt Clark; Mark Diekhans; Joel Rozowsky; Tim Hubbard; Jennifer Harrow; Mark B Gerstein
Journal:  Proc Natl Acad Sci U S A       Date:  2014-08-25       Impact factor: 11.205

5.  PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers.

Authors:  Travis S Johnson; Sihong Li; Eric Franz; Zhi Huang; Shuyu Dan Li; Moray J Campbell; Kun Huang; Yan Zhang
Journal:  Gigascience       Date:  2019-05-01       Impact factor: 6.524

Review 6.  Filling the Void: Proximity-Based Labeling of Proteins in Living Cells.

Authors:  Dae In Kim; Kyle J Roux
Journal:  Trends Cell Biol       Date:  2016-09-22       Impact factor: 20.808

7.  Site-selective tyrosine bioconjugation via photoredox catalysis for native-to-bioorthogonal protein transformation.

Authors:  Beryl X Li; Daniel K Kim; Steven Bloom; Richard Y-C Huang; Jennifer X Qiao; William R Ewing; Daniel G Oblinsky; Gregory D Scholes; David W C MacMillan
Journal:  Nat Chem       Date:  2021-06-28       Impact factor: 24.427

8.  Multiple actions of phi-LITX-Lw1a on ryanodine receptors reveal a functional link between scorpion DDH and ICK toxins.

Authors:  Jennifer J Smith; Irina Vetter; Richard J Lewis; Steve Peigneur; Jan Tytgat; Alexander Lam; Esther M Gallant; Nicole A Beard; Paul F Alewood; Angela F Dulhunty
Journal:  Proc Natl Acad Sci U S A       Date:  2013-05-13       Impact factor: 11.205

9.  Linear combinations of docking affinities explain quantitative differences in RTK signaling.

Authors:  Andrew Gordus; Jordan A Krall; Elsa M Beyer; Alexis Kaushansky; Alejandro Wolf-Yadlin; Mark Sevecka; Bryan H Chang; John Rush; Gavin MacBeath
Journal:  Mol Syst Biol       Date:  2009-01-20       Impact factor: 11.429

10.  Assessing the genomic evidence for conserved transcribed pseudogenes under selection.

Authors:  Amit N Khachane; Paul M Harrison
Journal:  BMC Genomics       Date:  2009-09-15       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.