Literature DB >> 15059995

Comparative analysis of amino acid repeats in rodents and humans.

M Mar Albà1, Roderic Guigó.   

Abstract

Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15059995      PMCID: PMC383298          DOI: 10.1101/gr.1925704

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  36 in total

1.  Nucleotide compositional constraints on genomes generate alanine-, glycine-, and proline-rich structures in transcription factors.

Authors:  Y Nakachi; T Hayakawa; H Oota; K Sumiyama; L Wang; S Ueda
Journal:  Mol Biol Evol       Date:  1997-10       Impact factor: 16.240

2.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

Review 3.  The complex pathology of trinucleotide repeats.

Authors:  P S Reddy; D E Housman
Journal:  Curr Opin Cell Biol       Date:  1997-06       Impact factor: 8.382

4.  Glutamine-rich domains activate transcription in yeast Saccharomyces cerevisiae.

Authors:  H Xiao; K T Jeang
Journal:  J Biol Chem       Date:  1998-09-04       Impact factor: 5.157

5.  Transcriptional activation modulated by homopolymeric glutamine and proline stretches.

Authors:  H P Gerber; K Seipel; O Georgiev; M Höfferer; M Hug; S Rusconi; W Schaffner
Journal:  Science       Date:  1994-02-11       Impact factor: 47.728

6.  Trinucleotide repeats and long homopeptides in genes and proteins associated with nervous system disease and development.

Authors:  S Karlin; C Burge
Journal:  Proc Natl Acad Sci U S A       Date:  1996-02-20       Impact factor: 11.205

7.  Distribution of trinucleotide microsatellites in different categories of mammalian genomic sequence: implications for human genetic diseases.

Authors:  R L Stallings
Journal:  Genomics       Date:  1994-05-01       Impact factor: 5.736

Review 8.  Polar zippers: their role in human disease.

Authors:  M Perutz
Journal:  Protein Sci       Date:  1994-10       Impact factor: 6.725

9.  A transcriptional repressor obtained by alternative translation of a trinucleotide repeat.

Authors:  R B Lanz; S Wieland; M Hug; S Rusconi
Journal:  Nucleic Acids Res       Date:  1995-01-11       Impact factor: 16.971

10.  Species-specific interaction of the glutamine-rich activation domains of Sp1 with the TATA box-binding protein.

Authors:  A Emili; J Greenblatt; C J Ingles
Journal:  Mol Cell Biol       Date:  1994-03       Impact factor: 4.272

View more
  76 in total

1.  Variable numbers of tandem repeats in Plasmodium falciparum genes.

Authors:  John C Tan; Asako Tan; Lisa Checkley; Caroline M Honsa; Michael T Ferdig
Journal:  J Mol Evol       Date:  2010-08-22       Impact factor: 2.395

2.  Natural selection drives the accumulation of amino acid tandem repeats in human proteins.

Authors:  Loris Mularoni; Alice Ledda; Macarena Toll-Riera; M Mar Albà
Journal:  Genome Res       Date:  2010-03-24       Impact factor: 9.043

3.  Role of everlasting triplet expansions in protein evolution.

Authors:  Zohar Koren; Edward N Trifonov
Journal:  J Mol Evol       Date:  2010-12-16       Impact factor: 2.395

4.  RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins.

Authors:  Noel G Faux; Gavin A Huttley; Khalid Mahmood; Geoffrey I Webb; Maria Garcia de la Banda; James C Whisstock
Journal:  Genome Res       Date:  2007-06-13       Impact factor: 9.043

5.  Characterization of EST-SSR markers in durum wheat EST library and functional analysis of SSR-containing EST fragments.

Authors:  Ali Akbar Asadi; Sajad Rashidi Monfared
Journal:  Mol Genet Genomics       Date:  2014-03-21       Impact factor: 3.291

6.  Genome-wide evidence for selection acting on single amino acid repeats.

Authors:  Wilfried Haerty; G Brian Golding
Journal:  Genome Res       Date:  2010-01-07       Impact factor: 9.043

Review 7.  Non-B DNA structure-induced genetic instability and evolution.

Authors:  Junhua Zhao; Albino Bacolla; Guliang Wang; Karen M Vasquez
Journal:  Cell Mol Life Sci       Date:  2009-09-01       Impact factor: 9.261

8.  Interactions between homopolymeric amino acids (HPAAs).

Authors:  Yoko Oma; Yoshihiro Kino; Kazuya Toriumi; Noboru Sasagawa; Shoichi Ishiura
Journal:  Protein Sci       Date:  2007-08-31       Impact factor: 6.725

9.  Genome-wide analysis of histidine repeats reveals their role in the localization of human proteins to the nuclear speckles compartment.

Authors:  Eulàlia Salichs; Alice Ledda; Loris Mularoni; M Mar Albà; Susana de la Luna
Journal:  PLoS Genet       Date:  2009-03-06       Impact factor: 5.917

10.  Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins.

Authors:  Michelle Simon; John M Hancock
Journal:  Genome Biol       Date:  2009-06-01       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.