Literature DB >> 17567984

RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins.

Noel G Faux1, Gavin A Huttley, Khalid Mahmood, Geoffrey I Webb, Maria Garcia de la Banda, James C Whisstock.   

Abstract

Over 3% of human proteins contain single amino acid repeats (repeat-containing proteins, RCPs). Many repeats (homopeptides) localize to important proteins involved in transcription, and the expansion of certain repeats, in particular poly-Q and poly-A tracts, can also lead to the development of neurological diseases. Previous studies have suggested that the homopeptide makeup is a result of the presence of G+C-rich tracts in the encoding genes and that expansion occurs via replication slippage. Here, we have performed a large-scale genomic analysis of the variation of the genes encoding RCPs in 13 species and present these data in an online database (http://repeats.med.monash.edu.au/genetic_analysis/). This resource allows rapid comparison and analysis of RCPs, homopeptides, and their underlying genetic tracts across the eukaryotic species considered. We report three major findings. First, there is a bias for a small subset of codons being reiterated within homopeptides, and there is no G+C or A+T bias relative to the organism's transcriptome. Second, single base pair transversions from the homocodon are unusually common and may represent a mechanism of reducing the rate of homopeptide mutations. Third, homopeptides that are conserved across different species lie within regions that are under stronger purifying selection in contrast to nonconserved homopeptides.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17567984      PMCID: PMC1899123          DOI: 10.1101/gr.6255407

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  38 in total

1.  Simple sequences are rare in the Protein Data Bank.

Authors:  Melanie A Huntley; G Brian Golding
Journal:  Proteins       Date:  2002-07-01

Review 2.  The contribution of cis-elements to disease-associated repeat instability: clinical and experimental evidence.

Authors:  J D Cleary; C E Pearson
Journal:  Cytogenet Genome Res       Date:  2003       Impact factor: 1.636

3.  SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data.

Authors:  Maximilian Diehn; Gavin Sherlock; Gail Binkley; Heng Jin; John C Matese; Tina Hernandez-Boussard; Christian A Rees; J Michael Cherry; David Botstein; Patrick O Brown; Ash A Alizadeh
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  Stabilizing effects of interruptions on trinucleotide repeat expansions in Saccharomyces cerevisiae.

Authors:  M L Rolfsmeier; R S Lahue
Journal:  Mol Cell Biol       Date:  2000-01       Impact factor: 4.272

5.  A role for selection in regulating the evolutionary emergence of disease-causing and other coding CAG repeats in humans and mice.

Authors:  J M Hancock; E A Worthey; M F Santibáñez-Koref
Journal:  Mol Biol Evol       Date:  2001-06       Impact factor: 16.240

6.  Amino acid runs in eukaryotic proteomes and disease associations.

Authors:  Samuel Karlin; Luciano Brocchieri; Aviv Bergman; Jan Mrazek; Andrew J Gentles
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-08       Impact factor: 11.205

7.  Recombination-induced CAG trinucleotide repeat expansions in yeast involve the MRE11-RAD50-XRS2 complex.

Authors:  G F Richard; G M Goellner; C T McMurray; J E Haber
Journal:  EMBO J       Date:  2000-05-15       Impact factor: 11.598

8.  Genomic and evolutionary insights into genes encoding proteins with single amino acid repeats.

Authors:  Pratibha Siwach; Saurabh Dilip Pophaly; Subramaniam Ganesh
Journal:  Mol Biol Evol       Date:  2006-04-17       Impact factor: 16.240

9.  Stabilization of perfect and imperfect tandem repeats by single-strand DNA exonucleases.

Authors:  Vladimir V Feschenko; Luis A Rajman; Susan T Lovett
Journal:  Proc Natl Acad Sci U S A       Date:  2003-01-21       Impact factor: 11.205

10.  Fidelity of primate cell repair of a double-strand break within a (CTG).(CAG) tract. Effect of slipped DNA structures.

Authors:  Julien L Marcadier; Christopher E Pearson
Journal:  J Biol Chem       Date:  2003-06-14       Impact factor: 5.157

View more
  23 in total

1.  Natural selection drives the accumulation of amino acid tandem repeats in human proteins.

Authors:  Loris Mularoni; Alice Ledda; Macarena Toll-Riera; M Mar Albà
Journal:  Genome Res       Date:  2010-03-24       Impact factor: 9.043

2.  Role of everlasting triplet expansions in protein evolution.

Authors:  Zohar Koren; Edward N Trifonov
Journal:  J Mol Evol       Date:  2010-12-16       Impact factor: 2.395

Review 3.  Comparative genomics and molecular dynamics of DNA repeats in eukaryotes.

Authors:  Guy-Franck Richard; Alix Kerrest; Bernard Dujon
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

Review 4.  The EDGE hypothesis: epigenetically directed genetic errors in repeat-containing proteins (RCPs) involved in evolution, neuroendocrine signaling, and cancer.

Authors:  Douglas M Ruden; D Curtis Jamison; Barry R Zeeberg; Mark D Garfinkel; John N Weinstein; Parsa Rasouli; Xiangyi Lu
Journal:  Front Neuroendocrinol       Date:  2008-01-08       Impact factor: 8.606

5.  Constraints and consequences of the emergence of amino acid repeats in eukaryotic proteins.

Authors:  Sreenivas Chavali; Pavithra L Chavali; Guilhem Chalancon; Natalia Sanchez de Groot; Rita Gemayel; Natasha S Latysheva; Elizabeth Ing-Simmons; Kevin J Verstrepen; Santhanam Balaji; M Madan Babu
Journal:  Nat Struct Mol Biol       Date:  2017-08-14       Impact factor: 15.369

6.  Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

Authors:  Sylvie Cloutier; Evelyn Miranda; Kerry Ward; Natasa Radovanovic; Elsa Reimer; Andrzej Walichnowski; Raju Datla; Gordon Rowland; Scott Duguid; Raja Ragupathy
Journal:  Theor Appl Genet       Date:  2012-04-07       Impact factor: 5.699

7.  Measuring microsatellite conservation in mammalian evolution with a phylogenetic birth-death model.

Authors:  Sterling M Sawaya; Dustin Lennon; Emmanuel Buschiazzo; Neil Gemmell; Vladimir N Minin
Journal:  Genome Biol Evol       Date:  2012-05-16       Impact factor: 3.416

8.  Length polymorphism and head shape association among genes with polyglutamine repeats in the stalk-eyed fly, Teleopsis dalmanni.

Authors:  Leanna M Birge; Marie L Pitts; Richard H Baker; Gerald S Wilkinson
Journal:  BMC Evol Biol       Date:  2010-07-27       Impact factor: 3.260

9.  Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins.

Authors:  Michelle Simon; John M Hancock
Journal:  Genome Biol       Date:  2009-06-01       Impact factor: 13.583

10.  Insight into role of selection in the evolution of polyglutamine tracts in humans.

Authors:  Hongwei Li; Jing Liu; Keliang Wu; Yuan Chen
Journal:  PLoS One       Date:  2012-07-25       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.