Literature DB >> 12543934

Abundance and distributions of eukaryote protein simple sequences.

Kim Lan Sim1, Trevor P Creamer.   

Abstract

Protein simple sequences are a subclass of low complexity regions of sequence that are highly enriched in one or a few residue types. Such sequences are common in transcription regulatory proteins, in structural proteins, in proteins involved in nucleic acid interactions, and in mediating protein-protein interactions. Simple sequences of 10 or more residues, containing >/=50% of a single residue type are surveyed in this work. Both eukaryote and prokaryote proteomes are investigated with emphasis on the eukaryotes. Very large numbers of such sequences are found in all organisms surveyed. It is found that eukaryotes possess far more simple sequences per protein than do the prokaryotes. Prokaryotes display a linear relationship between number of proteins containing simple sequences and proteome size, whereas it is not clear that such a relationship holds for eukaryotes. Strikingly, it is found that each eukaryote possesses its own unique distribution of simple sequences. Within those distributions it is found that simple sequences enriched in certain residue types are clearly favored, whereas others are just as clearly discriminated against. The preferences observed are not correlated with residue occurrence. An analysis of classes of proteins of known function suggests that simple sequence occurrence and distribution may be related to protein function. Based upon this analysis, the large number of simple sequences found above that would be expected from a simple statistical model, plus the known functional importance of numerous such sequences, it is postulated that eukaryotes have evolved to not only tolerate large numbers of simple sequences but also to require them.

Mesh:

Substances:

Year:  2002        PMID: 12543934     DOI: 10.1074/mcp.m200032-mcp200

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  22 in total

1.  Effect of low-complexity regions on protein structure determination.

Authors:  Ryan M Bannen; Craig A Bingman; George N Phillips
Journal:  J Struct Funct Genomics       Date:  2008-02-27

2.  TCP transcription factors predate the emergence of land plants.

Authors:  Olivier Navaud; Patrick Dabos; Elodie Carnus; Dominique Tremousaygue; Christine Hervé
Journal:  J Mol Evol       Date:  2007-06-12       Impact factor: 2.395

3.  ATP-dependent proteases differ substantially in their ability to unfold globular proteins.

Authors:  Prakash Koodathingal; Neil E Jaffe; Daniel A Kraut; Sumit Prakash; Susan Fishbain; Christophe Herman; Andreas Matouschek
Journal:  J Biol Chem       Date:  2009-04-21       Impact factor: 5.157

4.  LCD-Composer: an intuitive, composition-centric method enabling the identification and detailed functional mapping of low-complexity domains.

Authors:  Sean M Cascarina; David C King; Erin Osborne Nishimura; Eric D Ross
Journal:  NAR Genom Bioinform       Date:  2021-05-26

5.  Diverse single-amino-acid repeat profiles in the genus Cryptosporidium.

Authors:  Giovanni Widmer
Journal:  Parasitology       Date:  2018-02-12       Impact factor: 3.234

6.  DR1769, a protein with N-terminal beta propeller repeats and a low-complexity hydrophilic tail, plays a role in desiccation tolerance of Deinococcus radiodurans.

Authors:  Yogendra S Rajpurohit; Hari S Misra
Journal:  J Bacteriol       Date:  2013-06-21       Impact factor: 3.490

7.  Comparative genomics reveals long, evolutionarily conserved, low-complexity islands in yeast proteins.

Authors:  Philip A Romov; Fubin Li; Peter N Lipke; Susan L Epstein; Wei-Gang Qiu
Journal:  J Mol Evol       Date:  2006-08-21       Impact factor: 2.395

8.  Cloning and expression analysis of Fgf5, 6 and 7 during early chick development.

Authors:  Megha Kumar; Susan C Chapman
Journal:  Gene Expr Patterns       Date:  2012-05-24       Impact factor: 1.224

9.  Composition-modified matrices improve identification of homologs of saccharomyces cerevisiae low-complexity glycoproteins.

Authors:  Juan E Coronado; Oliver Attie; Susan L Epstein; Wei-Gang Qiu; Peter N Lipke
Journal:  Eukaryot Cell       Date:  2006-04

10.  Minimal plus-end tracking unit of the cytoplasmic linker protein CLIP-170.

Authors:  Kamlesh K Gupta; Benjamin A Paulson; Eric S Folker; Blake Charlebois; Alan J Hunt; Holly V Goodson
Journal:  J Biol Chem       Date:  2008-12-13       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.