| Literature DB >> 28232902 |
Zhenling Peng1, Vladimir N Uversky2, Lukasz Kurgan3.
Abstract
We analyze a correlation between the GC content in genes of 12 eukaryotic species and the level of intrinsic disorder in their corresponding proteins. Comprehensive computational analysis has revealed that the disordered regions in eukaryotes are encoded by the GC-enriched gene regions and that this enrichment is correlated with the amount of disorder and is present across proteins and species characterized by varying amounts of disorder. The GC enrichment is a result of higher rate of amino acid coded by GC-rich codons in the disordered regions. Individual amino acids have the same GC-content profile between different species. Eukaryotic proteins with the disordered regions encoded by the GC-enriched gene segments carry out important biological functions including interactions with RNAs, DNAs, nucleotides, binding of calcium and metal ions, are involved in transcription, transport, cell division and certain signaling pathways, and are localized primarily in nucleus, cytosol and cytoplasm. We also investigate a possible relationship between GC content, intrinsic disorder and protein evolution. Analysis of a devised "age" of amino acids, their disorder-promoting capacity and the GC-enrichment of their codons suggests that the early amino acids are mostly disorder-promoting and their codons are GC-rich while most of late amino acids are mostly order-promoting.Entities:
Keywords: DNA-binding protein; GC content; RNA-binding protein; disorder prediction; protein evolution
Year: 2016 PMID: 28232902 PMCID: PMC5314932 DOI: 10.1080/21690707.2016.1262225
Source DB: PubMed Journal: Intrinsically Disord Proteins ISSN: 2169-0707