Literature DB >> 32359163

Evolution of Sequence-Diverse Disordered Regions in a Protein Family: Order within the Chaos.

Thomas Shafee1, Antony Bacic1,2, Kim Johnson1,2.   

Abstract

Approaches for studying the evolution of globular proteins are now well established yet are unsuitable for disordered sequences. Our understanding of the evolution of proteins containing disordered regions therefore lags that of globular proteins, limiting our capacity to estimate their evolutionary history, classify paralogs, and identify potential sequence-function relationships. Here, we overcome these limitations by using new analytical approaches that project representations of sequence space to dissect the evolution of proteins with both ordered and disordered regions, and the correlated changes between these. We use the fasciclin-like arabinogalactan proteins (FLAs) as a model family, since they contain a variable number of globular fasciclin domains as well as several distinct types of disordered regions: proline (Pro)-rich arabinogalactan (AG) regions and longer Pro-depleted regions. Sequence space projections of fasciclin domains from 2019 FLAs from 78 species identified distinct clusters corresponding to different types of fasciclin domains. Clusters can be similarly identified in the seemingly random Pro-rich AG and Pro-depleted disordered regions. Sequence features of the globular and disordered regions clearly correlate with one another, implying coevolution of these distinct regions, as well as with the N-linked and O-linked glycosylation motifs. We reconstruct the overall evolutionary history of the FLAs, annotated with the changing domain architectures, glycosylation motifs, number and length of AG regions, and disordered region sequence features. Mapping these features onto the functionally characterized FLAs therefore enables their sequence-function relationships to be interrogated. These findings will inform research on the abundant disordered regions in protein families from all kingdoms of life.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  disordered protein regions; fasciclin-like arabinogalactan proteins; sequence analysis, hydroxyproline-rich glycoproteins; sequence space

Mesh:

Substances:

Year:  2020        PMID: 32359163     DOI: 10.1093/molbev/msaa096

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  3 in total

1.  Principal Component Analysis Applications in COVID-19 Genome Sequence Studies.

Authors:  Bo Wang; Lin Jiang
Journal:  Cognit Comput       Date:  2021-01-13       Impact factor: 4.890

2.  Fasciclin-Like Arabinogalactan-Protein 16 (FLA16) Is Required for Stem Development in Arabidopsis.

Authors:  Edgar Liu; Colleen P MacMillan; Thomas Shafee; Yingxuan Ma; Julian Ratcliffe; Allison van de Meene; Antony Bacic; John Humphries; Kim L Johnson
Journal:  Front Plant Sci       Date:  2020-12-11       Impact factor: 5.753

3.  FLA11 and FLA12 glycoproteins fine-tune stem secondary wall properties in response to mechanical stresses.

Authors:  Yingxuan Ma; Colleen P MacMillan; Lisanne de Vries; Shawn D Mansfield; Pengfei Hao; Julian Ratcliffe; Antony Bacic; Kim L Johnson
Journal:  New Phytol       Date:  2022-01-04       Impact factor: 10.323

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.