Literature DB >> 34056598

LCD-Composer: an intuitive, composition-centric method enabling the identification and detailed functional mapping of low-complexity domains.

Sean M Cascarina1, David C King1, Erin Osborne Nishimura1, Eric D Ross1.   

Abstract

Low complexity domains (LCDs) in proteins are regions predominantly composed of a small subset of the possible amino acids. LCDs are involved in a variety of normal and pathological processes across all domains of life. Existing methods define LCDs using information-theoretical complexity thresholds, sequence alignment with repetitive regions, or statistical overrepresentation of amino acids relative to whole-proteome frequencies. While these methods have proven valuable, they are all indirectly quantifying amino acid composition, which is the fundamental and biologically-relevant feature related to protein sequence complexity. Here, we present a new computational tool, LCD-Composer, that directly identifies LCDs based on amino acid composition and linear amino acid dispersion. Using LCD-Composer's default parameters, we identified simple LCDs across all organisms available through UniProt and provide the resulting data in an accessible form as a resource. Furthermore, we describe large-scale differences between organisms from different domains of life and explore organisms with extreme LCD content for different LCD classes. Finally, we illustrate the versatility and specificity achievable with LCD-Composer by identifying diverse classes of LCDs using both simple and multifaceted composition criteria. We demonstrate that the ability to dissect LCDs based on these multifaceted criteria enhances the functional mapping and classification of LCDs.
© The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2021        PMID: 34056598      PMCID: PMC8153834          DOI: 10.1093/nargab/lqab048

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  67 in total

1.  Charged single alpha-helices in proteomes revealed by a consensus prediction approach.

Authors:  Zoltán Gáspári; Dániel Süveges; András Perczel; László Nyitray; Gábor Tóth
Journal:  Biochim Biophys Acta       Date:  2012-01-28

Review 2.  Repeat expansion disease: progress and puzzles in disease pathogenesis.

Authors:  Albert R La Spada; J Paul Taylor
Journal:  Nat Rev Genet       Date:  2010-04       Impact factor: 53.242

3.  On the abundance, amino acid composition, and evolutionary dynamics of low-complexity regions in proteins.

Authors:  Mark A DePristo; Martine M Zilversmit; Daniel L Hartl
Journal:  Gene       Date:  2006-05-11       Impact factor: 3.688

4.  Dictyostelium discoideum has a highly Q/N-rich proteome and shows an unusual resilience to protein aggregation.

Authors:  Liliana Malinovska; Sandra Palm; Kimberley Gibson; Jean-Marc Verbavatz; Simon Alberti
Journal:  Proc Natl Acad Sci U S A       Date:  2015-05-04       Impact factor: 11.205

5.  Amino acid runs in eukaryotic proteomes and disease associations.

Authors:  Samuel Karlin; Luciano Brocchieri; Aviv Bergman; Jan Mrazek; Andrew J Gentles
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-08       Impact factor: 11.205

6.  Comparative analysis of amino acid repeats in rodents and humans.

Authors:  M Mar Albà; Roderic Guigó
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

7.  Control of transcriptional activity by design of charge patterning in the intrinsically disordered RAM region of the Notch receptor.

Authors:  Kathryn P Sherry; Rahul K Das; Rohit V Pappu; Doug Barrick
Journal:  Proc Natl Acad Sci U S A       Date:  2017-10-12       Impact factor: 11.205

8.  fLPS: Fast discovery of compositional biases for the protein universe.

Authors:  Paul M Harrison
Journal:  BMC Bioinformatics       Date:  2017-11-13       Impact factor: 3.169

9.  Atypical structural tendencies among low-complexity domains in the Protein Data Bank proteome.

Authors:  Sean M Cascarina; Mikaela R Elder; Eric D Ross
Journal:  PLoS Comput Biol       Date:  2020-01-27       Impact factor: 4.475

10.  Dissecting the role of low-complexity regions in the evolution of vertebrate proteins.

Authors:  Núria Radó-Trilla; Mmar Albà
Journal:  BMC Evol Biol       Date:  2012-08-24       Impact factor: 3.260

View more
  2 in total

1.  Expansion and functional analysis of the SR-related protein family across the domains of life.

Authors:  Sean M Cascarina; Eric D Ross
Journal:  RNA       Date:  2022-07-21       Impact factor: 5.636

2.  Regions with two amino acids in protein sequences: A step forward from homorepeats into the low complexity landscape.

Authors:  Pablo Mier; Miguel A Andrade-Navarro
Journal:  Comput Struct Biotechnol J       Date:  2022-09-18       Impact factor: 6.155

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.