| Literature DB >> 29426365 |
Daniel Brüne1, Miguel A Andrade-Navarro2, Pablo Mier3.
Abstract
OBJECTIVE: Amino acid composition is a sequence feature that has been extensively used to characterize proteomes of many species and protein families. Yet the analysis of amino acid composition of protein domains and the linkers connecting them has received less attention. Here, we perform both a comprehensive full-proteome amino acid composition analysis and a similar analysis focusing on domains and linkers, to uncover domain- or linker-specific differential amino acid usage patterns.Entities:
Keywords: Amino acid composition; Domains; Linkers
Mesh:
Substances:
Year: 2018 PMID: 29426365 PMCID: PMC5807739 DOI: 10.1186/s13104-018-3221-0
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Fig. 1Amino acid composition of 38 reference proteomes. The phylogenetic relationship between the species can be seen in the tree beneath the species’ name abbreviations. The number of protein sequences extracted from each proteome is shown at the base of the bars
Fig. 2Differential amino acid usage in domains vs. linkers in each proteome. Each dot represents the percentage of use of an amino acid in one of the proteomes in linkers versus domains. The black dashed line is the bisect, while the grey dashed lines mark the twofold increase, so that the amino acids on these lines are either twice as abundant in linkers than in domains (upper dashed line) or twice as abundant in domains (lower dashed line)
Fig. 3Composition of DNA-binding domains and linkers surrounding them, compared to the average in domains/linkers. a Results obtained in RACCOON when selecting the amino acid Arg in domains (see details in main text). b Direction of enrichment of average over-representation in the selected domains and surrounding linkers versus all domains and linkers in all the proteomes, mean fold deviation of amino acid usage over all proteomes, average percentage of amino acid usage over all the proteomes, and dispersion of the values (sum of squared Euclidean distances to centroid)