| Literature DB >> 29897971 |
Abstract
Amino acid mutations in proteins are random and those mutations which are beneficial or neutral survive during the course of evolution. Conservation or co-evolution analyses are performed on the multiple sequence alignment of homologous proteins to understand how important different amino acids or groups of them are. However, these traditional analyses do not explore the directed influence of amino acid mutations, such as compensatory effects. In this work we develop a method to capture the directed evolutionary impact of one amino acid on all other amino acids, and provide a visual network representation for it. The method developed for these directed networks of inter- and intra-protein evolutionary interactions can also be used for noting the differences in amino acid evolution between the control and experimental groups. The analysis is illustrated with a few examples, where the method identifies several directed interactions of functionally critical amino acids. The impact of an amino acid is quantified as the number of amino acids that are influenced as a consequence of its mutation, and it is intended to summarize the compensatory mutations in large evolutionary sequence data sets as well as to rationally identify targets for mutagenesis when their functional significance can not be assessed using structure or conservation.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29897971 PMCID: PMC5999116 DOI: 10.1371/journal.pone.0198645
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The work flow of creating directed networks.
A: Schematic of the Multiple Sequence Alignment and impact calculation B: Example of the impact analysis of one of the amino acids of DHFR performed on 2303 sequences obtained from Pfam database [24] (Pfam Id: PF00186) using PDB id 3QL3 as a reference. The green and blue lines drawn at 0.7 and 0.8 represent the two cut-offs. Amino acid 27 impacts no amino acids with γ = 0.8 and 3 at γ = 0.7. The data point at (1,1) is the identity relation showing the dependence of 27 on itself. It is not used in the analyses. C: Partial network that was constructed for the impact of amino acid 27 and γ = 0.7.
Fig 2Amino acid residues with non-zero impact factor represented on the three dimensional structures of proteins.
A: DHFR. B: serine protease. Impact factor (amino acids) for DHFR is: 3 (27), 2 (3, 57, 146), 1 (13, 14, 22, 31, 32, 55, 58, 90, 95, 135, 138, 149) and for serine protease is 8 (196), 3 (140, 194), 2 (19, 34, 102, 142, 182, 183, 184, 216, 228), 1 (29, 32, 40, 42, 57, 58, 100, 122, 136, 168, 189, 191, 201, 211, 226, 237). The coloring convention for PDBs is: Impact 0—gray, Impact 1—blue, Impact 2—cyan, Impact 3—green, Impact 8—red.
Fig 3Directed networks and their functional relevance.
Residue networks for A: DHFR (PDB Id:3QL3) and B: Serine protease (PDB Id:3TGI). The direction of the arrow shows is in the direction of impact. The thickness of the arrows is proportional to 1/r where r is the distance between pair of amino acids in the crystal structure. The functional annotation of the amino acids inferred from literature is shown as well.
Fig 4Comparisons of impact with other measures.
A: Impact vs. conservation shows that the impact does not trivially repeat the same information contained in conservation. B: Impact vs. dependency shows again in addition to the expected negative correlation between the two, there are several deviations from it. Impact was calculated with γ = 0.7. (Size of the marker shows the density of points at that position.)