| Literature DB >> 29905875 |
Eloy A Colell1, Javier A Iserte1, Franco L Simonetti1, Cristina Marino-Buslje1.
Abstract
Correlated mutations between residue pairs in evolutionarily related proteins arise from constraints needed to maintain a functional and stable protein. Identifying these inter-related positions narrows down the search for structurally or functionally important sites. MISTIC is a server designed to assist users to calculate covariation in protein families and provide them with an interactive tool to visualize the results. Here, we present MISTIC2, an update to the previous server, that allows to calculate four covariation methods (MIp, mfDCA, plmDCA and gaussianDCA). The results visualization framework has been reworked for improved performance, compatibility and user experience. It includes a circos representation of the information contained in the alignment, an interactive covariation network, a 3D structure viewer and a sequence logo. Others components provide additional information such as residue annotations, a roc curve for assessing contact prediction, data tables and different ways of filtering the data and exporting figures. Comparison of different methods is easily done and scores combination is also possible. A newly implemented web service allows users to access MISTIC2 programmatically using an API to calculate covariation and retrieve results. MISTIC2 is available at: https://mistic2.leloir.org.ar.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29905875 PMCID: PMC6030873 DOI: 10.1093/nar/gky419
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.MISTIC2 workspace: upper left: circo visualization of the MSA information. Amino acid names and positions are in the outer ring. Conservation (second ring) from light blue (lower) to red (higher); cScore (third ring) from yellow (lower) to violet (higher); pScore (inner ring) from green (lower) to red (higher). Inner lines are the top 5% covariation scores. Bottom left: covariation network colored by conservation. Right: covariation network's selected edges mapped onto the 3D structure (ribbon representation, pdb code: 4LPK_B).
Figure 2.AUC for residue contacts prediction. It can be observed that in this case (Pfam family: PF00071, pdb: 4LPK_B), there are methods that outperformed others. plmDCA method has the best predictive performance: plmDCA > all (plmDCA + Gaussian DCA + mfDCA + MI) > Gaussian DCA > mfDCA > MI.
Figure 3.(A) MIp covariation network with nodes colored by conservation, from light blue (lower) to red (higher). (B) Ribbon representation of the overall structure (PDB 4LPK_B) with surface representation of the GDP/GTP interaction site, colored as in the network. It is worth noting the high conservation in the binding site. (C) A section of the sequence logo which translates to the network conservation coloring schema.
Figure 4.Left panel:network; right panel: structure. Colored by pMI from green to red (lower to higher scores).