| Literature DB >> 24073209 |
Gandhimathi Arumugam1, Anu G Nair, Sridhar Hariharaputran, Sowdhamini Ramanathan.
Abstract
Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural differences which are functionally relevant. This paper analyses the impact of structural divergence on function for multi-member superfamilies, selected from the PASS2 superfamily alignment database. Functional annotations within superfamilies, with structural outliers or 'rebels', are discussed in the context of structural variations. Overall, these data reinforce the idea that functional similarities cannot be extrapolated from mere structural conservation. The implication for fold-function prediction is that the functional annotations can only be inherited with very careful consideration, especially at low sequence identities.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24073209 PMCID: PMC3779223 DOI: 10.1371/journal.pone.0074416
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Total number of multi-member superfamilies and superfamilies having structurally deviant domains according to structural class.
(Please see Methods for definition of ‘structural deviants’).
Details of all family-specific outliers in PASS2a multi-member superfamilies.
| S.NO | SCOPCode |
| Number ofPASS2 | Outliers | Outliers_family | Reasons |
| 1 | 55103 | FAD-linked oxidases, C-terminaldomain | 5 | d2i0ka1 | Cholesterol oxidase | Structural and Conformational difference |
| 2 | 81593 | Nucleotidyl transferase substratebinding subunit/domain | 5 | d1v4aa1 | HEPN domain | Structural and Conformational difference |
| 3 | 49695 | gamma-Crystallin-like | 12 | d1bhua_ | Streptomyces metalloproteinaseinhibitor, SMPI | Conformational difference of secondary structures |
| 4 | 57262 | Leech antihemostatic proteins | 5 | d1bx7a_ | Huristasin-like | Structural and Conformational difference |
| 5 | 50447 | Translation proteins | 20 | d1vqob1, d2gycb1 | Ribosomal protein L3 | Distinct topology and architecture |
| 6 | 55909 | Pentein | 12 | d1g61a_, d1g62a_ | Ribosome anti-association factoreIF6 | Distinct topology and architecture |
| 7 | 56059 | Glutathione synthetaseATP-binding domain-like | 22 | d1eucb2, d2nu7b2 | Succinyl-CoA synthetase, beta-chain,N-terminal domain | Distinct architecture and topology |
| 8 | 143437 | THUMP domain-like | 4 | d1rkia1 | PAE0736-like | Distinct architecture and topology |
| 9 | 51219 | TRAP-like | 3 | d1gtfa_ | Trp RNA-binding attenuationprotein (TRAP) | Distinct architecture and topology |
| 10 | 51246 | Rudiment single hybrid motif | 7 | d1e2w2 | Cytochrome f, small domain | Distinct architecture and topology |
| 11 | 52799 | (Phosphotyrosine protein)phosphatases II | 20 | d1ywfa1 | Mycobacterial PtpB-like | Distinct architecture and topology |
| 12 | 56747 | Prim-pol domain | 4 | d1ro0a_ | Bifunctional DNA primase/polymerase N-terminal domain | Distinct architecture and topology |
| 13 | 56935 | Porins | 11 | d1t16a_ | Outer membrane proteintransport protein | Distinct architecture and topology |
| 14 | 50692 | ADC-like* | 16 | d1ppya_, d1uhez1 | Pyruvoyl dependent aspartate decarboxylase, ADC | Distinct architecture and topology |
| 15 | 52016 | LeuD/IlvD-like | 6 | 2gp4a1 | IlvD/EDD C-terminal domain-like | Distinct topology and architecture |
| 16 | 55088 | Methyl-coenzyme M reductasesubunits | 3 | d1hbnc_ | Methyl-coenzyme M reductasegamma chain | Distinct topology and architecture |
| 17 | 48113 | Heme-dependent peroxidases* | 8 | d1cxpz1, d1q4ga1 | Myeloperoxidase-like | Deviation concentrated in specific taxa to different mode of substrate binding |
| 18 | 51730 | FAD-linked oxidoreductase* | 3 | d1tj1a2 | Proline dehydrohenase domainof bifunctional PutA protein | Circular permutation and different EC number |
| 19 | 56655 | Carbohydrate phosphatase | 8 | d1ni9a_ | GlpX-like bacterial fructose-1,6-bisphosphatase | Permuted topology |
| 20 | 46548 | Alpha-helical ferredoxin* | 4 | d1gtea1 | Dihydropyrimidinedehydrogenase,N-terminal domain | Different EC number and structural elaborations with added secondary structural elements |
| 21 | 110849 | ParB/Sulfiredoxin | 4 | d1vz0a2 | ParB-like nuclease domain | Difference in topology and secondary structure content |
| 22 | 46934 | UBA-like | 30 | d1mn3a_ | CUE domain | Difference in topology and secondary structures |
| 23 | 55144 | LigT-like | 4 | d2ilxa1 | 2′,3′-cyclic nucleotide 3′-phosphodiesterase, catalytic domain | Difference in the structural topology |
| 24 | 81631 | PAP/OAS1 substrate-binding domain* | 5 | d2pbea1 | AadK C-terminal domain-like | Difference in the architecture of the secondary structures. |
| 25 | 48295 | TrpR-like | 4 | d1jhga_ | Trp repressor, TrpR | Difference in the topology of secondary structures |
| 26 | 82615 | Polo-box domain * | 3 | d1mbya_ | Swapped Polo-box domain | Domain swapping |
| 27 | 50715 | Ribosomal protein L25-like | 4 | d1gtra1 | Gln-tRNA synthetase (GlnRS),C-terminal (anticodon-binding)domain | Duplication, consists of two barrel domains with the swapping of N-terminal strands |
| 28 | 56436 | C-type lectin-like | 36 | d1t61a1, d1t61a2 | Noncollagenous (NC1) domain ofcollagen IV | Segment swapping with subdomains of domain |
| 29 | 82057 | Prokaryotic SH3-related domain | 5 | d1xova1 | Ply C-terminal domain-like | tandem repeat of two SH3-like domains swapped with the N-terminal strands |
| 30 | 52266 | SGNH hydrolase * | 13 | d1flca2 | Esterase domain of haemagglutinin-esterase-fusion glycoprotein HEF1 | Extension of N and C-terminal part |
| 31 | 53649 | Alkaline phosphatase-like | 9 | d1ei6a_ | Phosphonoacetate hydrolase | Insertion of alpha+beta subdomain near C-terminus |
| 32 | 55003 | PAP/Archaeal CCA-adding enzyme, C-terminal domain | 3 | d1r89a3 | Archaeal tRNA CCA-adding enzyme | Insertion of large part of secondary structures |
| 33 | 55781 | GAF domain-like | 12 | d1stza2 | HrcA C-terminal domain-like | Insertion of a secondary structures |
| 34 | 55154 | CYTH-like phosphatases | 5 | d1d8ia_ | mRNA triphosphatase CET1 | Extra elaborations to the core |
| 35 | 50939 | Sialidases | 11 | d1v0ea1 | Endo-alpha-sialidase | Outlier has extra N-terminal domain and insert domain |
| 36 | 89095 | GatB/YqeY motif | 4 | d1ng6a_ | GatB/YqeY domain | Difference in secondary structure content due to insertion |
| 37 | 56634 | Heme-dependent catalase-like | 4 | d1u5ua_ | Allene oxide synthase | Difference in secondary structural content and conformation |
| 38 | 51679 | Bacterial luciferase-like | 7 | d1nfpa_ | Non-fluorescent flavoprotein (luxF, FP390) | Incomplete core structure of beta/alpha barrel with mixed beta-sheet of 7 strands |
| 39 | 48317 | Acid phosphatase/Vanadium-dependent haloperoxidase | 4 | d1vnsa_ | Chloroperoxidase | Duplication of secondary structures |
| 40 | 89957 | MTH1187/YkoF-like* | 6 | d1s99a_ | Putative thiamin/HMP-binding protein YkoF | Internal repeat or duplication |
| 41 | 55205 | EPT/RTPC-like | 4 | d1qmha2 | RNA 3′-terminal phosphate cyclase, RPTC | Non-duplicated fold of |
Please see http://caps.ncbs.res.in/pass2/for full alignments.
@The superfamilies marked with ‘*’ symbol are further considered for illustration. Also see http://caps.ncbs.res.in/download/pass2_outliers/FIR_map/for mapping of functionally important residues for each of these superfamilies.
Figure 2Mean RMSD plot for 8973 members of 731 multi-membered superfamilies.
Outlier protein domains are with RMSD greater than 5.5 Å. (Please see Methods definition of structure deviants).
Figure 3Topological differences seen in ADC-like superfamily.
(a) A representative structure of the ADC like superfamily that has ferredoxin reductase-like topology (b) Double psi-β-barrel fold observed in Pyruvoyl dependent aspartate decarboxylase (ADC) family. (c) Secondary structure arrangement and topological connections observed in ferredoxin reductase fold. (d) Arrangements as seen in double psi-β-barrel fold.
Figure 4Representative structures of mammalian ((a) MPO (b) PGHS) and (c) non-mammalian peroxidases (1gwu).
Both MPO and PGHS are outliers belonging to the mammalian myeloperoxidase family (d)–(f).The peroxidase active site residues and interaction with Heme is shown. In all the cases, the proximal His (H336, H388 and H163) is involved in coordination. Helix H2 and Helix B interact with heme group and are highlighted in red color in all the three structures.
Figure 5The structural difference between the MTHFR and PutA PRODH families.
1tj1 is the structurally deviant member of the FAD-linked oxidoreductase superfamily. The topology of 1tj1 structure is slightly different from the classic β8α8 barrel topology and also functionally diverse. The Helix number is also shown in the figure. The pdb ID and the chain ID along with their EC number and enzymatic activity are mentioned. The figures are made in pymol with spectrum coloring which shows N-terminus(blue) to C-terminus(red).(d) The superposed pose of 1b5t and 1v93 (f) Superimposed view of all the three domains (1b5t,1v93 and 1tj1).The N-terminus helix in 1tj1 is aligned with C-terminal helix of the other two domains (shown with an arrow). All other helices are not aligned properly.
Figure 6Structural view of the four domains of alpha-helical ferredoxin superfamily.
1gte:A1 is the structurally deviant member of the superfamily with different E.C. number. It has slightly elongated N-terminal tail part. All the other three domains superimpose well with less than 3Å RMSD.
Figure 7Structural view of all the domains PAP/OAS1 substrate-binding domain superfamily.
Among the five domains, 2pbe:A1 is structurally and functionally different member. The architecture of the domain is different from all the other domains.
Figure 8The structural view of members of the polo-box domain superfamily.
(a) and (b) are polo-box domain from the family “polo-box duplicated region” (c) 1mby:A is a structurally deviant member of the superfamily which belongs to swapped polo-box domain family.(d) superimposed view of all the three domains shows the alignment is not good. (e) Dimeric form of Polo-box domain in swapped conformation (PDB ID: 1mby). The swapped part is highlighted in red.
Figure 9Structural view of all the distantly related domains of SGNH hydrolase superfamily.
Conformations of N- and C-terminal part are conserved in all domains except 1FLC:A2. This domain has extra elongated N and C-terminal part which is involved in fusion.
Figure 10Structural view of members of MTH1187/YkoF-like superfamily.
The domain1S99:A has internal repeat of βαβ fold highlighted in grey colour.
Figure 11Comparison of mean structural deviation (rmsd) of the members (shown in X-axis) and mean GO semantics scores for ADC-like superfamily (shown in Y-axis).
Higher RMSD reflects higher structural deviation and lower mean GO semantics shows lower functional correspondence of a member with other members of that superfamily. The points corresponding to outliers are shown in red colour and non-outlier members of a superfamily are marked with blue colour.