| Literature DB >> 34944504 |
Luciana Esposito1, Nicole Balasco1, Giovanni Smaldone2, Rita Berisio1, Alessia Ruggiero1, Luigi Vitagliano1.
Abstract
One of the most striking features of KCTD proteins is their involvement in apparently unrelated yet fundamental physio-pathological processes. Unfortunately, comprehensive structure-function relationships for this protein family have been hampered by the scarcity of the structural data available. This scenario is rapidly changing due to the release of the protein three-dimensional models predicted by AlphaFold (AF). Here, we exploited the structural information contained in the AF database to gain insights into the relationships among the members of the KCTD family with the aim of facilitating the definition of the structural and molecular basis of key roles that these proteins play in many biological processes. The most important finding that emerged from this investigation is the discovery that, in addition to the BTB domain, the vast majority of these proteins also share a structurally similar domain in the C-terminal region despite the absence of general sequence similarities detectable in this region. Using this domain as reference, we generated a novel and comprehensive structure-based pseudo-phylogenetic tree that unraveled previously undetected similarities among the protein family. In particular, we generated a new clustering of the KCTD proteins that will represent a solid ground for interpreting their many functions.Entities:
Keywords: protein structure function; protein structure predictions; structure-based evolutionary trees
Mesh:
Substances:
Year: 2021 PMID: 34944504 PMCID: PMC8699099 DOI: 10.3390/biom11121862
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Structures of KCTD proteins in the PDB and their similarity with the corresponding AlphaFold models. If multiple crystal structures of the same protein/domain were present, the one determined at the highest resolution was used for calculation of RMSD values.
| Protein | Domain/Complex | PDB Code(s)/References | Resolution(s) (Å) | RMSD (Å) (#) 1 |
|---|---|---|---|---|
| KCTD1 | BTB | 5BXD/5BXB [ | 1.8/2.2 | 0.6 (103)/0.4 (103) |
| Full-length | 6S4L | 2.4 | 0.8 (205) | |
| KCTD5 | BTB | 3DRZ [ | 1.9 | 0.8 (102) |
| Full-length | 3DRX/3DRY [ | 3.1/3.3 | 2.1 (163)/2.4(133) | |
| KCTD8 | CTD | 6G57 | 2.8 | 1.8 (88) |
| KCTD9 | BTB | 5BXH [ | 2.8 | 0.7 (97) |
| KCTD10 | BTB | 5FTA [ | 2.6 | 0.4 (96) |
| KCTD12 | CTD | 6QZL | 2.0 | 0.7 (100) |
| CTD-Gβ1γ2 | 6M8S [ | 3.7 | 0.8 (103) | |
| KCTD13 | BTB | 4UIJ [ | 2.7 | 0.3 (102) |
| KCTD16 | BTB | 5A15/6OCR/6I0Q/6OCT [ | 2.8/2.3/2.3 | 0.5 (91)/0.4 (90)/0.7 (95)/0.5 (91) |
| CTD | 6QB7 | 2.2 | 0.7 (110) | |
| BTB-GABAB2 pept | 6OCP/6M8R [ | 2.3/3.2 | 0.5 (91)/0.5 (93) | |
| BTB | 5A6R [ | 2.8 | 0.7 (101) | |
| BTB | 4CRH [ | 1.7 | 1.0 (90) |
1 This represents the number of the superimposed residues.
Figure 1Representative AF-predicted models of the following KCTD proteins: KCTD1 (a), KCTD5 (b), KCTD3 (c), KCTD9 (d), and KCTD19 (e). The color code of the cartoons follows that used by AlphaFold to report the reliability of the models. Blue, cyan, orange, and yellow protein regions correspond to very high (LDDT > 90), high (90 > LDDT > 70), low (70 > LDDT > 50), and very low (LDDT < 50) model confidence, respectively.
Details of the KCTD AlphaFold models.
| Protein | AlphaFold | Low Structured and/or Low Confident Regions 1 | BTB Domain | CTD Domain |
|---|---|---|---|---|
| KCTD1 | AF-Q719H9 | 1–17/242–257 | A30-T133 | P139-L239 |
| KCTD2 | AF-Q14681 | 1–71/239–263 | R72-T178 | V183-N239 |
| KCTD3 | AF-Q9Y597 | 1–17/139–181/598–815 | E18-L115 | - 4 |
| KCTD4 | AF-Q8WVF5 | 1–32/198–217 | T33-L135 | T140-K259 |
| KCTD5 | AF-Q9NXV2 | 1–41/211–234 | V42-T149 | V154-N211 |
| KCTD6 | AF-Q8NC69 | - | D12-D107 | M114-K234 |
| KCTD7 | AF-Q96MP8 | 1–46/196–224 | P50-G158 | Y162-W289 |
| KCTD8 | AF-Q6ZWB6 | 1–41/148–203/325–434 | E44-L145 | R205-P322 |
| KCTD9 | AF-Q7L273 | 73–88 | D89-S191 | - 4 |
| KCTD10 | AF-Q9H3F6 | 1–18/274–295 | Y33-Q129 | P141-E260 |
| KCTD11 2 | AF-Q693B1 | - | G14-A123 | A126-H271 |
| KCTD12 | AF-Q96CX2 | 1–29/132–204/ | P34-A131 | R206-E325 |
| KCTD13 | AF-Q8WZ19 | 1–40/272–303/311–329 | K41-E142 | I150-T271 |
| KCTD14 | AF-Q9BQ13 | 1–28 | T33-D124 | M125-W255 |
| KCTD15 | AF-Q96SI1 | 1–43/266–283 | A56-R162 | A165-E265 |
| KCTD16 | AF-Q68DU8 | 1–21/124–161/281–389 | E25-T122 | K162-P280 |
| KCTD17 | AF-Q8N5Z5 | 1–29/197–260/265–321 | G30-V135 | P139-H196 |
| KCTD18 | AF-Q6PI47 | 242–426 | D12-S118 | P140-L242 |
| KCTD19 | AF-Q17RG1 | 108–171/259–282/ | D13-E107/ | - 4 |
| KCD20 | AF-Q7Z5Y7 | 1–115/359–419 | E117-C216 | D218-E356 |
| KCTD21 | AF-Q4G0X4 | - | P4-K107 | N109-R260 |
| KCNRG | AF-Q8N5I3 | 246–272 | E5- Q104 | P107-I245 |
| SH3KBP1 | AF-Q8TBC3 | 139–192/604–707 | E19-R118 | - 4 |
| TNFAIP1 | AF-Q13829 | 1–27/258–290/299–316 | K28-S130 | I137-E257 |
| BTBD10 | AF-Q9BSF8 | 1–148/409–475 | M149-C266 | D268-W405 |
1 Confident regions correspond to those with AF parameter LDDT > 70. Unstructured regions correspond to long (>15 residues) fragments with a marginal content of secondary structure and a low confidence prediction (LDDT < 70). 2 The sequence numbering refers to the long variant. The structural data were derived from the model obtained with the Colab server (see Methods for details). 3 The three regions correspond to the three BTB domains of the protein. 4 The KCTD-CTD domain has been not clearly identified in the AF structure of the protein (see text).
Figure 2Schematic representation of protein domain organization. For the KCTD10, KCTD13, and TNFAIP1 proteins, AF predicts that a potential β-strand of the flexible C-terminal region could join the β-sheet of the CTD domain. The Dom domain of KCTD19 presents some similarity to the TAFH domain.
Dali Z-score similarity matrix derived from the structural alignment of protein C-terminal domains. The clade definition is the one used in Figure S1. Dashes are reported for pairs for which Dali does not detect any significant similarity. The names of the proteins are abbreviated due to space limitations.
| K1 | K15 | K8 | K12 | K16 | K6 | K11 | K21 | K4 | K20 | B10 | K18 | K19 | KCN | K7 | K14 | K10 | K13 | TNF | K2 | K5 | K17 | K3 | SHK | K9 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Clade |
| 23.3 | 19.6 | 10.1 | 9.6 | 9.8 | 8.1 | 5.9 | 2.7 | 2.2 | - | - | 2.9 | - | 9.5 | 2.3 | 2.5 | - | - | - | 4.1 | 3.9 | 3.7 | - | - | - |
|
| 23.3 | 10.4 | 9.9 | 10.1 | 7.9 | 5.1 | 2.1 | 2.0 | - | - | 2.7 | - | 9.8 | 2.6 | 3.6 | - | - | - | 4.9 | 4.8 | 5.0 | - | - | - | ||
| Clade |
| 22.8 | 19.5 | 20.4 | 7.8 | 5.8 | - | 2.0 | - | - | - | - | 8.1 | 2.3 | - | - | - | - | 2.7 | 2.8 | 2.8 | - | - | - | ||
|
| 22.8 | 19.2 | 7.6 | 5.7 | - | - | - | - | - | - | 8.9 | - | - | - | - | - | 2.5 | 2.5 | 2.6 | - | - | - | ||||
|
| 22.7 | 7.5 | 5.0 | - | 2.1 | - | - | 2.2 | - | 8.0 | - | 2.3 | - | - | - | 3.2 | 3.0 | 3.3 | - | - | - | |||||
| Clade |
| 24.4 | 7.8 | 5.8 | 3.3 | - | - | 3.1 | 2.2 | 11.6 | 2.2 | 2.3 | - | - | - | 3.7 | 3.8 | 3.5 | - | - | - | |||||
|
| 26.9 | 14.2 | 5.3 | - | - | - | - | 8.7 | - | - | - | - | - | - | - | - | - | - | - | |||||||
|
| 25.2 | 2.1 | - | - | - | - | 9.2 | - | - | - | - | - | - | - | - | - | 2.7 | - | ||||||||
|
| 27.3 | - | - | - | 2.3 | 3.3 | - | - | - | - | - | - | - | - | - | - | - | |||||||||
| Clade |
| 25.3 | 22.9 | - | - | - | 5.6 | 6.6 | - | - | - | - | - | - | - | - | - | |||||||||
|
| 25.0 | - | 2.2 | 2.1 | 5.7 | 6.6 | - | - | - | - | - | - | - | - | - | |||||||||||
|
| 21.8 | - | 2.5 | - | - | - | - | - | - | - | - | - | - | - | ||||||||||||
|
| 54.7 | - | - | 2.0 | - | - | - | - | - | - | - | - | - | |||||||||||||
|
| 27.0 | 3.8 | 3.4 | - | - | - | 4.8 | 4.5 | 4.8 | - | - | - | ||||||||||||||
|
| 24.9 | 13.4 | - | - | - | 3.0 | 3.0 | - | - | - | - | |||||||||||||||
|
| 27.1 | 2.4 | 2.2 | 2.4 | 3.0 | 3.0 | 3.0 | - | - | - | ||||||||||||||||
| Clade |
| 25.6 | 21.4 | 23.5 | - | - | - | - | - | - | ||||||||||||||||
|
| 27.0 | 22.4 | - | - | - | - | - | - | ||||||||||||||||||
|
| 26.7 | - | - | - | - | - | - | |||||||||||||||||||
| Clade |
| 12.1 | 10.7 | 10.8 | - | - | - | |||||||||||||||||||
|
| 12.0 | 10.2 | - | - | - | |||||||||||||||||||||
|
| 11.9 | - | - | - | ||||||||||||||||||||||
| Clade |
| 62.7 | 50.3 | - | ||||||||||||||||||||||
|
| 63.6 | - | ||||||||||||||||||||||||
|
| 39.6 |
Figure 3Dali dendrogram generated using the structure of the KCTD-CTD domain. Representative three-dimensional models of the folded domains present in the C-terminal region are also shown. Helices and β-sheets are shown in cyan and magenta, respectively. The cluster numbering is reported in red.