| Literature DB >> 20193073 |
Krishanu Mukherjee1, Everly Conway de Macario, Alberto J L Macario, Luciano Brocchieri.
Abstract
BACKGROUND: Chaperonin proteins are well known for the critical role they play in protein folding and in disease. However, the recent identification of three diverged chaperonin paralogs associated with the human Bardet-Biedl and McKusick-Kaufman Syndromes (BBS and MKKS, respectively) indicates that the eukaryotic chaperonin-gene family is larger and more differentiated than previously thought. The availability of complete genome sequences makes possible a definitive characterization of the complete set of chaperonin sequences in human and other species.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20193073 PMCID: PMC2846930 DOI: 10.1186/1471-2148-10-64
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
The human hsp60 genes
| Name1 | Alternative names | Start2 | End3 | Str4 | Chr5 | Loc6 | IF7 | Exons8 | aa9 |
|---|---|---|---|---|---|---|---|---|---|
| TCP1, CCTa, CCTα TCP-1-α | 160,119,520 | 160,130,731 | - | 6 | q25.3 | 2 | 12, 7 | 556, 401 | |
| CCT β, TCP-1-β | 68,266,317 | 68,280,052 | + | 12 | q15 | 1 | 14 | 535 | |
| CCT γ, TCP-1-γ | 154,545,617 | 154,572,307 | - | 1 | q23.1 | 3 | 13, 13, 12 | 545,544, 507 | |
| CCT δ, TCPD, TCP-1-δ | 61,950,076 | 61,969,146 | - | 2 | p15 | 1 | 13 | 539 | |
| CCT ε, TCP1E, TCP-1-ε | 10,303,453 | 10,317,892 | + | 5 | p15.2 | 1 | 11 | 541 | |
| CCT ζ, CCT ζ-1, TCP-1-ζ, CCT6, Cctz, HTR3, TCP20, TCPZ, TTCP20 | 56,087,036 | 56,098,269 | + | 7 | p11.2 | 2 | 14, 13 | 531,486 | |
| CCT ζ-2, TCP-1-ζ-2, Cctz2, TSA303, Tcp20 | 30,279,183 | 30,312,525 | - | 17 | q12 | 1 | 14 | 530 | |
| CCT η, TCP-1-η, Ccth, NIP7-1 | 73,320,279 | 73,333,494 | + | 2 | p13.2 | 2 | 12, 7 | 543,339 | |
| CCT θ, TCP-1-θ, Cctq | 29,350,670 | 29,367,782 | - | 21 | q21.3 | 1 | 15 | 548 | |
| LOC155100 | 151,773,495 | 151,775,165 | + | 7 | q36.1 | 1 | 1 | 557 | |
| GROL, CESK1 | 15,451,770 | 15,453,440 | - | 22 | q11.1 | 1 | 1 | 557 | |
| BBS6 | 10,333,898 | 10,342,162 | - | 20 | p12.2 | 2 | 4, 4 | 570,570 | |
| C12orf58, FLJ23560 | 75,263,727 | 75,266,269 | - | 12 | q21.2 | 1 | 2 | 723 | |
| C4orf24, FLJ35630, FLJ41559 | 123,882,498 | 123,884,627 | + | 4 | q27 | 1 | 1 | 710 | |
| GROEL, HSP60, SPG13, CPN60, HuCHA60 | 198,060,018 | 198,071,817 | - | 2 | q33.1 | 2 | 11, 11 | 573,573 | |
| CFD, FAB1, PIP5K, PIP5K3 | 209,182,591 | 209,190,094 | + | 2 | q34 | 1 | 5 | 224 |
1Official NCBI Entrez gene database name; 2Start and 3End of coding region; 4Strand "+" indicates sequenced strand. "-" indicates complementary strand; 5Chromosome; 6Chromosome location; 7Number of isoforms; 8Number of exons. Multiple numbers indicate the number of exons in each isoform; 9Total amino acids; 10The official Entrez name is TCP1. CCT1 improves consistency with other subunit gene names. 11Fab1_TCP sequence domain of PIKFYVE kinase, most similar to the apical domain of CCT3. Features refer to the domain portion of the gene/protein.
The human hsp60 pseudogenes
| Name1 | Start2 | End3 | Str4 | Chr5 | Loc6 | Ex7 | P/D8 | Ka/Ks9 | LRT10 | FS11 | SC12 | aa13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CCT1-1P | 19,986,638 | 19,987,216 | + | 12 | p12.2 | 1 | P | 0.75 | 0.16 | 5 | 2 | 190 |
| CCT1-2P | 41,621,756 | 41,623,646 | - | 5 | p13.1 | 2? | D | 1.21 | 0.15 | 7 | 5 | 512 |
| CCT1-3P14 | 42,801,030 | 42,802,033 | + | 7 | p14.1 | 3 | D | 0.68 | 2.10 | 1 | 1 | 367 |
| CCT3-1P | 16,177,578 | 16,178,178 | + | 8 | p22 | 1 | P | 1.02 | 0.0 | 2 | 2 | 159 |
| CCT4-1P | 64,177,578 | 64,409,590 | + | X | q12 | 3 | D | 0.65 | 1.76 | 0 | 3 | 512 |
| CCT4-2P | 140,344,301 | 140,345,787 | - | 7 | q34 | 4 | D | 0.82 | 1.24 | 2 | 10 | 278 |
| CCT5-1P14,15 | 78,382,086 | 78,382,680 | + | 13 | q31.1 | 1 | P | 0.81 | 0.20 | 3 | 4 | 549 |
| CCT5-2P15 | 78,382,866 | 78,382,967 | - | 13 | q31.1 | 1 | ? | - | - | - | 1 | 34 |
| CCT5-3P | 114,876,388 | 114,877,290 | + | 5 | q22.3 | 1 | P | 0.25 | 2.12 | 6 | 3 | 201 |
| CCT6-1P | 14,692,965 | 14,693,954 | - | 5 | p15.2 | 1 | P | 1.00 | 0.0 | 2 | 2 | 330 |
| CCT6-2P | 109,013,584 | 109,014,117 | - | 11 | q22.3 | 1 | P | 0.90 | 0.0 | 0 | 2 | 178 |
| CCT6-3P16 | 64,162,812 | 64,171,325 | + | 7 | q11.21 | 8 | D | 0.43 | 3.06 | 5 | 4 | 289 |
| CCT6-4P | 191,915,332 | 191,916,879 | + | 3 | q28 | 1 | P | 0.57 | 9 | 4 | 292 | |
| CCT6-5P14,16 | 64,853,564 | 64,865,440 | + | 7 | q11.21 | 10 | D | 0.84 | 0.34 | 12 | 6 | 399 |
| CCT7-1P14 | 92,251,627 | 92,307,366 | - | 5 | q15 | 1 | P | 0.45 | 1.88 | 3 | 5 | 145 |
| CCT7-2P | 150,242,815 | 150,243,240 | + | 6 | q25.1 | 1 | P | 0.87 | 0.10 | 3 | 4 | 552 |
| CCT8-1P14 | 145,141,482 | 145,143,137 | - | 1 | q21.1 | 1 | P | 1.14 | 0.10 | 2 | 3 | 561 |
| HSPD1-1P14 | 135,744,902 | 135,745,039 | - | 5 | q31.1 | 1 | P | 1.46 | 0.27 | 0 | 0 | 48 |
| HSPD1-2P14,17 | 21,919,402 | 21,920,175 | - | 5 | p14.3 | 1 | P | 0.90 | 0.10 | 1 | 1 | 264 |
| HSPD1-3P | 43,602,029 | 43,602,280 | - | 20 | q13.12 | 4 | D | - | - | 0 | 1 | 84 |
| HSPD1-4P | 88,065,673 | 88,066,269 | + | 6 | q15 | 1 | P | 0.55 | 1.08 | 4 | 5 | 199 |
| HSPD1-5P14 | 55,191,053 | 55,192,769 | + | 12 | q13.2 | 1 | P | 0.56 | 3.08 | 2 | 1 | 499 |
| HSPD1-6P14 | 36,783,612 | 36,785,195 | - | 3 | P22.3 | 1 | P | 0.59 | 2.46 | 2 | 2 | 443 |
| HSPD1-7P18 | 7,263,938 | 7,265,475 | + | 8 | p23.1 | 1 | P | 1.12 | 0.18 | 5 | 4 | 396 |
| HSPD1-8P | 145,986,418 | 145,987,946 | + | 4 | q31.21 | 1 | P | 0.63 | 2.32 | 4 | 3 | 458 |
| HSPD1-9P18 | 7,785,932 | 7,787,502 | - | 8 | p23.1 | 1 | P | 0.91 | 0.08 | 5 | 3 | 416 |
| HSPD1-10P | 8,058,884 | 8,082,857 | + | 12 | p13.31 | 2? | D | 0.78 | 0.94 | 1 | 1 | 307 |
| HSPD1-11P | 95,130,459 | 95,132,169 | + | 5 | q15 | 5 | D | 0.74 | 2.44 | 6 | 6 | 375 |
| HSPD1-12P | 78,321,372 | 78,323,341 | + | 13 | q31.1 | 1 | P | 0.62 | 5 | 4 | 410 | |
| HSPD1-13P | 153,068,626 | 153,068,943 | + | 6 | q25.2 | 1 | P | 0.54 | 2.84 | 1 | 2 | 108 |
| HSPD1-14P14 | 37,465,288 | 37,466,827 | - | 13 | q13.3 | 4 | D | 0.68 | 3.3 | 6 | 3 | 361 |
| HSPD1-15P | 19,269,394 | 19,270,353 | + | 5 | p14.3 | 4 | D | 0.74 | 1.24 | 4 | 4 | 241 |
| HSPD1-16P | 105,082,802 | 105,083,755 | + | 11 | q22.3 | 2? | D | 0.51 | 5 | 4 | 199 | |
| HSPD1-17P | 34,077,070 | 34,078,293 | + | 1 | p35.1 | 3 | D | 0.74 | 1.48 | 2 | 2 | 217 |
| HSPD1-18P | 56,105,684 | 56,108,736 | + | 20 | q13.32 | 5 | D | 0.48 | 3 | 2 | 299 | |
| HSPD1-19P | 50,318,868 | 50,319,008 | + | 10 | q11.23 | 1 | ? | 2.42 | 0.72 | 0 | 0 | 47 |
| HSPD1-20P | 78,924,341 | 78,924,478 | - | 12 | q21.31 | 1 | ? | 0.40 | 3.08 | 0 | 0 | 46 |
| HSPD1-21P | 60,994,430 | 60,994,876 | - | 5 | q12.1 | 6 | D | - | - | 0 | 6 | 155 |
| HSPD1-22P | 29,181,851 | 29,183,334 | - | 21 | q21.3 | 2? | D | 0.69 | 1.4 | 3 | 4 | 344 |
1Pseudogene names follow the HUGO nomenclature. They are composed of the name of the parental gene followed by a unique number identifier and the suffix "P" (Pseudogene); 2Start and 3 End positions of the pseudogene on the chromosome; 4Strand; 5Chromosome; 6Location on the chromosome; 7Number of exons. A question mark indicates gene fragments with uncertain numbers of exons; 8Processed (P), duplicated (D) or undetermined (?); 9Ratio of non-synonymous vs. synonymous substitution rates; 10Likelihood Ratio Test (LRT) values. Values different from 1.0 with probability p < 0.01 (**) or p < 0.05 (*) are shown in bold-face; 11Number of Frame-Shifts recognized in the coding region of the pseudogene; 12Number of in-frame Stop Codons recognized in the coding region of the pseudogene; 13Length in amino acids of pseudo-translation of the recognized pseudogene sequence; 14Ten pseudogenes previously reported in the Ensembl (roman), Pseudogene.org (italics) or NCBI (bold) databases: CCT1-3P = OTTHUMG00000033751; CCT5-1P = Human.chr13.mb78; CCT6-5P = ENSP00000275603, Human.chr7.mb64; CCT7-1P = ENST00000399032; CCT8-1P = Human.chr1.mb145; HSPD1-1P = ENSG00000162241, Human.chr5.mb135; HSPD1-2P = ENSP00000328369; HSPD1-5P = LOC644745; HSPD1-6P = LOC645548; HSPD1-14P = OTTHUMG00000016753; 15,16,18Tandemly duplicated;17Previously identified as Hsp60s2 (Hsp60 short form 2).
Figure 1Evolutionary trees of CCT proteins. (a) Maximum-likelihood evolutionary tree of all human chaperonin-like proteins, including CCT monomers, MKKS, BBS10, BBS12 and the two members, CCT8L1 and CCT8L2, of the newly defined CCT8L class. Numbers associated with each branch indicate bootstrap support from 100 replicates. Tree rooted by the archaeal thermosome alpha subunit of Sulfolobus solfataricus (Ss_ThsA). (b) Bayesian evolutionary tree of the same sequences shown in (a). The numbers assigned to each branch indicate posterior probabilities. Tree rooted by the thermosome alpha subunit of Thermoplasma acidophilum (Ta_ThsA). The scale bars represent the indicated number of substitutions per position for a unit branch length.
Figure 2Evolutionary tree of CCT8L sequences. ML tree of CCT8L sequences from various mammal genomes. The homolog of human CCT8L1 in chimp (Ptr) is characterized as pseudogene and is shown in bold-italics font. Species abbreviations: Bt, Bos taurus (cow); Cf, Canis lupus familiaris (dog); Dn, Dasypus novemcinctus (nine-banded armadillo); Dr, Danio rerio (zebrafish); Ec, Equus caballus (horse); Ga, Gasterosteus aculeatus (stickleback, fish); Gg, Gallus gallus domesticus (chicken); Hs, Homo sapiens (human); La, Loxodonta africana (african bush elephant); Md, Monodelphis domestica (south american gray short-tailed opossum, marsupial); Mm, Mus musculus (mouse); Mmu, Macaca mulatta (rhesus monkey); Mmur, Microcebus murinus (gray mouse lemur); Oa, Ornithorhynchus anatinus (platypus); Ol, Oryzias latipes (the medaka or japanese killifish); Pp, Pongo pygmaeus (northwest bornean orangutan); Ptr, Pan troglodytes (chimpanzee); Rn, Rattus norvegicus (rat); Tn, Tetraodon nigroviridis (spotted green pufferfish); Tr, Takifugu rubripes (japanese pufferfish); Xl, Xenopus laevis (african clawed frog, amphibian); Xt, Xenopus tropicalis (western clawed frog, amphibian). The scale bar represents the indicated number of substitutions per position for a unit branch length.
Divergence of BBS and CCT8L proteins relative to CCT8 proteins
| MKKS | BBS10 | BBS12 | CCT8L1 | |
|---|---|---|---|---|
| Size ( | 5.7770 | 4.6020 | 5.8949 | 3.3859 |
| B-index ( | 0.6976 | 1.1079 | 1.0284 | 0.3196 |
| Unbiased pair-wise distance ( | 1.3952 | 2.2159 | 2.0568 | 0.6393 |
| 4.0300 | 5.0987 | 6.0623 | 1.0822 | |
| Average | 2.0951 | 2.9660 | 3.2858 | 0.8017 |
| Size ( | 5.5202 | 4.5503 | 4.3647 | 3.1100 |
| B-index ( | 0.1146 | 0.1373 | 0.0992 | 0.0227 |
| Unbiased pair-wise distance ( | 0.2291 | 0.2747 | 0.1983 | 0.0454 |
| 0.6324 | 0.6250 | 0.4328 | 0.0706 | |
| Average | 0.3394 | 0.3687 | 0.2709 | 0.0545 |
| 6.0873 | 8.0692 | 10.3669 | 14.0793 | |
| 6.3725 | 8.1579 | 14.0072 | 15.3286 | |
| 6.1730 | 8.0445 | 12.1292 | 14.7101 | |
| 1.0465 | 1.0114 | 1.3506 | 1.0887 | |
1Only the human CCT8L2 branch was included in the tree. The CCT8L1 branch had equivalent length; 2Chaperonin-BBS sequences used in the trees were from the following species (see the legend for Figure 2 for a complete list of abbreviations and species names). MKKS: Bt, Cf, Dr, Ec, Ga, Gg, Hs, Md, Mm, Mmu, Ol, Rn, Tr, Xt; BBS10: Bt, Cf, Dr, Ec, Ga, Hs, Md, Mm, Ol, Rn, Tr; BBS12: Bt, Cf, Dr, Ec, Ga, Gg, Hs, Mm, Ol, Rn, Tr, Xl; CCT8L: Bt, Cf, Hs, Mm, Rn. 3Size is the average number of sequences contained in a cluster over evolutionary time (see Methods); 4The B-index measures the average substitutions per site (evolutionary distance) of the sequences within a cluster from their common ancestor; 5L is the length of the tree (sum of the lengths of all branches); 6Average Dis the average pair-wise evolutionary distance of the sequences; 7Estimates for CCT8 were computed over corresponding species represented by the sets of MKKS, BBS10, BBS12 or CCT8L proteins (see footnote 2, above).
Ka/Ks substitution ratios in CCT8L genes evolution
| Foreground genes1 | Background genes1 | Foreground Ka/Ks2 | LRT ( | Foreground branches4 |
|---|---|---|---|---|
| All CCT8L1/2 | Human CCT8, Human CCT7 | 0.29 | 205.06 | 1 to 25 |
| Human CCT8L1 | Chimp CCT8L1, Human CCT8L2 | 0.58 | 0.88 | 1 |
| Chimp CCT8L1 | Human CCT8L1, Human CCT8L2 | 1.02 | 0.00 | 2 |
| Human CCT8L2 | Chimp CCT8L2, Human CCT8L1 | 0.48 | 1.2 | 4 |
| Chimp CCT8L2 | Human CCT8L1, Human CCT8L2 | 0.39 | 1.8 | 5 |
| Human CCT8L2 | Human CCT8L1, Rhesus CCT8L | 0.42 | 4.02 | 4+6 |
| Human CCT8L1 | Human CCT8L2, Rhesus CCT8L | 0.29 | 5.72 | 1+3 |
| Mouse and Rat CCT8L | Cow CCT8L, Human CCT8L2 | 0.38 | 31.14 | 12+13+14 |
| Mouse CCT8L | Rat CCT8L, Human CCT8L2 | 0.64 | 1.21 | 12 |
| Rat CCT8L | Mouse CCT8L, Human CCT8L2 | 0.49 | 5.91 | 13 |
| Rhesus CCT8L | Lemur CCT8L, Human CCT8L2 | 0.73 (0.55)5 | 1.91 (1.22)5 | 8 |
| Lemur CCT8L | Human CCT8L2, Mouse CCT8L | 0.29 | 36.82 | 10 |
| Dog CCT8L | Cow CCT8L, Human CCT8L2 | 0.31 | 12.07 | 16 |
| Cow CCT8L | Dog CCT8L, Human CCT8L2 | 0.13 | 113.78 | 17 |
| Armadillo CCT8L | Elephant CCT8L, Human CCT8L2 | 0.36 | 0.57 | 20 |
| Elephant CCT8L | Marsupial CCT8L, Human CCT8L2 | 0.29 | 14.96 | 21 |
| Marsupial CCT8L | Elephant CCT8L, Human CCT8L2 | 0.31 | 62.63 | 23+24 |
1See text for the definition and meaning of Foreground and Background species; 2Ka/Ks is the estimated ratio of non-synonymous and synonymous substitution rates; 3LRT, Likelihood Ratio Test results for estimated Ka/Ks vs. Ka/Ks = 1.0 (see Methods). Probabilities (p) not shown signify p > 0.05; 4Foreground-branch numbers correspond to the numbering in the schematic tree shown in Figure 3. 5Values in parenthesis were obtained after removing an unusually diverged region from rhesus CCT8L (see text).
Figure 3Evolutionary relations of CCT8L genes. Schematic representation of evolutionary relations of CCT8L genes from different eukaryotic species rooted by CCT8 and CCT7 sequences. The numbers associated with each branch identify the branches for which branch-specific Ka/Ks values are evaluated (Table 4).
Figure 4Evolutionary tree of vertebrate CCT6 proteins. ML tree of CCT6 proteins from mammals, chicken, and frog (in roman font) and translated sequences of the related pseudogenes from human, mouse, and rat (in bold-italics font). Only one copy of CCT6 was found in chicken and frog. Two copies, CCT6A and CCT6B, were found in all mammals examined, including marsupial (Md) and platypus (Oa). The CCT6 sequences from chicken (Gg) and from the two amphibians Xenopus laevis (Xl) and Xenopus tropicalis (Xt) were used to root the tree. All human, mouse, and rat pseudogenes clustered with the CCT6A sequences. Numbers next to branches indicate percent bootstrap values. Only bootstrap values > 30% are shown. For all species abbreviations see the legend for Figure 2. The scale bar represents the indicated number of substitutions per position for a unit branch length.
Figure 5Evolutionary tree of vertebrate mitochondrial Cpn60. ML tree of mitochondrial Cpn60 proteins from mammals, chicken, and frog (in roman font) and translated sequences of the related pseudogenes from human, mouse, and rat (in bold-italics font). Highly degraded pseudogenes for which only fragments could be detected were not considered. Human pseudogenes clustered with primate Cpn60 sequences whereas mouse and rat pseudogenes clustered with rodent counterparts, indicating independent evolution of these pseudogenes in these species. For all species abbreviations see legend for Figure 2. The scale bar represents the indicated number of substitutions per position for a unit branch length.
Figure 6Secondary structure predictions of chaperonin proteins. (a) Secondary structure predictions of Thermoplasma acidophilum thermosome alpha subunit ThsA (line Ta_ThsA), human CCTs, mammal CCT8Ls and vertebrate BBSs (lines MKKS, BBS10 and BBS12) compared to the secondary structure description of ThsA (top line 1a6d) determined from its crystal structure (PDB code 1a6d, chain A). Helices are represented as red boxes, beta-strands as yellow boxes and loops as black lines. Secondary structure elements in 1a6d are labeled in succession with numbers (strands) or letters (helices). The first 16 N-terminal residues of ThsA, predicted to contain a strand, are not included in the 1a6d crystal structure (top line). Secondary structure elements in all proteins recognized as homologous to the thermosome chain elements by sequence similarity and positional equivalence are vertically aligned. Blue circles indicate the position of sequence insertions in CCT8L and BBS sequences. (b) The three-dimensional fold of the secondary structure elements in the thermosome structure 1a6d chain A. Red cylinders represent helices and yellow arrows represent strands. Labels (i.e., letters and numbers) correspond to those in panel "a". Elements not predicted in some of the BBS and CCT8L sequences are labeled in gray. The positions of the ATP binding and hydrolysis sites are highlighted in green.
Conservation of monomer-monomer contact residues relative to CCT subunits1
| Protein | MM | CMM | RR | Global2 |
|---|---|---|---|---|
| ThsA | 78 (83.9) | 16 (94.1) | 13 (86.7) | 62.0-66.4 |
| BBS12 | 37 (39.8) | 7 (41.2) | 6 (40.0) | 35.5-38.0 |
| BBS10 | 45 (48.4) | 8 (47.1) | 7 (46.7) | 34.3-35.6 |
| MKKS | 42 (45.2) | 8 (47.1) | 7 (46.7) | 48.8-51.6 |
| CCT8L | 54 (58.1) | 8 (47.1) | 7 (46.7) | 53.4-61.1 |
1Conservation of archaeal ThsA and human BBS and CCT8L sequences relative to human CCT monomers. Sequence-positions are considered conserved if they are occupied by residue-types appearing in the homologous position in any of the human CCT sequences. Ninety-three intra-ring contact positions and 15 inter-ring contact positions were identified from the thermosome structure (1ad6). Contact positions were defined by a distance of their side-chain heavy atoms of at most 4.0Å from any heavy atom of the nearby monomer in the thermosome structure. For each protein family, the table indicates the number and percentage (in parenthesis) of positions conserved among: all 93 intra-ring contact positions (MM); seventeen intra-ring contact-positions conserved among human CCT monomers (CMM); all 15 inter-ring contact positions, none of which were conserved among CCT monomers (RR). 2Global indicates the range of similarities (percent values) of each sequence to human CCT-subunit proteins within all aligned positions.
Figure 7Profile logos of ATP-binding and ATP-hydrolysis sites in chaperonin proteins. Sequence profiles of ATP/ADP-binding and ATP-hydrolysis sites for CCTs, CCT8L and BBS (MMKS, BBS10 and BBS12) proteins from the multiple sequence alignments of sequences obtained from the species listed in the legend for Figure 2. Letters indicate the amino acid types observed at each position. The height of each stack of symbols in each position is proportional to the information content at that position and the height of each letter within the stack is proportional to the frequency of the corresponding residue at that position. Residues involved in direct contacts with base, ribose or phosphate groups, as determined by homology to the known thermosome structures, are indicated.
Conservation of potential substrate-binding residue positions1
| Family | Description | |||
|---|---|---|---|---|
| CCT1 | K | Y | DE | Lys/Tyr/Acidic |
| CCT2 | Q | L | A (GQ)3 | Gln/Leu/Ala |
| CCT3 | H | Y | KR | His/Tyr/Basic |
| CCT4 | H | F | K | His/Phe/Lys |
| CCT5 | H | L | Q | His/Leu/Gln |
| CCT6A | D | A | K | Asp/Ala/Lys |
| CCT6B | DE | AILMSV | K (R) | Acidic/Medium-Small/Lys |
| CCT7 | Q | Y | D (Y) | Gln/Tyr/Asp |
| CCT8 | H | Y | K | His/Tyr/Lys |
| CCT8L | DILPT | HLQR | KNRY | Variable/Variable/Polar-Basic |
| MKKS | Q (H) | FY (H) | DEMQST | Gln/Aromatic/Medium-Small |
| BBS10 | Y (AFQS) | CLY (W) | LMQV | Tyr/Variable/Variable |
| BBS12 | E (KLQ) | KR (HQ) | HNR (ASD) | Glu/Basic/Polar-Basic |
| Fab1_TCP4 | D (EN) | I (LMV) | Q | Asp/Ile/Gln |
1 Conservation evaluated among sequences in vertebrate genomes. 2 Potential substrate binding positions, corresponding to yeast CCT1 positions 308, 309 and 312 (i = 308) [37]. 3Rare substitutions are listed in parenthesis. 4Fab1_TCP domain of vertebrate PIKFYVE orthologs.