| Literature DB >> 16780600 |
Bin Chen1, Daibin Zhong, Antónia Monteiro.
Abstract
BACKGROUND: HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate the nuclear genomes of 32 species across all kingdoms of organisms, and all sequences available in GenBank, and address the diversity, evolution, gene structure, conservation and nomenclature of the HSP90 family of genes across all organisms.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16780600 PMCID: PMC1525184 DOI: 10.1186/1471-2164-7-156
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the HSP90 family of genes present on the genomes of 32 species, showing the number of functional and total (total in parenthesis) genes, exon number on genomes corresponding to in coding regions and mRNAs, and length and molecular mass of mature and precursor (in parenthesis) polypeptide in each subfamily member. Only the functional and full-length sequences are included in the calculation of exon number, length and molecular mass of polypeptide. The details are in Table 2.
| 1 (1) | |||||||||
| 1 (1) | 1 (1) | ||||||||
| 2 (2) | 2 (2) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 2 (2) | 2 (2) | ||||||||
| 1 (1) | 1 (1) | ||||||||
| 2 (2) | 1 (1) | 1 (1) | |||||||
| 4 (4) | 1 (1) | 3 (3) | |||||||
| 2 (2) | 1 (1) | 1 (1) | |||||||
| 4 (4) | 1 (1) | 3 (3) | |||||||
| 7 (7) | 2 (2) | 1 (1) | 4 (4) | ||||||
| 3 (3) | 1 (1) | 1 (1) | 1 (1) | ||||||
| 7 (8) | 2 (2) | 1 (1) | 4 (5) | ||||||
| 2 (2) | 1 (1) | ||||||||
| 1 (1) | 1 (1) | 1 (1) | |||||||
| 1 (1) | 1 (1) | ||||||||
| 2 (2) | 2 (2) | ||||||||
| 4 (5) | 1 (1) | 1 (1) | 2 (3) | ||||||
| 4 (4) | 1 (1) | 1 (1) | 2 (2) | ||||||
| 3 (3) | 1 (1) | 1 (1) | 1 (1) | ||||||
| 3 (4) | 1 (1) | 1 (2) | 1 (1) | ||||||
| 6 (8) | 1 (1) | 2 (2) | 2 (4) | 1 (1) | |||||
| 3 (3) | 1 (1) | 1 (1) | 1 (1) | ||||||
| 3 (3) | 1 (1) | 1 (1) | 1 (1) | ||||||
| 4 (4) | 1 (1) | 1 (1) | 1 (1) | 1 (1) | |||||
| 5 (16) | 1 (1) | 1 (3) | 2 (6) | 1 (6) | |||||
| 4 (4) | 1 (1) | 1 (1) | 2 (2) | ||||||
| 1 (1) – 1(1) | 3 (3) – 19 (19) | 10 (?) – 21 (21) | 1 (?) – 18 (18) | 1 (1) – 11 (12) | 8 (?) – 12 (12) | 7 (7) – 11 (12) | |||
| 588 (588) – 681 (681) | 644 (688) – 687 (719) | 756 (810) – 785 (811) | 695 (711) – 800 (823) | 689 (689) – 745 (745) | 728 (728) – 854 (854) | 724 (724) – 737 (737) | |||
| 66.7 (66.7) – 78.0 (78.0) | 74.8 (78.1) – 77.9 (81.5) | 84.2 (89.3) – 89.0 (91.5) | 79.5 (81.2) – 91.5 (94.2) | 78.3 (78.3) – 86.2 (86.2) | 84.1 (84.1) – 98.1 (98.1) | 83.3 (83.3) – 84.8 (84.8) | |||
HSP90 family of genes present on the genomes of 32 species, and their chromosomal location, intron/exon structure, molecular mass, signal peptide and prediction of genes that are functional.
| Gene namea | Accession numberb | Chrom. locationc | Exon nod | a.a.lengthe | Mass (kDa)f | Functiong | |
| mRNA | a.a. | ||||||
| (MMA)NP_634445.1 | NC_003901.1 | NP_634445.1 | N/A+ | 1 | 982 | 114.6 | F |
| (SCO)HTPG1 | NC_003888.3 | NP_631561.1 | N/A+ | 1 | 638 | 71.6 | F |
| (SCO)HTPG2 | NC_003888.3 | NP_625688.1 | N/A- | 1 | 615 | 66.7 | F |
| (BFR)HTPG1 | NC_006347.1 | YP_099692.1 | N/A- | 1 | 681 | 78.0 | F |
| (BFR)HTPG2 | NC_006347.1 | YP_097520.1 | N/A+ | 1 | 588 | 67.3 | F |
| (GVI)HTPG1 | NC_005125.1 | NP_924760.1 | N/A+ | 1 | 614 | 70.3 | F |
| (BSU)HTPG1 | NC_000964.2 | NP_391861.1 | N/A- | 1 | 626 | 72.3 | F |
| (WEN)HTPG1 | NC_002978.6 | NP_966987.1 | N/A+ | 1 | 635 | 72.7 | F |
| (BPE)HTPG1 | NC_002929.2 | NP_878979.1 | N/A+ | 1 | 635 | 71.2 | F |
| (GSU)HTPG1 | NC_002939.4 | NP_953436.1 | N/A+ | 1 | 650 | 73.7 | F |
| (HPY)HTPG1 | NC_000921.1 | NP_222917.1 | N/A+ | 1 | 621 | 71.1 | F |
| (ECO)HTPG1 | NC_000913.2 | NP_415006.1 | N/A+ | 1 | 624 | 71.4 | F |
| (BBU)HTPG1 | NC_001318.1 | NP_212694.1 | N/A- | 1 | 650 | 75.4 | F |
| (SCE)HSP90A1 | K01387.1 | AAA02743.1 | 16- | 1 (1) | 709 | 81.4 | F |
| (SCE)HSP90A2 | M26044.1 | AAA02813.1 | 13+ | 1 (1) | 705 | 80.9 | F |
| (CNE)HSP90A1 | XM_568451.1 | XP_568451.1 | 13- | 6 | 700 | 79.3 | F |
| (CNE)HSP90B1 | XM_571124.1 | XP_571124.1 | 5+ | 11 | 758 (780, H) | 85.5 (87.8) | F |
| (ECU)HSP90A1 | NC_003229.1 | NP_584635.1 | 2+ | 1 | 690 | 79.0 | F |
| (NCR)HSP90A1 | AABX01000271.1 | EAA32062.1 | 5+ | 2 | 705 | 80.2 | F |
| (PFA)HSP90A1 | NC_004328.1 | NP_704028.1 | 7+ | 2 | 745 | 86.2 | F |
| (PFA)HSP90B1 | NC_004316.1 | NP_701576.1 | 12- | 1 | 793 (821, H) | 91.6 (95.0) | F |
| (DDI)HSP90A1 | AAFI01000009.1 | EAL73152.1 | 1- | 3 | 700 | 79.9 | F |
| (DDI)HSP90B1 | AAFI01000066.1 | EAL67255.1 | 3- | 2 | 739 (767, H) | 84.1 (87.1) | F |
| (EHI)HSP90A1 | AAFB01000333.1 | EAL47778.1 | (Sca 00092+) | 1 | 718 | 83.0 | F |
| (EHI)HSP90A2 | AAFB01000334.1 | EAL47746.1 | (Sca 00092+) | 1 | 718 | 83.0 | F |
| (EHI)HSP90A3 | AAFB01000781.1 | EAL44230.1 | (Sca 00284+) | 2 | 702 | 81.1 | F |
| (EHI)HSP90B1 | AAFB01000726.1 | EAL44584.1 | (Sca 00256-) | 1 | 695 (711, H) | 79.5 (81.2) | F |
| (TCR)HSP90A1 | AAHK01000382.1 | EAN93041.1 | N/A | 1 | 704 | 80.7 | F |
| (TCR)HSP90A2 | AAHK01000690.1 | EAN89940.1 | N/A | 1 | 704 | 80.7 | F |
| (TCR)HSP90A3 | AAHK01002401.1 | EAN82629.1 | N/A | 1 | 704 | 80.7 | F |
| (TCR)HSP90B1 | AAHK01000131.1 | EAN96800.1 | N/A | 1 | 740 (762, H) | 84.7 (87.0) | F |
| (ATH)HSP90A1 | NM_124642.2 | NP_200076.1 | 5+ | 4 (4) | 705 | 81.2 | F |
| (ATH)HSP90A2 | NM_124985.3 | NP_200414.1 | 5- | 3 (3) | 699 | 80.1 | F |
| (ATH)HSP90A3 | NM_124983.3 | NP_200412.1 | 5+ | 3 (3) | 699 | 80.0 | F |
| (ATH)HSP90A4 | NM_124982.2 | NP_200411.1 | 5+ | 3 (3) | 699 | 80.1 | F |
| (ATH)HSP90B1 | NM_118552.2 | NP_194150.1 | 4- | 15 (15) | 800 (823, H) | 91.5 (94.2) | F |
| (ATH)HSP90C1 | NM_126439.2 | NP_178487.1 | 2+ | 19 (19) | 756 (780, H) | 86.2 (88.7) | F |
| (ATH)HSP90C2 | NM_111656.2 | NP_187434.1 | 3+ | 21 (21) | 778 (803, H) | 88.2 (91.0) | F |
| (OSA)HSP90A1P | XM_483191.1 | XP_483191.1 | 8- | 3 | 699 | 80.2 | F |
| (OSA)HSP90A2P | XM_470993.1 | XP_470993.1 | 4+ | 3 | 703 | 80.2 | F |
| (OSA)HSP90A3P | AP005392.3 | BAD33406.1 | 9- | 1 | 699 | 80.2 | F |
| (OSA)HSP90A4P | AP005392.3 | BAD33409.1 | 9- | 1 | 699 | 80.2 | F |
| (OSA)HSP90A5PNP | (OSA)HSP90A5NN* | (OSA)HSP90A5PN* | 9- | 1 | 437* | P | |
| (OSA)HSP90B1 | AB037681.1 | BAA90487.1 | 6- | 15 (15) | 791 (810, H) | 90.6 (92.8) | F |
| (OSA)HSP90C1 | XM_483065.1 | XP_483065.1 | 8- | 19 (19) | 763 (785, H) | 86.3 (88.3) | F |
| (OSA)HSP90C2 | AK073817.1 | (OSA)HSP90C2PN | 9- | 14 (17) | 785 (811, H) | 89.0 (91.5) | F |
| (CRE)HSP90A1 | Schroda (2004) | (Sca 2397) | 8 | 705 | 80.7 | F | |
| (CRE)HSP90B1 | Schroda (2004)* | (Sca 2084, 1758) | ≥11 | 736* | F | ||
| (CRE)HSP90C1 | AY705371.1 | AAU10511.1 | (Sca 6) | 10 | 756 (810, H) | 84.2 (89.3) | F |
| (DME)HSP90A1 | NM_079175.2 | NP_523899.1 | 3L+ 63B11 | 1 (2) | 717 | 81.9 | F |
| (DME)HSP90B1 | NM_143344.2 | NP_651601.1 | 3R+ 98B6 | 5 (5) | 767 (787, H) | 88.1 (90.2) | F |
| (DME)TRAP1 | NM_058091.3 | NP_477439.2 | 2R- 42C1 | 3 (3) | 673 (691, N) | 74.7 (78.0) | F |
| (DPS)HSP90A1P | NM_079175.2 | NP_523899.1 | XR- | 1 | 717 | 81.8 | F |
| (DPS)HSP90B1P | CM000070.1 | EAL27390.1 | 2- | 5 | 772 (792, H) | 88.4 (90.5) | F |
| (DPS)TRAP1P | CM000071.1 | EAL26542.1 | 3- | 3 | 644 (688, N) | 74.8 (78.1) | F |
| (AGA)HSP90A1P | XM_308800.2 | XP_308800.2 | 2L- | 2 | 689 | 78.3 | F |
| (AGA)HSP90A2P | XM_308799.1 | XP_308799.1 | 2L+ | 3 | 720 | 82.1 | F |
| (AGA)HSP90A3PNP | (AGA)HSP90A3N N* | (AGA)HSP90A3PN* | 2L+ | 2 | 380* | P | |
| (AGA)HSP90B1NP | (AGA)HSP90B1N N | (AGA)HSP90B1PN | 2R- | 2 | 778 (800, H) | 88.9 (91.3) | F |
| (AGA)TRAP1NP | (AGA)TRAP1N N | (AGA)TRAP1PN | 3L+ | 3 | 677 (713, H) | 76.8 (80.6) | F |
| (AME)HSP90A1P | XM_392456.2 | XP_392456.2 | (Group 7-) | 3 (3) | 700 | 80.7 | F |
| (AME)HSP90A2P | XM_395168.2 | XP_395168.2 | (Group 1-) | 3 (3) | 699 | 80.5 | F |
| (AME)HSP90B1P | XM_395614.2 | XP_395614.2 | (Group 8-) | 5 (5) | 766 (786, H) | 88.5 (90.8) | F |
| (AME)TRAP1P | XM_623363.1 | XP_623366.1 | (Group 6-) | 13 (13) | 668 (693, N) | 76.7 (79.6) | F |
| (CEL)HSP90A1 | NM_074225.2 | NP_506626.1 | 5+ | 4 (5) | 702 | 80.2 | F |
| (CEL)HSP90B1 | NM_069679.2 | NP_502080.1 | 4+ | 5 (5) | 737 (760, H) | 84.6 (87.1) | F |
| (CEL)TRAP1 | NM_171188.2* | NP_741219.1* | 3- | 9 (9) | 464* | F | |
| (SPU)HSP90A1 | XM_777937.1* | XP_783030.1* | N/A- | ≥ 8 (9) | 510* | F | |
| (SPU)HSP90A2 | XM_782798.1* | XP_787891.1* | N/A+ | ≥ 5 | 319* | F | |
| (SPU)HSP90B1 | NM_214643.1 | NP_999808.1 | N/A+ | 17 | 784 (806, H) | 90.0 (92.3) | F |
| (SPU)TRAP1 | XM_778412.1 | XP_783505.1 | N/A+ | 15 | 665 (691, N) | 75.5 (78.6) | F |
| Chen et al.A (2005) | |||||||
| (GGA)HSP90AA1 | X07265.1 | CAA30251.1 | 5- | 8 | 728 | 84.1 | F |
| (GGA)HSP90AB1 | XM_444655.1 | XP_444655.1 | 3- | 11 | 725 | 83.5 | F |
| (GGA)HSP90B1 | NW_060209.1 | NP_989620.1 | 1- | 18 | 774 (795, H) | 89.5 (91.6) | F |
| (GGA)TRAP1 | NW_060530.1 | XP_414963.1 | 14- | 18 | 663 (699, N) | 75.5 (79.5) | F |
| (DRE)HSP90AA1 | AF068773.1 | AAC21567.1 | 20- | 10 (10) | 726 | 83.6 | F |
| (DRE)HSP90AA2NP | (DRE)HSP90AA2NN | (DRE)HSP90AA2PN | 20- | 10 | 734 | 84.7 | F |
| (DRE)HSP90AA3PNP | (DRE)HSP90AA3NN* | (DRE)HSP90AA3PN* | 7+ | 12 | 838* | P | |
| (DRE)HSP90AA4PNP | (DRE)HSP90AA4NN* | (DRE)HSP90AA4PN* | 7+ | 3 | 417* | P | |
| (DRE)HSP90AB1 | AF042108.1 | AAB96969.1 | 7- | 7 (7) | 725 | 83.4 | F |
| (DRE)HSP90B1 | BC063951.1 | AAH63951.1 | 4+ | 18 (18) | 772 (793, H) | 88.9 (91.3) | F |
| (DRE)HSP90B2NP | (DRE)HSP90B2NN | (DRE)HSP90B2PN | 4+ | 18 | 773 (794, H) | 89.1 (91.4) | F |
| (DRE)TRAP1NP | (DRE)TRAP1N* | (DRE)TRAP1N* | 23+ | ≥8 | 414* | F | |
| (CIN)HSP90A1 | AK115284.1 | AK115284.1PN | (Sca 118+) | 11 (12) | 726 | 83.3 | F |
| (CIN)HSP90B1NP | (CIN)HSP90B1NN* | (CIN)HSP90B1PN* | (Sca 963-) | ≥15 | 732* | F | |
| (CIN)HSP90B2PNP | (CIN)HSP90B2NN* | (CIN)HSP90B2PN* | (Sca 462-) | ≥2 | 119* | P | |
| (CIN)TRAP1NP | (CIN)TRAP1N | (CIN)TRAP1N | (Sca 143+) | 13 | 687 (719, N) | 77.9 (81.5) | F |
a P predicted by genome annotation; NP predicted in this study. The abbreviations of species names in parenthesis are in Table 3. The genes with their nomenclatures ended with "P" are putative pseudogenes.
bN new sequences identified/deduced in this study and available at Additional File 1. The sequences marked with "*" are not full-length.
c Chromosomal location. + forward strand; – reverse strand. The numbers/letters are the chromosome numbers/letters, and N/A indicates unavailability of the chromosome number and Sca indicates Scaffold.
d Total number of exons in coding regions. The numbers in parenthesis include exons in untranslated regions.
e length of mature polypeptide with that of precursor and prediction method (Hidden Markov models, H; Neural networks, N) in parenthesis. The sequences marked "*" are incomplete, or from putative pseudogenes, and their mature polypeptides are not predicted.
f molecular mass of mature polypeptide with that of precursor in parenthesis.
g F – functional genes predicted by transcript and domain identification. P – putative pseudogenes.
The name abbreviation of 146 species containing HSP90 family members that were analyzed in the phylogenetic study.
Figure 1Exon splicing patterns of representative genes of the HSP90 family, identified with the genomic study and drawn to scale relative to the number of nucleotides present in each region. The black-filled rectangles depict protein-coding sequences, gray-filled rectangles represent untranslated regions, and unfilled rectangles represent introns.
Figure 2Alignment of representative amino acid sequences of the HSP90 family of proteins, showing their conserved/variable regions, functionally important residues, and functional domains. Amino acid residues completely conserved throughout the HSP90 family are shaded in red, and those conserved throughout each subfamily are in green. The regions that distinguish HTPG Groups A, B and C are shaded in yellow. Gaps are marked with"-", the last residue in each line is assigned a number and subfamily names are indicated at the end of each line. The conserved/variable regions are separated by "||" with the names above the alignment, and the cleavage sites located just before the underlined residues. "▼" stands for the functionally important residues experimentally identified: E47 (refers to HSP90AA1) for ATP hydrolysis; D93 for ATP binding [33]; G95, G132, G135, G137, and G183 for both GA and p23 binding; K112 for GA binding [72]; R400 and Q404 for ATPase activity; F369 for interdomain interaction [63]; S231 and S263 for phosphorylation by casein kinase II [74]. The conserved and functional domains are indicated by: "≈" for HSP90 protein family signature; "=" for HSP90 protein; "~" for histidine kinase-like ATPases; "∞" for four-helical cytokine; and "..." for Glutamic acid-rich region. The number of each sequence represents: 1. (HSA)HSP90AA1, 2. (GGA)HSP90AA1, 3. (HSA)HSP90AB1, 4. (GGA)HSP90AB1, 5. (HSA)HSP90B1, 6. (ATH)HSP90B1, 7. (ATH)HSP90C1, 8. (ATH)HSP90C2, 9. (HSA)TRAP1, 10. (DME)TRAP1, 11. (ECO)HTPG1, 12. (BFR)HTPG1.
Signature sequences detected in each subfamily or in particular subfamily combinations. The numbers in parenthesis following the signature sequence indicate the residue numbers in the corresponding reference sequence. HSP90B has no unique signature sequence but the combination of these conserved motifs can distinguish it from other family members. The alignment is in Figure 1.
| Subfamily | Signature sequence | Reference |
| HSP90AA | LIP (80–82), GDKKKK (272–277), TSASG (467–471), VSLKDYCT (475–482), QEEKK (561–565), EDD (701–703), MPP (717–719) | (HSA)HSP90AA1 |
| HSP90AB | LKID (residues 71–74) | (HSA)HSP90AB1 |
| HSP90A | SLIINT (31–36), NNLGTIA (105–111), SMIGQFGVGFYS (129–140), AGG (166–168), RGT (182–184), KHFSVEGQLEF (327–337), VKK (417–419), HED (450–452), TGE (495–497), IDEY (525–528), QALRD (617–621) | (HSA)HSP90AA1 |
| HSP90B | FLREL (residues 100–104), IGQFGVGFYS (191–201), LPLNVSRE (442–449) | (HSA)HSP90B1 |
| HSP90C | YQAEV (81–85), EVFLREL (101–107), LIGQFGVGFYSAFLV (193–207), DPLA (361–364), LYVKRVFISD (403–412), PRYL (420–423), DLPLNVSREILQESRIVR (434–451), APF (548–560) | (ATH)HSP90C1 |
| TRAP | LLDIVA (97–102), NLGTIARSGS (171–180), IIGQFGVGFY (197–206), PLNLSRE (397–403), SPY (511–513) | (HSA)TRAP1 |
| HTPG | No signature sequence | |
| HSP90A+B | VRRV (365–368), GMT (97–99) | (HSA)HSP90AA1 |
| HSP90B+C | LPLNVSRE (442–449) | (HSA)HSP90B1 |
| HSP90A+B+C | FLREL (44–48), VDS (389–391), LPLN (394–397) | (HSA)HSP90AA1 |
| HSP90A+B+C+TRAP+Group A | IGQFGVGFY (131–139), PLN (395–397) | (HSA)HSP90AA1 |
| HSP90A+B+C+TRAP+HTPG | Y (38), REL (46–48), N (51), D (54), G (95), G (97), GQFG (132–135), G (183), P (379), SRE (399–401) | (HSA)HSP90AA1 |
Figure 4Most parsimonious tree inferred from HSP90A+B+C proteins, rooted with bacteria HTPG proteins. Bootstrap percentages of 5000 replicates are shown above the branches where they exceed 50%. Branch lengths are proportional to number of character changes.
Figure 3Most parsimonious tree inferred from HTPG and TRAP proteins with representative HSP90A, HSP90B and HSP90C members, rooted with HSP90-like proteins existing in Archaea and Bacteria. Bootstrap percentages of 5000 replicates are shown above the branches where they exceed 50%. Branch lengths are proportional to the number of character changes.
Figure 5Schematic diagram representing the evolution of proeukaryotic (A) and eukaryotic (B) HSP90 family of genes. The filled bars indicate gene duplication events. A) We propose that HSP90-like genes in Bacteria and Archaea are basal to HTPG genes that only exist in Bacteria. HTPG genes underwent two gene duplication events that gave rise to three groups (Group A, B and C). B) Eukaryotic members were derived from HTPG Group A, and evolved into four subfamilies TRAP (mitochondrial), HSP90A (cytosolic), HSP90B (ER) and HSP90C (chloroplast) throughout three gene duplication events. One additional gene duplication event led to the division of HSP90A into HSP90AA and HSP90AB in vertebrates. The organismal groups in parenthesis indicate where these HSP90 gene copies are found.