| Literature DB >> 27297221 |
Emmanuel Jaspard1,2,3, Gilles Hunault4.
Abstract
BACKGROUND: small Heat Shock Proteins (sHSP) is a wide proteins family. SHSP are found in all kingdoms and they play critical roles in plant stress tolerance mechanisms (as well as in pathogenic microorganisms and are implicated in human diseases).Entities:
Keywords: Amino acids sequence; Physico-chemical properties; Small heat shock proteins
Mesh:
Substances:
Year: 2016 PMID: 27297221 PMCID: PMC4906601 DOI: 10.1186/s12870-016-0820-6
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Main taxonomy of the organisms in sHSPdb. Only groups containing more than 10 sequences are indicated
| 129 Archaea | 2355 Bacteria | 2259 Eukaryota | |
|---|---|---|---|
| 21 Crenarchaeota | 198 Actinobacteria | 1180 Opisthokonta | |
| Opisthokonta | Viridiplantae | ||
| 203 Fungi | 12 Funariacae | ||
Fig. 1Browse interface for accessing the whole sHSPdb content. The « Search » mode provides an advanced search with multiple parameters and permits retrieval of very fine subsets of data. The user can also retrieve sequences by entering amino acids motif either exact or degenerated using regular expressions with sophisticated syntax. Among other information, the « Summary » option provides links to general databases, the name of the sHSP and its putative function. The « Details » option provides more information. A series of physicochemical properties are given by selecting the « Physicochemical properties » option. The « AA comp » option displays the amino acid composition of sHSP. Finally, the « Fasta » option provides all sequences in FASTA format
sHSP classification in sHSPdb
| sHSP class | Localization | Examples of representant and/or annotation | Motif allowing to classify sequencesa | Motif position in sequences | Number of sHSP classifiedb | Main |
|---|---|---|---|---|---|---|
| Class 1 | Cytosol-nucleus | TaHsp16.9 | [AGPSTV][ADEGHMNQTV][ILPSVY][KR][AFGNPQST][IV][ADEHKNQ][IFV][ACGNSTY][ADEGV]?(.{1,6})?$ | C-terminal | 362 | Viridiplantae |
| Class 2 | Cytosol-nucleus | 17–17.9 kDa HSP | [ILNV].{0,1}[KNR].{0,1}[PT]P[APQ].{1,2}[AKR].{0,6}[IV].{0,5}$ | C-terminal | 111 | Viridiplantae |
| Class 3 | Endoplasmic reticulum | WH.{8,10}[FS][IW]R[QR][FL].{2,2}P.{3,3}[^K] | Central | 68 | Viridiplantae | |
| Class 4 | [^E]A[RS]AA$ | C-terminal | 46 | Viridiplantae | ||
| Class 5 | 21.7–22.3 kDa | D.{2,3}D.{1,2}PLW | N-terminal | 64 | Viridiplantae | |
| Class 6 | Mitochondria | HSP23 | [LM].{10,10}[KR].{1,2}[DEKQ][DEKNQ][DEGHPQ].{2,2}[DEN].{3,3}[IV].[^H][DENQ]$ | C-terminal | 118 | Viridiplantae |
| Class 7 | Plastids | HSP26 to HSP26.8 | [DEFHNQSTY][AGKPRST][KT][IV][FHILMTV][DEG][IRV][DEHKNQS][ILV][EKNQR]G?$ | C-terminal | 102 | Viridiplantae |
| Class 8 | Peroxysome | [IV].{3,3}[^D][KR]L$ | C-terminal | 52 | Viridiplantae | |
| Class 9 | Ribosome | sHSP15 | P[^P]K.{2,2}R[QR].{7,27}$ | C-terminal | 247 | Bacteria |
| Class 10 | Hsp20 - IbpA/B - HspA/B - HspB6 | P[GKP][FHMY][DN][ILV] | N-terminal | 545 | Bacteria | |
| Class 11 | [FILMV][^P].?[DEQ][FILMPV][ADEKNQS][^S][FILMV][FLVW][ADEGHIMNQS] | N-terminal | 679 | Bacteria | ||
| Class 12 | HspB1 | [AGHNPQRSTY][DHKR][AFILMPV].[DES][DQ].F[AG] | N-terminal | 534 | Metazoa | |
| Class 13 | [LM][LP][SV].{2,2}L.[ADEN].{11,14}F[KQ] | N-terminal | 56 | Metazoa | ||
| Class 14 | PVRY[EKQ] | N-terminal | 84 | Metazoa | ||
| Class 15 | EIKI$ | C-terminal | 26 | Metazoa | ||
| Class 16 | HSP family B | [ADE]TFW | N-terminal | 76 | Metazoa | |
| Class 17 | K[FY][AFG][IL]G | N-terminal | 88 | Bacteria | ||
| Class 18 | Hsp42p | DEEL.{1,16}$ | C-terminal | 16 | Fungi | |
| Class 19 | [PST][AEIV][HLR][PST][LH]W[PT][AEHT] | N-terminal | 24 | Metazoa | ||
| Class 20 | HchA « Holdases » | TG.{4,4}E.{11,11}G | N-terminal | 147 | Bacteria | |
| Class 21 | 33 kDa chaperonin « Holdases » | C.[FY]C.{3,3}Y | C-terminal | 135 | Bacteria |
aSequences classified via the motif of the class. Regular expression syntax for amino acid motifs: [XYZ] means X or Y or Z; X? means X is optional; [^XY] means not X nor Y; $ means C-terminal position;.means any amino acid; X{n,m} means n to m times X. bNumber of sequences retrieved in sHSPdb using the motif indicated
Fig. 2The statistical analysis tool. One original feature of sHSPdb is the possibility to dynamically generate plots of physico-chemical properties or amino acids usage of sHSP using different criteria. Therefore, one can immediately make multiple comparisons of self datasets. The user can access to statistical data through the menu « Statistical analysis ». This option displays « Class description » (examination of all physico-chemical properties or all amino acids usage or all combinations of amino acids for a given sHSP class) or « Class comparison » (comparison of a given physico-chemical property for the sHSP classes). Plots are automatically generated and are clickable to be resized
Some length characteristics of sHSP and « alpha-crystalin-unrelated sHSPs »
| sHSP class | Sequences length range (min - max) | Motif positiona | Mean limit values of domains length | ||
|---|---|---|---|---|---|
| N-terminal | ACDb | C-terminal | |||
| Class 1 | 130–282 | 132–273 | 1–50 | 51–141 | 142–157 |
| Class 2 | 129–175 | 131–151 | 1–49 | 50–139 | 140–156 |
| Class 3 | 165–328 | 90–255 | 1–72 | 73–164 | 165–198 |
| Class 4 | 119–211 | 115–207 | 1–42 | 43–123 | 124–148 |
| Class 5 | 172–254 | 41–118 | 1–86 | 87–182 | 183–194 |
| Class 6 | 101–279 | 185–212 | 1–120 | 121–202 | 203–218 |
| Class 7 | 157–266 | 215–229 | 1–129 | 130–222 | 223–235 |
| Class 8 | 115–183 | 109–177 | 1–28 | 29–127 | 128–144 |
| Class 9 | 100–175 | 98–127 |
|
|
|
| Class 10 | 120–498 | 8–200 | 1–40 | 41–137 | 138–160 |
| Class 11 | 111–356 | 6–109 | 1–51 | 52–139 | 140–156 |
| Class 12 | 108–374 | 14–214 | 1–59 | 60–151 | 152–176 |
| Class 13 | 146–277 | 26–152 | 1–53 | 54–142 | 143–167 |
| Class 14 | 141–187 | 13–54 | 1–63 | 64–145 | 146–152 |
| Class 15 | 108–262 | 105–259 | 1–79 | 80–160 | 161–178 |
| Class 16 | 114–230 | 34–80 |
|
|
|
| Class 17 | 102–131 | 5–6 |
|
|
|
| Class 18 | 332–453 | 317–440 | 1–256 | 257–349 | 350–384 |
| Class 19 | 191–220 | 6–26 | 1–81 | 82–165 | 166–211 |
| Class 20 | 133–298 | 54–163 | 1–42 | 43–133 | 134–156 |
| Class 21 | 161–344 | 136–320 |
|
|
|
aRange of starting position of the motif among all sequences of the class. bAlpha Crystallin Domain. cNot applicable since these HSP do not contain ACD and are defined as « alpha-crystallin-unrelated sHSPs »
Fig. 3Comparison of the mean value of the size of the N-terminal domain, ACD and the C-terminal domain. Only HSP classes containing an ACD are represented (see Table 3). The upper scale indicates the size of the sequences in amino acids. ER: endoplasmic reticulum
Fig. 4Phylogenetic tree of plant mitochondrial sHSP. 91 sequences of mitochondrial sHSP from 47 plant organisms were analyzed. The characteristics of the sequences are: a length from 187 to 248 amino acids; an isoelectric point from 4.620 to 10.15; a net charge at pH 7 from − 0.041 to +0.031; a FoldIndex from − 0.044 to 0.138; a hydropathy (GRAVY) from − 0.908 to − 0.274. The alignment was made using Clustal-W and the figure was drawn using Dendroscope [36]
Fig. 5Alignment of three mitochondrial sHSP from Arabidopsis thaliana. The alignment was made using Multalin [37] and the figure was drawn using ESPript [38]