| Literature DB >> 18318901 |
Michaela Hundertmark1, Dirk K Hincha.
Abstract
BACKGROUND: LEA (late embryogenesis abundant) proteins have first been described about 25 years ago as accumulating late in plant seed development. They were later found in vegetative plant tissues following environmental stress and also in desiccation tolerant bacteria and invertebrates. Although they are widely assumed to play crucial roles in cellular dehydration tolerance, their physiological and biochemical functions are largely unknown.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18318901 PMCID: PMC2292704 DOI: 10.1186/1471-2164-9-118
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of genes encoding LEA proteins in Arabidopsis thaliana
| At1g01470 | LEA14 | LEA_2 | LEA_2 | 0.056 | other | Everywhere | |
| At1g02820 | LEA3 family protein | LEA_3 | LEA_3 | -0.491 | Chloroplast | Stress | |
| At1g03120 | seed maturation family protein * | SMP | SMP | -0.564 | other | Seed | |
| At1g20440 | Dehydrin, COR47 | dehydrin | dehydrin | -1.257 | other | Non-seed + stress | |
| At1g20450 | dehydrin ERD10, LTI45 | dehydrin | dehydrin | -1.34 | other | Non-seed + stress | |
| At1g32560 | group 1 domain-containing protein | LEA_1 | LEA_1 | -1.042 | other | Seed | |
| At1g52690 | similar to LEA protein from | LEA_4 | LEA_4 | -1.317 | other | Bud, seed + stress | |
| At1g54410 | dehydrin family protein | dehydrin | dehydrin | -1.868 | other | Non-seed + stress | |
| At1g72100 | LEA domain-containing protein | LEA_4 | LEA_4 | -0.46 | secreted | Seed | |
| At1g76180 | dehydrin ERD14 | dehydrin | dehydrin | -1.265 | other | Non-seed | |
| At2g03740 | LEA domain-containing protein | LEA_4 | LEA_4 | -0.703 | Chloroplast | Bud | |
| At2g03850 | LEA domain-containing protein | LEA_4 | LEA_4 | -0.496 | Chloroplast | Bud | |
| At2g18340 | LEA domain-containing protein | LEA_4 | LEA_4 | -0.93 | secreted | Seed | |
| At2g21490 | dehydrin family protein | dehydrin | dehydrin | -1.032 | other | Seed | |
| At2g23110 | similar to LEA proteins | - | PvLEA18 | -1.059 | other | Seed | |
| At2g23120 | unknown protein | - | PvLEA18 | -1.001 | other | Everywhere | |
| At2g33690 | similar to PvLEA18 | - | PvLEA18 | -1.311 | other | Bud | |
| At2g35300 | LEA_1 domain containing protein | LEA_1 | LEA_1 | -1.156 | other | Salt | |
| At2g36640 | LEA protein AtECP63 | LEA_4 | LEA_4 | -1.023 | other | Seed | |
| At2g40170 | Em-like protein GEA6/EM6 | LEA_5 | LEA_5 | -1.407 | other | Everywhere | |
| At2g41260 | Late embryogenesis abundant protein M17 | - | AtM | -0.704 | secreted | Seed | |
| At2g41280 | Late embryogenesis abundant protein M10 | - | AtM | -0.011 | secreted | Seed | |
| At2g42530 | cold-regulated protein COR15b | - | LEA_4 | -0.542 | Chloroplast | Non-seed + stress | |
| At2g42540 | cold-regulated protein COR15a | - | LEA_4 | -0.554 | Chloroplast | Non-seed + stress | |
| At2g42560 | LEA domain-containing protein | LEA_4 | LEA_4 | -0.978 | other | Seed + salt | |
| At2g44060 | LEA domain-containing protein | LEA_2 | LEA_2 | -0.314 | other | Non-seed + stress | |
| At2g46140 | LEA domain-containing protein | LEA_2 | LEA_2 | 0.123 | other | Seed + root | |
| At3g02480 | ABA-responsive protein-related | LEA_4 | LEA_4 | -1.213 | other | Reproductive, seed + salt | |
| At3g15670 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.369 | other | Seed | |
| At3g17520 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.047 | secreted | Seed | |
| At3g22490 | RAB28 | SMP | SMP | -0.193 | other | Seed | |
| At3g22500 | Seed maturation protein AtECP31 | SMP | SMP | -0.341 | other | salt | |
| At3g50970 | dehydrin Xero2/LTI30 | dehydrin | dehydrin | -1.173 | other | Everywhere | |
| At3g50980 | dehydrin Xero1 | dehydrin | dehydrin | -1.053 | other | Seed | |
| At3g51810 | putative embryonic abundant protein AtEM1 | LEA_5 | LEA_5 | -1.468 | other | Seed | |
| At3g53040 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.194 | other | Seed | |
| At3g53770 | LEA protein-related | LEA_3 | LEA_3 | -0.79 | Mitochondrion | Seed | |
| At4g02380 | LEA_3 family protein SAG21 | LEA_3 | LEA_3 | -0.36 | Chloroplast | Everywhere | |
| At4g13230 | LEA domain-containing protein | LEA_4 | LEA_4 | -0.831 | Mitochondrion | Bud | |
| At4g13560 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.181 | other | Reproductive | |
| At4g15910 | drought-responsive protein AtDI21 | LEA_3 | LEA_3 | -0.526 | Chloroplast | Everywhere | |
| At4g21020 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.291 | Mitochondrion | Seed | |
| At4g36600 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.072 | Mitochondrion | Seed | |
| At4g38410 | putative dehydrin | dehydrin | dehydrin | -1.629 | other | Root | |
| At4g39130 | dehydrin family protein | dehydrin | dehydrin | -0.774 | other | Seed + bud | |
| At5g06760 | LEA group 1 domain-containing protein | LEA_1 | LEA_1 | -0.815 | other | Seed + salt | |
| At5g27980 | seed maturation family protein | SMP | SMP | -0.373 | other | Bud | |
| At5g44310 | LEA domain-containing protein | LEA_4 | LEA_4 | -1.409 | Chloroplast | Seed | |
| At5g53260 | seed maturation family protein | SMP | SMP | -0.273 | Chloroplast | Seed | |
| At5g53270 | seed maturation family protein | SMP | SMP | -0.103 | other | Seed | |
| At5g66400 | dehydrin RAB18 | dehydrin | dehydrin | -1.182 | other | Seed + stress |
The annotation and description in the NCBI database, and the protein family domains according to the Pfam database. The numbers in the first column are used throughout the paper as a shortcut to unambigously identify the different genes and proteins. GRAVY (grand average of hydropathy) quantitates the hydrophilicity of the proteins based on amino acid composition. Subcellular localization was predicted from protein sequence analysis using the targetP algorithm. The expression information is based on the quantitative RT-PCR experiments reported in Figure 4, Table 4 and 5 and Additional file 3.
The nomenclature of the different LEA protein groups in the Pfam database and according to Bray [4] and Dure [6].
| dehydrin | group 2 | D-11 |
| LEA_1 | group 4 | D-113 |
| LEA_2 | LEA14; D-95 | |
| LEA_3 | LEA5; D-73 | |
| LEA_4 | group 3; group 5 | D-7; D-29 |
| LEA_5 | group 1 | D-19 |
| SMP | group 6 | D-34 |
Figure 1Unrooted dendrogram of all Arabidopsis . Sequence alignments were performed unsing the ClustalW algorithm and an unrooted dendrogram was drawn subsequently. The different LEA groups are indicated by different colors, COR15A and COR15B are highlighted in the LEA_4 group.
Figure 2Comparison of the sizes of the different LEA gene groups in Arabidopsis, rice (Oryza), poplar (Populus), grapevine (Vitis) and Chlamydomonas.
Group-specific values for the different calculated traits
| AtM | 2 | -0.704 | -0.011 | -0.358 | 11432 | 29559 | 20496 | secreted | |
| dehydrin | 10 | -1.868 | -0.774 | -1.220 | 10796 | 29928 | 18881 | other | [KR]-[1]-K-[DE]-K-[1]-P-G |
| S(5)-[DE]-x-[DE]-[GV]-x(1,4)-[GE]-x(0,1)-[KR](4) | |||||||||
| LEA_1 | 3 | -0.815 | -1.156 | -1.042 | 10481 | 16179 | 13850 | other | |
| LEA_2 | 3 | -0.314 | 0.123 | -0.045 | 16563 | 36036 | 17846 | other | G-L-x(2)-[2]-[AILV]-x-[IV]-x-[GV]-x(2)-[PT]-x-[PS]-[ILV]-[NPST]-x(2)-[GI] |
| LEA_3 | 4 | -0.790 | -0.360 | -0.509 | 9298 | 14418 | 10959 | chloroplast and mitochondrion | W-x(2)-D-P-x-T-G-x-[WY]-x-P-x-[DGNST] |
| LEA_4 | 18 | -1.409 | -0.460 | -1.035 | 7145 | 67195 | 26804 | in all cellular compartments | - |
| LEA_5 | 2 | -1.468 | -1.407 | -1.438 | 9934 | 16612 | 13273 | other | G-[EQ]-T-V-V-P-G-G-T |
| PvLEA18 | 3 | -1.311 | -1.001 | -1.059 | 7515 | 9713 | 8482 | other | E-D-Y-K-x(2)-[AG]-Y-G-[AT]-[EQRS]-G-H |
| SMP | 6 | -0.564 | -0.103 | -0.307 | 16661 | 26826 | 19229 | mostly other | - |
The range and the median values for the GRAVY and molecular weight. The cellular localization based on the prediction with targetP represents the majority of the group proteins. The motifs specific for the groups were partly taken from Prosite (for dehydrins), the other motifs were build with the PRATT tool on the basis of well-defined LEA proteins belonging to these groups.
Figure 3Alignment of the dehydrin protein sequences of . Amino acid sequences were aligned using the ClustalW algorithm. Dashes indicate gaps introduced for optimal alignment. The typical dehydrin sequence elements are highlighted: K segment – red; Y segment – yellow; S segment – green; Lys-rich segment – grey. The genes forming homeologous pairs and tandem repeats in the genome (compare Fig. 6, Table 6 and 7) are indicated by arrows on the right and left side of the gene identifier, respectively. The complete sequences can also be found in Additional file 5.
Figure 4Expression analysis of all 51 . Expression was measured by quantitative RT-PCR in different organs (A), in mature leaves under different stress conditions (B), in axenic cultures under hormone induction (C) and in mature seeds (D). The color coding represents relative gene expression from 0 (yellow) to 100% (red), with 100% representing the highest expression within a given panel (compare e.g. the same leaf data as represented in A and D). See Additional file 3 for the complete data set. The numbers on the sides refer to the different LEA genes that are listed in Table 1.
Stress induced expression of LEA genes. Highly expressed LEA genes that are induced at least 3-fold in stress-treated leaf tissue. The numbering is according to Table 1. The relative expression in the different samples is shown under Expression. The first three columns show the unstressed controls ("hydroponics" for salt stress, "heat control" for heat stress, "leaf" for the other conditions). The gene expression after stress treatment compared to the appropriate controls is shown under Induction. Bold-face numbers highlight induction of genes by more than 3-fold.
| 1 | 32.1 | 426.0 | 32.1 | 340.7 | 90.5 | 53.5 | 875.1 | 44.6 | 81.9 | 2.8 | 1.7 | 2.1 | 1.4 | 2.5 | |
| 4 | 22.8 | 427.9 | 34.4 | 313.4 | 34.2 | 51.6 | 619.4 | 41.3 | 60.5 | 1.5 | 2.3 | 1.4 | 1.2 | 2.7 | |
| 5 | 47.4 | 675.9 | 34.4 | 392.3 | 83.0 | 82.2 | 1085.2 | 50.3 | 91.7 | 1.8 | 1.7 | 1.6 | 1.5 | 1.9 | |
| 7 | 0.2 | 0.5 | 0.1 | 0.3 | 4.2 | 0.9 | 142.7 | 0.3 | 1.3 | 2.0 | 2.5 | ||||
| 8 | 90.9 | 1236.8 | 111.3 | 496.3 | 106.6 | 185.1 | 1769.2 | 113.2 | 172.8 | 1.2 | 2.0 | 1.4 | 1.0 | 1.9 | |
| 16 | 182.1 | 33.2 | 166.4 | 104.7 | 53.8 | 78.0 | 385.1 | 218.5 | 49.9 | 0.6 | 0.3 | 0.4 | 1.3 | 0.3 | |
| 20 | 97.9 | 210.2 | 129.7 | 51.5 | 46.1 | 50.5 | 681.2 | 295.1 | 72.1 | 0.5 | 0.5 | 0.5 | 2.3 | 0.7 | |
| 23 | 4.9 | 724.9 | 9.5 | 411.2 | 2.1 | 30.6 | 500.0 | 14.4 | 22.7 | 0.4 | 0.7 | 1.5 | |||
| 24 | 15.9 | 1345.2 | 37.7 | 1912.1 | 39.1 | 57.4 | 2862.4 | 31.0 | 53.7 | 2.5 | 2.1 | 0.8 | |||
| 33 | 3.7 | 31.0 | 11.2 | 411.1 | 9.7 | 2.2 | 295.8 | 5.9 | 22.0 | 2.6 | 0.6 | 0.5 | |||
| 38 | 37.1 | 97.8 | 54.7 | 268.8 | 35.0 | 22.4 | 496.3 | 103.2 | 343.0 | 0.9 | 0.6 | 1.9 | |||
| 41 | 0.9 | 4.8 | 0.4 | 18.8 | 13.0 | 28.5 | 126.1 | 1.0 | 16.0 | 2.6 | |||||
| 46 | 0.4 | 0.0 | 0.3 | 1.9 | 18.8 | 0.7 | 58.4 | 0.7 | 2.0 | 1.7 | 2.7 | ||||
| 51 | 1.3 | 51.9 | 0.7 | 17.3 | 70.0 | 4.8 | 480.4 | 2.7 | 11.4 | ||||||
ABA-induced LEA gene expression
| 1 | 12.94 | 319.63 | |
| 4 | 57.38 | 1538.84 | |
| 5 | 19.47 | 1476.52 | |
| 6 | 0.28 | 12.02 | |
| 7 | 0.07 | 817.98 | |
| 8 | 181.55 | 93.35 | 0.5 |
| 10 | 41.46 | 407.82 | |
| 13 | 0.02 | 11.07 | |
| 15 | 0.17 | 11.99 | |
| 16 | 58.20 | 433.25 | |
| 18 | 0.25 | 129.79 | |
| 20 | 0 | 56.48 | |
| 23 | 0.71 | 21.49 | |
| 24 | 0.19 | 190.69 | |
| 25 | 0.01 | 13.44 | |
| 26 | 39.10 | 113.81 | 2.9 |
| 27 | 24.90 | 25.34 | 1.0 |
| 28 | 1.59 | 1739.22 | |
| 29 | 0.20 | 320.85 | |
| 33 | 9.30 | 1107.58 | |
| 36 | 0.06 | 195.57 | |
| 38 | 40.78 | 249.63 | |
| 41 | 53.15 | 117.89 | 2.2 |
| 42 | 0.06 | 17.46 | |
| 43 | 0.10 | 22.63 | |
| 46 | 0.31 | 726.86 | |
| 51 | 0 | 213.74 | |
Expression and induction of genes by abscisic acid treatment. The numbering of the genes is according to Table 1. Bold-face numbers highlight induction of more than 3-fold.
Figure 5Plot of mean net charge versus mean hydrophobicity of LEA and selected other proteins. "No LEA" refers to proteins originally annotated as LEA proteins but re-annotated in our study (Additional file 1). Arabidopsis seed storage proteins were included in the analysis, because they are a group of seed proteins that have clearly no sequence similarities to LEA proteins. The line marks the border between natively unstructured (left) and folded (right) proteins [91]. The inset documents the shift of five proteins when the putative targeting sequence is removed.
Figure 6Localization of the 51 identified . Genes related by endo-reduplication events during genome evolution (homeologous genes) are connected by lines and highlighted. Genes present as tandem repeats in the genome are boxed in.
Tandem repeats of LEA genes in the Arabidopsis genome
| At1g20440 | At1g20450 | 4 | 5 | Cytosol | Cytosol | 392 | 155 | 2.00E-36 | 0.723 |
| At1g52680 | At1g52690 | no LEA | 7 | Cytosol | Cytosol | 102 | 43.9 | 0.002 | 0 |
| At2g03740 | At2g03850 | 11 | 12 | Chloroplast | Chloroplast | 542 | 213 | 5.00E-54 | 0.893 |
| At2g23110 | At2g23120 | 15 | 16 | Cytosol | Cytosol | 194 | 79.3 | 4.00E-14 | 0.033 |
| At2g41260 | At2g41280 | 21 | 22 | secreted | secreted | 154 | 63.9 | 9.00E-09 | 0.45 |
| At2g42530 | At2g42540 | 23 | 24 | Chloroplast | Chloroplast | 464 | 183 | 2.00E-45 | 0.695 |
| At3g22490 | At3g22500 | 31 | 32 | Cytosol | Cytosol | 856 | 334 | 4.00E-90 | 0.857 |
| At3g50970 | At3g50980 | 33 | 34 | Cytosol | Cytosol | 144 | 60.1 | 7.00E-08 | 0.192 |
| At5g53260 | At5g53270 | 49 | 50 | Chloroplast | Cytosol | 646 | 253 | 4.00E-66 | n/a* |
Comparison of predicted subcellular localization (targetP) and expression (gene correlator on Genevestigator website) of tandem repeats of LEA genes. The coding sequences of the genes were aligned using BLAST 2 SEQUENCES to quantitate the sequence differences between the genes in a pair (gene 1 and gene 2). To test for diversification in expression patterns, the gene correlator function on the Genevestigator website was used to compare expression patterns from publicly available expression profiling data.
*this gene pair is not included on the Affymetrix ATH1 array and therefore no correlation value could be calculated.
Duplications of LEA genes in the Arabidopsis genome
| At1g01470 | At2g46140 | 1 | 27 | Cytosol | Cytosol | 484 | 191 | 1.00E-47 | 0.018 |
| At1g02820 | At4g02380 | 2 | 38 | Chloroplast | Chloroplast | 251 | 101 | 1.00E-20 | 0.014 |
| At1g20450 | At1g76180 | 5 | 10 | Cytosol | secreted | 386 | 153 | 1.00E-35 | 0.429 |
| At1g32560 | At2g35300 | 6 | 18 | Cytosol | Cytosol | 216 | 87.8 | 1.00E-16 | 0.819 |
| At1g52690 | At3g15670 | 7 | 29 | Cytosol | Cytosol | 453 | 179 | 7.00E-44 | 0.82 |
| At1g72100 | At1g22600 | 9 | no LEA | secreted | secreted | 317 | 126 | 2.00E-27 | 0.908 |
| At2g18340 | At4g36600 | 13 | 43 | secreted | Mitochondrion | 483 | 190 | 7.00E-47 | 0.966 |
| At2g21490 | At4g39130 | 14 | 45 | Cytosol | Cytosol | 146 | 60.8 | 4.00E-08 | 0.772 |
| At2g36640 | At3g53040 | 19 | 36 | Cytosol | Cytosol | 446 | 176 | 1.00E-42 | 0.945 |
| At3g02480 | At5g15960 | 28 | no LEA | Cytosol | Cytosol | 97 | 42 | 8.00E-03 | 0.043 |
| At3g50970 | At5g66400 | 33 | 51 | Cytosol | Cytosol | 124 | 52.4 | 2.00E-05 | 0.239 |
| At4g21020 | At5g44310 | 42 | 48 | Mitochondrion | Chloroplast | 503 | 198 | 3.00E-49 | 0.872 |
Comparison of predicted subcellular localization (targetP) and expression (gene correlator on Genevestigator website) of homeologous pairs of LEA genes. The coding sequences of the genes were aligned using BLAST 2 SEQUENCES to quantitate the sequence differences between the genes in a pair (gene 1 and gene 2). To test for diversification in expression patterns, the gene correlator function on the Genevestigator website was used to compare expression patterns from publicly available expression profiling data.