| Literature DB >> 24720629 |
Yan Zhu, Ningning Wu, Wanlu Song, Guangjun Yin, Yajuan Qin, Yueming Yan, Yingkao Hu1.
Abstract
BACKGROUND: Expansins are plant cell wall loosening proteins that are involved in cell enlargement and a variety of other developmental processes. The expansin superfamily contains four subfamilies; namely, α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB). Although the genome sequencing of soybeans is complete, our knowledge about the pattern of expansion and evolutionary history of soybean expansin genes remains limited.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24720629 PMCID: PMC4021193 DOI: 10.1186/1471-2229-14-93
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Chromosomal distribution of soybean () expansin genes. Chromosome size is indicated by its relative length. Chromosomes bearing no expansin genes (Chromosome 8 and 16) are not showed in this figure. Tandemly duplicated genes are represented by boxes with blue outlines. Segmental duplicated genes are indicated by red dots on the leftside. The figure was produced using the Map Inspector program.
Sizes of the four expansin subfamilies in different plants
| * | 26 | 6 | 3 | 1 |
| *Rice | 34 | 19 | 4 | 1 |
| Soybean | 49 | 9 | 2 | 15 |
| 25 | 6 | 0 | 5 | |
| 16 | 1 | 0 | 1 |
Note: *Datas collected from the review [7].
Figure 2An analytical view of the soybean expansin gene superfamily. The following parts are shown from left to right. Protein neighbor-joining tree: The unrooted tree was constructed using MEGA v5.0. The expansin proteins are named from their gene name (see Table 1). Gene structure: The gene structure is presented by green boxes that correspond to exons, and linking black lines that correspond to introns, while the blue line refers to the 5′-UTR and 3′-UTR. Motif compositions: The colored boxes represent the motifs in the protein, a total of 10 types of motifs were found in these 75 expansin genes, as indicated in the table on the right-hand side. The scale at the top of the image may be used to estimate motif length. aa, amino acids. A detailed motif introduction is shown in Additional file 9.
Genes involved in tandem duplication and their 4DTv values
| 6 | 0.1209 | |
| 10 | 0.2139 | |
| 5 | 0.1747 | |
| 5 | 0.1925 | |
| 5 | 0.2218 | |
| 17 | 0.2180 | |
| 17 | 0.3312 | |
| 17 | 0.2881 | |
| 17 | 0.2881 | |
| 17 | 0.2881 | |
| 17 | 0.2411 |
Estimates of the dates for the segmental duplication events of expanin gene superfamily in soybean
| 6 | 0.100 ± 0.012 | 8 | |
| 15 | 0.119 ± 0.054 | 10 | |
| 10 | 0.145 ± 0.024 | 12 | |
| 20 | 0.149 ± 0.054 | 12 | |
| 18 | 0.157 ± 0.118 | 13 | |
| 15 | 0.175 ± 0.141 | 14 | |
| 13 | 0.177 ± 0.213 | 14 | |
| 19 | 0.188 ± 0.172 | 15 | |
| 17 | 0.202 ± 0.166 | 17 | |
| 16 | 0.205 ± 0.096 | 17 | |
| 20 | 0.239 ± 0.253 | 20 | |
| 14 | 0.254 ± 0.219 | 21 | |
| 5 | 0.270 ± 0.132 | 22 | |
| 5 | 0.296 ± 0.266 | 24 | |
| 4 | 0.300 ± 0.179 | 25 | |
| 4 | 0.453 ± 0.085 | 37 | |
| 5 | 0.477 ± 0.346 | 39 | |
| 4 | 0.515 ± 0.139 | 42 | |
| 8 | 0.531 ± 0.118 | 44 | |
| 8 | 0.539 ± 0.132 | 44 | |
| 9 | 0.598 ± 0.189 | 49 | |
| 9 | 0.613 ± 0.180 | 50 | |
| 11 | 0.617 ± 0.158 | 51 | |
| 11 | 0.626 ± 0.155 | 51 | |
| 4 | 0.633 ± 0.257 | 52 | |
| 6 | 0.650 ± 0.158 | 53 | |
| 4 | 0.650 ± 0.177 | 53 | |
| 4 | 0.680 ± 0.163 | 56 | |
| 5 | 0.685 ± 0.135 | 56 | |
| 7 | 0.708 ± 0.099 | 58 | |
| 7 | 0.710 ± 0.102 | 58 | |
| 7 | 0.763 ± 0.112 | 63 | |
| 6 | 0.768 ± 0.078 | 63 | |
| 3 | 0.790 ± 0.207 | 65 | |
| 3 | 0.817 ± 0.266 | 67 | |
| 16 | 0.169 ± 0.077 | 14 | |
| 6 | 0.397 ± 0.277 | 33 | |
| 23 | 0.167 ± 0.072 | 14 | |
| 19 | 0.176 ± 0.117 | 14 | |
| 8 | 0.191 ± 0.165 | 16 | |
| 7 | 0.202 ± 0.149 | 17 | |
| 8 | 0.211 ± 0.179 | 17 | |
| 15 | 0.236 ± 0.216 | 19 | |
| 3 | 0.447 ± 0.029 | 37 | |
| 3 | 0.503 ± 0.068 | 41 | |
| 3 | 0.513 ± 0.093 | 42 | |
| 6 | 0.630 ± 0.117 | 52 | |
| 4 | 0.685 ± 0.227 | 56 |
Figure 3Expression profiles of the 75 soybean expansin genes. The hierarchical cluster color code: the largest values are displayed as the reddest (hot), the smallest values are displayed as the bluest (cool), and the intermediate values are a lighter color of either blue or red. Raw data were normalized by the following equation: reads/kilobase/million. Pearson correlation clustering was used to group the developmentally regulated genes. Six genes were excluded from the analysis because they were not expressed in an organ or a period. The red outlined boxes represent the small phylogenetic clades that had a largely similar transcript abundance profile.
Functional divergence between subfamilies of the expansin gene superfamily in soybean
| EXPA | EXPB | 0.498 ± 0.079 | 39.742 | 3 | 84C, | -0.023 ± 0.259 | 15 | 62 T,65 L,103 F, |
| | | | | | | | | 143 V,160 F,176 V,177G,190*S, |
| EXPA | EXLB | 0.783 ± 0.082 | 91.136 | 17 | 0.136 ± 0.278 | 53 | 18A, | |
| | | | | | 161 T, | | | 72 N,75S,76C,82I, |
| | | | | | | | 120P,125 F,126D,127 L,133*L,137Q,138Y, | |
| | | | | | 202G | | | |
| | | | | | | | | 168 F,170 L,175 N, |
| | | | | | | | | 187I,189G, |
| | | | | | | | | 204 N,205 W,207S,208 N,209 N,210Y,213G |
| EXPB | EXLB | 0.572 ± 0.141 | 14.448 | 0 | none | -0.081 ± 0.298 | 13 | |
| 134R,137R,140A,175 N |
Note: θI and θII, the coefficients of Type-I and Type-II functional divergence;
LRT, Likelihood Ratio Statistic;
Qk, posterior probability;
*Sites also responsible for the positive selection;
Sites in bold means they are responsible for both type-I and type-II functional divergence.
Tests for positive selection among codons of expansin genes using site models
| M0 | 1 | ω = 0.133 | -14554.8 | | None |
| (one-ratio) | | | | | |
| M3 | 5 | p0 = 0.22607 p1 = 0.55054 p2 = 0.22339 | -14274.8 | 560(M3 | None |
| (discrete) | | ω1 = 0.02570 ω2 = 0.11359 ω3 = 0.33505 | | | |
| M7 | 2 | p = 0.99176 q = 5.71801 | -14266.9 | | Not allowed |
| (beta) | | | | | |
| M8 | 4 | p0 = 0.99999 p = 0.61117 q = 1.88462 | -16630.6 | 4727.4(M8 | |
| (beta&ɯ) | (p1 = 0.00001) ω = 2.02644 |
Note: aNumber of parameters in the ω distribution.
bPositive-selection sites are inferred at posterior probabilities > 95% with those reaching 99% shown in bold.
*Sites were also found to be implicated in the functional divergence.
Parameters estimation and likelihood ratio tests for the branch-site models
| EXPA | 0 | 0.69757 | 0.12227 | 0.12227 | |
| | 1 | 0.07472 | 1 | 1 | |
| | 2a | 0.20568 | 0.12227 | 1.32036 | |
| | 2b | 0.02203 | 1 | 1.32036 | |
| EXPB | 0 | 0.70168 | 0.12941 | 0.12941 | none |
| | 1 | 0.08281 | 1 | 1 | |
| | 2a | 0.19276 | 0.12941 | 999 | |
| | 2b | 0.02275 | 1 | 999 | |
| EXLA | 0 | 0.43399 | 0.12875 | 0.12875 | none |
| | 1 | 0.05093 | 1 | 1 | |
| | 2a | 0.46097 | 0.12875 | 999 | |
| | 2b | 0.0541 | 1 | 999 | |
| EXLB | 0 | 0.49513 | 0.12363 | 0.12363 | 192 T* |
| | 1 | 0.05939 | 1 | 1 | |
| | 2a | 0.39777 | 0.12363 | 999 | |
| 2b | 0.04771 | 1 | 999 |
Note: aPositive-selection sites are inferred at posterior probabilities > 95% with those reaching 99% shown in bold.
*Sites were also found to be implicated in the functional divergence.
Duplication events of the four expansin subfamilies in Soybean, andrice
| | ||||||||
|---|---|---|---|---|---|---|---|---|
| Soybean | 67.3% (33 of 49) | 44.4% (4 of 9) | 100% (2 of 2) | 73.3% (11 of 15) | 4.1% (2 of 49) | 22.2% (2 of 9) | 0% (0 of 2) | 46.7% (7 of 15) |
| 61.5% (16 of 26) | 33.3% (2 of 6) | 0% (0 of 3) | 0% (0 of 1) | 23.1% (6 of 26) | 33.3% (2 of 6) | 66.7% (2 of 3) | 0% (0 of 1) | |
| Rice | 14.7% (5 of 34) | 47.4% (9 of 19) | 50% (2 of 4) | 0% (0 of 1) | 52.9% (18 of 34) | 73.7% (14 of 19) | 0% (0 of 4) | 0% (0 of 1) |
Figure 4Model building of the 3D structure of the soybean expansin protein (GmEXPA1) based on its similarity to the Zea m 1 (Protein Data Bank [PDB] code: 2HCZ). Seven critical amino acid sites responsible for both functional divergence and positive selection are shown to varying degrees in (A), (B), and (C). The figure was produced using the Swiss-model and pyMOL programs. (A) The overall view of the seven critical amino acid sites on the 3D structure. The seven sites are labeled in red. (B) View of critical amino acid sites on the surface of GmEXPA1. Four amino acid sites responsible for both functional divergence and positive selection located on the surface of the molecule are colored red. Of these amino acid sites, 190S and 192 T may be critical for polysaccharide binding of domain II. The gray area represents the N and C termini. (C) Critical amino acid sites related to the contact of the two domains. Only the two domains are shown to better exhibit the two residues, 172 L and 196P, which may be related to the contact of the two domains.
Numbers of CAASs for functional divergence and positive selection in specific region of the protein structure
| N terminus | 3 | 6 | 2 | 1 | 1 |
| Domain I | 4 | 29 | 2 | 1 | 1 |
| Linker | 0 | 2 | 1 | 0 | 0 |
| Domain II | 11 | 26 | 9 | 9 | 5 |
| C terminus | 0 | 0 | 0 | 0 | 0 |