| Literature DB >> 20542917 |
Meiping Zhang1, Yen-Hsuan Wu, Mi-Kyung Lee, Yun-Hua Liu, Ying Rong, Teofila S Santos, Chengcang Wu, Fangming Xie, Randall L Nelson, Hong-Bin Zhang.
Abstract
Many genes exist in the form of families; however, little is known about their size variation, evolution and biology. Here, we present the size variation and evolution of the nucleotide-binding site (NBS)-encoding gene family and receptor-like kinase (RLK) gene family in Oryza, Glycine and Gossypium. The sizes of both families vary by numeral fold, not only among species, surprisingly, also within a species. The size variations of the gene families are shown to correlate with each other, indicating their interactions, and driven by natural selection, artificial selection and genome size variation, but likely not by polyploidization. The numbers of genes in the families in a polyploid species are similar to those of one of its diploid donors, suggesting that polyploidization plays little roles in the expansion of the gene families and that organisms tend not to maintain their 'surplus' genes in the course of evolution. Furthermore, it is found that the size variations of both gene families are associated with organisms' phylogeny, suggesting their roles in speciation and evolution. Since both selection and speciation act on organism's morphological, physiological and biological variation, our results indicate that the variation of gene family size provides a source of genetic variation and evolution.Entities:
Mesh:
Year: 2010 PMID: 20542917 PMCID: PMC2965241 DOI: 10.1093/nar/gkq524
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
ANOVA in log10-transformed number of genes in the NBS and RLK families within a cultivar, within a subspecies, within a species, among subspecies and among species in Oryza, Glycine and Gossipium
| Genome | No. of lines | df | NBS | RLK | |||
|---|---|---|---|---|---|---|---|
| Different plants within a cultivar | |||||||
| cv. Teqing | AA | 23 | 0.715 | 0.554 | 0.587 | 0.650 | |
| cv | AA | 29 | 1.085 | 0.385 | 1.297 | 0.305 | |
| Different cultivars within a subspecies | |||||||
| ssp. | AA | 6 | 53 | 7.872*** | 0.000 | 4.000** | 0.009 |
| ssp. | AA | 6 | 59 | 10.718*** | 0.000 | 1.459 | 0.280 |
| ssp | AA | 6 | 35 | 12.366*** | 0.000 | 2.013 | 0.162 |
| Among subspecies of a species | |||||||
| | AA | 18 | 149 | 12.840*** | 0.000 | 9.477*** | 0.000 |
| Different cultivars or accessions within a species | |||||||
| | AA | 18 | 149 | 11.731*** | 0.000 | 3.676*** | 0.000 |
| | AA | 3 | 17 | 0.193 | 0.826 | 1.053 | 0.429 |
| | AA | 7 | 41 | 4.986*** | 0.001 | 0.869 | 0.549 |
| | AA | 4 | 23 | 15.611*** | 0.000 | 8.984** | 0.003 |
| | BB | 5 | 29 | 3.059* | 0.035 | 0.671 | 0.629 |
| | CC | 5 | 29 | 1.277 | 0.306 | 1.595 | 0.257 |
| | CC | 4 | 23 | 4.794* | 0.011 | 14.323*** | 0.001 |
| | CC | 3 | 17 | 3.176 | 0.071 | 4.531 | 0.075 |
| | BBCC | 5 | 29 | 5.679** | 0.002 | 6.363* | 0.017 |
| | BBCC | 5 | 29 | 1.926 | 0.137 | 6.446** | 0.008 |
| | CCDD | 2 | 11 | 12.910** | 0.005 | 37.584*** | 0.001 |
| Among different species | |||||||
| Diploids | 49 | 305 | 97.356*** | 0.000 | 29.909*** | 0.000 | |
| Polyploids | BBCC | 10 | 59 | 5.663* | 0.021 | 5.379* | 0.029 |
| Different cultivars or accessions within a species | |||||||
| | GG | 11 | 87 | 25.007*** | 0.000 | 5.534*** | 0.000 |
| | GG | 9 | 71 | 20.867*** | 0.000 | 4.920*** | 0.000 |
| Among different species | |||||||
| Diploids | 10 | 223 | 18.667*** | 0.000 | 11.408*** | 0.000 | |
| Different cultivars or accessions within a species | |||||||
| | A1A1 | 10 | 59 | 19.395*** | 0.000 | ||
| | A2A2 | 4 | 23 | 9.326*** | 0.000 | ||
| | D2-1D2-1 | 2 | 10 | 0.245 | 0.632 | ||
| | D3-kD3-k | 3 | 14 | 0.168 | 0.847 | ||
| | D5D5 | 3 | 17 | 0.302 | 0.744 | ||
| | D6D6 | 2 | 11 | 0.429 | 0.527 | ||
| | D8D8 | 3 | 16 | 0.026 | 0.974 | ||
| | (AADD)1 | 9 | 53 | 6.114*** | 0.000 | ||
| | (AADD)2 | 5 | 29 | 3.018* | 0.037 | ||
| | (AADD)3 | 12 | 77 | 1.404 | 0.192 | ||
| | (AADD)4 | 3 | 17 | 2.312 | 0.133 | ||
| | (AADD)5 | 2 | 17 | 2.038 | 0.173 | ||
| Among different species | |||||||
| Diploids | 65 | 372 | 4.737*** | 0.000 | |||
| Polyploids | 31 | 197 | 5.204*** | 0.000 | |||
aThe variation of the RLK family size was not studied in Gossypium.
The variation is significant in two-tailed test at *P ≤ 0.05, **P ≤ 0.01 and ***P ≤ 0.001.
Figure 1.Intra-specific variation of the NBS and RLK family sizes within (A) O. sativa (2x), (B) G. max (2x), (C) G. herbaceum (2x) and (D) G. hirsutum (4x). The mean number of genes in each accession or cultivar was calculated from four to eight replicates for each NBS or RLK family and the vertical bars indicate SDs (±SE). Up to 5.4-fold variation in number of genes in the gene families was detected among the lines of the species. The intra-specific variations were verified by ANOVA (Table 1).
Figure 2.Correlation of family size variation between NBS and RLK families in (A) Oryza and (B) Glycine.
Figure 3.Roles of polyploidization on the variation and evolution of the NBS and RLK family sizes in the (A) Oryza BBCC complex and the (B) Gossypium AADD complex. The mean number of genes in each species (genome) was calculated from 4 to 6 technical replicates per line and 2 to 12 line biological replicates for each NBS or RLK family, and the vertical bars indicate SDs (±SE). Note that the polyploid species have the numbers of NBS or RLK genes similar to that of one of their donor diploid species (P > 0.05). Ce, O. eichinggeri; Co, O. officinalis; Cr, O. rhizomatis; (BC)p, O. punctata (4x); (BC)m, O. minuta; Bp, O. punctata (2x); D5, G. raimondii; D6, G. gossypioides; (AD)1, G. hirsutum; (AD)2, G. barbadense; (AD)3, G. tomentosum; (AD)4, G. mustelinum; (AD)5, G. darwinii; A1, G. herbaceum; and A2, G. arboreum.
Correlation between genome size (pg/1C) and log10-transformed number of genes in the NBS and RLK families (Supplementary Table S1)
| Species analyzed | NBS | RLK | |||
|---|---|---|---|---|---|
| Coefficient ( | Coefficient ( | ||||
| 8 | −0.851** | 0.007 | −0.859** | 0.006 | |
| 10 | −0.372 | 0.290 | −0.260 | 0.468 | |
| 26 | 0.198 | 0.333 | Not studied | ||
| 18 | 0.312 | 0.208 | 0.038 | 0.881 | |
| 44 | 0.062 | 0.691 | Not studied | ||
The variation is significant in two-tailed test at **P ≤ 0.01.
Comparison by t-test in the log10-transformed number of genes in the NBS and RLK families between species of different geographical origins and ecotypes in Oryza and Glycine
| Species | NBS | RLK | |||||
|---|---|---|---|---|---|---|---|
| Mean ± SD | Mean difference | Mean ± SD | Mean difference | ||||
| | 24 | 2.7401 ± 0.20969 | −0.03599 | 0.532 | 3.0626 ± 0.19297 | 0.12677 | 0.189 |
| 18 | 2.7761 ± 0.13964 | 2.9358 ± 0.2178 | |||||
| | 54 | 2.9032 ± 0.17977 | 0.19380*** | 0.000 | 3.1268 ± 0.13808 | 0.16683*** | 0.000 |
| | 30 | 2.7094 ± 0.17976 | 2.9599 ± 0.13638 | ||||
| | 8 | 3.1492 ± 0.07208 | 0.47036*** | 0.000 | 3.2030 ± 0.17654 | 0.43739*** | 0.000 |
| 8 | 2.6789 ± 0.14579 | 2.7656 ± 0.10607 | |||||
The variation is significant in two-tailed test at ***P ≤ 0.001.
Comparison by t-test in the log10-tranformated number of genes in the NBS and RLK families between cultivated and wild species of Oryza and Glycine
| Species | NBS | RLK | |||||
|---|---|---|---|---|---|---|---|
| Mean ± SD | Mean difference | Mean ± SD | Mean difference | ||||
| | 54 | 2.9032 ± 0.17977 | 0.12703** | 0.008 | 3.1268 ± 0.13808 | 0.19095** | 0.006 |
| | 18 | 2.7761 ± 0.13964 | 2.9358 ± 0.21780 | ||||
| | 30 | 2.7094 ± 0.17976 | −0.06677 | 0.184 | 2.9599 ± 0.13638 | 0.02412 | 0.748 |
| | 18 | 2.7761 ± 0.13964 | 2.9358 ± 0.21780 | ||||
| | 36 | 2.7564 ± 0.19568 | −0.01971 | 0.705 | 2.9956 ± 0.13904 | 0.05978 | 0.401 |
| | 18 | 2.7761 ± 0.13964 | 2.9358 ± 0.21780 | ||||
| | 120 | 2.8107 ± 0.20221 | 0.03456 | 0.486 | 3.0446 ± 0.15494 | 0.10877 | 0.094 |
| 18 | 2.7761 ± 0.13964 | 2.9358 ± 0.21780 | |||||
| | 42 | 2.6399 ± 0.17400 | −0.10026* | 0.041 | 3.0017 ± 0.18582 | −0.06084 | 0.380 |
| | 24 | 2.7401 ± 0.20969 | 3.0626 ± 0.19297 | ||||
| | 88 | 2.9777 ± 0.19760 | −0.11342*** | 0.000 | 2.9935 ± 0.21865 | −0.07224* | 0.037 |
| | 72 | 3.0911 ± 0.16170 | 3.0657 ± 0.21465 | ||||
The variation is significant in two-tailed test at *P ≤ 0.05, **P ≤ 0.01 and ***P ≤ 0.001.
Figure 4.Association of gene family size variation with organismal phylogentic relationships indicated by phylogenetic distance. The phylogenetic trees and corresponding phylogenetic distances of (A) Oryza, (B) Glyine and (C) Gossypium species were from previous studies (52,55–57). The pairwised difference in number of genes in the families between two species was calculated by subtracting the mean number of the genes in one species from that of the other (Supplementary Table S1), and the phylogenetic distance between them was retrieved from the genetic distance matrix of their phylogenetic tree (Supplementary Table S7). The numbers near each branch of the trees indicate its percentage confidence estimated by bootstrap replications. Note that the size variations of the NBS and RLK families in the Oryza and Gossypium species positively (r = 0.692–0.792, P ≤ 0.004; r = 0.241, P = 0.000) and those in the Glycine species negatively (r=−0.526, P = 0.044) correlate with the species phylogenetic distances.