| Literature DB >> 30151141 |
Lihui Wang1, Ningning Wu1, Yan Zhu1, Wanlu Song1, Xin Zhao1, Yaxuan Li1, Yingkao Hu1.
Abstract
BURP domain-containing proteins belong to a plant-specific protein family and have diverse roles in plant development and stress responses. However, our understanding about the genetic divergence patterns and evolutionary rates of these proteins remain inadequate. In this study, 15 plant genomes were explored to elucidate the genetic origins, divergence, and functions of these proteins. One hundred and twenty-five BURP protein-encoding genes were identified from four main plant lineages, including 13 higher plant species. The absence of BURP family genes in unicellular and multicellular algae suggests that this family (1) appeared when plants shifted from relatively stable aquatic environments to land, where conditions are more variable and stressful, and (2) is critical in the adaptation of plants to adverse environments. Promoter analysis revealed that several responsive elements to plant hormones and external environment stresses are concentrated in the promoter region of BURP protein-encoding genes. This finding confirms that these genes influence plant stress responses. Several segmentally and tandem-duplicated gene pairs were identified from eight plant species. Thus, in general, BURP domain-containing genes have been subject to strong positive selection, even though these genes have conformed to different expansion models in different species. Our study also detected certain critical amino acid sites that may have contributed to functional divergence among groups or subgroups. Unexpectedly, all of the critical amino acid residues of functional divergence and positive selection were exclusively located in the C-terminal region of the BURP domain. In conclusion, our results contribute novel insights into the genetic divergence patterns and evolutionary rates of BURP proteins.Entities:
Keywords: BURP proteins; functional divergence; hormone response; phylogenetic tree; positive selection; promoter region; segmental duplication; stress response; tandem duplication
Year: 2015 PMID: 30151141 PMCID: PMC6102523 DOI: 10.1002/ece3.1792
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Phylogenetic relationships, exon/intron structure, and motif structures of BURP domain‐containing genes. (A) The neighbor‐joining phylogenetic tree was constructed based on a complete protein sequence alignment of 125 BURP domain‐containing genes, which were identified using MUSCLE and MEGA6. Numbers at the nodes represent bootstrap support (1000 iterations). The color of subclades indicates the four corresponding gene subfamilies. Genes with similar functions clustered together are indicated by filled blue circles; (B) Exon/intron structures of the BURP domain‐containing genes. Yellow boxes: exons; blue boxes: 3ʹ untranslated region; lines: introns. Box and line lengths are scaled based on gene length; (C) MEME motif search results. Conserved motifs are indicated in numbered, colored boxes.
Promoter analysis of BURP domain‐containing gene family
| Hormone‐responsive elements | Environmental stress‐related elements | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| TCA‐element | TGA‐element | GARE‐motif | ABRE | TGACG‐motif | ARE | MBS | HSE | TC‐rich elements | |
| PG1 | 25/33 | 8/33 | 22/33 | 31/33 | 18/33 | 43/33 | 33/33 | 29/33 | 42/33 |
| BNM2 | 11/15 | 2/15 | 9/15 | 10/15 | 12/15 | 22/15 | 24/15 | 22/15 | 22/15 |
| BURPIII | 45/69 | 18/69 | 40/69 | 119/69 | 102/69 | 88/69 | 117/69 | 51/69 | 69/69 |
| BURPIV | 3/8 | 4/8 | 4/8 | 6/8 | 8/8 | 19/8 | 11/8 | 4/8 | 4/8 |
| Total | 84/125 | 32/125 | 75/125 | 166/125 | 140/125 | 172/125 | 185/125 | 106/125 | 137/125 |
Total number of cis‐acting elements/members of BURP domain‐containing subfamilies;
Total number of cis‐acting elements/members of BURP domain‐containing family.
Estimates of the dates of the segmental duplication events of the BURP domain‐containing gene family
| Gene pairs | KS (mean ± SD) | Estimated time (mya) | GWD (mya) | |
|---|---|---|---|---|
| AT1G70370 | AT1G23760 | 0.795 ± 0.122 | 26.5 | 28–48 |
| Brara.I00536 | Brara.B03626 | 0.480 ± 0.203 | 17.1 | 13–17 |
| Brara.F02754 | Brara.I00536 | 0.451 ± 0.123 | 16.1 | |
| Brara.F02754 | Brara.B03626 | 0.441 ± 0.191 | 15.8 | |
| Brara.G02440 | Brara.G02959 | 0.380 ± 0.074 | 13.6 | |
| Glyma.06G013400 | Glyma.11G119100 | 0.553 ± 0.102 | 45.3 | 13,59 |
| Glyma.06G013400 | Glyma.12G044600 | 0.548 ± 0.163 | 44.9 | |
| Glyma.12G217300 | Glyma.13G283900 | 0.145 ± 0.083 | 11.9 | |
| Glyma.02G034700 | Glyma.01G030900 | 0.151 ± 0.051 | 12.3 | |
| Glyma.06G081100 | Glyma.04G079600 | 0.132 ± 0.042 | 10.8 | |
Genes involved in tandem duplication and their 4DTv values
| Species | Gene pairs | 4DTv value | |
|---|---|---|---|
| Setaria italica | Si014261m.g | Si014260m.g | 0.0000 |
| Si014261m.g | Si014258m.g | 0.0270 | |
| Si014260m.g | Si014258m.g | 0.0268 | |
| Si006838m.g | Si006050m.g | 1.6000 | |
| Sorghum bicolor | Sobic.008G157800 | Sobic.008G157700 | 0.8424 |
| Sobic.008G157800 | Sobic.008G157500 | 0.4927 | |
| Sobic.008G157800 | Sobic.008G157600 | 0.5364 | |
| Sobic.008G157700 | Sobic.008G157500 | 0.6103 | |
| Sobic.008G157700 | Sobic.008G157600 | 0.5243 | |
| Sobic.008G157500 | Sobic.008G157600 | 0.1370 | |
| Citrus sinensis | Orange1.1g044596m | Orange1.1g036311m | 0.6667 |
| Glycine max | Glyma.14G140900 | Glyma.14G141000 | 0.1259 |
| Glyma.11G119100 | Glyma.11G119900 | 0.7027 | |
| Glyma.12G217300 | Glyma.12G217400 | 0.0504 | |
| Glyma.18G138000 | Glyma.18G138200 | 0.2310 | |
| Populus trichocarpa | Potri.006G243600 | Potri.006G243700 | 0.0697 |
| Potri.T012700 | Potri.T012500 | 0.0258 | |
| Potri.004G136900 | Potri.004G136500 | 0.0198 | |
| Potri.004G136900 | Potri.004G136700 | 0.0001 | |
| Potri.004G136900 | Potri.004G136000 | 0.0380 | |
| Potri.004G136900 | Potri.004G136600 | 0.0060 | |
| Potri.004G136500 | Potri.004G136700 | 0.0202 | |
| Potri.004G136500 | Potri.004G136000 | 0.0234 | |
| Potri.004G136500 | Potri.004G136600 | 0.0225 | |
| Potri.004G136700 | Potri.004G136000 | 0.0029 | |
| Potri.004G136700 | Potri.004G136600 | 0.0057 | |
| Potri.004G136000 | Potri.004G136600 | 0.0057 | |
| Potri.009G114300 | Potri.009G114600 | 0.5909 | |
| Potri.010G040800 | Potri.010G040900 | 0.0000 | |
| Cucumis sativus | Cucsa.109710 | Cucsa.109700 | 0.9248 |
Functional divergence between subfamilies of the BURP domain‐containing gene family
| Group I | Group II | Type II | LRT | Qk >0.95 | Critical amino acid sites | Type II | Qk >0.95 | Critical amino acid sites |
|---|---|---|---|---|---|---|---|---|
| PG1 | BNM2‐like | 0.618 ± 0.080 | 59.287 | 7 | 181D,184R,273L,279 | −0.113 ± 0.238 | 13 | 178 |
| PG1 | BURP III | 0.591 ± 0.071 | 69.307 | 7 | 184 | −0.824 ± 0.520 | None | |
| BNM2‐like | BURP III | 0.230 ± 0.058 | 15.684 | 1 | 273L | −2.010 ± 0.905 | None |
θ I and θ II, the coefficients of type I and type II functional divergence; LRT, likelihood ratio statistic; Qk, posterior probability;
Sites also responsible for the positive selection;
Sites in bold font indicate those responsible for both type I and type II functional divergence.
Figure 2Multiple sequence alignment of several BURP domain‐containing protein sequences. The position of the BURP domain is indicated at the top of each sequence. The sites of two conserved cysteine (C) residues, and four repeats of the cysteine‐histidine motif, which are located in the C‐terminus BURP domain, are indicated by red frames. The critical amino acid sites of adaptive selection and functional divergence are marked by the red stars and blue circles, respectively.
Tests for positive selection among codons of BURP domain‐containing genes using site models
| Models | p | Estimates of parameters | InL | 2Δl | Positively selected sites |
|---|---|---|---|---|---|
| M0 (one‐ratio) | 1 |
| −20918.0 | None | |
| M3 (discrete) | 5 |
p0 = 0.12267 p1 = 0.40583 p2 = 0.47150 | −20403.3 | 1029.4 (M3 vs. M0) | None |
| M7 (beta) | 2 | p = 0.79669 q = 4.17489 | −20362.5 | Not allowed | |
| M8 (beta&ɯ) | 4 |
p0 = 0.99999 p = 1.03218 q = 1.97229 | −24885.1 | 9045.2 (M8 vs. M7) |
|
Number of parameters in the ω distribution.
Positive selection sites are inferred at posterior probabilities >95% with those reaching 99% shown in bold.
Sites also found to be involved in the functional divergence.
Parameter estimation and likelihood ratio tests for the branch‐site models
| Cluster | Site class | Proportion | Background ɯ | Foreground ɯ | InL | 2Δl | Positive selection sites |
|---|---|---|---|---|---|---|---|
| PG1 | 0 | 0.86612 | 0.16585 | 0.16585 | −20839.9 | 156.2 (M2 vs. M0) | None |
| 1 | 0.09642 | 1.00000 | 1.00000 | ||||
| 2a | 0.03370 | 0.16585 | 998.99217 | ||||
| 2b | 0.00375 | 1.00000 | 998.99217 | ||||
| BNM2‐like | 0 | 0.79685 | 0.16490 | 0.16490 | −20836.0 | 164 (M2 vs. M0) | 179 |
| 1 | 0.08773 | 1.00000 | 1.00000 | ||||
| 2a | 0.10438 | 0.16490 | 62.88724 | ||||
| 2b | 0.01144 | 1.00000 | 62.88724 | ||||
| BURP III | 0 | 0.82963 | 0.16564 | 0.16564 | −20839.3 | 157.4 (M2 vs. M0) | None |
| 1 | 0.09127 | 1.00000 | 1.00000 | ||||
| 2a | 0.07125 | 0.16564 | 999.00000 | ||||
| 2b | 0.00784 | 1.00000 | 999.00000 | ||||
| BURP IV | 0 | 0.81364 | 0.16582 | 0.16582 | −20838.4 | 159.2 (M2 vs. M0) | None |
| 1 | 0.09005 | 1.00000 | 1.00000 | ||||
| 2a | 0.08671 | 0.16582 | 5.66019 | ||||
| 2b | 0.00960 | 1.00000 | 5.66109 |
Positive selection sites are inferred at posterior probabilities >95%.
Sites also found to be involved in the functional divergence.