| Literature DB >> 35295554 |
Jia Li1, Litian Zhang2,3, Rania G Elbaiomy4, Lilan Chen1, Zhenrong Wang1, Jie Jiao1, Jiliang Zhu5, Wanhai Zhou1, Bo Chen1, Salma A Soaud6, Manzar Abbas1, Na Lin1, Ahmed H El-Sappah1,6.
Abstract
FRIZZY PANICLE (FZP), an essential gene that controls spikelet differentiation and development in the grass family (Poaceae), prevents the formation of axillary bud meristems and is closely associated with crop yields. It is unclear whether the FZP gene or its orthologs were selected during the evolutionary process of grass species, which possess diverse spike morphologies. In the present study, we adopted bioinformatics methods for the evolutionary analysis of FZP orthologs in species of the grass family. Thirty-five orthologs with protein sequences identical to that of the FZP gene were identified from 29 grass species. Analysis of conserved domains revealed that the AP2/ERF domains were highly conserved with almost no amino acid mutations. However, species of the tribe Triticeae, genus Oryza, and C4 plants exhibited more significant amino acid mutations in the acidic C-terminus region. Results of the phylogenetic analysis showed that the 29 grass species could be classified into three groups, namely, Triticeae, Oryza, and C4 plants. Within the Triticeae group, the FZP genes originating from the same genome were classified into the same sub-group. When selection pressure analysis was performed, significant positive selection sites were detected in species of the Triticeae and Oryza groups. Our results show that the FZP gene was selected during the grass family's evolutionary process, and functional divergence may have already occurred among the various species. Therefore, researchers investigating the FZP gene's functions should take note of the possible presence of various roles in other grass species. ©2022 Li et al.Entities:
Keywords: Evolution analysis; FRIZZY PANICLE (FZP) orthologs gene; Poaceae; Selective pressure
Year: 2022 PMID: 35295554 PMCID: PMC8919851 DOI: 10.7717/peerj.12880
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
FZP orthologs in 29 Poaceae species.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| XP_020192385.1 | 2D | 314 | 6e−109 | 71.6 | NCBI |
|
| Bradi1g18580.1 ( | Bd1 | 319 | 4e−86 | 57 | Phytozome |
|
| OEL14197.1 | – | 305 | 3e−123 | 69.4 | NCBI |
|
| TVU37303.1 | 4 | 285 | 1e−114 | 70.3 | NCBI |
|
| KAE8778899.1 | 2H | 290 | 2e−113 | 71.6 | NCBI |
|
| LPERR07G22580.1 | 7 | 328 | 1e−117 | 66 | EnsemblPlants |
|
| OBART07G25780.1 | 7 | 213 | 7e−87 | 99 | EnsemblPlants |
| Os07g0669500 ( | 7 | 319 | 0 | 100 | EnsemblPlants | |
| EAZ05082.1 | 7 | 299 | 0 | 100 | NCBI | |
|
| ORGLA07G0201100.1 | 7 | 321 | 0 | 98 | EnsemblPlants |
|
| OGLUM07G25800.1 | 7 | 320 | 0 | 99 | EnsemblPlants |
|
| ORUFI07G26720.1 | 7 | 319 | 0 | 100 | EnsemblPlants |
|
| AAX83534.1 | 7 | 279 | 0 | 99.7 | NCBI |
|
| OMERI07G22420.1 | 7 | 319 | 0 | 98 | EnsemblPlants |
|
| KAF0907722.1 | 7 | 292 | 4e−147 | 82.4 | NCBI |
|
| ONIVA07G25390.1 | 7 | 319 | 0 | 100 | EnsemblPlants |
|
| OPUNC07G24180.1 | 7 | 321 | 6e−163 | 90 | EnsemblPlants |
|
| RLN33328.1 | 3 | 304 | 2e−124 | 70.5 | NCBI |
|
| Pavir.Ba00192.1 | 02a | 302 | 4e−126 | 72 | Phytozome |
|
| Pahal.2G484900.1 | 2 | 310 | 2e−128 | 69 | Phytozome |
|
| Sspon.02G0001330-3D | 2D | 377 | 1e−115 | 74 | EnsemblPlants |
| Sspon.02G0001330-2C | 2C | 378 | 3e−116 | 74 | EnsemblPlants | |
|
| Seita.2G427300.1 | scaffold_2 | 294 | 3e−125 | 71 | Phytozome |
|
| Sevir.2G437800.1 | 2 | 294 | 3e−125 | 71 | Phytozome |
|
| Sobic.002G411000.1 | 2 | 327 | 1e−129 | 71.6 | Phytozome |
|
| TraesCS2A02G116900.1( | 2A | 300 | 2e−109 | 69 | EnsemblPlants |
| TraesCS2B02G136100.1( | 2B | 308 | 3e−113 | 71 | EnsemblPlants | |
| TraesCS2D02G118200.1( | 2D | 296 | 2e−109 | 72 | EnsemblPlants | |
|
| TRIDC2AG014040.1 | 2A | 300 | 2e−109 | 69 | EnsemblPlants |
| TRIDC2BG016990.1 | 2B | 305 | 6e−115 | 68 | EnsemblPlants | |
|
| QBI22216.1 | 2A | 299 | 9e−108 | 68.4 | NCBI |
|
| VAH42493.1 | 2B | 307 | 2e−113 | 71.5 | NCBI |
| QBI22219.1 | 2A | 299 | 9e−108 | 68.4 | NCBI | |
|
| GRMZM2G458437 ( | 2 | 296 | 1e−104 | 62.5 | MaizeGDP |
| GRMZM2G307119 ( | 2 | 315 | 6e−108 | 66.5 | MaizeGDP |
Notes.
Use the rice FZP gene (Os07g0669500) protein sequence as the query sequence, and other species protein sequences as the subject sequence using the Blastp program of the NCBI (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome) calculate E-value and identity (%).
Figure 1Conserved amino acid motifs of FZP orthologs gene in Poaceae.
(A) The AP2/ERF domain of FZP orthologs gene. Amino acids residues with asterisks indicate that confer specific GCC box binding. The amino acid residues with brown, blue, green and red arrows indicate mutation sites that can cause supernumerary spikelets (SSs) phenotype in maize, Brachypodiumdistachyon, bread wheat and rice, respectively. An amino acid substitution is shown at position 13 (Y13C, Y at position 13 is changed to C) and 39 (L39P, L at position 39 is changed to P) of the QBI22216.1 and QBI22219.1 genes, respectively. (B) The C-terminus acidic domain of FZP orthologs gene.
Figure 2Phylogenetic analysis of the FZP orthologs gene in Poaceae (ML tree).
The numbers above the branches indicate the bootstrap values. The branch labels are tagged with gene ID numbers and Latin names of species (inside brackets). The red asterisks indicate the branch with sites of significant positive selection.
Selection pressure analysis of the FZP orthologous gene in Poaceae using the branch-site model.
|
|
|
|
|
|
|
| ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Triticeae Branch | Model A | 73 | −7889.229233 | Site class | 0 | 1 | 2a | 2b | 0.000017894** | |
| f | 0.72446 | 0.23826 | 0.02805 | 0.00923 | ||||||
| 0.07414 | 1.00000 | 0.07414 | 1.00000 | |||||||
| 0.07414 | 1.00000 | 675.59953 | 675.59953 | |||||||
| Model A null | 72 | −7898.429883 | 1 | Not Allowed | ||||||
| Oryza Branch | Model A | 73 | −7897.344045 | f | 0.72693 | 0.25530 | 0.01315 | 0.00462 | 0.030827376* | 129 A 0.510, |
| 0.07428 | 1.00000 | 0.07428 | 1.00000 | |||||||
| 0.07428 | 1.00000 | 999.00000 | 999.00000 | |||||||
| Model A null | 72 | −7899.675320 | 1 | Not Allowed | ||||||
| C4 Plant Branch | Model A | 73 | −7896.076393 | f | 0.71012 | 0.25956 | 0.02221 | 0.00812 | 0.127622034 | |
| 0.07223 | 1.00000 | 0.07223 | 1.00000 | |||||||
| 0.07223 | 1.00000 | 26.74694 | 26.74694 | |||||||
| Model A null | 72 | −7897.236990 | 1 | Not Allowed | ||||||
Notes.
Number of parameters including branch lengths.
Likelihood score.
Positive selected sites (posterior probability >0.95).
Predict the impact of amino acid substitutions or indels on the biological functions of FZP gene proteins.
|
|
|
|
|
|---|---|---|---|
| A29I | A61 | −0.219 | Neutral |
| S45A | S129 | −0.686 | Neutral |
| A52T | A140 | 0.05 | Neutral |
|
| Y157 | −8.925 |
|
|
| L183 | −6.242 |
|
| T116A | T204 | −0.733 | Neutral |
| S197A | S327 | 0.747 | Neutral |
| S198D | S328 | −0.628 | Neutral |
| S252_D253insVS | −409, −410 | 1.097 | Neutral |
| M259_M260insG | −418 | 0.258 | Neutral |
| M259_M260delinsL | M417–M420 | −0.342 | Neutral |
| S308_S309insA | −483 | 1.028 | Neutral |
| S308_S309insN | −483 | 0.158 | Neutral |
| S309G | S484 | 0.219 | Neutral |
| S309A | S484 | −0.217 | Neutral |
|
| Y485-H493 | −4.189 |
|
Notes.
The variant position number is based on the amino acid sequence of the FZP gene (Os07g0669500) without gap as a reference. A29I, A at position 29 is changed to I, other similar types have the same meaning; S252_D253insVS, VS is inserted between positions 252 and 253, other similar types have the same meaning; M259_M260delinsL, MM is replaced by L; Y310_H318del, A deletion of nine amino acids, from Y at position 310 to H at position 318.
Variant with an asterisk indicates a significant positive selection site for selection pressure calculation. Bold variant indicates the “Deleterious” mutation site after amino acid substitution or indel.
The variant position number is based on multiple sequence alignment of FZP gene and its orthologous genes (Fig. S1).
Default threshold is −2.5, variants with a score equal to or below −2.5 are considered “deleterious”, if above −2.5 are considered “neutral”.
Figure 3The 3D structure of Arabidopsis AtERF96 gene and FZP orthologous gene protein.
The red and orange area in 3D structure indicate N-terminus AP2/ERF domain and C-terminus acidic domain of FZP genes, respectively. The C-terminus, α-helix, β-sheet and DNA were point out with black, purple, blue and orange arrows, respectively. The light blue amino acids with labels in the 3D structure indicate that they specifically bind to the DNA GCC box. The red amino acids with labels (P96 and C70) indicate that an amino acid substitution in the AP2/ERF domain are considered “deleterious” (PROVEAN predicted results). (A) Crystal structure of AtERF96 with GCC-box. (B–F) The 3D structure of Triticum monococcum QBI22216.1 gene, Zea mays BD1 gene (GRMZM2G307119), Triticum turgidum QBI22219.1 gene, Triticum aestivum WFZP-A gene (TraesCS2A02G116900.1) and Oryza Sativa L. spp. japonica FZP gene (Os07g0669500). The 3D structure of B–F graphics is visualized using model 1 by PyMOL software.