| Literature DB >> 29467740 |
Songli Yuan1,2, Xiangyong Li3, Rong Li1,2, Lei Wang3, Chanjuan Zhang1,2, Limiao Chen1,2, Qingnan Hao1,2, Xiaojuan Zhang1,2, Haifeng Chen1,2, Zhihui Shan1,2, Zhonglu Yang1,2, Shuilian Chen1,2, Dezhen Qiu1,2, Danxia Ke3, Xinan Zhou1,2.
Abstract
Root nodule symbiosis (RNS) is one of the most productive and economical systems for nitrogen fixation, and previous studies have shown that several nodule-specific C2H2-zinc finger proteins (ZFPs) play important roles in symbiosis establishment and nodule function. However, C2H2-ZFPs are the most widespread ZFPs in eukaryotes, and a great variation of structure and function exist among the family members. It remains largely unclear whether or not special types of C2H2-ZF genes participate in symbiosis, especially in soybean. In the present study, we performed a genome-wide survey of soybean C2H2-ZF genes, and 321 soybean C2H2-ZF genes were identified and classified into 11 clearly distinguishable subsets (Gm-t1-SF, Gm-t2-SF, Gm-1i-Q-SF, Gm-1i-M-SF, Gm-1i-Z-SF, Gm-1i-D-SF, Gm-2i-Q-SF, Gm-2i-M-SF, Gm-2i-Mix-SF, Gm-3i-SF, and Gm-4i-SF) based on the arrangements, numbers, and types of C2H2-ZF domains. Phylogenetic and gene ontology analyses were carried out to assess the conserved sequence and GO function among these subsets, and the results showed that the classification of soybean C2H2-ZFPs was reasonable. The expression profile of soybean C2H2-ZFPs in multiple tissues showed that nearly half of soybean C2H2-ZFPs within different subsets had expressions in nodules, including a clustering branch consisting of 11 Gm-1i-Q-SF genes specifically expressed in symbiotic-relative tissues. RNA-Seq was used to identify symbiosis-related soybean C2H2-ZFPs, and the expression pattern of the soybean C2H2-ZFPs in roots and nodules at different development stages showed that soybean C2H2-ZFPs mainly played roles in nodule development or nodule function rather than nodulation signal transduction, and nearly half of these genes had high expressions and/or different expression patterns during soybean nodule development, especially for the six clustering branches of genes consisting of different subsets of C2H2-ZFPs. Furthermore, the selected symbiosis-related soybean C2H2-ZFPs might function in legume-rhizobium symbiosis through regulating or interacting with other key proteins. Taken together, our findings provided useful information for the study on classification and conservative function of C2H2-ZFPs, and offered solid evidence for investigation of rhizobium symbiosis-related C2H2-ZFPs in soybean or other legumes.Entities:
Keywords: C2H2 zinc finger proteins; classification; gene expression; root nodule symbiosis; soybean
Year: 2018 PMID: 29467740 PMCID: PMC5807899 DOI: 10.3389/fmicb.2018.00126
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Chromosomal location of the soybean C2H2-ZF genes. The illustrated genome-wide chromosome organization caused by whole genome duplication events is accomplished using the Circos software based on the duplication coordinates defined in the soybean genome assembly v2.0. Segmental duplicated blocks are gray coded. Paralogous pairs of soybean C2H2-ZF genes are connected with orange lines.
Figure 2Subset classification of C2H2-ZF genes in the soybean genome. (A) Numbers of C2H2-ZFPs in the 11 different subsets. (B) Numbers of C2H2-ZFPs containing 1~5 and 9 C2H2-ZF domains. For the detailed information of the classified C2H2-ZFPs in soybean, see Supplementary Tables S3, S4.
Figure 3Phylogenetic analysis of the classified soybean C2H2-ZF genes. Firstly, neighbor-joining phylogenetic tree of the 321 soybean C2H2-ZFPs was constructed using MEGA6 by the neighbor-joining method with 1,000 bootstrap replicates; then classified the subset of each C2H2-ZF gene and compressed the subtree of the neighbor-joining phylogenetic tree to construct a phylogenetic tree of the 11 classified soybean C2H2-ZFP subsets. The low bootstrap supports of some main branches indicated that a high degree of complexity was existed in different soybean C2H2-ZFPs subsets. Most of the same subset C2H2-ZFPs was clustered together and had more than 50% bootstrap support. RGB background indicated 18 small branches with different subsets had significant bootstrap support (>50%).
Figure 4Gene Ontology (GO) results for the 11 soybean C2H2-ZFP subsets. Go analysis of 321 soybean C2H2-ZFPs (11 subsets) showed their involvement in biological process, cellular components, and molecular function, and the numbers of genes in each term are shown in histograms. Seventeen GO function terms are indicated: 1, Signal Transduction; 2, Flower Development; 3, Photosynthesis; 4, Cell Differentiation; 5, Extracellular Region; 6, Cell Wall; 7, Intracellular; 8, Nucleus; 9, Nucleolus; 10, Cytoplasm; 11, Mitochondrion; 12, Cytosol; 13, Plasma Membrane; 14, Nucleic Acid Binding; 15, DNA Binding; 16, Sequence-specific DNA Binding Transcription Factor Activity; 17, Protein Binding. (A) Gm-t1-SF, Gm-t2-SF, Gm-1i-Q-SF, Gm-1i-M-SF and Gm-1i-Z-SF. (B) Gm-1i-D-SF, Gm-2i-Q-SF, Gm-2i-M-SF, Gm-2i-Mix-SF, Gm-3i-SF, and Gm-4i-SF.
GO slim terms in soybean C2H2-ZFPs compared with all the annotated genes of genome.
| GO:0006355 | 218 | Overrepresented | 1.30929E-42 | Regulation of transcription, DNA-dependent |
| GO:0051973 | 3 (Gm-1i-Q-SF) | Overrepresented | 0.021678694 | Positive regulation of telomerase activity |
| GO:0006417 | 12 (Gm-1i-Q-SF, Gm-t1-SF and Gm-1i-M-SF) | Overrepresented | 1.20782E-06 | Regulation of translation |
| GO:1900109 | 4 (Gm-t1-SF) | Overrepresented | 2.1153E-05 | Regulation of histone H3-K9 dimethylation |
| GO:0035067 | 4 (Gm-t1-SF) | Overrepresented | 0.000307881 | Negative regulation of histone acetylation |
| GO:0033169 | 6 (Gm-t1-SF) | Overrepresented | 5.08703E-08 | Histone H3-K9 demethylation |
| GO:0006457 | 2 (Gm-2i-M-SF) | Underrepresented | 0.015078646 | Protein folding |
| GO:0007165 | 5 (Gm-2i-Q-SF) | Underrepresented | 0.000326855 | Signal transduction |
| GO:0009965 | 27 (Gm-1i-Q-SF, Gm-t1-SF and Gm-1i-M-SF) | Overrepresented | 0.000149664 | Leaf morphogenesis |
| GO:0030154 | 25 (Gm-1i-Q-SF, Gm-t1-SF and Gm-1i-M-SF) | Overrepresented | 6.47367E-07 | Cell differentiation |
| GO:2000011 | 3 (Gm-1i-M-SF) | Overrepresented | 0.001130818 | Regulation of adaxial/abaxial pattern formation |
| GO:0010500 | 10 (Gm-2i-Mix-SF and Gm-t1-SF) | Overrepresented | 1.54005E-11 | Transmitting tissue development |
| GO:0010026 | 8 (Gm-1i-Q-SF) | Overrepresented | 0.000356462 | Trichome differentiation |
| GO:0010093 | 8 (Gm-1i-Q-SF) | Overrepresented | 0.01569553 | Specification of floral organ identity |
| GO:2000904 | 6 (Gm-t1-SF) | Overrepresented | 1.13217E-05 | Regulation of starch metabolic process |
| GO:0010160 | 6 (Gm-1i-Q-SF) | Overrepresented | 1.94968E-05 | Formation of organ boundary |
| GO:0035264 | 5 (Gm-2i-Q-SF) | Overrepresented | 3.95396E-07 | Multicellular organism growth |
| GO:0009793 | 4 (Gm-1i-Q-SF and Gm-1i-Z-SF) | Underrepresented | 0.000676858 | Embryo development ending in seed dormancy |
| GO:0045604 | 4 (Gm-t1-SF) | Overrepresented | 0.000307881 | Regulation of epidermal cell differentiation |
| GO:0010117 | 5 (Gm-2i-Q-SF) | Overrepresented | 0.000448948 | Photoprotection |
| GO:0010031 | 5 (Gm-t1-SF) | Overrepresented | 0.002027841 | Circumnutation |
| GO:0009959 | 5 (Gm-t1-SF) | Overrepresented | 0.01248965 | Negative gravitropism |
| GO:0009611 | 5 (Gm-2i-Q-SF) | Underrepresented | 0.031241483 | Response to wounding |
| GO:0009733 | 5 (Gm-2i-Q-SF) | Underrepresented | 0.043936403 | Response to auxin stimulus |
| GO:0002679 | 21 (Gm-2i-Q-SF, Gm-1i-Q-SF and Gm-1i-M-SF) | Overrepresented | 0.009990945 | Respiratory burst involved in defense response |
| GO:0048579 | 4 (Gm-1i-Q-SF and Gm-t1-SF) | Overrepresented | 0.002472123 | Negative regulation of long-day photoperiodism, flowering |
| GO:0010106 | 17 (Gm-t1-SF,Gm-2i-Mix-SF and Gm-1i-Q-SF) | Overrepresented | 0.0005255 | Cellular response to iron ion starvation |
| GO:0006826 | 17 (Gm-t1-SF,Gm-2i-Mix-SF and Gm-1i-Q-SF) | Overrepresented | 0.000657982 | Iron ion transport |
| GO:0009651 | 9 (Gm-1i-M-SF and Gm-2i-Q-SF) | Underrepresented | 5.40743E-05 | response to salt stress |
| GO:0009937 | 6 (Gm-t1-SF) | Overrepresented | 2.00227E-07 | Regulation of gibberellic acid mediated signaling pathway |
| GO:0010447 | 5 (Gm-t1-SF) | Overrepresented | 3.95396E-07 | Response to acidity |
| GO:0009590 | 5 (Gm-t1-SF) | Overrepresented | 0.00028063 | Detection of gravity |
Figure 5Expression analysis of 321 soybean C2H2-ZFPs in different tissues in the Phytozome database. Heatmaps were produced based on the expression values of soybean C2H2-ZFPs in the Phytozome database using the pheatmap packages in R. These tissues include flower, leaves, nodules, pod, root, root hairs, seed, and stem. The red frame indicates the symbiotic-relative clustering branch. (A) Heatmap showing Gm-t1-SF and Gm-t2-SF genes in eight selected tissues. (B) Heatmap showing Gm-1i-Q-SF, Gm-1i-M-SF, Gm-1i-Z-SF, and Gm-1i-D-SF genes in eight selected tissues. (C) Heatmap showing Gm-2i-Q-SF, Gm-2i-M-SF, Gm-2i-Mix-SF, Gm-3i-SF, and Gm-4i-SF in eight selected tissues.
Figure 6RNA-seq analysis of 321 soybean C2H2-ZFPs in soybean nodules at five important developmental stages. Heatmaps were produced based on our RNA-Seq results using the pheatmap packages in R. The description of these five important developmental stages (branching stage, flowering stage, fruiting stage, pod stage, and harvest stage) were shown in our previous research (Yuan et al., 2017). “⋆⋆”indicates the clustering branches of genes were expressed most abundantly in all or most of above-mentioned nodule samples. “⋆”indicates the clustering branches of genes were expressed high in nodule samples. (A) Heatmap for Gm-t1-SF and Gm-t2-SF genes. (B) Heatmap for Gm-1i-Q-SF, Gm-1i-M-SF, Gm-1i-Z-SF and Gm-1i-D-SF genes. (C) Heatmap for Gm-2i-Q-SF, Gm-2i-M-SF, Gm-2i-Mix-SF, Gm-3i-SF, and Gm-4i-SF genes.
Figure 7Comparison of expression determined by RNA-Seq and qPCR on 12 genes in soybean roots and nodule development. Half of the roots and nodules were inoculated with B. japonicum strain 113-2 and harvested at different time intervals post inoculation, and the other roots treated with water served as the uninoculation control. The relative expression levels of roots were calculated with reference to the value of the control roots harvested 0.5 h after mock treatment with water, and the relative expression levels of nodules were calculated with reference to the value of the branching_stage N. All qPCR reactions were repeated three times and the data are presented as the mean ± SD.
List of symbiosis-related C2H2 genes in Medicago truncatula and common soybean by searching for homologs of soybean.
| Glyma.01G056900 | Nodule development | Medtr5g043930 | Nodule development | Phvul.002G082200 | Nodule formation and nodule function |
| Glyma.01G079900 | Nodule development | Medtr0160s0030 | High expressed in nodules.symbiotic condition | Phvul.008G143900 | Nodule development and nodule function |
| Glyma.01G134200 | Nodule development | 0 | Phvul.010G071300 | Nodule development and nodule function | |
| Glyma.02G099100 | Nodule development | Medtr1g093095 | Nodule development | Phvul.003G027900 | No expression in roots and nodules |
| Glyma.02G144400 | Nodule development | 0 | Phvul.007G174900 | Nodule development | |
| Glyma.03G173100 | Nodule development | 0 | Phvul.005G046200 | Expressed in roots and nodules | |
| Glyma.03G228100 | Nodule development | Medtr7g111520 | Nodule development | Phvul.001G222000 | Nodule development |
| Glyma.04G044900,Glyma.06G045400 | Nodule senescence | Medtr3g102980 | High expressed in different nodules and roots | Phvul.009G070800 | Nodule formation |
| Glyma.04G130300,Glyma.06G316400 | High expressed in different nodules and roots | Medtr3g007710 | Nodule development | Phvul.005G011500 | Nodule formation |
| Glyma.04G191900 | Nodule development | Medtr3g075210 | High expressed in different nodules | Phvul.009G169700 | Nodule formation |
| Glyma.05G241800,Glyma.08G049300 | High expressed in different nodules and roots | Medtr8g106220 | Nodule development | Phvul.002G325100 | Nodule formation |
| Glyma.06G314900 | High expressed in different nodules and roots | Medtr3g006760 | High expressed in different nodules | Phvul.005G010800 | Nodule development or formation |
| Glyma.07G135800,Glyma.18G185900 | High expressed in different nodules | 0 | Phvul.008G117400 | Nodule development and nodule function | |
| Glyma.07G158200 | Nodule development | 0 | Phvul.003G008300 | Nodule formation | |
| Glyma.09G250600,Glyma.18G241900 | High expressed in different nodules and roots | 0 | Phvul.008G054200 | Nodule development or formation | |
| Glyma.09G267800 | Nodule development | Medtr7g075170 | Nodule development | Phvul.008G075100 | Nodule development or formation |
| Glyma.10G034700,Glyma.13G121000 | High expressed in different nodules | Medtr1g070250 | Nodule development | Phvul.007G195500 | Nodule development and nodule function |
| Glyma.10G055200 | High expressed in different nodules and roots | Medtr1g063190 | Expressed in nodules/roots.symbiotic condition | Phvul.007G219000 | Nodule development or formation |
| Glyma.10G153200,Glyma.20G235100 | Nodule development | 0 | Phvul.007G196100 | Nodule formation | |
| Glyma.10G184500,Glyma.20G206000 | Nodule development | 0 | Phvul.007G125400 | Nodule development and nodule function | |
| Glyma.10G215200,Glyma.20G176500 | High expressed in different nodules and roots | 0 | Phvul.007G090500 | Nodule formation | |
| Glyma.10G257900,Glyma.20G133200 | Nodule senescence | Medtr1g106730 | High expressed in different nodules and roots | Phvul.007G044500 | Nodule development or formation |
| Glyma.11G135800,Glyma.12G059500 | Nodule development | 0 | Phvul.011G061500 | Nodule development and nodule function | |
| Glyma.12G171700,Glyma.13G327500 | High expressed in different nodules | 0 | Phvul.005G136300 | Nodule formation | |
| Glyma.12G183900,Glyma.13G317200 | Nodule development | 0 | Phvul.005G124800 | Nodule formation | |
| Glyma.13G250800,Glyma.15G063600 | Nodule development | Medtr2g010860 | High expressed in different roots | Phvul.006G196000 | Mainly expressed in nodules |
| Glyma.14G088300 | Nodule senescence | Medtr1g018420 | High expressed in different nodules | Phvul.001G026700 | High expressed in nodules and roots |
| Glyma.16G156400 | High expressed in different nodules and roots | 0 | Phvul.004G084300 | Nodule formation | |
| Glyma.18G186000 | Nodule development | 0 | Phvul.008G117300 | Nodule development or formation | |
| Glyma.U019400 | High expressed in different nodules and roots | Medtr4g059870 | High expressed in different nodules | Phvul.011G074100 | Nodule formation |
0″ indicates no match to the genome. The expression of soybean C2H2 in symbiosis was referred to Figure .
The interacting protein prediction of symbiosis-related soybean C2H2-ZFPs.
| Glyma.02G144400 | Gm-t1-SF | Glyma.05g189400, Glyma.08g147000, | Ran GTPase-activating protein | |
| Glyma.10G153200 | Gm-t1-SF | Glyma.01g146600, Glyma.09g194200 | cGMP-dependent protein kinase | |
| Glyma.12G171700 | Gm-t1-SF | Glyma.07g249700, Glyma.09g046800, Glyma.15g154300 | 5′-AMP-activated protein kinase | |
| Glyma.20G235100 | Gm-t1-SF | Glyma.17g024700, Glyma.19g234600, Glyma.20g234600 | 5′-AMP-activated protein kinase | |
| Glyma.U019400 | Gm-t1-SF | |||
| Glyma.16G156400 | Gm-t1-SF | Glyma.05g189400, Glyma.08g147000 | Ran GTPase-activating protein | |
| Glyma.10G215200 (STOP1) | Gm-t1-SF | Glyma.01g146600, Glyma.09g194200 | cGMP-dependent protein kinase | |
| Glyma.01g199400,Glyma.11g042500, Glyma.17g154200 | Predicted membrane protein | |||
| Glyma.10g047100 | Putative ABC transport system permease protein | |||
| Glyma.20g234600 | 5′-AMP-activated protein kinase | |||
| Glyma.13G327500 | Gm-t1-SF | Glyma.05g189400, Glyma.08g147000 | Ran GTPase-activating protein | |
| Glyma.01g146600, Glyma.09g194200 | cGMP-dependent protein kinase | |||
| Glyma.10g082800 | CCAAT-binding transcription factor | |||
| Glyma.01g008700 | RNA polymerase | |||
| Glyma.14g110900 | C2H2-Zinc finger | |||
| Glyma.17g065800 | Myb-like transcription factor | |||
| Glyma.14G050100 | ERF5 | |||
| Glyma.20g234600 | 5′-AMP-activated protein kinase | |||
| Glyma.05G241800 | Gm-t1-SF | Glyma.02g297300, Glyma.08g325700,Glyma.18g081400, | Frigida-like protein | |
| Glyma.08G049300 | Gm-t1-SF | Glyma.04g203400, Glyma.06g162100 | Frigida-like protein | |
| Glyma.10g258300, Glyma.20g132800,Glyma.20g133400, | mRNA export factor and BUB3 | |||
| Glyma.02g112500, Glyma.10g257700 | mRNA export factor and BUB3 | |||
| Glyma.18G241900 | Gm-1i-M-SF | Glyma.02g187100, Glyma.10g106100 | Translation initiation factor 3 subunit E | |
| Glyma.09G250600 | Gm-1i-M-SF | Glyma.09g227200, Glyma.12g009600 | Large subunit ribosomal protein L1 | |
| Glyma.10g055200,Glyma.13g142300 | C2H2-Zinc finger (Gm-1i-M-SF) | |||
| Glyma.13G333200 | MYB139 | |||
| Glyma.15g101700 | F-box protein | |||
| Glyma.18g012400 | TCP-1/cpn60 chaperonin family | |||
| Glyma.04G044900 | Gm-2i-Q-SF | Glyma.07g023300,Glyma.08g218600, Glyma.13g370100, Glyma.15g003300 | WRKY56, WRKY78, WRKY transcription factor 40- related | |
| Glyma.06G045400 (SCTF-1) | Gm-2i-Q-SF | Glyma.02g138800, Glyma.07g206200,Glyma.12g073000, Glyma.U021800 | MAPK2, MAPK1, MAPK | |
| Glyma.14G088300 | Gm-2i-Q-SF | Glyma.01g146600,Glyma.09g194200, | cGMP-dependent protein kinase | |
| Glyma.17G236200 (SCOF-1) | Gm-2i-Q-SF | Glyma.07g023300,Glyma.08g218600, Glyma.13g370100, Glyma.15g003300 | WRKY56, WRKY78, WRKY transcription factor 40- related | |
| Glyma.12g073000, Glyma.07g206200, Glyma.U021800 | MAPK1, MAPK | |||
| Glyma.01g146600, Glyma.09g194200 | cGMP-dependent protein kinase | |||
| Glyma.10g013300 | BZIP59 | |||
| Glyma.10G257900 | Gm-2i-Q-SF | Glyma.01g146600, Glyma.09g194200 | cGMP-dependent protein kinase | |
| Glyma.20G133200 | Gm-2i-Q-SF | Glyma.15g154300, Glyma.17g024700, Glyma.19g234600, Glyma.20g234600 | 5′-AMP-activated protein kinase | |
| Glyma.05g189400, Glyma.08g147000 | Ran GTPase-activating protein | |||
| Glyma.12g179000 | C2H2-Zinc finger (Gm-t1-SF) | |||
| Glyma.12G059500 | Gm-1i-Q-SF | Glyma.18G124800 | Whirly transcription factor | |
| Glyma.11G135800 | Gm-1i-Q-SF | |||
| Glyma.10G184500 | Gm-1i-Q-SF | No | ||
| Glyma.07G135800 (GmRSD2) | Gm-1i-Q-SF | No | ||
| Glyma.20G206000 | Gm-1i-Q-SF | No | ||
| Glyma.18G185900 (GmRSD1) | Gm-1i-Q-SF | No | ||
| Glyma.10G034700 | Gm-1i-Q-SF | No | ||
| Glyma.13G121000 | Gm-1i-Q-SF | No |
Figure 8The transcriptional regulatory networks of 12 symbiosis-related soybean C2H2-TFs. These transcriptional regulatory networks were constructed by Cytoscape software. The red rhombuses represent C2H2-TFs, the green circles pointed by arrows represent the regulated downstream genes, blue triangle pointed by arrows represent the upstream genes or bridge genes, the red circle represents the co-regulated genes, and the gray ring arrow represents the self-regulation of C2H2-TFs. Besides, the solid arrows indicate the up-regulation genes, and the dotted arrows indicate the down-regulation genes.