| Literature DB >> 28523007 |
Yongqun Zhu1,2, Xia Wang1, Linkai Huang1, Chaowen Lin2, Xinquan Zhang1, Wenzhi Xu2, Jianhua Peng3, Zhou Li1, Haidong Yan1, Fuxiang Luo2, Xie Wang2, Li Yao2, Dandan Peng2.
Abstract
Sudan grass (Sorghum sudanense) is an annual warm-season gramineous forage grass that is widely used as pasture, hay, and silage. However, drought stress severely impacts its yield, and there is limited information about the mechanisms of drought tolerance in Sudan grass. In this study, we used next-generation sequencing to identify differentially expressed genes (DEGs) in the Sudan grass variety Wulate No.1, and we developed simple sequence repeat (SSR) markers associated with drought stress. From 852,543,826 raw reads, nearly 816,854,366 clean reads were identified and used for analysis. A total of 80,686 unigenes were obtained via de novo assembly of the clean reads including 45,065 unigenes (55.9%) that were identified as coding sequences (CDSs). According to Gene Ontology analysis, 31,444 unigenes were annotated, 11,778 unigenes were identified to 25 categories in the clusters of orthologous groups of proteins (KOG) classification, and 11,223 unigenes were assigned to 280 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Additionally, there were 2,329 DEGs under a short-term of 25% polyethylene glycol (PEG) treatment, while 5,101 DEGs were identified under the long-term of 25% PEG treatment. DEGs were enriched in pathways of carbon fixation in photosynthetic organisms and plant hormone signal transduction which played a leading role in short-term of drought stress. However, DEGs were mainly enriched in pathway of plant hormone signal transduction that played an important role under long-term of drought stress. To increase accuracy, we excluded all the DEGs of all controls, specifically, five DEGs that were associated with high PEG concentrations were found through RNA-Seq. All five genes were up-regulated under drought stress, but the functions of the genes remain unclear. In addition, we identified 17,548 SSRs obtained from 80,686 unigenes. The newly identified drought tolerance DEGs will contribute to transgenic breeding efforts, while SSRs developed from high-throughput transcriptome data will facilitate marker-assisted selection for all traits in Sudan grass.Entities:
Keywords: PEG; Sudan grass; differentially expressed genes; next-generation sequencing; simple sequence repeat markers
Year: 2017 PMID: 28523007 PMCID: PMC5415614 DOI: 10.3389/fpls.2017.00687
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1RNA validation of Sudan grass samples by 1% agarose gel electrophoresis. M represents the marker, lanes 1–3 correspond to the three replicates of Wulate No.1 at 0 day in 0% PEG, lanes 4–6 correspond to the three replicates of Wulate No.1 at 3 day in 0% PEG, lanes 7–9 correspond to the three replicates of Wulate No.1 at 3 day in 25% PEG, lanes 10–12 correspond to the three replicates of Wulate No.1 at 6 day in 0% PEG, and lanes 13–15 correspond to the three replicates of Wulate No.1 at 6 day in 25% PEG.
Figure 2Sudan grass RNA sample quality. The labeling scheme for each of the Wulate No.1 RNA samples is as follows: L-0d-1 through L-0d-3, the three replicates at day 0 in 0% PEG; L-3d(ck)-1 through L-3d(ck)-3, the three replicates at day 3 in 0% PEG; L-3d(T)-1 through L-3d(T)-3, the three replicates at day 3 in 25% PEG; L-6d(ck)-1 through L-6d(ck)-3, the three replicates at day 6 in 0% PEG; and L-6d(T)-1 through L-6d(T)-3, the three replicates at day 6 in 25% PEG.
Summary statistics of the Sudan grass transcriptome assemblies.
| L_0_1 | 168,623,898 | 165,220,518 |
| L_3_1 | 173,980,764 | 169,869,984 |
| L_3_2 | 171,496,394 | 163,559,358 |
| L_6_1 | 183,220,324 | 173,103,162 |
| L_6_2 | 155,222,446 | 145,101,344 |
| Total | 852,543,826 | 816,854,366 |
L_0_1, L_3_1, and L_6_1 were non-stressed controls that were collected at 0, 3, and 6 day, respectively. L_3_2 and L_6_2 were drought-stressed treatments that were collected at 3 and 6 day, respectively (Each treatment had three replicates). The same below.
Figure 3Transcriptome and unigene length distribution of Sudan grass.
Unigene information annotated in different databases.
| Annotated in NR | 38,919 | 48.23 |
| Annotated in NT | 46,064 | 57.09 |
| Annotated in KO | 11,223 | 13.9 |
| Annotated in Swiss-Prot | 24,436 | 30.28 |
| Annotated in PFAM | 26,786 | 33.19 |
| Annotated in GO | 31,444 | 38.97 |
| Annotated in KOG | 11,778 | 14.59 |
| Annotated in all Databases | 5,892 | 7.3 |
| Annotated in at least one Database | 52,315 | 64.83 |
| Total Unigenes | 80,686 | 100 |
Figure 4Species distribution of all the unigene sequences.
Figure 5The length distribution of CDSs. BLAST results are on the left, while ESTScan results are on the right.
Figure 6Gene Ontology classification of assembled unigenes.
Figure 7Histogram presentation of clusters of orthologous groups.
Figure 8KEGG classification results: A, Cellular Processes; B, Environmental Information Processing; C, Genetic Information Processing; D, Metabolism; and E, Organismal Systems.
Differentially expressed genes across two categories of Sudan grass.
| L_0_1 vs. L_3_2 | 3,319 | 1,584 | 4,903 |
| L_3_1 vs. L_3_2 | 2,355 | 1,111 | 3,466 |
| L_0_1 vs. L_6_2 | 5,482 | 4,030 | 9,512 |
| L_6_1 vs. L_6_2 | 3,746 | 2,448 | 6,194 |
| L_3_2 vs. L_6_2 | 5 | 125 | 130 |
| L_0_1 vs. L_3_1 | 1,962 | 1,009 | 2,971 |
| L_0_1 vs. L_6_1 | 482 | 212 | 694 |
| L_3_1 vs. L_6_1 | 250 | 814 | 1,064 |
Differentially expressed genes across four categories of Sudan grass.
| L_0_1 vs. L_3_2 vs. L_3_1 vs. L_3_2 | 1,531 | 785 | 2,329 |
| L_0_1 vs. L_6_2 vs. L_6_1 vs. L_6_2 | 3,031 | 2,084 | 5,101 |
Figure 9KEGG pathways enriched for DEGs by the 3rd and 6th days. KEGG pathways enriched for DEGs by the 3rd day are on the top, while those enriched by the 6th day are in the bottom.
Figure 10Analysis of DEGs with drought stress in Sudan grass. The left one is the up-regulation of DEGs, the right one is the down-regulation of DEGs.
Functions of DEGs according to various databases.
| Gene Length | 612 bp | 1583 bp | 3697 bp | 2181 bp | 1110 bp |
| NR Description | Hypothetical protein | Hypothetical protein | Hypothetical protein | Phosphosulfolactate synthase-related protein | Hypothetical protein |
| NT Description | Hypothetical protein, mRNA | Zea mays phosphosulfolactate synthase-related protein | |||
| Swiss-Prot Description | Defensin-like protein | – | Uncharacterized AAA domain-containing protein | – | L-ascorbate oxidase |
| PFAM description | Gamma-thionin Family/Scorpion toxin-like domain | – | ABC transporter/Sigma-54 interaction domain/Magnesium chelatase, subunit ChlI/AAA domain (dynein-related subfamily)/Uncharacterized P-loop hydrolase UPF0079/Parvovirus non-structural protein NS1/ATPase family associated with various cellular activities (AAA)/K-Cl Co-transporter type 1 (KCC1)/Holliday junction DNA helicase ruvB N-terminus/TIP49 C-terminus/Guanylate kinase/AAA domain (Cdc48 subfamily)/Zeta toxin/ATPase family associated with various cellular activities (AAA)/PhoH-like protein/RNA helicase/IstB-like ATP binding protein | (2R)-phospho-3-sulfolactate synthase (ComA)/Small cytokines (intecrine/chemokine), interleukin-8 like | Multicopper oxidase/Multicopper oxidase/Multicopper oxidase |
| BP Description | Transport/defense response | Protein folding | DNA repair/DNA recombination/regulation of transcription, DNA-templated/threonylcarbamoyladenosine biosynthetic process/transport/photosynthesis/chlorophyll metabolic process/viral genome replication/ion transport/chlorophyll biosynthetic process | Signal transduction/chemotaxis/heat acclimation/coenzyme M biosynthetic process/immune response | Oxidation-reduction process |
| MF Description | Ion channel inhibitor activity | Unfolded protein binding/heat shock protein binding | ATP binding/transcription factor binding/nucleoside-triphosphatase activity/ATPase activity/RNA binding/magnesium chelatase activity/protein binding/four-way junction helicase activity/transporter activity/RNA helicase activity/DNA helicase activity/kinase activity | Catalytic activity/chemokine activity | Oxidoreductase activity/copper ion binding |
| CC Description | Plasma membrane/extracellular region/plant-type cell wall | – | Replication fork/membrane/magnesium chelatase complex/transcription factor complex/Holliday junction helicase complex | Extracellular region | Extracellular region/plant-type cell wall |
| KOG Description | – | – | AAA+-type ATPase | – | Multicopper oxidases |
SSR motifs.
| A/T | 6,166 | |
| C/G | 605 | |
| Total | 6,771 | 38.59 |
| AC/GT | 806 | |
| AG/CT | 1,571 | |
| AT/AT | 696 | |
| CG/CG | 293 | |
| Total | 3,366 | 19.18 |
| AAC/GTT | 140 | |
| AAG/CTT | 356 | |
| AAT/ATT | 124 | |
| ACC/GGT | 506 | |
| ACG/CGT | 741 | |
| ACT/AGT | 142 | |
| AGC/CTG | 1,015 | |
| AGG/CCT | 942 | |
| ATC/ATG | 204 | |
| CCG/CGG | 2,690 | |
| Total | 6,860 | 39.09 |
| Total | 474 | 2.7 |
| Total | 49 | 0.28 |
| Total | 28 | 0.16 |
Figure 11Venn diagrams of DEGs among 5 combinations. Up-regulated DEGs are on the left, while down-regulated DEGs are on the right.
Figure 12The distribution of SSR motifs in Sudan grass.
Characterization of 23 novel microsatellite loci for Sudan grass.
| 1 | GGTAGGAGACGGAGGTGAGT | GTCCTCCGATCTGCTAAGCC | 60 | 140 | (GCG)6 | 0.7608333 | 0.4577 | 0.3537 | 100.00 | |
| 2 | GTTTGCTTGTCCGCTAAGGC | TTGTCGTCGTCCGTTGTAGG | 60 | 197 | (ACC)6 | 0.6325 | 0.6567 | 0.4902 | 100.00 | |
| 3 | GCAACGGCAAGCAGAGTATG | TTTGCATGCCACAGACTTGC | 59 | 255 | (GCA)7 | 0.4908333 | 0.5555 | 0.4925 | 66.67 | |
| 4 | ACAGCTAGTTTGGGTGTCCG | CATGGCCGAGGTCAACTCTT | 59 | 140 | (GCT)6 | 0.6357143 | 0.5622 | 0.4607 | 100.00 | |
| 5 | TCACGCACTGCAGGGTTAAT | ACACGCGTATCAGTGTTCGT | 59 | 134 | (A)10 | 0.7336364 | 0.6645 | 0.1948 | 100.00 | |
| 6 | CCGCCAAAGAAGACAAAGGC | GTCCAGCTCCATCCTGAACC | 60 | 218 | (CGA)6 | 0.7140625 | 0.4848 | 0.4030 | 100.00 | |
| 7 | GTGACACGGATCTCACGGAG | GGTGTGGTGAAGAGGTCTGG | 60 | 192 | (TTG)5 | 0.3033333 | 0.6109 | 0.5281 | 33.33 | |
| 8 | GCGGGTGTTGATGATGCTTG | GCCAAAGCAAGAACGACAGG | 60 | 104 | (GCC)5 | 0.3442857 | 0.5156 | 0.5132 | 57.14 | |
| 9 | TGCTGAAACTTTGGAATGGCA | CCCCTTGCATGTTCAAGTGC | 58 | 154 | (T)11 | 0.5883333 | 0.6851 | 0.4850 | 83.33 | |
| 10 | CAATTTTGCCCTCCTTGCCC | CTTCCTCATCTCCTCCGCAC | 60 | 242 | (T)11 | 0.39075 | 0.5207 | 0.5045 | 50.00 | |
| 11 | AAGATCAAGAAGCCCCTGCC | ATCTGGTGGTACCTCGGGAA | 60 | 143 | (GCC)5 | 0.7382143 | 0.5887 | 0.3875 | 85.71 | |
| 12 | TGTGACTTTGAGCTCCGTCA | TGCTACCTGAACTCCCCCTT | 59 | 229 | (T)11 | 0.3525 | 0.5943 | 0.4448 | 66.67 | |
| 13 | TCCGAAATCCTTGTCCCTCT | TCGCCATCGATCAACTCTCG | 58 | 121 | (CAC)5 | 0.453 | 0.6227 | 0.5120 | 80.00 | |
| 14 | GAGCAGAGGAACCATGGCTT | TCGTCACGCAACAACTGTTC | 60 | 200 | (A)10 | 0.41875 | 0.5554 | 0.5169 | 50.00 | |
| 15 | AGGTCAGACGCCAAACTGTC | ATCAGCACGTACTTGGTCGG | 60 | 234 | (TTC)5 | 0.1642857 | 0.4127 | 0.4668 | 28.57 | |
| 16 | CCACATCCGTTCACCCTTCA | GGGTTTGGTTGCTCCGTTTC | 59 | 206 | (CT)6 | 0.57 | 0.5004 | 0.4693 | 100.00 | |
| 17 | CTCTCTCCCACTCTCTGCCT | AGGGAAGTGCATGCAAGCTA | 60 | 163 | (TCG)5 | 0.715 | 0.3902 | 0.3791 | 100.00 | |
| 18 | TCCTATGCGCACCACCAATT | GTGACATGATCCCTGGCCC | 60 | 248 | (GC)6 | 0.6275 | 0.5088 | 0.4187 | 66.67 | |
| 19 | GAGCATCGAGGAGCGTCTC | CTTCTGCGCTCGAAATGGAG | 60 | 183 | (C)11 | 0.23625 | 0.3183 | 0.4424 | 100.00 | |
| 20 | GGGGCAAAACTGCTGTTGTT | CCCACTTCCTGCTCAACCAG | 59 | 157 | (CTG)5 | 0.66375 | 0.6276 | 0.3963 | 75.00 | |
| 21 | CGTAATTTGGGCCGCCAATC | CGAACGAATCGCTGAGGTTT | 60 | 275 | (A)10 | 0.626875 | 0.6272 | 0.4561 | 75.00 | |
| 22 | CATAGTGCCGTAGCCCGTAG | CGGACGACTCTTGGACCAAA | 60 | 280 | (GGC)5 | 0.2615 | 0.5579 | 0.4745 | 80.00 | |
| 23 | GCAAGATGATATATGGAGTACGGC | TCCCCTTCCTAGAGCTGCTT | 59 | 140 | (A)10 | 0.8304167 | 0.6704 | 0.3114 | 100.00 |
Tm, optimal annealing temperature (°C); PIC, average polymorphism information content; I, Shannon's information index of diversity; H, Nei's gene diversity; and PPB, percentage of polymorphic bands.