| Literature DB >> 30674899 |
Muhammad Bilal Sarwar1,2, Zarnab Ahmad1, Bushra Rashid3, Sameera Hassan1, Per L Gregersen2, Maria De la O Leyva2, Istvan Nagy2, Torben Asp2, Tayyab Husnain1.
Abstract
Agave, monocotyledonous succulent plants, is endemic to arid regions of North America, exhibiting exceptional tolerance to their xeric environments. They employ various strategies to overcome environmental constraints, such as crassulacean acid metabolism, wax depositions, and protective leaf morphology. Genomic resources of Agave species have received little attention irrespective of their cultural, economic and ecological importance, which so far prevented the understanding of the molecular bases underlying their adaptations to the arid environment. In this study, we aimed to elucidate molecular mechanism(s) using transcriptome sequencing of A. sisalana. A de novo approach was applied to assemble paired-end reads. The expression study unveiled 3,095 differentially expressed unigenes between well-irrigated and drought-stressed leaf samples. Gene ontology and KEGG analysis specified a significant number of abiotic stress responsive genes and pathways involved in processes like hormonal responses, antioxidant activity, response to stress stimuli, wax biosynthesis, and ROS metabolism. We also identified transcripts belonging to several families harboring important drought-responsive genes. Our study provides the first insight into the genomic structure of A. sisalana underlying adaptations to drought stress, thus providing diverse genetic resources for drought tolerance breeding research.Entities:
Mesh:
Year: 2019 PMID: 30674899 PMCID: PMC6344536 DOI: 10.1038/s41598-018-35891-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Numerical Summary of the Illumina generated raw reads and denovo assembly statistics.
| Contents | Control Library (C) | Drought Library (T) | Total |
|---|---|---|---|
|
| |||
| Number of clean reads | 152553060 | 124292730 | 276845790 |
| Total base pairs (bp) | 15407859060 | 12553565730 | 27961424790 |
| Q20 percentage (%) | 97.8% | 95.9% | 96.85% |
| N Percentage | 0 | 0 | 0 |
| GC percentage | 48.29% | 47.7% | 48.1% |
|
| |||
|
|
| ||
| Total number of sequences | 93141 | 67328 | |
| average length | 731 (bp) | 582 (bp) | |
| N50 | 1164 (bp) | 834 (bp) | |
| Min length | 201 (bp) | 201 (bp) | |
| Max length | 9304 (bp) | 9304 (bp) | |
Figure 1(A) Sequence length distribution of the transcripts and unigenes of the trinity generated de novo assembly driven out of the raw reads from the control and drought stress samples. (B) Graphical representation of the statistics of cleaned raw reads mapping back to the de novo assembled transcripts (RMBT). (C) Benchmarking Universal Single-Copy Orthologs (BUSCO) scores for assembly quality assessment. (D) Homology analysis of the non-redundant unigenes against the publically available databases.
Figure 2Clusters of orthologous group-based classification of all unigenes. The unigenes were aligned to the COG database (1e−5) to understand their possible protein function. In total 9307 unigenes were annotated and grouped into the 26 categories. The capital letter indicates the COG categories listed on the right while numeric represents the total number of unigenes in each category.
Figure 3Pathways classification into metabolism, organismal system, environmental information processing, cellular process and genetic processing major groups based on the KEGG analysis.
Figure 4Total genes occupied a proportion of the (A) transcription factors and (B) heat shock proteins families in the A. sisalana de novo assembled transcriptome.
Figure 5GO terms enrichment analysis of all the differentially expressed genes was performed by the AgriGO online tool. Percentage of genes that were associated with specific GO terms are shown on left side of the graph.
Figure 6(A) Total number of up and down-regulated transcription factors and their response to drought stress. Within the red bar and blue colors indicating the up-regulated and down-regulated genes respectively. (B) Heat Shock Protein families response to the drought stress.
Statistics of SSRs identified in Agave sisalana.
| SSR mining | |
|---|---|
| Total number of sequences examined | 67,328 |
| Total size of examined sequences (bp) | 39203184 |
| Total number of identified SSRs | 13375 |
| Number of SSR containing sequences | 10729 |
| Number of sequences containing more than one SSR | 2108 |
| Number of SSRs present in compound formation | 1096 |
|
| |
| Mono-nucleotide | 5318 (39.7%) |
| Di-nucleotide | 4347 (32.5%) |
| Tri-nucleotide | 3544 (26.4%) |
| Tetra-nucleotide | 97 (0.72%) |
| Penta-nucleotide | 37 (0.28%) |
| Hexa-nucleotide | 32 (0.24%) |
Statistics of identified SNPs.
| Number of SNP | |
|---|---|
| Transition | |
| A<−>G | 10143 (28.9%) |
| C<−>T | 9962 (28.4%) |
| Total | 20105 (57.3%) |
| Transversion | |
| T<−>G | 3817 (10.88%) |
| C<−>G | 3236 (9.23%) |
| A<−>T | 4239 (12.09%) |
| A<−>C | 3662 (10.4%) |
| Total | 14954 (42.6%) |
| Total | 35059 (100%) |
Figure 7RNA-Seq differentially expressed genes data validation by quantitative real-time PCR (qRT-PCR). (Supplementary File 6 for additional information).