| Literature DB >> 29162860 |
Rubab Zahra Naqvi1,2,3, Syed Shan-E-Ali Zaidi1,2,3,4, Khalid Pervaiz Akhtar5, Susan Strickler3, Melkamu Woldemariam3, Bharat Mishra6, M Shahid Mukhtar6, Brian E Scheffler7, Jodi A Scheffler8, Georg Jander3, Lukas A Mueller3, Muhammad Asif1, Shahid Mansoor9.
Abstract
Cotton leaf curl disease (CLCuD), caused by cotton leaf curl viruses (CLCuVs), is among the most devastating diseases in cotton. While the widely cultivated cotton species Gossypium hirsutum is generally susceptible, the diploid species G. arboreum is a natural source for resistance against CLCuD. However, the influence of CLCuD on the G. arboreum transcriptome and the interaction of CLCuD with G. arboreum remains to be elucidated. Here we have used an RNA-Seq based study to analyze differential gene expression in G. arboreum under CLCuD infestation. G. arboreum plants were infested by graft inoculation using a CLCuD infected scion of G. hirsutum. CLCuD infested asymptomatic and symptomatic plants were analyzed with RNA-seq using an Illumina HiSeq. 2500. Data analysis revealed 1062 differentially expressed genes (DEGs) in G. arboreum. We selected 17 genes for qPCR to validate RNA-Seq data. We identified several genes involved in disease resistance and pathogen defense. Furthermore, a weighted gene co-expression network was constructed from the RNA-Seq dataset that indicated 50 hub genes, most of which are involved in transport processes and might have a role in the defense response of G. arboreum against CLCuD. This fundamental study will improve the understanding of virus-host interaction and identification of important genes involved in G. arboreum tolerance against CLCuD.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29162860 PMCID: PMC5698292 DOI: 10.1038/s41598-017-15963-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Experimental design, methodology and RNA-Seq pipeline used in study. Panel A and B show the graphical representation of grafting experiments (A) scion of cotton leaf curl disease (CLCuD) infected Gossypium hirsutum was used on a rootstock of G arboreum; asymptomatic leaves of CLCuD were collected for RNA extraction and sequencing. (B) Leaves showing symptoms of CLCuD were collected for RNA extraction and sequencing. (C) Shows an asymptomatic CLCuD free leaf of G. arboreum. (D) Shows a leaf with very mild symptoms of CLCuD including vein swelling and darkening, highlighted with black arrows. (E) Shows the workflow of the RNA-Seq experiment and the tools used at each step.
Summary of RNA-Seq runs used in this study.
| Sample | Condition | No. of Reads | % Mapping |
|---|---|---|---|
| RZN7 | Grafted - CLCuD symptomatic | 11304210 | 76.25 |
| RZN8 | Grafted - CLCuD symptomatic | 9833891 | 66.97 |
| RZN9 | Grafted - CLCuD symptomatic | 10201621 | 62.17 |
| RZN10 | Grafted - asymptomatic | 10049099 | 69.77 |
| RZN11 | Grafted - asymptomatic | 10941016 | 68.2 |
| RZN12 | Grafted - asymptomatic | 9667217 | 77.69 |
Figure 2RNA-Seq data quality check and efficiency of mapping with G. arboreum genome. Panel A shows the raw sequencing reads compared the mapped reads where the y axis indicates the number of reads and x axis indicates the samples used in study. Panel B, C, D and E show the amount of data among replicates in terms of log10_(FPKM), gene dispersion, density and differentially expressed transcripts in the dataset. q1 and q2 represent the two conditions of symptomatic and asymptomatic G. arboreum respectively.
Expression levels of housekeeping control genes in G. arboretum.
| No | Function | Gene_Id | log2FC | significant |
|---|---|---|---|---|
| 1 | Clathrin adaptor complexes medium subunit family protein | Cotton_A_09164_BGI-A2_v1.0 | −0.363142 | no |
| 2 | Catalytic subunit of protein phosphatase 2 A | Cotton_A_09192_BGI-A2_v1.0 | 0.619919 | no |
| 3 | F-box family | Cotton_A_12507_BGI-A2_v1.0 | 0.510816 | no |
| 4 | Betatubulin | Cotton_A_14308_BGI-A2_v1.0 | −0.7817 | no |
| 5 | Elongation factor | Cotton_A_23419_BGI-A2_v1.0 | −0.452078 | no |
| 6 | Glyceraldehyde-3-phosphate dehydrogenase C-2 | Cotton_A_31637_BGI-A2_v1.0 | −0.539749 | no |
| 7 | Ubiquitin | Cotton_A_32873_BGI-A2_v1.0 | 0.0623818 | no |
| 8 | Actin | Cotton_A_38366_BGI-A2_v1.0 | −0.395614 | no |
Figure 3Hierarchical clustering of differentially expressed genes. A heat map of G. arboreum differentially expressed genes in response to cotton leaf curl disease with respect to hierarchical clustering. Log10 expression values were used for the analysis and negative values were set to zero. Clustering and the heat map were performed using heatmap 2.0 package in R.
Figure 4Validation of gene expression with quantitative RT-PCR. (A) Heat map of 17 selected DEGs for qRT-PCR in G. arboreum in response to cotton leaf curl disease with respect to hierarchical clustering. Log10 expression values were used for the analysis and negative values were set to zero. Clustering and the heat map were performed using heatmap 2.0 package in R. (B) Quantitative RT-PCR was used to measure the relative expression levels of seventeen pathogen resistance related genes with 18 S as an internal reference. Values were expressed as fold changes of transcript levels in the CLCuD infested symptomatic leaf samples with respect to the transcript levels in CLCuD infested asymptomatic leaf samples. Error bars represented standard error (SE) of three biological replicates.
Selected differentially expressed genes for qRT-PCR and their probable functions in plant pathogen defense.
| Gene | Probable role in defense | References |
|---|---|---|
| Probable zinc metallopeptidase EGY3 | Development and stress response |
[ |
| Defensin-like protein 1 | Bacterial and fungal pathogens as well as herbivorous insects |
[ |
| Phytosulfokines 3 | Pattern-triggered immunity against pathogens, Leucine Rich Repeat family (LRR) |
[ |
| Chaperone protein DnaJ | Pathogen defense, antiviral defense |
[ |
| Transcription factor UNE10 | Antiviral defense |
[ |
| Protein REVEILLE 7 | Plant growth, stress, pathogen response |
[ |
| Small heat shock protein C2 | Antiviral and antibacterial stress response |
[ |
| Transcription factor RADIALIS | Myb encoding genes, plant defense response |
[ |
| Putative disease resistance protein RGA3 | R-gene mediated pathogen and disease response |
[ |
| Bidirectional sugar transporter SWEET17 | Abiotic stress tolerance, pathogenesis related protein |
[ |
| Protein ECERIFERUM 1 | Biotic and abiotic stresses |
[ |
| Probable aquaporin PIP2-2 | Biotic and abiotic stresses, plant immunity |
[ |
| Potassium channel AKT2/3 | Plant development, stress responses, antiviral defense |
[ |
| 60 S ribosomal protein L39-3 | Pathogen and disease resistance, antiviral defense |
[ |
| Boron transporter 4 | R-gene mediated viral defense |
[ |
| Vignain | Plant immunity, pathogenesis and plant defense |
[ |
| Nitrate reductase [NADH] | Pathogen signal-induced NO production |
[ |
Figure 5Weighted gene co-expression network construction and analysis for CLCuD-responsive cotton genes in CLCuD symptomatic and asymptomatic leaves. (A) Transcriptomic data from symptomatic and asymptomatic leaves was analyzed by WGCNA using 468 DEGs. Genes were clustered as per expression arrangements characterized by the dendrogram and topological overlap mapping metric (TOM) heat map. Each line of the dendrogram corresponds to a gene. Clusters of similarly-regulated genes are grouped as modules by corresponding color (blue and turquoise) with a threshold minimum module size of 70 genes. The intensity of pink coloring in heatmap specifies high strength and green as no strength of correlation between pairs of genes on a direct scale and features hierarchical clustering dendrograms possessing a range of weighted correlations. (B) Weighted network illustrates correlations (edges) among the nodes (genes) with a weighted correlation threshold of ≥0.85. The network is composed of 468 connections and 348 genes organized in two different modules. The node color corresponds to modules identified via WGCNA. Nodes with high connectivity ( ≥5 connections, hubs) among different modules are indicated with increased node size. (C) Highly connected nodes (hubs) within the co-expression network are shown. Average degree of turquoise and blue modules is not changed in the entire co-expression network (2.689655172). (D) Clustering coefficient (degree to which a node is connected in a neighborhood) for genes within each node is illustrated in a box plot. Turquoise module displays significantly a higher average clustering coefficient than whole co-expression network (0.443746579). (E) The information centrality (the flow of information between any two nodes in a connected network) for the largest component of co-expression network (92 nodes) is presented. Turquoise module displays significantly increased information centrality compared to the entire network (0.005299235). (F) Interactive graph of GO terms associated with cotton leaf curl disease responsive G. arboreum hub genes identified by co-expression network analysis. Analysis performed with online tool agriGO (bioinfo.cau.edu.cn/agriGO/) where a key indicates significance levels of GO terms.