| Literature DB >> 27821048 |
Taylor P Frazier1, Nathan A Palmer2, Fuliang Xie3, Christian M Tobias4, Teresa J Donze-Reiner5, Aureliano Bombarely1, Kevin L Childs6, Shengqiang Shu7, Jerry W Jenkins8, Jeremy Schmutz7,8, Baohong Zhang3, Gautam Sarath2, Bingyu Zhao9,10.
Abstract
BACKGROUND: Switchgrass (Panicum virgatum L.) is a warm-season perennial grass that can be used as a second generation bioenergy crop. However, foliar fungal pathogens, like switchgrass rust, have the potential to significantly reduce switchgrass biomass yield. Despite its importance as a prominent bioenergy crop, a genome-wide comprehensive analysis of NB-LRR disease resistance genes has yet to be performed in switchgrass.Entities:
Keywords: Biofuel; Disease resistance; Gene expression; NB-LRR; Panicum virgatum (switchgrass); RNA-seq; SNP
Mesh:
Year: 2016 PMID: 27821048 PMCID: PMC5100175 DOI: 10.1186/s12864-016-3201-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
RNA-seq analysis of ‘Alamo’ and ‘Dacotah’ used for variant detection and gene expression analysis. One sample of ‘Alamo’ and one sample of ‘Dacotah’ were sequenced at a deep coverage (36X and 20X, respectively) and were used for variant detection. Three biological replicates of ‘Alamo’ and ‘Dacotah’ were used for gene expression analysis and were sequenced between 3.7X and 5.2X coverage
| Sample | Sequencing | Initial reads | Trimmed reads | Mapped reads | % Reads mapped |
|---|---|---|---|---|---|
| Alamo | 101 bp-PE | 221,647,550 | 216,244,584 | 180,996,166 | 84.35 |
| Dacotah | 101 bp-PE | 125,257,298 | 121,958,884 | 99,283,683 | 82.03 |
| Alamo- 1 | 50 bp-PE | 56,036,894 | 54,211,026 | 52,494,171 | 96 |
| Alamo- 2 | 50 bp-PE | 45,931,626 | 44,044,084 | 42,657,586 | 96 |
| Alamo- 3 | 50 bp-PE | 58,312,466 | 55,472,698 | 53,590,453 | 96 |
| Dacotah- 1 | 50 bp-PE | 59,434,186 | 56,774,376 | 54,489,840 | 96 |
| Dacotah- 2 | 50 bp-PE | 48,408,374 | 46,266,102 | 44,525,787 | 96 |
| Dacotah- 3 | 50 bp-PE | 63,304,978 | 61,060,688 | 58,815,617 | 96 |
Fig. 1Numerical and structural representation of the 1011 switchgrass RGHs identified in this study. Four different protein structures were found for the 1011 switchgrass RGHs identified in this study. The coiled-coil (CC) domain is depicted in purple, the nucleotide binding (NB) domain in green, and the leucine rich repeat (LRR) region in yellow. The placement of the domains below does not reflect accurate molecular distances
Fig. 2Chromosomal distribution of the 1011 switchgrass RGHs identified in this study. The switchgrass A and B sub-genomes are marked as black and white, respectively. An additional 500 genes were found on unanchored contigs (data not shown)
Unique domains that were identified in 40 switchgrass NB or NB-LRR proteins
| Category | Unique domain | Protein ID | E-value | Structure |
|---|---|---|---|---|
| Protein modification | Glutaredoxin | Pavir.Bb01048.1.p | 9.80E-08 | NB-LRR-Glutaredoxin |
| Thioredoxin | Pavir.Fa01782.1.p | 3.70E-22 | NB-LRR-Thioredoxin | |
| Pavir.Hb00484.1.p | 8.50E-21 | NB-LRR-Thioredoxin | ||
| Protein tyrosine kinase | Pavir.Ba02898.1.p | 1.50E-26 | NB-LRR-PK | |
| Serine/threonine phosphatases, family 2C, catalytic domain | Pavir.J24356.1.p | 2.90E-14 | NB-LRR-PP2C | |
| Serine/Threonine protein kinases, catalytic domain | Pavir.Aa03320.1.p | 2.30E-13 | PK-NB-LRR | |
| Pavir.Bb00833.1.p | 3.20E-30 | PK-NB | ||
| Pavir.Ha00561.1.p | 4.10E-26 | PK-NB-LRR | ||
| Pavir.Ha00691.1.p | 2.10E-97 | NB-LRR-PK | ||
| Pavir.Ha01101.1.p | 2.40E-23 | PK-NB-LRR | ||
| Pavir.Ha01108.1.p | 1.70E-25 | PK-NB-LRR | ||
| Pavir.Ha01246.1.p | 4.00E-21 | PK-NB-LRR | ||
| Pavir.Hb00345.1.p | 7.80E-16 | PK-NB-LRR | ||
| Pavir.Hb00951.1.p | 6.20E-28 | PK-NB | ||
| Pavir.Hb01026.1.p | 3.10E-22 | PK-NB | ||
| Pavir.Hb01049.1.p | 8.40E-13 | PK-NB | ||
| Pavir.Hb01188.1.p | 1.40E-28 | PK-NB | ||
| Pavir.J00375.1.p | 2.40E-25 | PK-NB-LRR | ||
| Pavir.J01180.1.p | 4.60E-27 | PK-NB-LRR | ||
| Pavir.J16540.1.p | 1.60E-23 | PK-NB-LRR | ||
| Pavir.J23132.1.p | 7.30E-19 | PK-NB | ||
| Pavir.J36755.1.p | 3.30E-12 | PK-NB | ||
| Pavir.J37596.1.p | 1.30E-14 | PK-NB | ||
| Pavir.J39176.1.p | 4.00E-14 | PK-NB | ||
| DNA binding, transcription | B3 DNA binding domain | Pavir.Ba02315.1.p | 2.20E-10 | B3-NB-LRR |
| B3 DNA binding domain | Pavir.J20878.1.p | 3.10E-07 | B3-NB-LRR-WRKY | |
| WRKY DNA binding domain | 1.00E-14 | |||
| BED zinc finger | Pavir.Fa02339.1.p | 5.40E-09 | ZF-NB-LRR | |
| Pavir.Ga00028.1.p | 5.40E-09 | ZF-NB-LRR | ||
| Pavir.Gb00931.1.p | 1.70E-08 | ZF-NB-LRR | ||
| Pavir.J19380.1.p | 2.50E-12 | ZF-NB-LRR | ||
| Pavir.J40131.1.p | 7.80E-08 | ZF-NB | ||
| WRKY DNA-binding domain | Pavir.J33941.1.p | 1.80E-14 | NB-LRR-WRKY | |
| protein trafficking and vesicle movement | MSP (Major sperm protein) domain | Pavir.Ib02384.1.p | 1.10E-09 | MSP-NB-LRR |
| Pavir.Ib02433.1.p | 1.50E-08 | MSP-NB-LRR | ||
| Pavir.J18369.1.p | 5.10E-13 | MSP-NB | ||
| Protein-protein interaction | WD domain, G-beta repeat | Pavir.J03445.1.p | 5.00E-06 | NB-LRR-WD |
| hAT family C-terminal dimerisation region | Pavir.Fb01504.1.p | 3.50E-08 | NB-LRR-hAT | |
| Sugar binding | Jacalin-like lectin domain | Pavir.Hb01174.1.p | 5.60E-11 | NB-LRR-Jacalin |
| Signal transduction | Calmodulin binding protein-like | Pavir.Hb00190.1.p | 5.10E-79 | NB-LRR-Calmodulin |
| Transposon element | gag-polypeptide of LTR copia-type | Pavir.Aa01444.1.p | 8.90E-07 | NB-LRR-LTR |
Fig. 3Skylign output of the NB region for the 600 switchgrass sequences that contained a full-length NB domain. Four highly conserved motifs were identified in these sequences: the Walker-A/P-loop motif, the Kinase 2 motif, the Kinase 3/RNBS-B motif, and the GLPL motif. The height of each letter within a stack corresponds to the frequency and conservation of that letter at that position
Fig. 4Phylogenetic analysis of 578 switchgrass RGHs and 116 Brachypodium distachyon RGHs. The phylogenetic tree was built using MEGA 6 with default settings. Branches for the Brachypodium RGHs are highlighted in blue and branches for switchgrass RGHs located on chromosome 8 are highlighted in red. The green dots represent the phylogenetic location for the 10 protein kinase-containing NB-LRR genes that are found on chromosome 8. Numbers on the tree indicate branches with bootstrap values greater than 50
Fig. 5Validation of SNPs identified by RNA-sequencing using allele-specific primers. Pictured are the PCR products using ‘Dacotah’-specific primers where a = ‘Alamo’ DNA and d = ‘Dacotah’ DNA. 1) 1 kb ladder, 2) Pavir.Ba03659, 3) Pavir.Ba02315, 4) Pavir.Hb01688, 5) Pavir.Hb00487, 6) 1 kb ladder, 7) ELF1α
Significant Differentially Expressed RGHs between ‘Alamo’ and ‘Dacotah’
| Gene ID | Chromosome | Gene start location | Gene end location | Alamo RPKMameans | Dacotah RPKMameans | Fold changeb | FDR corrected |
|---|---|---|---|---|---|---|---|
| Pavir.Ha01577 | Chr08a | 45819137 | 45820543 | 0.074 | 11.565 | 54.837 | 6.03E-08 |
| Pavir.Hb00191 | Chr08b | 3683280 | 3690800 | 22.965 | 4.363 | −5.724 | 1.02E-07 |
| Pavir.J41099 | contig99714 | 31 | 3755 | 10.721 | 0.387 | −23.222 | 6.52E-07 |
| Pavir.Ha00254 | Chr08a | 5275668 | 5283177 | 8.387 | 0.088 | −43.065 | 2.74E-06 |
| Pavir.Ea02999 | Chr05a | 52240105 | 52245554 | 7.763 | 0.145 | −31.66 | 9.51E-06 |
| Pavir.J18698 | contig20439 | 2509 | 7946 | 0.019 | 10.273 | 68.028 | 1.57E-05 |
| Pavir.J36183 | contig58243 | 960 | 5090 | 7.474 | 0.014 | −57.923 | 1.77E-05 |
| Pavir.Ha00258 | Chr08a | 5331659 | 5334691 | 12.753 | 0.617 | −18.44 | 3.17E-05 |
| Pavir.Ba03626 | Chr02a | 73291071 | 73293182 | 0.139 | 8.63 | 30.634 | 5.28E-05 |
| Pavir.J38310 | contig74920 | 865 | 4419 | 0.377 | 8.784 | 16.208 | 5.28E-05 |
| Pavir.J13560 | contig156657 | 131 | 2675 | 0.565 | 8.893 | 11.899 | 1.71E-04 |
| Pavir.J35611 | contig54273 | 2061 | 5449 | 0.195 | 5.96 | 17.525 | 4.86E-03 |
| Pavir.J01215 | contig01325 | 2932 | 5562 | 7.052 | 1.095 | −6.492 | 5.22E-03 |
| Pavir.J03054 | contig03762 | 12948 | 18535 | 4.087 | 0.096 | −20.509 | 7.55E-03 |
| Pavir.Aa01110 | Chr01a | 14217597 | 14230708 | 3.764 | 0.03 | −26.75 | 0.013 |
| Pavir.Ha01146 | Chr08a | 31223571 | 31227884 | 0.703 | 5.777 | 6.478 | 0.019 |
| Pavir.Cb00441 | Chr03b | 7318929 | 7322714 | 11.648 | 26.964 | 2.068 | 0.034 |
| Pavir.Bb00035 | Chr02b | 557471 | 559270 | 0.054 | 3.333 | 18.041 | 0.041 |
| Pavir.Ha00828 | Chr08a | 23240515 | 23246499 | 3.354 | 0.02 | −25.367 | 0.041 |
| Pavir.J29227 | contig33532 | 301 | 4214 | 3.316 | 0.035 | −22.844 | 0.041 |
| Pavir.J02637 | contig03134 | 4596 | 14505 | 4.903 | 0.772 | −6.169 | 0.048 |
a RPKM reads per kilobase per million of reads mapped
bFold Change = expression level difference between the two samples when using ‘Dacotah’ as the reference sample
3FDR corrected p-value = p-value that the Fold Change is significant based on the corrected False Discovery Rate
Fig. 6WGCNA analysis of developmental gene expression for 755 RGHs in the switchgrass cv. Summer. Shown are eight modules (a-h) that correspond to the expression patterns of 755 RGHs identified in this study over five sampling time points. The line represents the overall expression pattern of each co-expression module. The number of genes included in each module is represented by n