Literature DB >> 26883051

Deep RNA-Seq profile reveals biodiversity, plant-microbe interactions and a large family of NBS-LRR resistance genes in walnut (Juglans regia) tissues.

Sandeep Chakraborty1, Monica Britton2, P J Martínez-García1, Abhaya M Dandekar3.   

Abstract

Deep RNA-Seq profiling, a revolutionary method used for quantifying transcriptional levels, often includes non-specific transcripts from other co-existing organisms in spite of stringent protocols. Using the recently published walnut genome sequence as a filter, we present a broad analysis of the RNA-Seq derived transcriptome profiles obtained from twenty different tissues to extract the biodiversity and possible plant-microbe interactions in the walnut ecosystem in California. Since the residual nature of the transcripts being analyzed does not provide sufficient information to identify the exact strain, inferences made are constrained to the genus level. The presence of the pathogenic oomycete Phytophthora was detected in the root through the presence of a glyceraldehyde-3-phosphate dehydrogenase. Cryptococcus, the causal agent of cryptococcosis, was found in the catkins and vegetative buds, corroborating previous work indicating that the plant surface supported the sexual cycle of this human pathogen. The RNA-Seq profile revealed several species of the endophytic nitrogen fixing Actinobacteria. Another bacterial species implicated in aerobic biodegradation of methyl tert-butyl ether (Methylibium petroleiphilum) is also found in the root. RNA encoding proteins from the pea aphid were found in the leaves and vegetative buds, while a serine protease from mosquito with significant homology to a female reproductive tract protease from Drosophila mojavensis in the vegetative bud suggests egg-laying activities. The comprehensive analysis of RNA-seq data present also unraveled detailed, tissue-specific information of ~400 transcripts encoded by the largest family of resistance (R) genes (NBS-LRR), which possibly rationalizes the resistance of the specific walnut plant to the pathogens detected. Thus, we elucidate the biodiversity and possible plant-microbe interactions in several walnut (Juglans regia) tissues in California using deep RNA-Seq profiling.

Entities:  

Keywords:  Aedes aegypti; Cryptococcus; Juglans regia; Phytophthora; RNA-Seq

Year:  2016        PMID: 26883051      PMCID: PMC4755957          DOI: 10.1186/s13568-016-0182-3

Source DB:  PubMed          Journal:  AMB Express        ISSN: 2191-0855            Impact factor:   3.298


Introduction

Rapid detection of pathogens in plants is becoming increasingly necessary to prevent loss of productivity and quality (Dandekar et al. 2010; Fletcher et al. 2006). The wide variety of diseases and pathogens necessitates a broad detection system (Asiatic citrus canker: Xanthomonas axonopodis, sudden oak death: Phytophthora ramorum, Pierce's disease of grapevine: Xylella fastidiosa, etc.). Traditionally, real-time PCR has been used extensively for plant disease diagnostics (Schaad and Frederick 2002). However, these diagnostic tools are biased, and can only detect pathogens with a known nucleic acid template. RNA-Seq, a high-throughput DNA sequencing method, has revolutionized the field of gene discovery (Wang et al. 2009; Flintoft 2008). RNA-Seq can detect transcripts with very low expression levels, in contrast to other traditional methods like RNA:DNA hybridization (Clark et al. 2002) and short sequence-based approaches (Kodzius et al. 2006). The RNA-Seq derived transcriptome with a selection protocol for polyadenylated mRNA from an organism with known genome enables detection of mRNA from extraneous eukaryotes like fungi and pests. Certain RNA-Seq protocols ensure that only polyadenylated mRNA is being analyzed, yet some bacterial mRNA does leak through in the analysis. Thus, this presents an unbiased method of diagnosing the presence of wide range of prokaryotic and eukaryotic organisms (Moretti et al. 2007; Janse 2010). Such a study can also guide downstream PCR diagnostics to determine the exact species/strain of a pathogen. We have recently used a RNA-Seq methodology to derive the transcriptome of walnut (Juglans regia) from twenty different tissues types with selection for polyadenylated mRNA in the course of obtaining the walnut genome sequence (WGS) (manuscript submitted). Firstly, we excluded transcripts that aligned to WGS and the E. coli genome. Expression counts enabled the determination of the localization, although the residual nature of the transcripts being analyzed did not provide sufficient information to identify the exact species/strain. Thus, inferences made were constrained to the genus. These counts were not normalized, since there were no comparisons of absolute or relative expression levels. Some non-polyadenylated bacterial mRNA leaked through the RNA-Seq analysis. We detected several well-known pathogens, fungi, endophytic bacteria, and pests. The detection of these pathogenic agents in an otherwise healthy plant can be ascribed to the presence and activity of resistance (R) genes that specifically recognize pathogens, which contain complementary avirulence genes (Staskawicz 2001). The oomycete Phytophthora, a pathogen responsible for destructive diseases in a wide variety of crop plants, was found localized in the root (Fletcher et al. 2006; Belisario et al. 2012; Nowicki et al. 2012). Although sequence homology indicated the presence of several species of Phytophthora (nicotianae, infestans, parasitica), the similarity among these strains did not allow for an exact enumeration of the individual species. For example, a glyceraldehyde-3-phosphate dehydrogenase with 97 % identity to a GAPDH from P. parasitica and P. infestans is an enzyme detected from this pathogen. Cryptococcosis in human and animals is caused by Cryptococcus neoformans and C. gattii, which has been exacerbated in recent times in immuno-compromised individuals (Mitchell and Perfect 1995). The plant surface is a conducive environment for the sexual cycle of Cryptococcus (Xue et al. 2007). Here, we detect prolyl-isomerases and ADP/ATP translocases from Cryptococcus present in catkins and in vegetative buds, corroborating these findings. Endophytic Actinobacteria are present extensively in the inner tissues of living plants, and are a source of important secondary metabolites related to the defense response, growth and environmental stress (Ventura et al. 2007). Based on the top BLAST score, we detected several species in the Actinobacteria phyla spread out across all tissues. Methylibium petroleiphilum, which is capable of using methyl tert-butyl ether as a sole source of carbon, was also found in the root (Nakatsu et al. 2006). The ribosomal L37 protein from the pest pea aphid was found in the leaves and vegetative buds. Interestingly, a serine protease from the mosquito (Kelleher and Markow 2009) with significant homology to a female reproductive tract protease from Drosophila mojavensis (Isoe et al. 2009) in the vegetative bud suggested egg-laying activities by these pests.

Materials and methods

RNA-Seq

Fifteen samples of walnut tissue (Table 1) were gathered from Chandler trees in the UC Davis field facilities located in Davis, California. Three additional samples were taken from Chandler plant material maintained in tissue culture. The root sample was taken from potted Chandler trees in the greenhouse/lath house. Several grams of leaf and root tissue from each plant were frozen in liquid nitrogen immediately after harvest and then transferred to a −80 °C freezer. RNA was isolated from each sample using the hot borate method (Wilkins and Smart 1996) followed by purification and DNAse treatment using an RNA/DNA Mini Kit (Qiagen, Valencia, CA) per the manufacturer's protocol. High quality RNA was confirmed by running an aliquot of each sample on an Experion Automated Electrophoresis System (Bio-Rad Laboratories, Hercules, CA). The cDNA libraries were constructed following the Illumina mRNA-sequencing sample preparation protocol (Illumina Inc., San Diego, CA). Final elution was performed with 16 µL RNase-free water. The quality of each library was determined using a BioRad Experion (BioRad, Hercules, CA). Each library was run as an independent lane on a Genome Analyzer II (Illumina, San Diego, CA) to generate paired-end sequences of 85 bp in length from each cDNA library. In total, over a billion reads were obtained. Prior to assembly, all reads underwent quality control for paired-end reads and trimming using Sickle (Joshi and Fass 2011). The minimum read length was 45 bp with a minimum Sanger quality score of 35. The quality controlled reads were de novo assembled with Trinity v2.0.6 (Grabherr et al. 2011). Standard parameters were used and the minimum contig length was 300 bp. Individual assemblies for each library and a combined assembly of all tissues were performed (Chakraborty et al. 2015).
Table 1

Walnut tissue sources used for RNAseq analysis

CodeTissue sourceSequence read archiveWalnut genotypeDevelopmental stageSourceNo of reads x106
VBVegetative budSRS523592ChandlerVegetativeOrchard39.9
LYLeaf—youngSRS523594ChandlerVegetativeOrchard63.3
RTRootSRS523799ChandlerVegetativeGreenhouse38.8
CICallus interiorSRS523805ChandlerVegetativeIn Vitro59.3
CECallus exteriorSRS523808ChandlerVegetativeIn Vitro29.8
FLPistillate flowerSRS523810ChandlerVegetativeOrchard69.8
CKCatkinsSRS523917ChandlerImmatureOrchard56.4
SESomatic embryoSRS523919ChandlerImmatureIn Vitro27.8
LMLeaf—matureSRS523921ChandlerVegetativeOrchard50.4
LELeavesSRS523922ChandlerVegetativeOrchard60.1
IFFruit immatureSRS523923MixedImmatureOrchard57.0
HLHull immatureSRS523924ChandlerImmatureOrchard115.8
PTPacking tissueSRS523925ChandlerImmatureOrchard62.8
HPHull peelSRS523926ChandlerMatureOrchard43.3
HCHull cortexSRS523927ChandlerMatureOrchard62.8
PKPacking tissueSRS523928ChandlerMatureOrchard56.7
PLPellicleSRS523929ChandlerMatureOrchard42.7
EMEmbryoSRS523930MixedMatureOrchard35.5
HUHull—dehiscingSRS523931ChandlerSenescentOrchard59.5
TZTransition woodSRS523933 J.nigra Transition zoneOrchard48.4
Total number of reads:1080.0
Walnut tissue sources used for RNAseq analysis

In silico analysis

The NCBI database (http://www.ncbi.nlm.nih.gov/taxonomy) provides several resources for the ‘curated classification and nomenclature of all of organisms in the public sequence databases. This currently represents about 10 % of the described species of life on the planet.’ There were ~111 k transcripts. ~5 k did not align to the walnut genome, and were removed (Chakraborty et al. 2015). Of these, ~4 k transcripts had significant homology to E. coli genomes. The remaining ~1 k transcripts were the subject of analysis in the current manuscript, under the assumption that they were derived from extraneous organisms, pathogens or commensal, inhabiting the twenty different tissues. The species names were derived from the best BLAST match to the ‘nt’ database. A bitscore (BLASTSCORE) cutoff of 150 was used (~E-value = 1E − 33). The numerical identifier was obtained from the species name using the site http://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentifier/tax identifier.cgi. For example, Arthrobacter has the tax ID 1663. These numerical IDs were then used to obtain the complete lineage. The first classification of all organisms was into Eukaryota or Bacteria. We used the second classification field to cluster the organisms discussed here. The expression counts are not normalized since we do not make any inferences on the absolute or relative abundance of the transcripts. The iterative gene finding method described in YeATS (Chakraborty et al. 2015) was used to identify the homologous set of nucleotide-binding site (NBS)-leucine-rich repeat (LRR) class of genes. A BLAST bitscore of 100 (E value ~1E − 20), with an increment of 20 for each iteration, was used as the homology threshold. The increment in each of the iterations ensures that the resultant proteins do not diverge far from the initially chosen protein. We have used 600 as a lower threshold for the length of NBS-LRR proteins. Additionally, we exclude transcripts with  % of leucine less than the 10 % frequency of leucine residues seen in plant proteomes. These transcripts are probably fragments which have not been assembled by Trinity (Chakraborty et al. 2015).

Results

As a result of the analysis of ~1 k transcripts obtained from 20 different tissues of walnut, different extraneous organisms, pathogens or commensal, were detected (Fig. 1). Several transcripts (N = 260) were associated with Phytophthora, mostly localized in the root (Fig. 2a). A sample of transcripts, and the putative proteins they encode, identified C43181_G1_I1 encoding a 293 nt long ORF with a predicted protein (molecular weight = 30 kDa) homologous to glyceraldehyde-3-phosphate dehydrogenase (GAPDH) (Tables 2, 3a). This GAPDH has 97 % identity to a GAPDH from P. parasitica and P. infestans, and a 96 % identity to the GAPDH from P. sojae (Fig. 3a). The 3D structure of PhyGAPDH1 was modeled using SWISSMODEL (Arnold et al. 2006). The structural superimposition of the PhyGAPDH1 to the structure of the human placental GAPDH (PDBid:1U8F, chain O) reveals the structural conservation of this gene across different species (Fig. 3b). In this study, the presence of Cryptococcus was also confirmed in the catkins and the vegetative buds (Table 3b). We identified a cyclophilin A (peptidyl-prolyl cis–trans isomerase) (Table 4a) associated with Cryptococcus. In addition, we detected several transcripts associated with the endophytic Actinobacteria (EndAct) in several tissues of walnut (Fig. 2b, Table 5). Most putative proteins from these transcripts have significant homologs in the BLAST’nr’ database, although most of them are uncharacterized. Other interesting results are the presence of Methylibium petroleiphilum in the roots (3442 cumulative counts, Fig. 2c), Acyrthosiphon pisum (or the pea aphid) with 1256 cumulative counts in the leaves (Fig. 2d) and Aedes aegypti (yellow fewer mosquito) in the vegetative bud of walnut, with a total of 428 cumulative counts of transcripts (Fig. 2e).
Fig. 1

Diseases, pests and pathogens affecting walnut: Images obtained from https://commons.wikimedia.org/wiki and used under the Creative Commons Attribution 2.5 Generic license

Fig. 2

Localization of transcripts. These shows the cumulative counts (not normalized) of transcripts assigned to each genus. a Phytophthora—localized in the root. b Actinobacteria—present in all tissues c Methylibium—localized in the root. d Acyrthosiphon—localized in early leaves and the vegetative bud. e Aedes aegypti—localized in the vegetative bud

Table 2

Proteins from Phytophthora

TranscriptDescriptionE-value
C43181_G1_I1XM_ 008893048.1 P. parasitica INRA-310 glyceraldehyde-3-phosphate dehydrogenase0
C30378_G1_I1XM_ 008895058.1 P. parasitica INRA-310 guanine nucleotide-binding protein0
C7548_G1_I1DQ057354.1 P. nicotianae manganese superoxide dismutase (MnSOD1a)0
C26754_G1_I1FJ493002.1 P. cinnamomi clone PC02 Ric1 protein mRNA, complete1.00E − 78
C49955_G2_I1XM_ 002997461.1 P. infestans T30-4 14-3-3 protein epsilon (PITG 19017)0
C62634_G1_I1XM_ 002903199.1 P. infestans T30-4 transcription factor BTF3-like protein0

Transcripts from the pathogenic oomycete Phytophthora with ORFs that have significant matches in the BLAST ‘nr’ database. There are several strains of P. (nicotianae, cinnamomi, infestans) that have the best matches to these transcripts. It is difficult to ascertain the exact species from these transcripts since some of these proteins have high conservation across many species

Table 3

Expression counts of selected transcripts

TranscriptCECICKEMFLHCHLHPHUIFLELMLYPKPLPTRTSETZVB
a
 C30378 _G1_I1104
 C7548 _G1_I14972
 C26754 _G1_I128
 C43181 _G1_I1326
 C49955_G2_I11211321214185
 C62634 _G1_I1123
 C442 G2 I124
 C03 _G1_I128
 C28542 _G1_I118214
b
 C9244 _G1_I18
 C87 _G1_I123
 C61284 _G1_I11214
 C498 _G1_I1212
 C6553 _G1_I18
 C35196 _G1_I12214210
 C50331 G1 I4212111237
 C34393 _G1_I14141163942245126842211215
c
 C50331_G1_I52131482226
 C58663 _G1_I12210
 C29801 _G1_I14216
 C30295 _G1_I1829
 C16207 _G1_I154
 C50331 G1 I221432214

These are raw counts, and are not normalized

a Phytophthora is the pathogen responsible for potato blight, which caused the Great Irish Famine (1845–1849). C30378_G1_I1 encodes a glyceraldehyde 3-phosphate dehydrogenase, an enzyme involved in glycolysis. Most transcripts from this oomycete are found localized in the root (Fig. 2a). b Cryptococcus is the causal agent of cryptococcosis, affecting immuno-compromised individuals. These fungi are mostly localized in the catkins and the vegetative buds. It was previously shown that the plant surface provides a conducive environment for the sexual cycle of these pathogens. c Pyrenophora: These fungi, responsible for the `tan spot’ disease in barley, are spread out throughout different tissues. d Acyrthosiphon: The pea aphid pest. C58762_G1_I1 encodes a 91 long ORF that has a 99 % identity match to the ribosomal protein L37 (Accession: NP 001129424.1). The low counts of this transcript are testimony to the ability of the RNAseq technology to accurately determine the sequence and localization of transcripts

Fig. 3

Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) from Phytophthora: C43181_G1_I1 encodes a 293 bp long ORF with a predicted molecular weight of 30 kDa and has 97 % identity GAPDH from P. parasitica and P. infestans, and a 96 % identity to the GAPDH from P. sojae. Thus, although the presence of a pathogen from the Phytophthora genus is almost certain, it is not possible to determine the exact strain of this pathogen. The Phytophthora GAPDH also shares a 70 % identity with the GAPDH in human placenta. a Multiple sequence alignment of the GAPDHs obtained using ENDscript 2.x (Robert and Gouet 2014). b Structural superimposition of the C43181_G1_I1 GAPDH to the structure of the human placental GAPDH (PDBid:1U8F, chain O). The structure of C43181_G1_I1 GAPDH was modelled using SWISSMODEL

Table 4

Transcripts from fungi

TranscriptDescriptionE-value
a
 C44240_G2_I1KGB78973.1 ADP/ATP translocase [Cryptococcus gattii R265]3e-91
 C8003_G1_I1XM_571019.1 C. neoformans var. neoformans JEC21 60 s ribosomal protein3e-98
 C28542_G1_I1XP 003191240.1 40 s ribosomal protein s3ae-a (s1-a) [Cryptococcus gattii WM276]3e-91
 C9244_G1_I1XM_569612.1 C. neoformans var. neoformans JEC21 40S ribosomal protein3e-119
 C8207_G1_I1KGB74187.1 cyclophilin A, peptidyl-prolyl isomerase [Cryptococcus gattii R265]5e-82
 C61284_G1_I1KGB74302.1 60S ribosomal protein L10 [Cryptococcus gattii R265]1e-76
 C60498_G1_I1AFR92589.1 glutamine synthetase [Cryptococcus neoformans var. grubii H99]9e-51
b
 C6553_G1_I1XM_001932117.1 P. tritici-repentis Pt-1C-BFP subtilase-type proteinase6e-130
 C35196_G1_I1XM_001932926.1 P. tritici-repentis Pt-1C-BFP 60S ribosomal protein0.0
 C50331_G1_I4XM_001930633.1 P. tritici-repentis Pt-1C-BFP hypothetical protein9e-125
 C34393 _G1_I1XM_003296940.1 P. teres f. teres 0-1 ubiquitin-40S ribosomal protein1e-101
 C50331_G1_I5XM_001930633.1 P. tritici-repentis Pt-1C-BFP hypothetical protein1e-133
 C58663_G1_I1XM_001937272.1 P. tritici-repentis Pt-1C-BFP opsin-1, mRNA3e-69
 C29801_G1_I1XM_001936103.1 P. tritici-repentis Pt-1C-BFP conserved hypothetical1e-57
 C30295_G1_I1XM_003306030.1 P. teres f. teres 0-1 hypothetical protein, mRNA2e-70
 C16207_G1_I1XM_001935967.1 P. tritici-repentis Pt-1C-BFP conserved hypothetical7e-97
 C50331_G1_I2XM_001930633.1 P. tritici-repentis Pt-1C-BFP hypothetical protein1e-122

a Cryptococcus is the causal agent of the human/animal respiratory disease cryptococcosis. These fungi are mostly localized in the catkins and the vegetative bud in walnut, corroborating previous results indicating that the plant surface provides a conducive environment for their sexual cycle. Here, we see two different species: C. neoformans var. neoformans and C. gattii. b The ‘tan spot’ causing pathogenic fungi Pyrenophora had two different strains present: P. tritici-repentis and P. teres f. teres

Table 5

Species from Actinobacteria

Taxonomy nameIDLineage
Arthrobacter 1663Bacteria; Actinobacteria; Actinobacteria; Micrococcales; Micrococcaceae
Modestobacter 88138Bacteria; Actinobacteria; Actinobacteria; Geodermatophilales; Geodermatophilaceae
Microlunatus 29404Bacteria; Actinobacteria; Actinobacteria; Propionibacteriales; Propionibacteriaceae
Actinoplanes 1865Bacteria; Actinobacteria; Actinobacteria; Micromonosporales; Micromonosporaceae
Nakamurella 53460Bacteria; Actinobacteria; Actinobacteria; Nakamurellales; Nakamurellaceae
Streptomyces 1883Bacteria; Actinobacteria; Actinobacteria; Streptomycetales; Streptomycetaceae
Propionibacterium 1743Bacteria; Actinobacteria; Actinobacteria; Propionibacteriales; Propionibacteriaceae
Corynebacterium 1716Bacteria; Actinobacteria; Actinobacteria; Corynebacteriales; Corynebacteriaceae
Micrococcus 1269Bacteria; Actinobacteria; Actinobacteria; Micrococcales; Micrococcaceae

We obtain the taxonomy ID from http://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentier/tax_identifer.cgi using the taxonomy name, which is then used to get the lineage. The taxonomy names were obtained from best entry in a BLAST search to the ‘nt’ database

Diseases, pests and pathogens affecting walnut: Images obtained from https://commons.wikimedia.org/wiki and used under the Creative Commons Attribution 2.5 Generic license Localization of transcripts. These shows the cumulative counts (not normalized) of transcripts assigned to each genus. a Phytophthora—localized in the root. b Actinobacteria—present in all tissues c Methylibium—localized in the root. d Acyrthosiphon—localized in early leaves and the vegetative bud. e Aedes aegypti—localized in the vegetative bud Proteins from Phytophthora Transcripts from the pathogenic oomycete Phytophthora with ORFs that have significant matches in the BLAST ‘nr’ database. There are several strains of P. (nicotianae, cinnamomi, infestans) that have the best matches to these transcripts. It is difficult to ascertain the exact species from these transcripts since some of these proteins have high conservation across many species Expression counts of selected transcripts These are raw counts, and are not normalized a Phytophthora is the pathogen responsible for potato blight, which caused the Great Irish Famine (1845–1849). C30378_G1_I1 encodes a glyceraldehyde 3-phosphate dehydrogenase, an enzyme involved in glycolysis. Most transcripts from this oomycete are found localized in the root (Fig. 2a). b Cryptococcus is the causal agent of cryptococcosis, affecting immuno-compromised individuals. These fungi are mostly localized in the catkins and the vegetative buds. It was previously shown that the plant surface provides a conducive environment for the sexual cycle of these pathogens. c Pyrenophora: These fungi, responsible for the `tan spot’ disease in barley, are spread out throughout different tissues. d Acyrthosiphon: The pea aphid pest. C58762_G1_I1 encodes a 91 long ORF that has a 99 % identity match to the ribosomal protein L37 (Accession: NP 001129424.1). The low counts of this transcript are testimony to the ability of the RNAseq technology to accurately determine the sequence and localization of transcripts Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) from Phytophthora: C43181_G1_I1 encodes a 293 bp long ORF with a predicted molecular weight of 30 kDa and has 97 % identity GAPDH from P. parasitica and P. infestans, and a 96 % identity to the GAPDH from P. sojae. Thus, although the presence of a pathogen from the Phytophthora genus is almost certain, it is not possible to determine the exact strain of this pathogen. The Phytophthora GAPDH also shares a 70 % identity with the GAPDH in human placenta. a Multiple sequence alignment of the GAPDHs obtained using ENDscript 2.x (Robert and Gouet 2014). b Structural superimposition of the C43181_G1_I1 GAPDH to the structure of the human placental GAPDH (PDBid:1U8F, chain O). The structure of C43181_G1_I1 GAPDH was modelled using SWISSMODEL Transcripts from fungi a Cryptococcus is the causal agent of the human/animal respiratory disease cryptococcosis. These fungi are mostly localized in the catkins and the vegetative bud in walnut, corroborating previous results indicating that the plant surface provides a conducive environment for their sexual cycle. Here, we see two different species: C. neoformans var. neoformans and C. gattii. b The ‘tan spot’ causing pathogenic fungi Pyrenophora had two different strains present: P. tritici-repentis and P. teres f. teres Species from Actinobacteria We obtain the taxonomy ID from http://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentier/tax_identifer.cgi using the taxonomy name, which is then used to get the lineage. The taxonomy names were obtained from best entry in a BLAST search to the ‘nt’ database

Discussion

High-throughput mRNA sequencing (RNA-Seq) has revolutionized the view of the profile of the transcriptome, enhancing gene discovery. Although some protocols are designed to view exclusively polyadenylated eukaryotic mRNA, prokaryotic mRNA can be surreptitiously included in the analysis, especially highly abundant transcripts like ribosomal proteins. This presents an opportunity to identify extraneous transcripts residing in various tissues, provided the genome of the organism is known. Expression counts are low due to the residual nature of the analysis. Yet, as we observed for pea aphid, very low counts were able to accurately identify the L37 ribosomal protein, which shares 88 % identity with the L37 protein from DrosophilaMelanogaster (Anger et al. 2013).

Phytophthora: causal agent of potato blight

The oomycete Phytophthora is a pathogen responsible for destructive diseases in a wide variety of crop plants, including tomato, potato (Nowicki et al. 2012) and walnut (Belisario et al. 2012) (Fig. 1). Although the presence of a pathogen from the Phytophthora genus is almost certain, it is not possible to determine the exact strain of this pathogen. GAPDH is involved in gycolysis, and other non-metabolic processes (Tarze et al. 2007), and is a well-known housekeeping gene (Eisenberg and Levanon 2013). The Phytophthora GAPDH also shares a 70 % identity with the GAPDH in human placenta (Jenkins and Tanner 2006).

Fungi

Cryptococcus: Causal agent of cryptococcosis in human

These fungi are mostly localized in the catkins and the vegetative bud in walnut, corroborating previous results about their sexual cycle (Xue et al. 2007). Cryptococcosis is a disease of the respiratory system in human and animals, caused by Cryptococcus neoformans and C. gattii, and exacerbated in patients infected with the AIDS virus (Mitchell and Perfect 1995). Plants are known to host a large number of commensal fungi (Schmit and Mueller 2007). An interesting ecological experiment demonstrated that the plant surface is a conducive environment, stimulating the sexual cycle of Cryptococcus (Xue et al. 2007). Myo-inositol and the plant growth hormone IAA synergistically were proved as ‘strong aphrodisiacs’ (Xue et al. 2007). Two homologous cyclophilin A genes (KGB74193 and KGB74187) have been shown to influence cell growth, mating and virulence (Wang et al. 2001). A peptidyl-prolyl cis–trans isomerase from Cryptococcus (Ess1), non-homologous to the above two genes, is required only for virulence (Ren et al. 2005). Another pathogenic fungus, Pyrenophora (teres/triticirepentis), and the causal agent of the disease ‘tan spot’ (Liu et al. 2011) have been identified in several tissues (Tables 3c, 4b).

Actinobacteria: nitrogen fixing bacterial diazotrophs

Bacterial mRNA is non-polyadenylated, and most should be excluded by the RNA-Seq library preparation method, but some mRNA invariably leaks through. EndAct are present extensively in the inner tissues of living plants, and are a source of important secondary metabolites related to the defense response, growth and environmental stress (Qin et al. 2011; Palaniyandi et al. 2013). The significant homology of the putative proteins from these transcripts after BLAST results highlights the ability of the current methodology to detect a genus with fair precision. A clear example is the transcript C54818_G4_I1 that encodes a 68 bp long ORF, and matches to a ‘MULTISPECIES: hypothetical protein’ from Streptomyces with 82 % identity. The nitrogen fixing diazotroph, Paenibacilluspolymyxa P2b-2R, was found to enhance the growth of the important oilseed crop canola (Puri et al. 2016). EndAct obtained from healthy wheat tissue was shown to ‘prime’ the systemic acquired resistance (SAR) and the jasmonate/ethylene (JA/ET) pathways in Arabidopsisthaliana JA/ET pathways when infected with bacterial pathogen Erwiniacarotovora subsp. carotovora or the fungal pathogen Fusariumoxysporum, respectively (Conn et al. 2008). The importance of EndAct in biodiversity was established in a tropical rainforest native plant, which identified a total of 312 Actinobacteria associated with the order Actinomycetales (Qin et al. 2012). Recently, the genome sequence of Arthrobacterkoreensis 5J12A, a desiccation-tolerant strain, was obtained (Manzanera et al. 2015). Teicoplanin, an antibiotic working against Gram positive bacteria like methicillin-resistant Staphylococcusaureus and Enterococcusfaecalis, was obtained from the fermentation broth of a strain of Actinoplanesteichomyceticus (Jung et al. 2008). Another EndAct (Streptomyces) was shown to produce lipase, β-1-3-glucanase and chitinase (defence related enzymes), and aid plant growth (Gopalakrishnan et al. 2013). Micrococcus sp NII-0909 isolated from the Western ghat forest soil in India had demonstrable ability to enhance soil fertility and promote plant growth (Dastager et al. 2010). EndAct, for example Corynebacteria, can be associated with plant pathogenicity (Vidaver 1982). Also, juglone has been found to have inhibitory effects on some of these nitrogen fixing bacteria (Dawson et al. Dawson and Seymour 1983).

Methylibiumpetroleiphilum: involved in aerobic biodegradation of methyl tert-butyl ether

Methylibium petroleiphilum is capable of using methyl tert-butyl ether as a sole source of carbon in the root (Fig. 2c) (Nakatsu et al. 2006). Unlike Phytophthora, there are only two transcripts for Methylibium. One transcript is 461 nt long and has a 92 % identity to the M.petroleiphilum PM1 genome (Accession: CP000555.1). The ORF from this transcript has a 79 % identity to part of a protein (Accession: ABM53545.1) from uncultured beta proteobacterium CBNPD1 BAC clone 578, which was obtained in a metagenomic analysis of a freshwater toxic cyanobacteria bloom (Pope and Patel 2008). Although the PCR-generated 75-clone, 16S rRNA gene library had confirmed the presence of Proteobacteria, data here associates this protein with M.petroleiphilum. Interestingly, this transcript or ORF has no match in the new draft sequence of the Methylibium sp. strain T29, a fuel oxygenate-degrading bacterial isolate from Hungary (Szabó et al. 2015). This is explained by the differences observed: ‘unlike M. petroleiphilum PM1 our isolate does not harbor the mega plasmid which carries the genes for MTBE-degradation’ (Szabó et al. 2015).

Acyrthosiphon pisum: the pea aphid pest

While both Methylibium and Phytophthora are mostly localized in the root, thepeaaphid A. pisum, which is a pest of importance in agriculture (Van Emden and Harrington 2007), was found in the leaves (Fig. 2d). One transcript of Acyrthosiphon (C58762 _G1_I1) encodes a 91 amino acid long ORF having a 99 % match to the ribosomal L37 protein (Accession: NP 001129424.1). The low count of this transcript demonstrates the accuracy of the RNA-Seq technology (Table 3d).

Aedes aegypti: yellow fever mosquito

The presence of yellow fever mosquito in the vegetative bud was not expected (Fig. 2e) and was not previously reported in Northern California. The proteins found there include proteases (both serine and metallo), ribosomal RNA and an elongation factor (Table 6). Among the serine proteases, C40984_G1_I1 encodes a trypsin that has a significant similarity to a female reproductive tract protease from Drosophila mojavensis (Uniprot id: C5IB51) (Kelleher and Markow 2009), suggesting that the mosquito had been using the vegetative buds for reproductive purposes. The importance of the serine proteases in egg-formation abilities of mosquitoes were established using a RNAi knockdown method (Isoe et al. 2009). Another interesting development has been the recent monitoring of Aedes aegypti and Aedes albopictus by the California Department of Public Health.
Table 6

Transcripts from Aedes aegypti, the yellow fever mosquito

TranscriptDescriptionE-value
C40984 _G1_I1XM_ 001652893.1 Aedes aegypti trypsin partial mRNA0.0
C56263 _G1_I1AY432478.1 Aedes aegypti ASAP ID: 35053 metalloendopeptidase mRNA sequence3e-152
C58453 _G1_I1AY432463.1 Aedes aegypti ASAP ID: 35049 serine protease mRNA sequence0.0
C62133 _G1_I1U65375.1 AAU65375 Aedes aegypti 18S ribosomal RNA gene0.0
C24216 _G1_I1U65375.1 AAU65375 Aedes aegypti 18S ribosomal RNA gene8e-148
C5389 _G1_I1L22060.1 Aedes albopictus 8S, 5.8S, and 28S ribosomal RNA genes0.0
C58751 _G1_I1AY433205.1 Aedes aegypti solate A20 28S ribosomal RNA gene9e-163
C59860 _G1_I1AY736001.1 Aedes aegypti elongation factor 1 alpha mRNA, partial cds4e-152

These match with high significance to the yellow fever mosquito BLAST ‘nt’ database, and are mostly found in the vegetative bud (Fig. 2e). C40984_G1_I1 encodes an ORF with a 99 % identity to the Aedes aegypti trypsin. Interestingly, this protein also has a significant similarity to a female reproductive tract protease from Drosophila mojavensis, suggesting that the walnut vegetative bud has been used for egg-laying purposes by the female mosquito

Transcripts from Aedes aegypti, the yellow fever mosquito These match with high significance to the yellow fever mosquito BLAST ‘nt’ database, and are mostly found in the vegetative bud (Fig. 2e). C40984_G1_I1 encodes an ORF with a 99 % identity to the Aedes aegypti trypsin. Interestingly, this protein also has a significant similarity to a female reproductive tract protease from Drosophila mojavensis, suggesting that the walnut vegetative bud has been used for egg-laying purposes by the female mosquito Their detection sites are updated regularly. (https://www.cdph.ca.gov/HEALTHINFO/DISCOND/Pages/Aedes-albopictus-and-Aedes-aegypti-Mosquitoes.aspx). However, their detection method is not known to us.

Nucleotide-binding site (NBS) and leucine-rich repeats (LRR) in walnut

The detection of several well-characterized plant pathogens in the current study raises the question of what innate resistance mechanism in walnut could provide resistance to these virulent agents. While it is possible that these strains of these pathogens are non-virulent, it is equally likely that this plant encodes and transcribes the desired resistance genes and transcripts needed to combat a virulent response from any one of these pathogens. Plants possess two distinct kinds of defence mechanisms—the pathogen-associated molecular patterns (PAMP) mediated immunity (PTI) and effector-triggered immunity encoded by resistance (R) genes (ETI). PTI is analogous to the first line of defense (innate immunity) in vertebrates, which is bypassed or disrupted by pathogen effector molecules that are used to downregulate PTI, making the cell vulnerable to pathogen attack (Nicaise et al. 2009). R genes have evolved in this ensuing “evolutionary warfare” in plants, akin to the mammalian adaptive immunity, to recognize pathogens which contain complementary avirulence genes (DeYoung and Innes 2006). However, unlike the mammalian adaptive immune system which is enforced through specialized cells, R genes are active in all plant cells. The majority of R genes encode proteins comprise of a nucleotide-binding site (NBS) and leucine-rich repeats (LRRs). NBS-LRR proteins recognize and neutralize specialized pathogen avirulence (Avr) proteins, leading to the upregulation of PTI, thus, providing plants with resistance to the attack (Hayashi et al. 2010; Ernst et al. 2002; Borhan et al. 2004; Zhang et al. 2010). It has been hypothesized that the distribution and diversity of NBS-LRR sequences is a direct consequence of extensive duplication and random rearrangements, endowing plants with the ability to recognize diverse molecules arising from dynamically changing biotic challenges (Meyers et al. 2003). Here, we briefly describe the transcripts and expression levels of the NBS-LRRs genes in walnut. Two specific examples of such NBS-LRRs genes conferring resistance to plants are the blast-resistance gene Pb1 NBS-LRR from rice (Uniprot: E3WF10) (Hayashi et al. 2010) and the cyst nematode resistance gene from tomato (Uniprot: Q8GT46) (Ernst et al. 2002). In addition to being leucine rich, these two proteins are also abundant with the negatively charged glutamic acid (Fig. 4a). Although these specific NBS-LRRs are ~1300 amino acid long, the typical length of NBS-LRR varies from a few hundred to <2 K (McHale et al. 2006). Using these proteins as initial search entities, we have identified ~400 NBS-LRR transcripts (excluding splice variants denoted by transcripts having the same prefix) in walnut through the ‘findgene’ algorithm described earlier by us (Chakraborty et al. 2015) (Fig. 4b). This is in excellent agreement with the 374 NBS-LRR genes that were identified in a genome wide study of Chinese chestnut (Castanea mollissima) resistant to Chestnut Blight Disease (Zhong et al. 2015). The tissue-specific expression pattern allows the discrimination of the truly critical genes in this large family (Table 7). The tissue-specific nature of certain genes is exemplified by C54426_G7_I1, which has significantly higher expression in the catkins and hull, and shares 78 % identity with a Strubbelig-receptor family (SRF) from Malus domestica. SRFs are receptor-like kinases (Eyüboglu et al. 2007), and are involved in tissue morphogenesis (Vaddepalli et al. 2011) and immune response (Alcázar et al. 2010). As corroboration, we chose one NBS-LRR protein from each of the two major domains of NBS-LRR (McHale et al. 2006)—TIR-NBS-LRR (Uniprot:Q6QX58 (Borhan et al. 2004)) and CC-NBS-LRR Uniprot:Q56YM8 (Meyers et al. 2003), and obtained the same number of transcripts encoded by NBS-LRR genes. Thus, we demonstrate that transcriptomic data that has revealed the biodiversity in different tissues of walnut simultaneously provides insights into the ability of the plant to negate the threat posed by some of these potentially destructive pathogens.
Fig. 4

Nucleotide-binding site (NBS) and leucine-rich repeats (LRR) in walnut. a Amino acid frequency for two NBS-LRRs. The blast-resistance gene Pb1 NBS-LRR from rice (Uniprot: E3WF10) is in red, while the cyst nematode resistance gene from tomato (Uniprot: Q8GT46) is in green. While, expectedly, both these proteins are leucine rich, we also observe a large proportion of negatively charged glumatic acid. b Phylogenetic tree for the ~400 identified NBS-LRR genes in walnut obtained by Neighbor Joining/UPGMA phylogeny implemented in MAFFT (Katoh et al. 2002) and drawn with FigTree v1.4.2

Table 7

Expression counts of ten highly expressed transcripts from the NBS-LRR family

TranscriptCECICKEMFLHCHLHPHUIFLELMLYPKPLPTRTSETZVBL
C54426_G7_I19.919.71.202.826.23.5230.21.20.90.73.22.80.51.44.50.21.21745
C50016_G1_I30.92.22.61.382.318.91.51.598.10.47.60.60.78.62.41.90.23.31026
C52820_G1_I11.74.63.82.414.32.51 02.80.67.92.60.43.12.41.412.31.44.51.13.11010
C8180_G1_I11.32.80.80.48.1325.21.5513.92.403.80.10.317.21.60.502668
C55004_G7_I224.42.22.59.23.911.93.33.15.67.33.18.33.21.56.23.621.73.9767
C49942_G2_I12.76.72.231.314.874.87.42.91.32.71.36.55.14.23.22.85.50.8669
C47067_G1_I21.12.61.10.38.227.51.31.45.96.4194.4711.22.41.31.43.6677
C44186_G1_I31.72.54.50.55.7215.21.266.47.92.17.210.79.35.90.90.14.31034
C44186_G1_I11.92.64.60.65.42.114.81.56.86.68.12.27.31.20.89.15.40.90.14.1659
C51189_G5_I1510.34.13.516.53.619.1321.310.69.51.510.86.52.714.95.96.52.761017

These are raw counts in K (103), and are not normalized. Some NBS-LRR’s (like C54426_G7 _I1, a Strubbelig receptor kinase) are significantly overexpressed in specific tissues

Nucleotide-binding site (NBS) and leucine-rich repeats (LRR) in walnut. a Amino acid frequency for two NBS-LRRs. The blast-resistance gene Pb1 NBS-LRR from rice (Uniprot: E3WF10) is in red, while the cyst nematode resistance gene from tomato (Uniprot: Q8GT46) is in green. While, expectedly, both these proteins are leucine rich, we also observe a large proportion of negatively charged glumatic acid. b Phylogenetic tree for the ~400 identified NBS-LRR genes in walnut obtained by Neighbor Joining/UPGMA phylogeny implemented in MAFFT (Katoh et al. 2002) and drawn with FigTree v1.4.2 Expression counts of ten highly expressed transcripts from the NBS-LRR family These are raw counts in K (103), and are not normalized. Some NBS-LRR’s (like C54426_G7 _I1, a Strubbelig receptor kinase) are significantly overexpressed in specific tissues In summary, high conservation of some proteins within a genus does not allow the proper characterization of the species. Thus, although we can state with a great degree of certainty the presence of the genus Phytophthora, it is not possible to identify the exact species/strain. No viruses have been detected using the current methodology. Also, since the root samples were derived from a sterile sample, we did not detect root lesion nematodes (Pratylenchusvulnus), a major source of concern for the California walnut industry (Walawage et al. 2013). The detection of specific proteins from pathogens can serve as a target for therapeutics. The methodology described here presents an unbiased rapid tool to extract the metagenome from an RNA-Seq profile that can be used to develop diagnostics. In this study, the profile represented twenty different tissues from walnut, and the extracted metagenome from all of these tissue types presents a vivid picture of the biodiversity in its surroundings in California.
  49 in total

1.  The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling.

Authors:  Konstantin Arnold; Lorenza Bordoli; Jürgen Kopp; Torsten Schwede
Journal:  Bioinformatics       Date:  2005-11-13       Impact factor: 6.937

Review 2.  Plant NBS-LRR proteins in pathogen sensing and host defense.

Authors:  Brody J DeYoung; Roger W Innes
Journal:  Nat Immunol       Date:  2006-12       Impact factor: 25.606

3.  CAGE: cap analysis of gene expression.

Authors:  Rimantas Kodzius; Miki Kojima; Hiromi Nishiyori; Mari Nakamura; Shiro Fukuda; Michihira Tagami; Daisuke Sasaki; Kengo Imamura; Chikatoshi Kai; Matthias Harbers; Yoshihide Hayashizaki; Piero Carninci
Journal:  Nat Methods       Date:  2006-03       Impact factor: 28.547

4.  Metagenomic analysis of a freshwater toxic cyanobacteria bloom.

Authors:  Phillip B Pope; Bharat K C Patel
Journal:  FEMS Microbiol Ecol       Date:  2008-03-04       Impact factor: 4.194

5.  Durable panicle blast-resistance gene Pb1 encodes an atypical CC-NBS-LRR protein and was generated by acquiring a promoter through local genome duplication.

Authors:  Nagao Hayashi; Haruhiko Inoue; Takahiro Kato; Taketo Funao; Masaki Shirota; Takehiko Shimizu; Hiroyuki Kanamori; Hiroko Yamane; Yuriko Hayano-Saito; Takashi Matsumoto; Masahiro Yano; Hiroshi Takatsuji
Journal:  Plant J       Date:  2010-10-05       Impact factor: 6.417

6.  The Ess1 prolyl isomerase is dispensable for growth but required for virulence in Cryptococcus neoformans.

Authors:  Ping Ren; Anne Rossettini; Vishnu Chaturvedi; Steven D Hanes
Journal:  Microbiology       Date:  2005-05       Impact factor: 2.777

7.  Structures of the human and Drosophila 80S ribosome.

Authors:  Andreas M Anger; Jean-Paul Armache; Otto Berninghausen; Michael Habeck; Marion Subklewe; Daniel N Wilson; Roland Beckmann
Journal:  Nature       Date:  2013-05-02       Impact factor: 49.962

8.  Molecular characterisation of the STRUBBELIG-RECEPTOR FAMILY of genes encoding putative leucine-rich repeat receptor-like kinases in Arabidopsis thaliana.

Authors:  Banu Eyüboglu; Karen Pfister; Georg Haberer; David Chevalier; Angelika Fuchs; Klaus F X Mayer; Kay Schneitz
Journal:  BMC Plant Biol       Date:  2007-03-30       Impact factor: 4.215

9.  Plant growth-promoting activities of Streptomyces spp. in sorghum and rice.

Authors:  Subramaniam Gopalakrishnan; Vadlamudi Srinivas; Meesala Sree Vidya; Abhishek Rathore
Journal:  Springerplus       Date:  2013-10-29

10.  Draft genome sequence of Methylibium sp. strain T29, a novel fuel oxygenate-degrading bacterial isolate from Hungary.

Authors:  Zsolt Szabó; Péter Gyula; Hermina Robotka; Emese Bató; Bence Gálik; Péter Pach; Péter Pekker; Ildikó Papp; Zoltán Bihari
Journal:  Stand Genomic Sci       Date:  2015-07-19
View more
  10 in total

1.  YeATSAM analysis of the walnut and chickpea transcriptome reveals key genes undetected by current annotation tools.

Authors:  Sandeep Chakraborty; Pedro J Martínez-García; Abhaya M Dandekar
Journal:  F1000Res       Date:  2016-11-17

2.  Genomic Variation Among and Within Six Juglans Species.

Authors:  Kristian A Stevens; Keith Woeste; Sandeep Chakraborty; Marc W Crepeau; Charles A Leslie; Pedro J Martínez-García; Daniela Puiu; Jeanne Romero-Severson; Mark Coggeshall; Abhaya M Dandekar; Daniel Kluepfel; David B Neale; Steven L Salzberg; Charles H Langley
Journal:  G3 (Bethesda)       Date:  2018-07-02       Impact factor: 3.154

3.  Selection of reference genes for flowering pathway analysis in the masting plants, Celmisia lyallii and Chionochloa pallens, under variable environmental conditions.

Authors:  Paula E Jameson
Journal:  Sci Rep       Date:  2019-07-05       Impact factor: 4.379

4.  Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut.

Authors:  Mohammad Sadat-Hosseini; Mohammad Reza Bakhtiarizadeh; Naser Boroomand; Masoud Tohidfar; Kourosh Vahdati
Journal:  PLoS One       Date:  2020-04-28       Impact factor: 3.240

5.  Two UGT84A Family Glycosyltransferases Regulate Phenol, Flavonoid, and Tannin Metabolism in Juglans regia (English Walnut).

Authors:  Houston J Saxe; Takanori Horibe; Bipin Balan; Timothy S Butterfield; Noah G Feinberg; Christopher M Zabaneh; Aaron E Jacobson; Abhaya M Dandekar
Journal:  Front Plant Sci       Date:  2021-02-24       Impact factor: 5.753

6.  Genome-wide association analysis of 101 accessions dissects the genetic basis of shell thickness for genetic improvement in Persian walnut (Juglans regia L.).

Authors:  Jiangtao Wang; Hang Ye; Pengpeng Chen; Huijuan Zhou; Hengzhao Liu; Ruimin Xi; Gang Wang; Na Hou; Peng Zhao
Journal:  BMC Plant Biol       Date:  2022-09-13       Impact factor: 5.260

7.  Sequence/structural analysis of xylem proteome emphasizes pathogenesis-related proteins, chitinases and β-1, 3-glucanases as key players in grapevine defense against Xylella fastidiosa.

Authors:  Sandeep Chakraborty; Rafael Nascimento; Paulo A Zaini; Hossein Gouran; Basuthkar J Rao; Luiz R Goulart; Abhaya M Dandekar
Journal:  PeerJ       Date:  2016-05-24       Impact factor: 2.984

8.  De novo transcriptome assembly, functional annotation, and expression profiling of rye (Secale cereale L.) hybrids inoculated with ergot (Claviceps purpurea).

Authors:  Khalid Mahmood; Jihad Orabi; Peter Skov Kristensen; Pernille Sarup; Lise Nistrup Jørgensen; Ahmed Jahoor
Journal:  Sci Rep       Date:  2020-08-10       Impact factor: 4.379

Review 9.  Regulation and Evolution of NLR Genes: A Close Interconnection for Plant Immunity.

Authors:  Grazia M Borrelli; Elisabetta Mazzucotelli; Daniela Marone; Cristina Crosatti; Vania Michelotti; Giampiero Valè; Anna M Mastrangelo
Journal:  Int J Mol Sci       Date:  2018-06-04       Impact factor: 5.923

10.  Decoding Wheat Endosphere-Rhizosphere Microbiomes in Rhizoctonia solani-Infested Soils Challenged by Streptomyces Biocontrol Agents.

Authors:  Ricardo Araujo; Christopher Dunlap; Steve Barnett; Christopher M M Franco
Journal:  Front Plant Sci       Date:  2019-08-26       Impact factor: 5.753

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.