| Literature DB >> 26413905 |
Nicolas Delhomme1, Görel Sundström2, Neda Zamani2, Henrik Lantz3, Yao-Cheng Lin4, Torgeir R Hvidsten5, Marc P Höppner3, Patric Jern6, Yves Van de Peer7, Joakim Lundeberg8, Manfred G Grabherr3, Nathaniel R Street1.
Abstract
After performing de novo transcript assembly of >1 billion RNA-Sequencing reads obtained from 22 samples of different Norway spruce (Picea abies) tissues that were not surface sterilized, we found that assembled sequences captured a mix of plant, lichen, and fungal transcripts. The latter were likely expressed by endophytic and epiphytic symbionts, indicating that these organisms were present, alive, and metabolically active. Here, we show that these serendipitously sequenced transcripts need not be considered merely as contamination, as is common, but that they provide insight into the plant's phyllosphere. Notably, we could classify these transcripts as originating predominantly from Dothideomycetes and Leotiomycetes species, with functional annotation of gene families indicating active growth and metabolism, with particular regards to glucose intake and processing, as well as gene regulation.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26413905 PMCID: PMC4586145 DOI: 10.1371/journal.pone.0139080
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Two-dimensional distribution of expression and GC content differentiates fungal and plant transcripts.
(a) Percentage GC frequency of transcripts annotated as Embryophyta (brown) or fungi (purple) on the basis of maximum sequence similarity to the UniRef90 database. (b) Frequency distribution of tau score calculated using FPKM expression values for plant (brown) and fungal (blue) transcripts in all 22 RNA sequencing samples. The tau score ranges from 1 for complete specificity to 0 for equal expression in all samples. The Embryophyta tau score distribution is significantly different from that of the fungi (Welch t-test p-value << 0.001).
Fig 2Heatmap representation of fungal expression patterns.
Expression clustering based on FPKM values pooled across gene clusters, i.e. including both paralogs of gene families as well as species orthologs. The heatmap represents gene clusters with the highest numbers of sequences (>40).