| Literature DB >> 32328037 |
Nikhil Ram-Mohan1, Michelle M Meyer1.
Abstract
Periodontitis is an inflammatory disease that deteriorates bone supporting teeth afflicting ∼743 million people worldwide. Bacterial communities associated with disease have been classified into red, orange, purple, blue, green, and yellow complexes based on their roles in the periodontal pocket. Previous metagenomic and metatranscriptomics analyses suggest a common shift in metabolic signatures in disease vs. healthy communities with up-regulated processes including pyruvate fermentation, histidine degradation, amino acid metabolism, TonB-dependent receptors. In this work, we examine existing metatranscriptome datasets to identify the commonly differentially expressed transcripts and potential underlying RNA regulatory mechanisms behind the metabolic shifts. Raw RNA-seq reads from three studies (including 49 healthy and 48 periodontitis samples) were assembled into transcripts de novo. Analyses revealed 859 differentially expressed (DE) transcripts, 675 more- and 174 less-expressed. Only ∼20% of the DE transcripts originate from the pathogenic red/orange complexes, and ∼50% originate from organisms unaffiliated with a complex. Comparison of expression profiles revealed variations among disease samples; while specific metabolic processes are commonly up-regulated, the underlying organisms are diverse both within and across disease associated communities. Surveying DE transcripts for known ncRNAs from the Rfam database identified a large number of tRNAs and tmRNAs as well as riboswitches (FMN, glycine, lysine, and SAM) in more prevalent transcripts and the cobalamin riboswitch in both more and less prevalent transcripts. In silico discovery identified many putative ncRNAs in DE transcripts. We report 15 such putative ncRNAs having promising covariation in the predicted secondary structure and interesting genomic context. Seven of these are antisense of ribosomal proteins that are novel and may involve maintaining ribosomal protein stoichiometry during the disease associated metabolic shift. Our findings describe the role of organisms previously unaffiliated with disease and identify the commonality in progression of disease across three metatranscriptomic studies. We find that although the communities are diverse between individuals, the switch in metabolic signatures characteristic of disease is typically achieved through the contributions of several community members. Furthermore, we identify many ncRNAs (both known and putative) which may facilitate the metabolic shifts associated with periodontitis.Entities:
Keywords: antisense; metatranscriptomics; non-coding RNA; oral microbiome; periodontitis; riboswitch
Year: 2020 PMID: 32328037 PMCID: PMC7160235 DOI: 10.3389/fmicb.2020.00482
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Distribution of the number and magnitude of differentially expressed (DE) transcripts in the genera represented in the oral microbiome. (A) The number of DE transcripts originating from different genera. Juxtaposed lines represent different species within a genus that have DE transcripts. Bars are colored based on the classification of the organism into periodontitis associated complexes (Socransky et al., 1998). Actinomyces and Streptococcus are the genera with the largest number of different species that undergo differential expression during disease. (B) Magnitude of up- or down-regulation (Log2 fold-change) of the transcripts during disease. Each point represents a DE transcript and points are colored by the complex in which the organism is classified (Socransky et al., 1998). A large fraction of the differentially transcripts originate from organisms that are unaffiliated to a specific disease associated complex. Transcripts from members of the red and orange complexes undergo higher magnitude differential expression.
Species with the largest number of DE transcripts.
| Organism | Complex | % of total up-regulated transcripts |
| Blue | ∼11% | |
| Unaffiliated | ∼7% | |
| Orange | ∼7% | |
| Unaffiliated | ∼6% | |
| Unaffiliated | ∼5% | |
| Purple | ∼5% | |
| Unaffiliated | ∼3% | |
| Orange | ∼3% | |
| Orange | ∼2% | |
| Unaffiliated | ∼2% |
FIGURE 2Correlation in expression of differentially expressed (DE) transcripts among individual samples. Expression of the DE transcripts (from the pooled dataset) was calculated for in each individual sample and the pair-wise correlation between samples based on these transcripts computed and a heatmap generated to represent the correlation. Red is high correlation (PCC = 1) while blue is a negative correlation (PCC = -1). Clustering of samples based on the similarity in expression is represented on top (since this is a pairwise comparison, only the column dendrogram is represented). Samples are further annotated by state – disease or healthy, and by origination study. Clustering of disease samples across different origination studies based on expression is not observed. Possible clustering of samples based on the origination study suggests inherent differences between studies and may be attributed to many variables such as inclusion criteria, disease progression, or sampling and sequencing methodologies.
FIGURE 3Prevalence of functionally active taxa across disease samples. Screening for the functionally active microorganisms in the disease samples. (A) Heatmap of the 50 most functionally active genera from the overall dataset identified from the metatranscriptomes using HUMAnN2 (Abubucker et al., 2012). Gradient of color represents the abundance of the genus in a disease sample. Dendrograms represent hierarchical clustering of samples (column) and genera (rows) representing the similarity in the taxonomic composition between the samples and relative abundance of various genera, respectively. (B) Frequency distribution of the genera identified in at least 24 of the 48 disease samples. Streptococcus, Rothia, and Veillonella were found active in 40 out of the 48 disease samples. Porphyromonas, Tannerella, and Fusobacterium were found active in 30 samples or fewer.
FIGURE 4Pyruvate metabolism during disease. Rendering of the KEGG pathway of pyruvate metabolism annotated with the DE transcripts and the microbial complexes from which they originate. Differential expression is represented by the black rectangle around the KEGG Orthology term. Background color for the KEGG orthology term represents the microbial complex of the organism that showed differential expression. The pathway as a whole is up-regulated, but the individual genes are up-regulated in diverse species supporting polymicrobial synergy during periodontitis.
FIGURE 5Contributors to pyruvate metabolism during disease. Relative contributions of different bacteria in the two pyruvate metabolism pathways in different samples. (A) Relative contributions of various organisms in the fermentation of pyruvate to acetate and lactate in each disease sample. Top 15 contributing organisms are listed by name and the rest are grouped as others. Columns are annotated by the study they come from. Different organisms perform similar functions in different disease samples maintaining the functional signature despite the community composition. (B) Relative contribution of various organisms in the fermentation of pyruvate to isobutanol. Again, functional signature is maintained despite differences in the organisms carrying out fermentation in the different samples.
FIGURE 6Known ncRNAs identified in differentially expressed transcripts. Known ncRNA identified in DE transcripts, and the genus from which the ncRNA originates. Triangles colors correspond to the organism’s microbial complex, orientation depicts whether the transcript was up- or down-regulated, and the size indicates the magnitude of log2 fold change. tRNA and tmRNA are the most abundantly identified ncRNAs. No known ncRNAs were identified in a large number of DE transcripts.
FIGURE 7de novo discovered ncRNAs in differentially expressed transcripts. Transcripts that showed no known ncRNAs were subjected to de novo RNA structure discovery using Graphclust (Heyne et al., 2012). Potential ncRNAs were manually curated and a subset of ncRNA were chosen based on predicted secondary structure and evidence for covariation in the base pairing and/or conserved genomic context. Significance of the covariation in predicted base pairs was further tested using R-scape (Rivas et al., 2016a, b). Red * indicate base pairs with E ≤ 0.05 and yellow ** indicate base pairs with E ≤ 1. (A) Novel putative ncRNAs discovered in the up-regulated transcripts. Consensus sequence and secondary structure of each ncRNA after scanning against Refseq77 and removing hits that did not share at least 60% of the secondary structure. Covarying base pairs are shaded. (B) Novel putative ncRNAs discovered in down-regulated transcripts produced as described above for the up-regulated transcripts. (C) Phylogenetic distribution of novel putative ncRNAs in the Bacterial domain. The points represent the percentage of genomes in which the ncRNA was found compared to the genomes in Refseq77 within a phylum. 13 of the 16 novel ncRNAs were found in a fraction of all the Firmicutes genomes. ncRNA-26 (antisense of ribosomal protein S19) and ncRNA-94 (antisense of ribosomal protein S13) are the most widely distributed across the Bacterial domain.
Of the 224-predicted putative ncRNAs in the up-regulated transcripts, 9 sense and antisense putative ncRNAs were chosen (Figure 7A) based on their secondary structure and genomic context (Supplementary File S1). Three of these ncRNAs appear to act as 5′-UTR cis regulators. The first, ncRNA-161, was identified in a transcript from Streptococcus anginosus CCUG 39159 (logFC of ∼2.9). This ncRNA is in the beginning of dihydroxyacetone kinase and is found only in the Firmicutes (Figures 7A,C), almost exclusively in Streptococcus with the exception of Bacillus sp. 1NLA3E. Our second candidate ncRNA-116, was found upstream of the rnfA gene in Fusobacterium nucleatum ATCC 25586. This nitrogen fixation gene involved in electron transport to nitrogenase (Schmehl et al., 1993) was found to have a fold change of ∼4.3 between healthy and disease samples. Furthermore, the putative riboregulator is more highly distributed, appearing in a diverse group of phyla including Firmicutes, Proteobacteria, Bacteroidetes, Spirochaetes, Thermotogae, and Fusobacteria (Figure 7C). The third candidate, ncRNA-66, was identified upstream of a hypothetical protein in Actinomyces oral taxon 175 F0384 that had a logFC of ∼1.42 and upstream of a hypothetical protein in Rothia dentocariosa ATCC 17931 with a logFC of ∼1.41. Survey of the phylogenetic distribution revealed that ncRNA-66 was found distributed upstream of NAD-dependent dehydrolase, UDP-glucose-4-epimerase, and hypothetical proteins across the Actinobacteria and Spirochaetes (Figure 7C).
In addition to potential 5′UTR ncRNAs discussed above, our analyses also found four up-regulated putative cis-antisense regulators associated with ribosomal protein genes. Our first candidate, ncRNA-196, was identified in three up-regulated transcripts, located downstream and antisense of the ribosomal protein S15 coding region. Two of these transcripts belonged to strains of Rothia dentocariosa – M567 and ATCC 17931 with logFC of ∼2.35 and 1.7, respectively, while the third transcript belonged to Streptococcus oral taxon 071 73H25AP displayed a logFC of 3.15. This putative ncRNA was found to be distributed through Firmicutes and Actinobacteria – mostly in the genus Streptococcus and in Bacillus megaterium WSH-002 within the Firmicutes; and Rothia dentocariosa in the Actinobacteria. Our second candidate, ncRNA-20 is another example of a putative cis antisense regulator of a ribosomal protein, S14. It was found in four up-regulated transcripts, three of which were from the genus Actinomyces and one from Rothia; Actinomyces oral taxon 175 F0384 displayed a logFC of ∼2.21; Actinomyces oris K20: ∼1.5; and Actinomyces naeslundii MG1: ∼1.5; Rothia dentocariosa M567: ∼1.8. Surveying Refseq77 revealed that this ncRNA is distributed across the Actinobacteria, Deinococcus-Thermus, and Firmicutes. A third candidate, ncRNA-3, is found cis-antisense of ribosomal protein L20 and represented in two transcripts, both from the genus Streptococcus – Streptococcus peroris ATCC 700780 and Streptococcus cristatus ATCC 51100 with a fold change of ∼1.69 and ∼1.66, respectively. Phylogenetic distribution of this ncRNA spans the Firmicutes, Proteobacteria, and Bacteroidetes. Finally, we identified a putative ncRNA antisense of ribosomal protein S19 (ncRNA-26). This ncRNA was found in three highly up-regulated transcripts from Tannerella forsythia (logFC ∼7.5), Fusobacterium nucleatum (logFC ∼4.0), and Veillonella parvula (logFC ∼2.3). It is widely distributed in the Bacterial domain. Of these antisense ncRNAs for ribosomal proteins, we find additional evidence for expression of ncRNA-196 and ncRNA-3 in our recent study of the Streptococcus pneumoniae TIGR4 transcriptional profile (Warrier et al., 2018).
We also find up-regulated putative antisense ncRNAs associated with a variety of other processes. Our first candidate, ncRNA-9 is associated with the elongation factor G gene, fusA, that is differentially expressed in Streptococcus oligofermentans and Streptococcus anginosus. It is narrowly distributed and is identified only in ∼6% of the Firmicutes genomes in Refseq77. A second example is ncRNA-8, which is antisense and downstream of the cell cycle protein gpsB. We find ncRNA-8 in Streptococcus infantarius ATCC BAA-102 with the transcript having a logFC of ∼2.5. ncRNA-8 is narrowly distributed to only Firmicutes and Tenericutes.
We also identified six promising antisense ncRNAs in the down-regulated transcripts (Figures 7B,C). Many of these also putatively regulate ribosomal proteins. ncRNA-118 was identified antisense of ribosomal protein S9 in Streptococcus mutans UA 159 (down-regulated by ∼−1.48 fold). This putative ncRNA is narrowly distributed across Refseq77 and is identified only in Firmicutes and Tenericutes (Figure 7C). A second example is ncRNA-73, which is antisense to the beginning of the ribosomal protein S1 coding region in Actinomyces oral taxon 180 F0310 (down-regulated ∼−1.6 fold). Surveying the bacterial genomes in Refseq77 revealed that this ncRNA is unique to the Actinobacteria. A third example is the putative ncRNA antisense of ribosomal protein S13, ncRNA-94, which was identified in Granulicatella adiacens ATCC 49175 (down-regulated ∼−1.6 fold). ncRNA-94 is widely distributed across the Bacterial domain and is absent only in certain phyla such as Acidobacteria, Claocimonetes, and Deferribacteres.
In addition to ribosomal proteins, we also find down-regulated transcripts antisense to genes involved in sugar metabolism. ncRNA-56 was identified in the Klebsiella pneumoniae located antisense to the beginning of the beta-galactosidase gene, and is extremely down-regulated (-6.8 fold). Surveying its distribution, ncRNA-56 is found in the genomes of other Actinobacteria, Firmicutes, Proteobacteria, and Tenericutes. We also find two antisense ncRNAs putatively regulating different steps of glycolysis. ncRNA-38 was discovered antisense and overlapping the beginning of the down-regulated phosphoglycerate mutase gene. It was found in a fraction of the Firmicutes, Fusobacteria, and Proteobacteria (Figure 7C). The other putative regulator of glycolysis is ncRNA-68, which is found antisense and overlapping the translational start of glyceraldehyde-3-phosphate dehydrogenase. However, ncRNA-68 was found only in the Firmicutes. By applying de novo discovery pipelines on the DE transcripts, we have identified several promising sense and antisense putative regulators of bacterial ribosomal proteins and other metabolic genes that are associated with periodontitis as reflected in their expression in the healthy vs. disease samples.