| Literature DB >> 34213555 |
Margaret W Thairu1,2, Venkata Rama Sravani Meduri1, Patrick H Degnan3, Allison K Hansen1.
Abstract
Historically it has been difficult to study the evolution of bacterial small RNAs (sRNAs) across distantly related species. For example, identifying homologs of sRNAs is often difficult in genomes that have undergone multiple structural rearrangements. Also, some types of regulatory sRNAs evolve at rapid rates. The high degree of genomic synteny among divergent host-restricted bacterial lineages, including intracellular symbionts, is conducive to sRNA maintenance and homolog identification. In turn, symbiont genomes can provide us with novel insights into sRNA evolution. Here, we examine the sRNA expression profile of the obligate symbiont of psyllids, Carsonella ruddii, which has one of the smallest cellular genomes described. Using RNA-seq, we identified 36 and 32 antisense sRNAs (asRNAs) expressed by Carsonella from the psyllids Bactericera cockerelli (Carsonella-BC) and Diaphorina citri (Carsonella-DC), respectively. The majority of these asRNAs were associated with genes that are involved in essential amino acid biosynthetic pathways. Eleven of the asRNAs were conserved in both Carsonella lineages and the majority were maintained by selection. Notably, five of the corresponding coding sequences are also the targets of conserved asRNAs in a distantly related insect symbiont, Buchnera. We detected differential expression of two asRNAs for genes involved in arginine and leucine biosynthesis occurring between two distinct Carsonella-BC life stages. Using asRNAs identified in Carsonella, Buchnera, and Profftella which are all endosymbionts, and Escherichia coli, we determined that regions upstream of these asRNAs encode unique conserved patterns of AT/GC richness, GC skew, and sequence motifs which may be involved in asRNA regulation.Entities:
Keywords: zzm321990 Bactericera cockerellizzm321990 ; zzm321990 Carsonellazzm321990 ; zzm321990 Diaphorina citrizzm321990 ; antisense small RNA; small RNA; small genomes
Mesh:
Substances:
Year: 2021 PMID: 34213555 PMCID: PMC8557413 DOI: 10.1093/molbev/msab202
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Summary of RNA-Seq Data from Bactericera cockerelli and Diaphorina citri Samples.
| Samplesa | Total Number of Reads | Reads after Quality Screen and Adapter Trimming | Reads Aligning to Genome | Average Genome Coverage |
|---|---|---|---|---|
|
| ||||
| BC-A1 | 3.81 × 107 | 3.30 × 107 | 2.24 × 106 | 974 |
| BC-A2 | 4.32 × 107 | 3.59 × 107 | 1.80 × 106 | 782 |
| BC-A3 | 4.42 × 107 | 3.63 × 107 | 1.91 × 106 | 829 |
| BC-N1 | 4.49 × 107 | 2.96 × 107 | 4.00 × 104 | 17 |
| BC-N2 | 5.92 × 107 | 2.97 × 107 | 2.61 × 104 | 11 |
| BC-N3 | 5.49 × 107 | 3.78 × 107 | 2.56 × 104 | 11 |
| BC-All | 5.75 × 107 | 3.54 × 107 | 7.41 × 104 | 32 |
|
| ||||
| D-All | 4.64 × 107 | 2.92 × 107 | 1.67 × 105 | 73 |
|
| ||||
| 2.25 × 106 | 368 | |||
BC-A1-3 and BC-N1-3 represent B. cockerelli adult and fifth instar nymph life-stage samples, respectively. BC-All and D-All are pooled samples of all B. cockerelli and D. citri life stages.
GO PANTHER Pathways of the Predicted CDSs for sRNAs Found in Carsonella-BC and Carsonella-DC.
| Predicted CDS of Expressed sRNA | ||
|---|---|---|
| Pathway Associated with Predicted sRNA Target |
|
|
| 5-Hydroxytryptamine degradation |
| — |
| Alanine biosynthesis |
| — |
|
| — |
|
|
|
| |
|
|
| |
| ATP synthesis |
|
|
|
|
| — |
|
|
| |
| De novo purine biosynthesis |
| — |
| De novo pyrimidine ribonucleotides biosynthesis |
|
|
|
|
| |
|
|
| — |
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
| — | |
|
| — | |
| Pentose phosphate pathway |
|
|
|
| — |
|
|
|
|
|
Note. “—” not targeted in Carsonella taxa. Underlined pathways are related to essential amino acid biosynthesis.
Sequence and Structural Analysis of Conserved asRNAs.
| ML Distance of AA Sequences | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Alignment Lengtha | sRNA Regionb | Non-sRNA Region of Protein | ΔGc (BC/DC)d | No. of Compensatory Changes (BC/DC) | No. of Aligned | ΔG (ALL)e | No. of Compensatory Changes (ALL) | No. of Shared Pairs | % of Shared Pairs | |
|
| 89 |
| 0.208 | −29.4 | 2 | 11 | −7.6 | 1 | 0 | 0% |
|
| 111 |
| 0.229 | −42.6 | 5 | 11 | −22.8 | 4 | 18 | 53% |
|
| 178 |
| 0.238 | −63.4 | 5 | 11 | −30.1 | 5 | 19 | 36% |
|
| 240 |
| 0.107 | −90.2 | 6 | 11 | −55.4 | 7 | 39 | 48% |
|
| 329 | 0.487 | 0.458 | −126.8 | 9 | 9 | −51.3 | 10 | 14 | 13% |
|
| 107 | 0.437 | 0.312 | −26.8 | 2 | 9 | −6.3 | 5 | 5 | 15% |
|
| 138 |
| 0.220 | −42.2 | 6 | 11 | −13.2 | 2 | 5 | 11% |
|
| 150 | 0.373 | 0.231 | −45.0 | 5 | 11 | −13.7 | 4 | 0 | 0% |
|
| 145 |
| 0.569 | −44.1 | 6 | 11 | −10.0 | 6 | 5 | 11% |
|
| 193 |
| 0.265 | −65.4 | 7 | 11 | −30.3 | 7 | 0 | 0% |
|
| 143 |
| 0.435 | −37.3 | 2 | 11 | −18.1 | 2 | 16 | 39% |
Length of overlapping sRNA region with additional 15 nt upstream and downstream in Carsonella-BC and Carsonella-DC.
Values in italics are significantly more conserved than surrounding protein coding region in one-tailed t-test, P < 0.01.
Thermodynamic ensemble prediction (kcal/mol) from RNAalifold.
BC/DC = Carsonella-BC and Carsonella-DC.
ALL = all aligned Carsonella strains.
GO PANTHER Pathways of the Predicted CDSs for sRNAs of Carsonella-BC That Are Differentially Expressed between the Adult (BC-A1-3) and Nymph (BC-N1-3) Samples.
| Predicted CDS of Expressed sRNA | ||
|---|---|---|
| Pathway Associated with Predicted sRNA Target | Predicted CDS of Differentially Expressed sRNA | Life Stage That sRNA Is Upregulated |
|
|
| Adult (BC-A1-3) |
| De novo pyrimidine ribonucleotides biosynthesis |
| Adult (BC-A1-3) |
|
|
| Adult (BC-A1-3) |
Note.—Underlined pathways are related to essential amino acid biosynthesis.
Fig. 1.Sliding window analysis of the nucleotide content found in the 61 nt upstream region of expressed asRNAs in symbiont lineages and Escherichia coli. The shaded region around each line represents the standard error. Windows that have a significant nucleotide enrichment (P < 0.05) compared with a randomized sequence are marked by a *. The red line under the * indicates how many windows are significant within that region. See supplementary table 6, Supplementary Material online for sample size for each.
Fig. 2.Sliding window analysis of the GC skew found in the 61 nt upstream region of expressed asRNAs and coding sequences in symbiont lineages and Escherichia coli. The shaded region around each line represents the standard error. Sample sizes: Buchnera 5A: CDS = 558, asRNAs = 90; Buchnera-AK: CDS = 569, asRNA = 70; Buchnera-SG: CDS = 545, asRNAs = 73; Buchnera-UA CDS = 539, asRNAs = 76; Carsonella-BC: CDS = 196, asRNAs = 36; Carsonella-DC: CDS = 207, asRNAs = 33; Profftella: CDS = 367, asRNAs = 181; E. coli: CDS = 4,140, asRNAs = 16.
Significantly Enriched Motifs Identified in the Upstream Regions of Expressed sRNAs Identified in Carsonella, Profftella, and Buchnera Genomes.
2 Carsonella-BC
| Genome(s) | No. of sRNA Upstream Regions Searched | Motifa |
| % of Sequences w/Motif |
|---|---|---|---|---|
|
| ||||
| | 34 | GATWWWKTAAHAAWWKBAGSW | 3.7E-05 | 62% |
| | 159 | YCASSWWBWHSWGMAATTGCWSMAGC | 2.0E-05 | 8% |
| | 89 | ADMWDCAKYWWTWKYWDYTTTTTYT | 8.3E-06 | 37% |
| | 67 | TATYTTCARWATTWBTWWYTTTTTTTTTD | 2.4E-03 | 28% |
| | 67 | GCAWYARNTHCTGCT | 1.1E-05 | 30% |
|
| ||||
| | ||||
| | 65 | TGVTRATTYARKAAWAVMWGCWTKAKCWD | 3.6E-06 | 45% |
| | 294 | WAAWGCWAHTRHTTYTTTTTY | 2.1E-37 | 24% |
| | ||||
| | CTGMTTTTAHTMTTSATVSWMMWATWYDRTWWGRTNYTGC | 5.7E-46 | 5% | |
| | AAAAAAAWMWACAAAWAWTRRADATAWWT | 4.2E-24 | 15% | |
|
| ||||
| | ||||
| | 518 | HAGCWDYWGVTRYAKCWSCW | 9.0E-33 | 9% |
| | ||||
| | ||||
| | RCTGMTWTWAHTATTGMTVBWVVWATWMDRTWTGATGTTSC | 3.9E-49 | 8% | |
| | ||||
| | YTAAAAAAAWWAAARAAAAAR | 2.2E-17 | 18% | |
N = A/C/G/T, V = A/C/G, H = A/C/G, D = A/G/T, B = C/G/T, M = A/C, R = A/G, W = A/T, W = A/T, S = C/G, Y = C/T, K = G/T.