| Literature DB >> 32481751 |
Martin Raden1, Fabio Gutmann1, Michael Uhl1, Rolf Backofen1,2.
Abstract
In silico RNA-RNA interaction prediction is widely applied to identify putative interaction partners and to assess interaction details in base pair resolution. To verify specific interactions, in vitro evidence can be obtained via compensatory mutation experiments. Unfortunately, the selection of compensatory mutations is non-trivial and typically based on subjective ad hoc decisions. To support the decision process, we introduce our COmPensatOry MUtation Selector CopomuS. CopomuS evaluates the effects of mutations on RNA-RNA interaction formation using a set of objective criteria, and outputs a reliable ranking of compensatory mutation candidates. For RNA-RNA interaction assessment, the state-of-the-art IntaRNA prediction tool is applied. We investigate characteristics of successfully verified RNA-RNA interactions from the literature, which guided the design of CopomuS. Finally, we evaluate its performance based on experimentally validated compensatory mutations of prokaryotic sRNAs and their target mRNAs. CopomuS predictions highly agree with known results, making it a valuable tool to support the design of verification experiments for RNA-RNA interactions. It is part of the IntaRNA package and available as stand-alone webserver for ad hoc application.Entities:
Keywords: RNA-RNA interaction; compensatory mutation; design; mutation; sRNA
Mesh:
Substances:
Year: 2020 PMID: 32481751 PMCID: PMC7311995 DOI: 10.3390/ijms21113852
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Depiction of an RNA-RNA interaction verification experiment based on compensatory mutations of two RNA sequences A and B. The mutated nucleotides are highlighted by red circles. The lost and regained base pair is given as red line. Black solid lines depict likely formed RRI base pairs, while unlikely base pairs (instable due to reduced RRI) are represented in dotted lines.
Single-nucleotide CoMs from literature used for this study.
| RNA-1 | RNA-2 | Mutation | Source |
|---|---|---|---|
| OxyS_NC_000913 | b2731_NC_000913_-200+100 | G102C&C-13G | [ |
| CyaR_NC_000913 | b2687_NC_000913_-200+100 | A44U&U-7A | [ |
| MicA_NC_000913 | b0411_NC_000913_-200+100 | C11G&G-46C | [ |
| MicA_NC_000913 | b0814_NC_000913_-200+100 | C7G&G17C | [ |
| RybB_NC_000913 | b0805_NC_000913_-200+100 | C2G&G-71C | [ |
| RybB_NC_000913 | b2594_NC_000913_-200+100 | C2G&G4C | [ |
| MicF_NC_000913 | b0889_NC_000913_-200+100 | C2G&G11C | [ |
| SgrS_NC_000913 | b1101_NC_000913_-200+100 | G178C&C-19G | [ |
| DsrA_NC_000913 | b2741_NC_000913_-200+100 | C16G&G-103C | [ |
| RprA_NC_000913 | b2741_NC_000913_-200+100 | C42G&G-103C | [ |
| ArcZ_NC_000913 | b2741_NC_000913_-200+100 | C70G&G-103C | [ |
| ArcZ_NC_000913 | b3546_NC_000913_-200+100 | C69G&G-10C | [ |
| CyaR_NC_000913 | b1824_NC_000913_-200+100 | G32C&C-11G | [ |
| FnrS_NC_000913 | b2531_NC_000913_-200+100 | C47G&G-3C | [ |
| RyhB_NC_000913 | b3365_NC_000913_-200+100 | C45G&G5C | [ |
| RybB_NC_003197 | STM1473_NC_003197_-200+100 | C2G&G19C | [ |
| MicF_NC_003197 | STM0366_NC_003197_-200+100 | C6G&G-31C | [ |
| MicF_NC_003197 | STM0959_NC_003197_-200+100 | C6G&G7C | [ |
| RybB_NC_003197 | STM0413_NC_003197_-200+100 | C2G&G-8C | [ |
| RybB_NC_003197 | STM0999_NC_003197_-200+100 | C2G&G-39C | [ |
| RybB_NC_003197 | STM1070_NC_003197_-200+100 | C2G&G31C | [ |
| RybB_NC_003197 | STM1572_NC_003197_-200+100 | C2G&G25C | [ |
| RybB_NC_003197 | STM1732_NC_003197_-200+100 | C2G&G19C | [ |
| RybB_NC_003197 | STM1995_NC_003197_-200+100 | C2G&G19C | [ |
| RybB_NC_003197 | STM2267_NC_003197_-200+100 | C2G&G-42C | [ |
| RybB_NC_003197 | STM2391_NC_003197_-200+100 | C2G&G55C | [ |
| CyaR_NC_003197 | STM0833_NC_003197_-200+100 | A43U&U-5A | [ |
| SgrS_NC_003197 | STM2945_NC_003197_-200+100 | G176C&C5G | [ |
| ArcZ_NC_003197 | STM1682_NC_003197_-200+100 | G70C&C22G | [ |
| ArcZ_NC_003197 | STM2970_NC_003197_-200+100 | G70C&C-12G | [ |
| MicC_NC_003197 | STM1572_NC_003197_-200+100 | C9G&G69C | [ |
Multi-nucleotide CoMs from literature used for this study.
| RNA-1 | RNA-2 | Mutation | Source |
|---|---|---|---|
| Spot42_NC_000913 | b2702_NC_000913_-200+100 | U23A&A-4U,C24G&G-5C,U25A&A-6U | [ |
| Spot42_NC_000913 | b3962_NC_000913_-200+100 | G49C&C21G,U50A&A20U,A51U&U19A | [ |
| Spot42_NC_000913 | b4311_NC_000913_-200+100 | G5A&C-20U,G6C&U-21G,U7G&A-22C | [ |
| Spot42_NC_000913 | b1302_NC_000913_-200+100 | G55C&C-33G,G56A&C-34U,A57C&U-35G | [ |
| Spot42_NC_000913 | b2715_NC_000913_-200+100 | G5A&C-25U,G6C&C-26G,U7G&A-27C | [ |
| Spot42_NC_000913 | b2801_NC_000913_-200+100 | G49C&C-32G,U50A&A-33U,A51U&U-34A | [ |
| Spot42_NC_000913 | b3224_NC_000913_-200+100 | G5A&C-54U,G6C&C-55G,U7G&A-56C | [ |
| Spot42_NC_000913 | b1901_NC_000913_-200+100 | G5A&C-34U,G6C&C-35G,U7G&G-36U | [ |
| MicA_NC_000913 | b1130_NC_000913_-200+100 | C7G&G7C,G8C&C6G,C9G&G5C,G10C&C4G | [ |
| GcvB_NC_000913 | b1130_NC_000913_-200+100 | C158G&G-13C,U157A&A-14U,G156C&C-15G ,U155A&A-16U,C154G&G-17C | [ |
| CyaR_NC_000913 | b1740_NC_000913_-200+100 | A40U&U-3A,G39A&C-2U | [ |
| CyaR_NC_000913 | b2666_NC_000913_-200+100 | A40U&U7A,G39A&U8U,G38C&C9G | [ |
| ArcZ_NC_000913 | b1892_NC_000913_-200+100 | U78A&G-60U,U77A&A-59U,G76G&C-58C,U75C&A-57G,G74C&U-56G,G73U&C-55A | [ |
| OxyS_NC_000913 | b1892_NC_000913_-200+100 | A69U&U-18A,A68A&U-17U,U67C&A-16G ,A66C&U-15G,A65U&U-14A | [ |
| RybB_NC_000913 | b0721_NC_000913_-200+100 | U12A&A-21U,C13G&G-22C | [ |
| RyhB_NC_000913 | b0721_NC_000913_-200+100 | U51A&A-15U,A50U&U-14A | [ |
| Spot42_NC_000913 | b0721_NC_000913_-200+100 | G13C&C-53G,G14C&C-54G | [ |
| FnrS_NC_000913 | b0755_NC_000913_-200+100 | C47A&G-4U,U48A&A-5U,U49G&A-6C | [ |
| FnrS_NC_000913 | b1479_NC_000913_-200+100 | U57A&A-13U,U58G&A-14C,U59A&A-15U | [ |
| FnrS_NC_000913 | b2153_NC_000913_-200+100 | G5U&C-18A,G4C&C-19G | [ |
| FnrS_NC_000913 | b2303_NC_000913_-200+100 | G5U&C-6A,G4C&C-5G | [ |
| RyhB_NC_000913 | b1656_NC_000913_-200+100 | C49G&G-6C,C45G&G-3C,G44C&C-2G | [ |
| RyhB_NC_000913 | b0592_NC_000913_-200+100 | G53U&C-7A,C54U&G-8A,U55C&A-9G | [ |
| RyhB_NC_000913 | b2155_NC_000913_-200+100 | C47A&G-47U,A48U&U-48A,C49A&G-49U ,A50C&U-50G | [ |
| Spot42_NC_000913 | b1761_NC_000913_-200+100 | C46G&G86C,C48G&G88C | [ |
| RybB_NC_003197 | STM0687_NC_003197_-200+100 | C5G&G14C,A4U&U15A | [ |
| MicA_NC_003197 | STM4231_NC_003197_-200+100 | A22C&U5G,A12G&U14C | [ |
| GcvB_NC_003197 | STM3930_NC_003197_-200+100 | U84G&A-15C,G85A&C-16U,U86A&A-17U ,U87C&A-18G,U88A&A-19U | [ |
sRNAs used within this study.
| sRNA_Genome | Sequence |
|---|---|
| ArcZ_NC_000913 | GTGCGGCCTGAAAAACAGTGCTGTGCCCTTGTAACTCATCATAATAATTTACGGCGCAGCCAAGATTTCCCTGGTGTTGGCGCAGTATTCGCGCACCCCGGTCTAGCCGGGGTCATTTTTT |
| ArcZ_NC_003197 | GTGCGGCCTGAAAACAGGACTGCGCCTTTGACATCATCATAATAAGCACGGCGCAGCCACGATTTCCCTGGTGTTGGCGCAGTATTCGCGCACCCCGGTCAAACCGGGGTCATTTTTT |
| CyaR_NC_000913 | GCTGAAAAACATAACCCATAAAATGCTAGCTGTACCAGGAACCACCTCCTTAGCCTGTGTAATCTCCCTTACACGGGCTTATTT |
| CyaR_NC_003197 | GCTGAAAAACATAACCCATAAATGCTAGCTGTACCAGGAACCACCTCCTTGGCCTGCGTAATCTCCCTTACGCAGGCTTATTT |
| DsrA_NC_000913 | AACACATCAGATTTCCTGGTGTAACGAATTTTTTAAGTGCTTCTTGCTTAAGCAAGTTTCATCCCGACCCCCTCAGGGTCGGGATTTTT |
| FnrS_NC_000913 | GCAGGTGAATGCAACGTCAAGCGATGGGCGTTGCGCTCCATATTGTCTTACTTCCTTTTTTGAATTACTGCATAGCACAATTGATTCGTACGACGCCGACTTTGATGAGTCGGCTTTTTTTT |
| GcvB_NC_000913 | ACTTCCTGAGCCGGAACGAAAAGTTTTATCGGAATGCGTGTTCTGGTGAACTTTTGGCTTACGGTTGTGATGTTGTGTTGTTGTGTTTGCAATTGGTCTGCGATTCAGACCATGGTAGCAAAGCTACCTTTTTTCACTTCCTGTACATTTACCCTGTCTGTCCATAGTGATTAATGTAGCACCGCCTAATTGCGGTGCTTT |
| GcvB_NC_003197 | ACTTCCTGAGCCGGAACGAAAAGTTTTATCGGAATGCGTGTTCTGATGGGCTTTTGGCTTACGGTTGTGATGTTGTGTTGTTGTGTTTGCAATTGGTCTGCGATTCAGACCACGGTAGCGAGACTACCCTTTTTCACTTCCTGTACATTTACCCTGTCTGTCCATAGTGATTAATGTAGCACCGCCATATTGCGGTGCTTT |
| MicA_NC_000913 | GAAAGACGCGCATTTGTTATCATCATCCCTGAATTCAGAGATGAAATTTTGGCCACTCACGAGTGGCCTTTT |
| MicA_NC_003197 | GAAAGACGCGCATTTGTTATCATCATCCCTGTTTTCAGCGATGAAATTTTGGCCACTCCGTGAGTGGCCTTTT |
| MicC_NC_003197 | GTTATATGCCTTTATTGTCACATATTCATTTTGTCGCTGGGCCATTGCGTTAACCTTTGCTTTCCAGCGTATAAATTGACAAGCCCGAACGGATGTTCGGGCTTTTTTT |
| MicF_NC_000913 | GCTATCATCATTAACTTTATTTATTACCGTCATTCATTTCTGAATGTCTGTTTACCCCTATTTCAACCGGATGCCTCGCATTCGGTTTTTTTT |
| MicF_NC_003197 | GCTATCATCATTAACTTTATTTATTACCGTCATTCACTTCTGAATGTCTGTTTACCCCTATTTCAACCGGATGCTTCGCATTCGGTTTTTTTT |
| OxyS_NC_000913 | GAAACGGAGCGGCACCTCTTTTAACCCTTGAAGTCACTGCCCGTTTCGAGAGTTTCTCAACTCGAATAACTAAAGCCAACGTGAACTTTTGCGGATCTCCAGGATCCGC |
| RprA_NC_000913 | ACGGTTATAAATCAACATATTGATTTATAAGCATGGAAATCCCCTGAGTGAAACAACGAATTGCTGTGTGTAGTCTTTGCCCATCTCCCACGATGGGCTTTTTTT |
| RybB_NC_000913 | GCCACTGCTTTTCTTTGATGTCCCCATTTTGTGGAGCCCATCAACCCCGCCATTTCGGTTCAAGGTTGATGGGTTTTTT |
| RybB_NC_003197 | GCCACTGCTTTTCTTTGATGTCCCCATTTTGTGGAGCCCATCAACCCCGCCATTTCGGTTCAAGGTTGGTGGGTTTTTT |
| RyhB_NC_000913 | GCGATCAGGAAGACCCTCGCGGAGAACCTGAAAGCACGACATTGCTCACATTGCTTCCAGTATTACTTAGCCAGCCGGGTGCTGGCTTTT |
| SgrS_NC_000913 | GATGAAGCAAGGGGGTGCCCCATGCGTCAGTTTTATCAGCACTATTTTACCGCGACAGCGAAGTTGTGCTGGTTGCGTTGGTTAAGCGTCCCACAACGATTAACCATGCTTGAAGGACTGATGCAGTGGGATGACCGCAATTCTGAAAGTTGACTTGCCTGCATCATGTGTGACTGAGTATTGGTGTAAAATCACCCGCCAGCAGATTATACCTGCTGGTTTTTTTT |
| SgrS_NC_003197 | GATGAAGCAAGAGGAAGAGGTCACTATGCGCCAGTTCTGGTTGAGATATTTTGCCGCGACGGAAAAAACGTCCTGGCTGGCTTGCCTGAGCGCACCGCAGCGCTTAAAAATGCTCGCGGAACTGATGCAGTGGGAGGCGACCGATTGAAGCCAATTGCAGACATCATGTGTGACTGAGTATTGGTGTAGGCGATAGCCTAAAATCACCCGCCAGCAGATAATATCTGCTGGCTTTTTTT |
| Spot42_NC_000913 | GTAGGGTACAGAGGTAAGATGTTCTATCTTTCAGACCTTTTACTTCACGTAATCGGATTTGGCTGAATATTTTAGCCGCCCCAGTCAGTAATGACTGGGGCGTTTTTTA |
genomic subsequences of NC_003197 -200+100 around start codon.
| Locus_Genome_Range | Sequence | Gene |
|---|---|---|
| STM0366_NC_003197_-200+100 | CGTTCATCTTATTAATAGTCAAACCAGATGATTGCGAGTGAGATCACAAAGCAGGGGCGTTTTAATCCGCGTTGTTACGCCGACAGAGCGGGGGCTGACTGGATTTTTCCAGTAATCTACACTACTTATTTAATCAGTCCGAACGGCCTTTTTGTTCTGATAAAGCGATGATGGCGTAATAATAAAACGAGGGTTTTGCTATGAAAACTGGCTACAAGGTTATGCTTGGCGCATTAGCGTTTGTCGTGACAAACGTTTATGCCGCAGAAATCATGAAAAAAACGGACTTTGATAAAGTCG | yahO |
| STM0413_NC_003197_-200+100 | TCATGAAAGATAGTACTGTCGCCGCGTCTAAAATGCGCAAACGTGAACGCAATCGATTACGTAAATGATAGATATGTGAAACAAGACATATTTTTGTGAGCAATGATTTTTATAATAGGCTCCGCAGAAACACGAAATATTTAGAAACGCAAATTGCGTTCTTTTCACTCCCGCAAGGGATTTCAAACAGTGGCATACATATGAAAAAAACTTTACTCGCAGTCAGCGCAGCGCTGGCGCTCACCTCATCTTTTACTGCTAACGCAGCAGAAAATGATCAGCCGCAGTATTTGTCCGACT | tsx |
| STM0687_NC_003197_-200+100 | AGCAGTCGAATGTAACAGAAAGCAATTAAATATGTGCGGTTGCTCATATTATTACATACTGGTTACAGAAAGAGATTGATAATTCGCATCGCGAAAAATAGTCTATTTAACGTAGTAAATGAGGTTTCTCAGCGCTACTTTTTATTTTTTCGCTGTTCGCTTTTGTCGGCAGCAATTTATACGTCAAAGAGGATTAACTTATGCGTACGTTTAGTGGCAAACGTAGTACGCTGGCTCTGGCTATCGCCGGTATCACAGCAATGTCGGGGTGGATCGTTGTTCCGCAGGCGCAAGCCTCCG | ybfM/chiP |
| STM0833_NC_003197_-200+100 | CTGGATGAATGTATCGCGCCGCACGCGCATTATTGGTGCAATAAGCCGGAAAAGTGATGTTAATTGAATAAGATAGCGCGATATGGAAACGTTCTGTTACATGAAAGGCGCCCTTAGACACCGTGAATCGCAAAGAGTTTCCCATTAATTTTTGATATATTTAAAACTTAGGACTTACTTGAAGCACATTTGAGGTGGTTATGAAAAAAATTGCATGTCTTTCAGCACTGGCCGCTGTTCTGGCTTTTTCCGCAGGTACTGCAGTAGCTGCTACTTCTACCGTTACCGGTGGTTACGCTC | ompX |
| STM0959_NC_003197_-200+100 | TGAAATCTACGCATGGCGTGGACAGACGCCATTCGTGATGTCGATAGCTGCCGCGAGGCAACGGTCTTCTCACCATAGACCAGGCATTGCGCGCCGTTAATCCCTCTGGGTTTCGGTCTATCGTGATGGGCAGCGACTCTGAACAGTGATGTGAGTAGAGTCAGGCAGGAGTAGGGAAGGAATACAGAGAGACAATAATAATGGTAGATAGCAAGAAGCGCCCTGGCAAAGATCTCGACCGTATCGATCGTAACATTCTTAATGAACTGCAAAAGGATGGGCGTATTTCCAACGTCGAGC | lrp |
| STM0999_NC_003197_-200+100 | ACATATTATTTCCTTTTGAAACCAAATCTTTATCTTTGTAGCACTTTCACGGTAGCGAAACGTTAGTTTGAATGGAAAGATGCCTGTCAGACACATAAAGACACCAAACTCTCATCAATATTTCTGTAAAGTTTTATTGACGGAATTTATTGACGGCAGTGGCAGGTGTCATATAAAAAAACCAATGAGGGTAATAAATAATGATGAAGCGCAAAATCCTGGCAGCGGTGATCCCTGCCCTGCTGGCTGCTGCAACCGCAAACGCAGCAGAAATTTATAATAAAGATGGTAATAAGCTGG | ompF |
| STM1070_NC_003197_-200+100 | TGTTTTTTTCACATGTCTGACGGAGTTCACACTTGTAAGTTTCCAACTACGTTGTAGACTTTACATCGCCAGGGGTGCTCAGCATAAGCCGTAGATATCGGTAGAGTAACTATTGAGCAGATCCCCCGGTGAAGGATTTAACCGTGTTATCTCGTTGGAGATATTCATGGCGTATTTTGGATGATAACGAGGCGCAAAAAATGAAAAAGACAGCTATCGCGATTGCAGTGGCACTGGCTGGTTTCGCTACCGTAGCGCAGGCCGCTCCGAAAGATAACACCTGGTACGCTGGTGCTAAAC | ompA |
| STM1473_NC_003197_-200+100 | CACATACATGAATACATAATAACAAATATATTCACCATAAATATATGCGTTTCCGATAGTAACTTTTGTATTAATTAATAACATATAAGAAAAGTTAGCATTTGCTGAAATAATATTATTCAGATTAGGATGCCTTTGATTCAACGAATCTGTAGAAGTTCAATCTTTTGCAAATAAGTTAAGTTTTTAAGGATAAAAAAATGAAAAGAAAAGTATTGGCACTTGTCATCCCGGCTCTGCTGGCTGCTGGCGCAGCACACGCCGCTGAAATTTATAACAAAGACGGCAACAAACTGGACC | ompN |
| STM1572_NC_003197_-200+100 | TTAGACAGTCCCTATTTGAATTAATACTCTCAAATGTATTAAGGAGATCTCGATCACACAAATTAAAATAATTTGTAATCTTATGAAACTTATTATTGAACTTATGCCACTCCGTCATTTAAAAATAGTCTGCCATTGACAAACGCCTCGTTTAACAATGGTTGAGGAAACACGCTAAGAAAATTATAAGGATTATTAAAATGAAACTTAAGTTAGTGGCAGTGGCAGTGACTTCCCTGTTGGCAGCAGGCGTTGTAAATGCAGCCGAGGTATATAACAAAGACGGCAATAAACTGGATC | ompD |
| STM1682_NC_003197_-200+100 | CATGCGCTTCGCTTCGGCTACCACGCGCAATAACAAAAGGCGTATGTAACGGCCAGGTTTCCTCATATACCTTAACAGATCTCATCTCTTCCCCTCTGATAGCGCCGGACGCGGCTTGCTACAAATAGTATTTGCCGTGTTAATGAATAATGACTTAAACTGGATTTCGACGTTAACTATAAGTAAATAGGAACATAATTATGTCACAGACTGTACATTTCCAGGGTAACCCGGTCACCGTTGCCAACGTTATTCCGCAGGCTGGTAGCAAAGCACAGGCTTTTACTCTTGTCGCAAAAG | tpx |
| STM1732_NC_003197_-200+100 | GAGCAGACAAATATTTGCATAGCGTGAATATGTCAAAATTGATCTGAATTCCTATAACCAGGATTTTCAATACAAGTTCTAAATTAATCTGGATCAATAAATGTTAAATTATAAGAACAAATGTGATCTGTATTAGATCACTTATTACTTCATTGTGGGTATATTCATCACGCTTTTATAACCATAACGATGGAGCGGGTATGAAAAAATTTACAGTGGCGGCACTGGCGTTAACAACTCTTCTCTCAGGCAGCGCGTTCGCGCACGAAGCCGGAGAATTCTTTATGCGTGCAGGTCCGG | ompW |
| STM1995_NC_003197_-200+100 | GATAATTATAGAATATATATTCTTAGTTACTTATATAGTCTGTATTATAAAAAACCAAACAGAAACAAATTGAAATATTTTAAATACCTTTGTTACATGTTATTTTTTAAATTCCATGAACTTCATAGAATAGTATCAATTTGTAGTTTTGTTGAAGTGGCTACATATTCATATAAATTATTATCATAAGGGAATACATAATGAACAGAAAAGTTCTGGCACTGCTTGTCCCGGCGTTATTAGTGGCAGGCGCAGCAAATGCGGCTGAAGTTTATAATAAAAATGGCAACAAACTCGACC | ompS |
| STM2267_NC_003197_-200+100 | GCTTTAAAAAAGTTCCGTAAAATTCATATTTTGAAACATCTATGTAGATAACTGTAACATCTTAAAAGTTTTAGTATCATATTCGTGTTGGATTATTCTGTATTTTTGCGGAGAATGGACTTGCCGACTGGTTAATGAGGGTTAACCAGTAAGCAGTGGCATAAAAAAGCAATAAAGGCATATAACAGAGGGTTAATAACATGAAAGTTAAAGTACTGTCCCTCCTGGTACCAGCTCTGCTGGTGGCGGGCGCAGCGAATGCGGCTGAAATTTATAATAAAGACGGCAACAAATTAGACC | ompC |
| STM2391_NC_003197_-200+100 | TTAAGCCCGCCGATTTTGCCAGCCAGATCTCGTTTCTAAGATCACAATTGAAAAAACTTATAAACATACTTGCAACATTCTAGCTGGTCAGACCTATACTCTCGCCACTGGTCTGATTTCTAAGTCGTACCGCAGACCCTACACTTCGCGCTCCTGTTACAGCATGTAACATAGTTTGTATAAAAATAATCAATGAGGTTATGGTCATGAGCCAGAAAACCCTGTTTACAAAGTCTGCTCTCGCAGTCGCAGTGGCAATCATCTCCACCCAGGCCTGGTCTGCAGGCTTTCAGTTAAACG | fadL |
| STM2945_NC_003197_-200+100 | AACGACCATTTGCGGCGAATCATCTACCTTTTGTCTGAATTATCGTCACCACAAAGGATTACCAACCATAAATGTGCTGTATTAATAATGTCGTTCAAATTCTCTCCTGTAGTAAACTTTATCTGTTTAATAAAAAAGAGAGAATTGAACGATATATTTTACTCCGGATATTGAATAATATAAATTTGAAGGAAAATATTATGCCAGTCACTTTAAGCTTCGGTAATCATCAAAATTATACGCTTAATGAAAGTCGGCTTGCTCATCTGTTAAGCGCAGATAAAGAAAAAGCAATCCATA | sopD |
| STM2970_NC_003197_-200+100 | TCCATGTAAGAAGCGGATTATTGCATTTGAGATCGGGATCACTGATAGATTCATCACTTAAATGTATCTTTCCGCCCGAAAATTATTACGGCGAAAAATTATATAAAAAGCGTCCCTAAGCAGATTTCATTTTACGATCAGGTCTTTTTTCATTGGATTAGACCAGCAACCTGATTTTTAGCATCCTCCAGGAGAAATAGATGGAAACCACTCAGACCAGCACTATTGCTTCGATTGACTCTCGAAGCGCATGGCGCAAAACGGATACCATGTGGATGCTGGGCCTTTACGGCACGGCTA | sdaC |
| STM3930_NC_003197_-200+100 | CTTCTGATGACTTGAGCAGCGGATTGTGCTTATGGTGCTGCTCATTTACAACATAATCGATGATTTCTTACACAATAAGTGCATTTTTTTAATGCTCCATTTGCCATTTGTCCAAATTTAAGAAAATATTCGCAACAATCGATGTACCCATAACAATAACCGGTACTACCGGAACCGTTGCAAACACGACATGAGGATTTATGGCAGAGAAAAAACCGGAGCTACAGCGTGGGCTGGAAGCTCGTCATATTGAATTGATTGCCCTCGGGGGCACCATCGGCGTCGGACTCTTTATGGGCG | yifK |
| STM4231_NC_003197_-200+100 | GTTGGTAGAAGAGGGCGCCACATTCGCTATCGGCCTGCCGCCAGAGCGCTGTCATCTGTTCCGCGAGGATGGCAGCGCATGTCGTCGTCTGCATCAAGAGCCGGGTGTTTAAGGCCTCCATAAAAAAACGAAACGCAAAACCATTCGCAGTTTTAGAAGGTGGCAGCGTTTAAAGAAAAGCAATGATCTCAGGAGATAGAATGATGATTACTCTGCGCAAACTCCCACTGGCGGTTGCTGTCGCAGCGGGCGTAATGTCCGCTCAGGCAATGGCTGTCGATTTCCACGGTTACGCCCGTT | lamB |
genomic subsequences of NC_000913 -200+100 around start codon (continued in Table A6).
| Locus_Genome_Range | Sequence | Gene |
|---|---|---|
| b0411_NC_000913_-200+100 | AGGGCGAAAGTCAGTACAATCCCCGCCCGAATGTGTGTAAACGTGAACGCAATCGATTACGTAAATGATAGAACTGTGAAACGAAACATATTTTTGTGAGCAATGATTTTTATAATAGGCTCCTCTGTATACGAAATATTTAGAAACGCAATTTGCGCCTTTTTCACTCCCGCAAGGGATTTTCAAACAGTGGCATACATATGAAAAAAACATTACTGGCAGCCGGTGCGGTACTGGCGCTCTCTTCGTCTTTTACTGTCAACGCAGCTGAAAACGACAAACCGCAGTATCTTTCCGACT | tsx |
| b0592_NC_000913_-200+100 | ATAAGCGCAATGTGATGTCCTGCGCCGTTCTGCCCCCTCTCCCTTCCAGGGTGAGGGCTGGGGTGAGGGTTAATGTTCGCACCAGTGCTGGCTGTTCCCCTCACCCTAACCCTCTCCCCAAAGGGGCGAGGGGACGGATTGTGCGCTTTGTCGAATTTGTCATTACGCCCTTAACCTTATTAATAACAGGAAGCTGATTTGTGAGACTCGCCCCGCTCTACCGCAACGCCCTTCTATTAACAGGACTTTTGCTTTCAGGAATAGCCGCAGTTCAGGCCGCTGACTGGCCGCGTCAGATTA | fepB |
| b0721_NC_000913_-200+100 | TCCCGAGCCACCCAGCGTTGTAACGTGTCGTTTTCGCATCTGGAAGCAGTGTTTTGCATGACGCGCAGTTATAGAAAGGACGCTGTCTGACCCGCAAGCAGACCGGAGGAAGGAAATCCCGACGTCTCCAGGTAACAGAAAGTTAACCTCTGTGCCCGTAGTCCCCAGGGAATAATAAGAACAGCATGTGGGCGTTATTCATGATAAGAAATGTGAAAAAACAAAGACCTGTTAATCTGGACCTACAGACCATCCGGTTCCCCATCACGGCGATAGCGTCCATTCTCCATCGCGTTTCCG | sdhC |
| b0755_NC_000913_-200+100 | GTTACGCCCTCGTCATGAGGGCTTTATCTCATATTGTTCAAATCACCAGCAAACACCGACATATTTGCAACTCAATATTCACAACAACCTTACACTGCGCCACTATTTTCGCTATGGTTATGCGTAAGCATTGCTGTTGCTTCGTCGCGGCAATATAATGAGAATTATTATCATTAAAAGATGATTTGAGGAGTAAGTATATGGCTGTAACTAAGCTGGTTCTGGTTCGTCATGGCGAAAGTCAGTGGAACAAAGAAAACCGTTTCACCGGTTGGTACGACGTGGATCTGTCTGAGAAAG | gpmA |
| b0805_NC_000913_-200+100 | TCAAATATGAACTCAATGTAAATAAATGTATTTCTTTTTCGCGCAATGGGTGATAGAAAATCGCTCCAAGTGATAATGCTTATCAAAATTATTATCACTTTCACGAGCACTATCACGGGATTAACAGTGGCATCGCATCCGCAGAGAGGCTTTCTCGTGGCAGTGAAAATTTCAACATATAAGAAAAAGTCACCTGCAAAATGGAAAACAATCGCAATTTCCCTGCCAGACAATTTCATTCGCTCACGTTCTTTGCCGGTCTTTGTATTGGCATCACGCCTGTGGCTCAGGCACTCGCCG | fiu |
| b0814_NC_000913_-200+100 | CTGGATGAATGACAGGGAAAACATGCGTAATACTTACGCAGTTCTCTGAAAAAGTGATTTAAATTTAGATGGATAGCGGTGTATGGAAACGTTCTGTTACATGAAATGGCCCGTTAGACATCACAAATCGCGAAGAGTTTCCCATTAATTTTTGATATATTTAAAACTTAGGACTTATTTGAATCACATTTGAGGTGGTTATGAAAAAAATTGCATGTCTTTCAGCACTGGCCGCAGTTCTGGCTTTCACCGCAGGTACTTCCGTAGCTGCGACTTCTACTGTAACTGGCGGTTACGCAC | ompX |
| b0889_NC_000913_-200+100 | GTGAAATCTACGTATGGCGTGGACAGACGCCATTCGTGATGTCGATAGCTGCCACAAGGCAACGGTCTTCTCACCGTAGACCCAGGCATTGCGCGCCGTGAATCTTCATGATTTCGGTCTATCGTGACGGGTAGCGACTCTGAACAGTGATGTTTCAGGGTCAGACAGGAGTAGGGAAGGAATACAGAGAGACAATAATAATGGTAGATAGCAAGAAGCGCCCTGGCAAAGATCTCGACCGTATCGATCGTAACATTCTTAATGAGTTGCAAAAGGATGGGCGTATTTCTAACGTCGAGC | lrp |
| b1101_NC_000913_-200+100 | CATATGTTTTGTCAAAATGTGCAACTTCTCCAATGATCTGAAGTTGAAACGTGATAGCCGTCAAACAAATTGGCACTGAATTATTTTACTCTGTGTAATAAATAAAGGGCGCTTAGATGCCCTGTACACGGCGAGGCTCTCCCCCCTTGCCACGCGTGAGAACGTAAAAAAAGCACCCATACTCAGGAGCACTCTCAATTATGTTTAAGAATGCATTTGCTAACCTGCAAAAGGTCGGTAAATCGCTGATGCTGCCGGTATCCGTACTGCCTATCGCAGGTATTCTGCTGGGCGTCGGTT | ptsG |
| b1130_NC_000913_-200+100 | GAGCTATCACGATGGTTGATGAGCTGAAATAAACCTCGTATCAGTGCCGGATGGCGATGCTGTCCGGCCTGCTTATTAAGATTATCCGCTTTTTATTTTTTCACTTTACCTCCCCTCCCCGCTGGTTTATTTAATGTTTACCCCCATAACCACATAATCGCGTTACACTATTTTAATAATTAAGACAGGGAGAAATAAAAATGCGCGTACTGGTTGTTGAAGACAATGCGTTGTTACGTCACCACCTTAAAGTTCAGATTCAGGATGCTGGTCATCAGGTCGATGACGCAGAAGATGCCA | phoP |
| b1302_NC_000913_-200+100 | AGCCGGACGTTTGATTGCCGAACTGCTGCGCGGCGACGCCGAACGTTTCGATGCCTTCGCCAATCTGCCGCATTACCCGTTCCCCGGCGGGCGCACGCTGCGTGTGCCGTTTACCGCGATGGGCGCGGCGTATTACAGCCTGCGCGATCGTCTGGGCGTTTAATTTCCGATTAACCGTGAAGAGTCAAAAGGTGTGAAACATGAGCAACAATGAATTCCATCAGCGTCGTCTTTCTGCCACTCCGCGCGGGGTTGGCGTGATGTGTAACTTCTTCGCCCAGTCGGCTGAAAACGCCACGC | puuE |
| b1479_NC_000913_-200+100 | TAGTAAATAACCCAACCGGCAGAAAACGCCCCGCTGAAAAGTAATTCATAACCATCAGTCCTCAATGACGATTAAACACCATTGCCTGCGCAATGGTGTTTTTGTTTTTATCTGCTTTATACTTGAGGCCGACGCCCTGGCGGTAAAGCAAAGACGATAAAAGCCCCCCAGGGATGGATATTCAAAAAAGAGTGAGTGACATGGAACCAAAAACAAAAAAACAGCGTTCGCTTTATATCCCTTACGCTGGCCCTGTACTGCTGGAATTTCCGTTGTTGAATAAAGGCAGTGCCTTCAGCA | maeA |
| b1656_NC_000913_-200+100 | TCTCAGTGAAGACTACTGGCAGCGCCACTATGTTGGCGCTCGTCGGGTAATGACCCCAAAAACACTTCGCTAAAACTTTACCCTGTTGTTACGGCAACAGGGTAAGTTCATCTTTTGTCTCACCTTTTAATTTGCTACCCTATCCATACGCACAATAAGGCTATTGTACGTATGCAAATTAATAATAAAGGAGAGTAGCAATGTCATTCGAATTACCTGCACTACCATATGCTAAAGATGCTCTGGCACCGCACATTTCTGCGGAAACCATCGAGTATCACTACGGCAAGCACCATCAGA | sodB |
| b1740_NC_000913_-200+100 | TTCCGTCCTCTTGTTTATCAGCGTGTTAGATAAGCCTGGAATACATTGGGCGCTTTTTCAAGCCCGTGAACGAAACGGCTCCGCTTTCAGAGGATTCCTGTATGACGTTTTAACCACCATTCAGCCCGCTGTCGCTTGTCGTTTCAGTAGCAACGGGTTAGCTTTAAGGAAGTTTTGTCTTTTCTGTCTGGAGGGGTTCAATGACATTGCAACAACAAATAATAAAGGCGCTGGGCGCAAAACCGCAGATTAATGCTGAAGAGGAAATTCGTCGTAGTGTCGATTTTCTGAAAAGCTACC | nadE |
| b1761_NC_000913_-200+100 | TAACGGTAGCCGGGTGGCAAAACTTTAGCGTCTGAGTTATCGCATTTGGTTATGAGATTACTCTCGTTATTAATTTGCTTTCCTGGGTCATTTTTTTCTTGCTTACCGTCACATTCTTGATGGTATAGTCGAAAACTGCAAAAGCACATGACATAAACAACATAAGCACAATCGTATTAATATATAAGGGTTTTATATCTATGGATCAGACATATTCTCTGGAGTCATTCCTCAACCATGTCCAAAAGCGCGACCCGAATCAAACCGAGTTCGCGCAAGCCGTTCGTGAAGTAATGACCA | gdhA |
| b1824_NC_000913_-200+100 | GCCAGTTTAAGTATCTGCCTGAACTGGCAAGGTTAAGCACAATGATATATCGGCGCGTATTCCGTTGCATAAGTGTGCAAAAAAAGTGGAAGACGTATCGAGATTTGTGCGTCTGATCGAGACATGTTTAAAAATGGCTTGCCATAATTAACGTTGTATGTGATAACAGATTTCGGGTTAAACGAGGTACAGTTCTGTTTATGTGTGGCATTTTCAGTAAAGAAGTCCTGAGTAAACACGTTGACGTTGAATACCGCTTCTCTGCCGAGCCTTATATTGGTGCCTCATGCAGTAATGTGT | yobF |
| b1892_NC_000913_-200+100 | TCGATTTAGGAAAAATCTTAGATAAGTGTAAAGACCCATTTCTATTTGTAAGGACATATTAAACCAAAAAGGTGGTTCTGCTTATTGCAGCTTATCGCAACTATTCTAATGCTAATTATTTTTTACCGGGGCTTCCCGGCGACATCACGGGGTGCGGTGAAACCGCATAAAAATAAAGTTGGTTATTCTGGGTGGGAATAATGCATACCTCCGAGTTGCTGAAACACATTTATGACATCAACTTGTCATATTTACTACTTGCACAGCGTTTGATTGTTCAGGACAAAGCGTCCGCTATGT | flhD |
| b1901_NC_000913_-200+100 | TCCCGCTAAATTTATGCACGTTCTCACTGTAATTCTGCGATGTGATATTGCTCTCCTATGGAGAATTAATTTCTCGCTAAAACTATGTCAACACAGTCACTTATCTTTTAGTTAAAAGGTAATGCTTTGTTTTCCGATTAATTTAACGAATGTCATTCGTTTTTGCCCTACACAAAACGACACTAAAGCTGGAGAGAACCATGCACAAATTTACTAAAGCCCTGGCAGCCATTGGTCTGGCAGCCGTTATGTCACAATCCGCTATGGCGGAGAACCTGAAGCTCGGTTTTCTGGTGAAGC | araF |
| b2153_NC_000913_-200+100 | CTGTGAGTAACTTTCACTTCCGTATTTGCATAACGATGTTTTAACATCTGCTGATGAAAGGCAGCGGCAATTACAATAATTATCGCTGTGAATACTGGATTATGTGCGCCGCCTCACGCACAATAATCAGGCTGTAAATCAGCTTAATAACTTTGCCCCCACGCAGGGCGGAGGCGTCACACCTGCAGGAGAAATCATAAATGCCATCACTCAGTAAAGAAGCGGCCCTGGTTCATGAAGCGTTAGTTGCGCGAGGACTGGAAACACCGCTGCGCCCGCCCGTGCATGAAATGGATAACG | folE |
| b2155_NC_000913_-200+100 | GGATTGATAATTGTTATCGTTTGCATTATCGTTACGCCGCAATCAAAAAAGGCTGACAAATCAGAGGCTGTTCCGGCTTTCTGGGATGATCACCTGCATAAAAAATAAGTCCACCGCGATGCTGCCGTACGCAAGGGGACGTGAAGAAGATGTGAGCGATAACCCATTTTATTTTCGTAGTTACCTCATGGAGATATGGAATGTTTAGGTTGAACCCTTTCGTACGGGTCGGGCTGTGTTTGTCCGCTATTTCTTGTGCATGGCCTGTGTTAGCGGTCGATGATGATGGCGAAACGATGG | cirA |
| b2303_NC_000913_-200+100 | TGGGTTAATGCCTGGACTCGCCAGCGAATTGACCTAGCAATGTATCCGGCAGTCAAGAACTGGCATGAGCGGATCCGTTCGCGCCCTGCCACCGGGCAGGCACTGCTAAAAGCACAACTCGGTGATGAGCGTTCGGATAGTTAACAGAAACAGGTTCTCGTGTATTATTTCATCCTAAGTAAAACAACGGAGAACCTGCAATGGCACAACCTGCCGCTATTATTCGTATAAAGAACCTTCGTTTGCGTACGTTTATCGGAATTAAGGAAGAAGAAATTAACAACCGTCAGGATATTGTTA | folX |
| b2531_NC_000913_-200+100 | ATGGCGTTCACGCCGCATCCGACAACAGGTACAAACGCCACGATAAAAAAATGGCACTGAAGGTTAAATACCCGACTAAATCAGTCAAGTAAATAGTTGACCAATTTACTCGGGAATGTCAGACTTGACCCTGCTATGCAATACCCCCACTTTTACAATAAAAAACCCCGGGCAGGGGCGAGTTTGAGGTGAAGTAAGACATGAGACTGACATCTAAAGGGCGCTATGCCGTGACCGCAATGCTTGACGTTGCGCTCAACTCTGAAGCGGGCCCGGTACCGTTGGCTGATATTTCCGAAC | iscR |
| b2594_NC_000913_-200+100 | CCCCGAGCAACCCGCCAAAAACAGGCTTAGTGTGGCGGCTGCCACCAGATATTTCATGCGCGTCATGACGTTTTGACTTTCCTCAAAATGTAATACGGGAGATTCTCTGTTCCTGCTCCCGGTTAAGACCAGCTACAATAGCACACTATATTAAACGGCAAAGCCGTAAAACCCCAACGATAAACGAAGAAGCAGTATATATGGCACAACGAGTACAGCTCACTGCAACGGTGTCCGAAAACCAACTCGGTCAACGCTTAGATCAGGCTTTGGCCGAAATGTTCCCGGATTATTCACGTT | rluD |
| b2666_NC_000913_-200+100 | TTGCGCGAGTTCAGTCATATTTATTTAAGTATTTTCTAAATTAAGTAAACTCTAAACTAAAAATGCAACATATACCAGCCTCAGCAGCGTAAATGAGAGTAAAAGCGTAAGCTGAAACTGGCAGGCTCCGCTAAAATTACTACGCTTAAGAGATAAAATCTCTTTTTAAACAATGAGTAATTTTCTTATAGGGAGTACATATGGGTTTCTGGAGAATCGTCATCACCATCATTCTGCCGCCGCTCGGCGTGCTGCTCGGTAAAGGGTTCGGTTGGGCGTTCATTATTAATATTCTGTTGA | yqaE |
| b2687_NC_000913_-200+100 | GAAGCCGCTGATACCGAACCGTTTGCGGTGTGGCTGGAAAAACACGCCTGACAGAAAAGAAAAAGGCCACTCGTGAGTGGCCAAAATTTCATCTCTGAATTCAGGGATGATGATAACAAATGCGCGTCTTTCATATACTCAGACTCGCCTGGGAAGAAAGAGTTCAGAAAATTTTTAAAAAAATTACCGGAGGTGGCTAAATGCCGTTGTTAGATAGCTTCACAGTCGATCATACCCGGATGGAAGCGCCTGCAGTTCGGGTGGCGAAAACAATGAACACCCCGCATGGCGACGCAATCA | luxS |
genomic subsequences of NC_000913 -200+100 around start codon (continuation of Table A5).
| Locus_Genome_Range | Sequence | Gene |
|---|---|---|
| b2702_NC_000913_-200+100 | CGCACAAGGAAGCGGTAGTCACTGCCCGATACGGACTTTACATAACTCAACTCATTCCCCTCGCTATCCTTTTATTCAAACTTTCAAATTAAAATATTTATCTTTCATTTTGCGATCAAAATAACACTTTTAAATCTTTCAATCTGATTAGATTAGGTTGCCGTTTGGTAATAAAACAATAAATCCTGAAGGAGAGAACAATGATAGAAACCATTACTCATGGTGCAGAGTGGTTTATCGGGCTGTTCCAAAAGGGCGGAGAGGTGTTTACCGGGATGGTGACCGGCATTCTTCCGCTGT | srlA |
| b2715_NC_000913_-200+100 | TGCACAATCGGCGGGAAAAATATTCAGGTGACCGGTTTCACAAATATAAAAAATGAACAATTCACTCTCTTGCTTATTTAGTGACAACTATTCATGATTTTGTGAAACCGGTTTCTTAATTCCGTTTCAGCATCGGCATTTTTCCGTCACGTCGACTGATAACAACTACATCTACCCTACTGATAACAGGATAAAATCCGATGGCCAAAAATTATGCGGCGCTGGCACGCTCGGTGATAGCGGCACTGGGCGGCGTTGATAACATCTCGGCGGTCACGCACTGTATGACGCGGTTGCGCT | ascF |
| b2731_NC_000913_-200+100 | ACTGGGGAAAGACGCGGCGCTGATTGGTGAAGTGGTGGAACGTAAAGGTGTTCGTCTTGCCGGTCTGTATGGCGTGAAACGAACCCTCGATTTACCACACGCCGAACCGCTTCCGCGTATATGCTAATAAAATTCTAAATCTCCTATAGTTAGTCAATGACCTTTTGCACCGCTTTGCGGTGCTTTCCTGGAAGAACAAAATGTCATATACACCGATGAGTGATCTCGGACAACAAGGGTTGTTCGACATCACTCGGACACTATTGCAGCAGCCCGATCTGGCCTCGCTGTGTGAGGCTC | fhlA |
| b2741_NC_000913_-200+100 | CGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGGCGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAACAGG | rpoS |
| b2801_NC_000913_-200+100 | ATGGTAGTCACATAAAGTCACCTTCTAGCTAATAAGTGTGACCGCCGTCATATTACAGAGCGTTTTTTATTTGAAAATGAATCCATGAGTTCATTTCAGACAGGCAAATATTCACTGATATGAAGCCCGAACTCGCTGGTTTTGCACTTTTGAAAACATAACCGATTACGTGCTTAAGCTTCTGAACCTAAGAGGATGCTATGGGAAACACATCAATACAAACGCAGAGTTACCGTGCGGTAGATAAAGATGCAGGGCAAAGCAGAAGTTACATTATTCCATTCGCGCTGCTGTGCTCAC | fucP |
| b3224_NC_000913_-200+100 | CGCTGTGCCGCAAACCGTTTGGACCGGTAGATGAAAAATATCTGCCAGAACTGAAGGCGCTGGCCCAGCAGTTGATGCAAGAGCGCGGGTGAGTTGTTTCCCCTCGCTCGCCCCTACCGGGTGAGGGGAAATAAACGCATCTGTACCCTACAATTTTCATACCAAAGCGTGTGGGCATCGCCCACCGCGGGAGACTCACAATGAGTACTACAACCCAGAATATCCCGTGGTATCGCCATCTCAACCGTGCACAATGGCGCGCATTTTCCGCTGCCTGGTTGGGATATCTGCTTGACGGTT | nanT |
| b3365_NC_000913_-200+100 | ATCTATTTCTATAAACCCGCTCATTTTGTCTATTTTTTGCACAAACATGAAATATCAGACAATTCCGTGACTTAAGAAAATTTATACAAATCAGCAATATACCCATTAAGGAGTATATAAAGGTGAATTTGATTTACATCAATAAGCGGGGTTGCTGAATCGTTAAGGTAGGCGGTAATAGAAAAGAAATCGAGGCAAAAATGAGCAAAGTCAGACTCGCAATTATCGGTAACGGTATGGTCGGCCATCGCTTTATCGAAGATCTTCTTGATAAATCTGATGCGGCCAACTTTGATATTA | nirB |
| b3546_NC_000913_-200+100 | CTAAAGTCTCTTTTCAAACTTGCATTTTTGTAAATTTGTGCTTCATGCACACTCTTTCCCCACACTTTTTCCCTTTGCTGTGGTCTACTTATTCGCGCGTGTAGATTTTACTTATCTGACTACCTCCGCACTTTTTCCCTGCCGGGCCTGAAAAGCCACTAAGCAGGGTGTTATCACCTGTTTGTCCAGGGTTTGTTTGCATGAGATACATCAAATCGATTACACAGCAGAAGCTGAGCTTTTTGCTTGCAATCTATATTGGCCTTTTTATGAATGGCGCGGTTTTTTACCGCCGCTTCG | eptB |
| b3962_NC_000913_-200+100 | TACGTACAGCGGAAACCTGCCGCTTAAACGGAGAGTATCGTCGATAAAAATCCAATAAAACGTCAGGGCAAAAGTAAGAAACAGACAAAGCAAAGGCCGCTCAGGATATAGCCAGATAAATGACGGGGATCAATTGGCTTACCCGCGATAAAATGTTACCATTCTGTTGCTTTTATGTATAAGAACAGGTAAGCCCTACCATGCCACATTCCTACGATTACGATGCCATAGTAATAGGTTCCGGCCCCGGCGGCGAAGGCGCTGCAATGGGCCTGGTTAAGCAAGGTGCGCGCGTCGCAG | sthA |
| b4311_NC_000913_-200+100 | GTATTTAATCTGGATCTCTGTTTATTTAAATAATGTGAAAAGAGATTTTTCACAGGAGACCTTATACAAAAAAATATAAAATACAGCTACCGGTTGCCAAAGACACTATAAGCCTGGCAAAAAAATATTACACAACATAAATGCTAATTGTTTATGCGGGCTTTGTATTGCTTTCTGTATCCTACAAATGAGTGAAATTTATGAAAAAGGCTAAAATACTTTCTGGCGTATTATTACTGTGCTTTTCGTCCCCATTAATTTCTCAGGCTGCGACACTGGACGTACGTGGTGGATATCGTA | nanC |
Figure 2Workflow of CopomuS to generate and rank CoM candidates based on IntaRNA RRI predictions and respective characteristics.
Figure 3Characteristics of CoMs known from the literature. (A) Distribution of mutation types where the first two letters encode the wildtype nucleotides and the last two the respective mutated bases (both lex-sorted to reduce classes). For instance, AUCG represents both an AU or a UA wildtype base pair mutated either to CG or GC; (B) Rank of the RRI containing a CoM base pair among IntaRNA’s energy-sorted suboptimal RRI list.
Figure 4(A) Minimum free energy (MFE) distributions of known single-nt GC-mutating CoMs and their background model. The ’Mutations’ data (blue hues) covers 22 CGCG CoMs known from literature, while the ’Background’ data (orange hues) aggregates all remaining 207 CGCG CoMs from the MFE RRIs containing the known CoMs. There are four possible sequence combinations, referring to sRNA-mRNA pairs with respective (w)ildtype/(m)utant annotation. That is, an interaction of wildtype sRNA with the mutated mRNA is denoted by ’wm’, while e.g., ’mm’ refers to the interaction of sRNA and mRNA mutant. Each subplot provides the p-value of the sample t-test comparing the respective distributions. Dotted lines mark mean values, while dashed black lines highlight an energy difference of zero; (B) Pairwise energy differences of mutant combinations compared to wildtype-only ’ww’ or mutant-only ’mm’ MFEs for the CoMs of both the Mutations and Background data set. That is, e.g., ’wm-mm’ refers to the energy difference of a ’wm’ interaction and the respective mutant-only ’mm’ interaction energy. For each data set, p-values of paired sample t-tests that compare the values with the respective reference MFEs (’ww’ or ’mm’) are provided. The minDeltaE feature is defined as (min(MFE(mw),MFE(wm))-max(MFE(ww),MFE(mm))). For further details, see text.
Figure 5Effect of thresholds on MFE-difference-based CoM classification. Each solid line represents the number of valid CoM candidates for an RNA pair with a rank not higher than the known CoM from literature (designated as CoM*; colors differentiate between the single CoMs). The left-most data points represent the overall numbers of CoM candidates in the RNA pair that harbors the respective CoM*. The black dotted line provides the average over all RNA pairs for . The red dotted line depicts the number of RNA pairs for which the known CoM* does not fulfill the energy constraint. (A) Results for equal values of and ; (B) Results for explicit value combinations of and in range [0.5, 2].
Figure 6Effect of different classification and sorting combinations on CoM candidate set sizes. Each solid line represents the number of valid CoM candidates for an RNA pair with a rank not higher than the known CoM from literature (designated as CoM*; colors differentiate between the single CoMs). The left-most data points represent the overall numbers of CoM candidates. The black dotted line provides the average over all RNA pairs. The red dotted line depicts the number of RNA pairs for which the known CoM* does not fulfill the constraints.