| Literature DB >> 21483688 |
Junping Peng1, Jian Yang, Qi Jin.
Abstract
BACKGROUND: The completion of numerous genome sequences introduced an era of whole-genome study. However, many genes are missed during genome annotation, including small RNAs (sRNAs) and small open reading frames (sORFs). In order to improve genome annotation, we aimed to identify novel sRNAs and sORFs in Shigella, the principal etiologic agents of bacillary dysentery. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 21483688 PMCID: PMC3071730 DOI: 10.1371/journal.pone.0018509
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of newly confirmed sRNAs in chromosome and virulence plasmid of Shigella flexneri strain 301.
| sRNA genes | Adjacent genes | Strand | Northern size | 5′ end | 3′end | Method |
| pssrA | CP0121/ipaJ | ←→→ | ∼90 | ∼103842 | 103931 | R/M/P |
| pssrB | virG/CP0183 | →→→ | ∼200 | ∼152821 | 153020 | R/M/P |
| cssrA | map/rpsB | ←→→ | ∼110 | ∼181629 | 181738 | R/M/P |
| cssrB | SF2021/SF2022 | →←→ | ∼180 | ∼2046404 | 2046225 | R/M/P |
| cssrC | SF2042/SF2043 | ←←← | ∼340 | ∼2064237 | 2063898 | R/M/P |
| cssrD | rpsP/ffh | ←←← | ∼200 | ∼2745060 | 2744861 | R/M/P |
| cssrE | yggN/yggL | ←←← | ∼140 | ∼3043882 | 3043743 | R/M/P |
| cssrF | dacB/yhbZ | →←← | ∼290 | ∼3322880 | 3322591 | R/M/P |
| cssrG | rbsB/rbsK | →→→ | ∼230 | ∼3946524 | 3946755 | R/M/P |
The middle arrow represents the sRNA gene, while the flanking arrows indicate the orientation of the adjacent genes, respectively. Genes present on the strand given in the S. flexneri strain 301 genome database are indicated by (→), and genes present on the complementary strand are indicated by (←).
The sRNA 3′ boundaries are from rho-independent terminator predictions. 5′ boundaries are calculated according to the 3′-ends and northern results.
sRNAs were predicted based on different methods. R, RNAz prediction; M, tiling array hybridization; P: RT-PCR verification.
Figure 1Detection of small RNAs by Northern blot analyses.
Northern blots were performed with total RNA using strand-specific probes as described in Materials and Methods. The size of RNA markers is indicated on the left. 5s RNA was used as control.
Summary of candidate sORFs in chromosome and virulence plasmid of Shigella flexneri strain 301.
| ID | Location | Length (amino acids) | Strand | Description |
| Chromosome | ||||
| BIO00004 | 15610–15401 | 70 | − | regulatory protein mokC |
| BIO00051 | 259932–259741 | 64 | − | hypothetical protein |
| BIO00126 | 511407–511556 | 50 | + | putative small toxic membrane polypeptide |
| BIO00127 | 511910–512059 | 50 | + | putative small toxic membrane polypeptide |
| BIO00144 | 583036–582926 | 37 | − | putative outer membrane lipoprotein, cyd operon protein |
| BIO00301 | 1056382–1056486 | 35 | + | hypothetical protein |
| BIO00533 | 1577459–1577376 | 28 | − | hypothetical protein |
| BIO00534 | 1577818–1577543 | 92 | − | hypothetical protein |
| BIO00587 | 1717264–1717148 | 40 | − | hypothetical protein |
| BIO00620 | 1809435–1809527 | 31 | + | hypothetical protein |
| BIO00669 | 1894482–1894333 | 51 | − | hypothetical protein |
| BIO00670 | 1894620–1894501 | 40 | − | hypothetical protein |
| BIO00790 | 2213607–2213521 | 29 | − | hypothetical protein |
| BIO00803 | 2238453–2238557 | 35 | + | hypothetical protein |
| BIO00855 | 2421445–2421317 | 43 | − | hypothetical protein |
| BIO00864 | 2469896–2469615 | 94 | − | hypothetical protein |
| BIO00898 | 2585789–2585685 | 35 | − | hypothetical protein |
| BIO00936 | 2769587–2769432 | 52 | − | predicted membrane protein (regulated by cyaR sRNA) |
| BIO01076 | 3201904–3202023 | 40 | + | hypothetical protein |
| BIO01336 | 4066446–4066339 | 36 | − | hypothetical protein |
| VP | ||||
| BIO01501b | 9285–9443 | 53 | + | hypothetical protein |
| BIO01567 | 67854–68126 | 91 | + | hypothetical protein |
| BIO01585 | 91670–91422 | 83 | − | hypothetical protein |
| BIO01587 | 91991–91860 | 44 | − | putative arylsulfatase regulatory protein |
| BIO01595 | 105022–105132 | 37 | + | hypothetical protein |
| BIO01608 | 135447–135677 | 77 | + | hypothetical protein |
| BIO01637 | 153138–153392 | 85 | + | adhesion protein, fragment |
| BIO01674 | 183288–183455 | 56 | + | hypothetical protein |
| BIO01675 | 183646–183792 | 49 | + | hypothetical protein |
Newly identified sORFs were not annotated in any genome.
These sORFs were only annotated in one E.coli or Shigella strain.