| Literature DB >> 24039725 |
Minenosuke Matsutani1, Motohiko Ogawa, Naohisa Takaoka, Nozomu Hanaoka, Hidehiro Toh, Atsushi Yamashita, Kenshiro Oshima, Hideki Hirakawa, Satoru Kuhara, Harumi Suzuki, Masahira Hattori, Toshio Kishimoto, Shuji Ando, Yoshinao Azuma, Mutsunori Shirai.
Abstract
Rickettsia japonica is an obligate intracellular alphaproteobacteria that causes tick-borne Japanese spotted fever, which has spread throughout East Asia. We determined the complete genomic DNA sequence of R. japonica type strain YH (VR-1363), which consists of 1,283,087 base pairs (bp) and 971 protein-coding genes. Comparison of the genomic DNA sequence of R. japonica with other rickettsiae in the public databases showed that 2 regions (4,323 and 216 bp) were conserved in a very narrow range of Rickettsia species, and the shorter one was inserted in, and disrupted, a preexisting open reading frame (ORF). While it is unknown how the DNA sequences were acquired in R. japonica genomes, it may be a useful signature for the diagnosis of Rickettsia species. Instead of the species-specific inserted DNA sequences, rickettsial genomes contain Rickettsia-specific palindromic elements (RPEs), which are also capable of locating in preexisting ORFs. Precise alignments of protein and DNA sequences involving RPEs showed that when a gene contains an inserted DNA sequence, each rickettsial ortholog carried an inserted DNA sequence at the same locus. The sequence, ATGAC, was shown to be highly frequent and thus characteristic in certain RPEs (RPE-4, RPE-6, and RPE-7). This finding implies that RPE-4, RPE-6, and RPE-7 were derived from a common inserted DNA sequence.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24039725 PMCID: PMC3767692 DOI: 10.1371/journal.pone.0071861
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Circular exhibition of the Rickettsia japonica YH genome.
The outermost scale is marked for nucleic acid position in Mbp, and Region-1 and -2 (red). From the outside (track 1), gene positions and directions (clockwise on the outside and anti-clockwise on the inside) of each gene were classified and colored based on COGs [45]. Track 2 and 3: tRNA (red) and rRNA (blue), respectively. Track 4: GC skew, outside yellow and inside purple indicate values >0 and values <0 as calculated by (G−C/G+C) [46]. Track 5: innermost, GC contents. Based on the cumulus of the GC skew values, accompanied by other Richettsia genomes, a hypothetical origin, shown as ori, was determined and the base numbers were counted from the origin.
List of genes with RPE-4, RPE-6 and RPE-7 and insertion lengths in R. japonica.
|
|
|
|
|
|
| |||
| Gene ID | RPE length | RPE group | RPE group | RPE group | RPE group | RPE group | RPE group |
|
| RJP_0284 | 84 | RPE-4 | RPE-4 | RPE-4 | ND | ND | ND | multisubunit Na+/H+ antiporter, mnhB subunit |
| RJP_0560 | 142 | RPE-4 | RPE-4 | RPE-4 | RPE-4 | ND | ND | putative aminomethyltransferase |
| RJP_0434 | 98 | RPE-4 | RPE-4 | RPE-4 | RPE-6 | ND | ND | recB family exonuclease |
| RJP_0891 | 114 | RPE-4 | RPE-4 | RPE-4 | RPE-6 | ND | ND | DNA polymerase III subunit alpha |
| RJP_0222 | 115 | RPE-4 | RPE-4 | RPE-4 | RPE-7 | ND | ND | tRNA ribosyltransferase-isomerase |
| RJP_0300 | 90 | RPE-6 | RPE-6 | RPE-6 | RPE-7 | ND | ND | NADH dehydrogenase subunit N |
| RJP_0790 | 192 | RPE-4 | RPE-4 | RPE-4 | RPE-7 | ND | ND | proline/betaine transporter |
| RJP_0384 | 137 | RPE-4 | RPE-4 | RPE-4 | RPE-7 | ND | RPE-6 | putative lipid A core - O-antigen ligase |
| RJP_0202 | 150 | RPE-6 | RPE-6 | RPE-6 | RPE-6 | ND | RPE-4 | bifunctional penicillin-binding protein 1C |
| RJP_0524 | 96 | RPE-6 | RPE-6 | RPE-6 | RPE-4 | ND | ND | biotin-(acetyl-CoA carboxylase) ligase |
| RJP_0634 | 101 | RPE-6 | RPE-6 | RPE-6 | RPE-4 | ND | ND | excinuclease ABC subunit C |
| RJP_0964 | 108 | RPE-6 | RPE-6 | RPE-6 | RPE-7 | ND | ND | tRNA (guanine-N(7)-)-methyltransferase |
| RJP_0013 | 68 | RPE-7 | RPE-7 | RPE-7 | RPE-7 | ND | RPE-7 | ABC transporter substrate binding protein |
| RJP_0722 | 144 | RPE-7 | RPE-7 | RPE-7 | RPE-6 | ND | ND | ATP-dependent DNA helicase |
| RJP_0124 | 114 | RPE-7 | RPE-7 | RPE-7 | RPE-6 | ND | ND | putative nucleoside-diphosphate-sugar epimerase |
| RJP_0449 | 108 | RPE-7 | RPE-7 | RPE-7 | RPE-7 | ND | ND | phenylalanyl-tRNA synthetase beta subunit |
| RJP_0691 | 144 | RPE-7 | RPE-7 | RPE-7 | RPE-7 | ND | ND | isoleucyl-tRNA synthetase |
| RJP_0552 | 96 | RPE-7 | RPE-7 | RPE-7 | RPE-7 | RPE-4 | ND | soluble lytic murein transglycosylase |
ND: typical RPE-4, RPE-6 and RPE-7 were not detected while orthologous genes and insertions exist.
Figure 2Multiple alignments of amino acid and DNA sequences, including inserted DNA sequences.
Each panel shows A–C) DNA polymerase III alpha chain; D–F) tRNA ribosyltransferase-isomerase; G–I) phenylalanyl-tRNA synthetase subunit beta; A, D, G) amino acid alignments; B, E, H) structure models; C, F, I) DNA alignments. Letters in alignments indicate green, typical RPE-4s; blue, typical RPE-6s; purple, typical RPE-7s; red, extended RPE-4 regions; black, boundary regions of genes with inserted DNA sequences. Amino acids encoded by inserted DNA sequences were shown by red balls in the protein structures. Gene sequences were extracted from each genome sequence; Rickettsia japonica (this work); R. conorii, AE006914 [13]; R. felis, CP000053 [28]; R. prowazekii, AJ235269 [29]; R. typhi, AE017197 [30]; Acetobacter pasteurianus IFO 3283-01, AP011121 [47]. Protein Data Bank Identifiers (PDB IDs) for amino acid sequences of Escherichia coli DNA polymerase III alpha chain, Bacillus subtilis tRNA ribosyltransferase-isomerase, and Thermus thermophilus phenylalanyl-tRNA synthetase subunit beta are 2hnhA, 1yy3A, and 1b70B, respectively. J) The highly conserved short sequence in e-RPEs were illustrated using a sequence LOGO format [33].
Frequency of the ATGAC sequences in extended RPE-4s (eRPE-4s).
| Regions | Category of eRPE-4 | Length (bp) | Amount of ATGAC | Frequency (per kb) | |
| eRPE-4 | eRPE-4 including RPE-4 | Total | 2,243 | 35 | 15.6 |
| RPE-4 | 806 | 27 | 33.5 | ||
| Extended | 1,437 | 8 | 5.6 | ||
| eRPE-4 including RPE-6 | Total | 1,805 | 35 | 19.4 | |
| RPE-6 | 1,235 | 20 | 16.2 | ||
| Extended | 570 | 15 | 26.3 | ||
| eRPE-4 including RPE-7 | Total | 2,291 | 33 | 14.4 | |
| RPE-7 | 878 | 18 | 20.5 | ||
| Extended | 1,413 | 15 | 10.6 | ||
| eRPE-4 without RPE-4,6,7 | Extended | 3,565 | 17 | 4.8 | |
| RPE-1,-2,-3,-5 | 24,452 | 3 | 0.1 | ||
| Other regions | 1,248,731 | 2,311 | 1.9 | ||
| Whole genome of | 1,283,087 | 2,417 | 1.9 | ||
ND: Not detected.