| Literature DB >> 28257489 |
Motoshige Yasuike1, Issei Nishiki1, Yuki Iwasaki1, Yoji Nakamura1, Atushi Fujiwara1, Yoshiko Shimahara2, Takashi Kamaishi3, Terutoyo Yoshida4, Satoshi Nagai1, Takanori Kobayashi5, Masaya Katoh1.
Abstract
Nocardiosis caused by Nocardia seriolae is one of the major threats in the aquaculture of Seriola species (yellowtail; S. quinqueradiata, amberjack; S. dumerili and kingfish; S. lalandi) in Japan. Here, we report the complete nucleotide genome sequence of N. seriolae UTF1, isolated from a cultured yellowtail. The genome is a circular chromosome of 8,121,733 bp with a G+C content of 68.1% that encodes 7,697 predicted proteins. In the N. seriolae UTF1 predicted genes, we found orthologs of virulence factors of pathogenic mycobacteria and human clinical Nocardia isolates involved in host cell invasion, modulation of phagocyte function and survival inside the macrophages. The virulence factor candidates provide an essential basis for understanding their pathogenic mechanisms at the molecular level by the fish nocardiosis research community in future studies. We also found many potential antibiotic resistance genes on the N. seriolae UTF1 chromosome. Comparative analysis with the four existing complete genomes, N. farcinica IFM 10152, N. brasiliensis HUJEG-1 and N. cyriacigeorgica GUH-2 and N. nova SH22a, revealed that 2,745 orthologous genes were present in all five Nocardia genomes (core genes) and 1,982 genes were unique to N. seriolae UTF1. In particular, the N. seriolae UTF1 genome contains a greater number of mobile elements and genes of unknown function that comprise the differences in structure and gene content from the other Nocardia genomes. In addition, a lot of the N. seriolae UTF1-specific genes were assigned to the ABC transport system. Because of limited resources in ocean environments, these N. seriolae UTF1 specific ABC transporters might facilitate adaptation strategies essential for marine environment survival. Thus, the availability of the complete N. seriolae UTF1 genome sequence will provide a valuable resource for comparative genomic studies of N. seriolae isolates, as well as provide new insights into the ecological and functional diversity of the genus Nocardia.Entities:
Mesh:
Year: 2017 PMID: 28257489 PMCID: PMC5336288 DOI: 10.1371/journal.pone.0173198
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sequencing statistics of the N. seriolae UTF1 genome.
| Number of Reads | 227,796 |
| N50 Read Length (bp) | 7,364 |
| Mean Read Length (bp) | 4,756 |
| Number of Bases | 1,083,416,572 |
| Average Reference Coverage | 113.01 |
| Number of contigs | 2 |
Fig 1Comparative genomic map of N. seriolae UTF1 genome and the other four Nocardia complete genomes.
The BLASTN-based ring image was generated by BLAST Ring Image Generator (BRIG) version 0.95 [32]. The innermost two rings show GC content (black) and GC skew (purple/green). The remaining rings (rings 3–6) represent a BLASTN comparison with complete genome of N. farcinica IFM 10152 (magenta), N. brasiliensis HUJEG-1(cyan), N. cyriacigeorgica GUH-2 (blue) and N. nova SH22a (pale blue). Bars indicate the position of mobile-element related genes in the N. seriolae UTF1 genome such as transposases (black), endonuclease DDE (blue) and integrase (green). The outermost red boxes highlight the 8 mce operons found in the N. seriolae UTF1 genome.
Comparison of genomic features of N. seriolae UTF1 and the other four Nacardia spp. complete genomes.
| Species | host | Accesion number | Size (bp) | GC% | CDS | rRNA | tRNA | Reference |
|---|---|---|---|---|---|---|---|---|
| AP017900 | 8,121,733 | 68.1 | 7,697 | 4 | 62 | This study | ||
| AP006618 | 6,021,225 | 70.8 | 5,674 | 3 | 53 | [ | ||
| CP003876 | 9,436,348 | 68.0 | 8,414 | 3 | 51 | [ | ||
| FO082843 | 6,194,645 | 68.4 | 5,491 | 3 | 49 | [ | ||
| A root of | CP006850 | 8,348,532 | 67.8 | 7,583 | 3 | 49 | [ |
Virulence factor candidates in the N. seriolae UTF1 by comparison with the virulence genes data set of N. farcinica.
| Category | Gene | Description | %Identity | %Similarity | |||
|---|---|---|---|---|---|---|---|
| Cell wall proteins | ORF-145 | nfa1810 | mycolyltransferase | 70.9 | 79.9 | 5.00E-159 | |
| ORF-7590 | nfa1820 | mycolyltransferase | 56.4 | 75.9 | 7.00E-67 | ||
| ORF-150 | nfa1830 | mycolyltransferase | 57.7 | 73.0 | 2.00E-37 | ||
| Metal importers | ORF-5098 | nfa37790 | transcriptional regulator | 66.0 | 79.4 | 4.00E-84 | |
| ORF-950 | nfa7630 | thioesterase | 85.7 | 90.2 | 0 | ||
| ORF-951 | nfa7640 | polyketide synthase | 86.4 | 90.0 | 5.00E-60 | ||
| ORF-952 | nfa7650 | polyketide synthase | 87.3 | 93.0 | 1.00E-143 | ||
| ORF-953 | nfa7660 | non-ribosomal peptide synthetase | 88.5 | 93.8 | 0 | ||
| ORF-954 | nfa7670 | non-ribosomal peptide synthetase | 91.6 | 96.2 | 4.00E-161 | ||
| ORF-750 | nfa7680 | non-ribosomal peptide synthetase | 93.0 | 97.1 | 2.00E-108 | ||
| ORF-948 | nfa7610 | lysine-N-oxygenase | 84.7 | 93.1 | 0 | ||
| ORF-747 | nfa6190 | salicylate synthase | 81.4 | 88.5 | 4.00E-126 | ||
| ORF-749 | nfa6200 | salicylate-AMP ligase | 82.5 | 91.3 | 4.00E-126 | ||
| Oxidative and nitrosative stresses | ORF-5118 | nfa37890 | alkylhydroperoxide reductase | 68.0 | 79.0 | 5.00E-149 | |
| ORF-5119 | nfa37900 | alkylhydroperoxidase | 68.8 | 79.3 | 0 | ||
| ORF-325 | nfa55390 | catalase | 79.0 | 87.8 | 0 | ||
| ORF-4212 | nfa29500 | catalase-peroxidase | 63.8 | 77.8 | 1.00E-169 | ||
| ORF-6237 | nfa45490 | nitrate reductase alpha subunit | 71.3 | 84.8 | 0 | ||
| ORF-6238 | nfa45500 | nitrate reductase beta subunit | 72.4 | 83.3 | 1.00E-78 | ||
| ORF-6240 | nfa45520 | nitrate reductase gamma subunit | 75.3 | 84.6 | 0 | ||
| ORF-6239 | nfa45510 | nitrate reductase delta subunit | 74.1 | 83.8 | 1.00E-157 | ||
| ORF-6265 | nfa45610 | nitrite reductase (NAD(P)H) subunit | 76.9 | 86.9 | 0 | ||
| ORF-6264 | nfa45600 | nitrite reductase (NAD(P)H) subunit | 75.6 | 84.0 | 1.00E-173 | ||
| ORF-5117 | nfa37880 | hydrogen peroxide sensing transcriptional regulator | 66.9 | 78.6 | 2.00E-165 | ||
| ORF-418 | nfa52980 | superoxide dismutase | 78.2 | 86.4 | 1.00E-140 | ||
| ORF-68 | nfa1210 | superoxide dismutase | 40.5 | 57.5 | 3.00E-112 | ||
| Penetration into mammalian cells | ORF-4357 | nfa34810 | invasin | 65.6 | 80.4 | 2.00E-119 | |
| Phagosome arresting | ORF-5937 | nfa13510 | nucleoside diphosphate kinase | 44.8 | 57.6 | 2.00E-45 | |
| ORF-5498 | nfa16310 | protein-tyrosine phosphatase | 48.7 | 61.4 | 2.00E-88 | ||
| ORF-2644 | nfa18680 | protein-tyrosine phosphatase | 60.2 | 70.7 | 0 | ||
| other | ORF-2873 | nfa19960 | putative cytotoxin/hemolysin | 77.4 | 83.2 | 6.00E-135 |
* according to the Nocardia farcinica Genome Project Page (http://nocardia.nih.go.jp/).
Fig 2The organization of 8 mce operons in N. seriolae UTF1 genome.
The figure was generated using Easyfig 2.1 [29]. Arrows represent yrbE genes, mce genes and two additional genes, DDE (endonuclease DDE) and orf3406 (hypothetical protein) in mce3. Values indicate the number of base positions. Asterisks (mce6* and mce7*) indicate reverse complement orientation.
Potential antibiotic resistance genes of N. seriolae UTF1 according to the Antibiotic Resistance Genes Database (ARDB).
| Resistance gene type | Antibiotic resistance | Description | UTF1 ORF ID | Best Hit Accession | Score | |
|---|---|---|---|---|---|---|
| Amikacin, dibekacin, isepamicin, netilmicin, sisomicin, tobramycin | Aminoglycoside N-acetyltransferase, which modifies aminoglycosides by acetylation. | ORF-1348 | AAA26549 | 6.00E-32 | 130 | |
| Bacitracin | Undecaprenyl pyrophosphate phosphatase, which consists in the sequestration of Undecaprenyl pyrophosphate. | ORF-4188 | CAL13705 | 8.00E-23 | 101 | |
| Penicillin | Class A beta-lactamase. This enzyme breaks the beta-lactam antibiotic ring open and deactivates the molecule's antibacterial properites. | ORF-6138 | YP_089932 | 8.00E-65 | 241 | |
| Lincosamide, macrolide, streptogramin b | ABC transporter system, Macrolide-Lincosamide-Streptogramin B efflux pump. | ORF-4637 | AAC32027 | 2.00E-10 | 58 | |
| Chloramphenicol | Group B chloramphenicol acetyltransferase, which can inactivate chloramphenicol. Also referred to as xenobiotic acetyltransferase. | ORF-1707 | AAK88712 | 3.00E-06 | 45 | |
| Trimethoprim | Group A drug-insensitive dihydrofolate reductase, which can not be inhibited by trimethoprim. | ORF-6939 | CAL48457 | 8.00E-29 | 120 | |
| Macrolide | Resistance-nodulation-cell division transporter system. Multidrug resistance efflux pump. Macrolide-specific efflux system. | ORF-1987 | YP_001453760 | 9.00E-54 | 204 | |
| ORF-4813 | YP_001571041 | 1.00E-46 | 180 | |||
| ORF-6113 | YP_001334578 | 5.00E-47 | 181 | |||
| ORF-6376 | YP_001453760 | 2.00E-49 | 189 | |||
| Fluoroquinolone | Pentapeptide repeat family, which protects DNA gyrase from the inhibition of quinolones. | ORF-3773 | ABL05132 | 6.00E-08 | 52 | |
| Tetracycline | Major facilitator superfamily transporter, tetracycline efflux pump. | ORF-1750 | AAD04032 | 1.00E-105 | 375 | |
| Lincosamide, macrolide, streptogramin b | ABC transporter system, Macrolide-Lincosamide-Streptogramin B efflux pump. | ORF-4948 | CAA45050 | 9.00E-68 | 250 | |
| Tetracenomycin c | Major facilitator superfamily transporter. Resistance to tetracenomycin C by an active tetracenomycin C efflux system which is probably energized by transmembrane electrochemical gradients. | ORF-4285 | AAA67509 | 3.00E-05 | 40 | |
| Tetracycline | Major facilitator superfamily transporter, tetracycline efflux pump. | ORF-6293 | BAA07390 | 8.00E-99 | 355 | |
| Vancomycin | VanB type vancomycin resistance operon genes, which can synthesize peptidoglycan with modified C-terminal D-Ala-D-Ala to D-alanine—D-lactate. | ORF-5538 | ABB53368 | 1.00E-13 | 70 | |
| ORF-7016 | ABB53368 | 3.00E-16 | 79 | |||
| Vancomycin | VanC type vancomycin resistance operon genes, which can synthesize peptidoglycan with modified C-terminal D-Ala-D-Ala to D-alanine—D-serine. | ORF-2855 | AAY67971 | 3.00E-31 | 129 | |
| Streptogramin a | Virginiamycin A acetyltransferase, which can inactivate the target drug. | ORF-4237 | YP_001038094 | 9.00E-10 | 57 | |
| Streptogramin a | Virginiamycin A acetyltransferase, which can inactivate the target drug. | ORF-3078 | AAG21695 | 6.00E-49 | 187 |
Fig 3Dot plot comparisons of Nucleotide-based alignments were performed with MUMmer version 3.22 and dot plots were generated by the mummerplot script and the Unix program gnuplot [31].
Average nucleotide identity (ANI, upper grids) and average amino acid identity (AAI, lower grids) values (in percent) calculated between Nocardia genomes.
| - | 79.88 | 79.21 | 79.88 | 79.63 | |
| 69.17 | - | 81.36 | 81.61 | 79.77 | |
| 68.96 | 73.61 | - | 81.19 | 79.43 | |
| 69.62 | 74.99 | 73.68 | - | 79.89 | |
| 69.17 | 69.21 | 68.06 | 69.82 | - |
Fig 4Functional profiling of the five Nocardia complete genomes.
Heat map shows the abundance of the top 50 subsystems [36, 37] enriched in the five Nocardia genomes. The color scale indicates the abundance of gene content for each category.
Fig 5Comparison of COG distribution of N. seriolae UTF1 and the other four Nocardia genomes.
COG definitions are described as follows: A, RNA processing and modification; B, Chromatin structure and dynamics; C, Energy production and conversion; D, Cell cycle control, cell division, chromosome partitioning; E, Amino acid transport and metabolism; F, Nucleotide transport and metabolism; G, Carbohydrate transport and metabolism; H, Coenzyme transport and metabolism; I, Lipid transport and metabolism; J, Translation, ribosomal structure and biogenesis; K, Transcription; L, Replication, recombination and repair; M, Cell wall/membrane/envelope biogenesis; N, Cell motility; O, Posttranslational modification, protein turnover, chaperones; P, Inorganic ion transport and metabolism; Q, Secondary metabolites biosynthesis, transport and catabolism; R, General function prediction only; S, Function unknown; T, Signal transduction mechanisms; U, Intracellular trafficking, secretion, and vesicular transport; V, Defense mechanisms; W, Extracellular structures; X, Mobilome: prophages, transposons; Y, Nuclear structure; Z, Cytoskeleton. ‘Uk’ indicate unknown (unassigned genes). The number of ORFs with each COG category are listed in S4 Table.
Fig 6Identification of The orthologous and species-specific genes were identified using OrthoMCL [34] (A). The protein sequences of 1,982 N. seriolae UTF1-specific genes were functionally annotated with metabolic information from the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database using KEGG Orthology And Links Annotation (BlastKOALA) program [27] (B).
Functional classification of unique genes of N. seriolae UTF1 by KEGG modules.
| KEGG Module | Module ID | Description | Number of genes |
|---|---|---|---|
| ABC-2 type and other transport systems | M00254 | ABC-2 type transport system | 12 |
| M00258 | Putative ABC transport system | 2 | |
| Mineral and organic ion transport system | M00188 | NitT/TauT family transport system | 1 |
| M00190 | Iron(III) transport system | 2 | |
| M00299 | Spermidine/putrescine transport system | 4 | |
| Phosphotransferase system (PTS) | M00273 | PTS system, fructose-specific II component | 3 |
| Phosphate and amino acid transport system | M00233 | Glutamate transport system | 1 |
| Two-component regulatory system | M00452 | CusS-CusR (copper tolerance) two-component regulatory system | 2 |
| M00475 | BarA-UvrY (central carbon metabolism) two-component regulatory system | 1 | |
| Bacterial secretion system | M00335 | Sec (secretion) system | 1 |
| Drug efflux transporter/pump | M00713 | Fluoroquinolone resistance, efflux pump LfrA | 1 |
| Drug resistance | M00742 | Aminoglycoside resistance, protease FtsH | 1 |
| M00743 | Aminoglycoside resistance, protease HtpX | 1 | |
| M00727 | Cationic antimicrobial peptide (CAMP) resistance, N-acetylmuramoyl-L-alanine amidase AmiA and AmiC | 1 | |
| M00745 | Imipenem resistance, repression of porin OprD | 2 | |
| M00714 | Multidrug resistance, efflux pump QacA | 1 | |
| ATP synthesis | M00159 | V-type ATPase, prokaryotes | 5 |
| M00149 | Succinate dehydrogenase, prokaryotes | 4 | |
| M00151 | Cytochrome bc1 complex respiratory unit | 1 | |
| Nitrogen metabolism | M00530 | Dissimilatory nitrate reduction, nitrate = > ammonia | 1 |
| Methane metabolism | M00345 | Formaldehyde assimilation, ribulose monophosphate pathway | 1 |
| Fatty acid metabolism | M00083 | Fatty acid biosynthesis, elongation | 3 |
| M00082 | Fatty acid biosynthesis, initiation | 1 | |
| Central carbohydrate metabolism | M00307 | Pyruvate oxidation, pyruvate = > acetyl-CoA | 3 |
| Serine and threonine metabolism | M00555 | Betaine biosynthesis, choline = > betaine | 1 |
| Cysteine and methionine metabolism | M00021 | Cysteine biosynthesis, serine = > cysteine | 2 |
| Other amino acid metabolism | M00027 | GABA (gamma-Aminobutyrate) shunt | 1 |
| Ribosome | M00178 | Ribosome, bacteria | 1 |