| Literature DB >> 28469601 |
Hasan C Tekedar1, Attila Karsi1, Joseph S Reddy2, Seong W Nho1, Safak Kalindamar1, Mark L Lawrence1.
Abstract
Flavobacterium columnare is a Gram-negative fish pathogen causing columnaris disease in wild and cultured fish species. Although the pathogen is widespread in aquatic environments and fish worldwide, little is known about biology of F. columnare and mechanisms of columnaris disease pathogenesis. Previously we presented the complete genome sequence of F. columnare strain ATCC 49512. Here we present a comparison of the strain ATCC 49512 genome to four other Flavobacterium genomes. In this analysis, we identified predicted proteins whose functions indicate F. columnare is capable of denitrification, which would enable anaerobic growth in aquatic pond sediments. Anaerobic growth of F. columnare ATCC 49512 with nitrate supplementation was detected experimentally. F. columnare ATCC 49512 had a relatively high number of insertion sequences and genomic islands compared to the other Flavobacterium species, suggesting a larger degree of horizontal gene exchange and genome plasticity. A type VI subtype III secretion system was encoded in F. columnare along with F. johnsoniae and F. branchiophilum. RNA sequencing proved to be a valuable technique to improve annotation quality; 41 novel protein coding regions were identified, 16 of which had a non-traditional start site (TTG, GTG, and CTT). Candidate small noncoding RNAs were also identified. Our results improve our understanding of F. columnare ATCC 49512 biology, and our results support the use of RNA sequencing to improve annotation of bacterial genomes, particularly for type strains.Entities:
Keywords: Flavobacterium columnare; RNA-Seq; comparative genomics analysis; denitrification; fish health
Year: 2017 PMID: 28469601 PMCID: PMC5395568 DOI: 10.3389/fmicb.2017.00588
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Chromosome features of .
| Genome size | 3,162,432 | 2,861,988 | 3,559,884 | 6,096,872 | 2,993,089 |
| G+C content (%) | 31.5 | 32.5 | 32.9 | 34.1 | 31.4 |
| Number of rRNA operons | 5 | 6 | 3 | 6 | 4 |
| Number of tRNA genes | 74 | 49 | 44 | 62 | 55 |
| Protein coding genes | 2,642 | 2,432 | 2,867 | 5,056 | 2,671 |
| Average gene length (bp) | 1,021 | 1,003 | 1,030 | 1,061 | Unknown |
| Plasmid | No | Yes | Yes | No | No |
Figure 1Comparison of Amino acid composition for Flavobacterium strains. Letter abbreviations: A, Alanine; C, Cysteine; D, Aspartic Acid; E, Glutamic Acid; F, Phenylalanine; G, Glycine; H, Histidine; I, Isoleucine; K, Lysine; L, Leucine; M, Methionine; N, Asparagine; P, Proline; Q, Glutamine; R, Arginine; S, Serine; T, Threonine; V, Valine; W, Tryptophan; Y, Tyrosine. (B) Predicted protein lengths for Flavobacterium strains. (C) Predicted subcellular localization of proteins encoded in Flavobacterium genomes using PSORTb. (D) Genome to genome distance calculation for Flavobacterium genomes. Species abbreviations: FB, F. branchiophilum FL15; FC, F. columnare ATCC 49512; FI, F. indicum GPTSA100-9; FJ, F. johnsoniae UW101; and FP, F. psychrophilum JIB02-86.
Figure 2. (A) Core vs. pan-genome plot of the five evaluated Flavobacterium genomes. (B) Singleton (unique genes) analyses for Flavobacterium genomes. A decreasing number of singletons are observed with each sequential addition of a genome. (C) Venn diagram of Flavobacterium pan and core genomes based on orthology analysis.
Figure 3Orthology analysis of . Circles display percent identity of proteins from each listed species compared to F. columnare. From outside to inside: Flavobacterium branchiophilum FL-15, Flavobacterium indicum GPTSA100-9, Flavobacterium johnsoniae UW101, and Flavobacterium psychrophilum JIP02/86. Boxed region 1 indicates a unique F. columnare genomic island. Boxed region 2 is conserved in all the evaluated Flavobacterium genomes; the genes in this region encode ribosomal subunit elements.
Transcriptional factors and two-component systems in .
| 8 | 9 | 6 | 36 | 9 | 11 | 4 | 1 | 2 | 2 | 3 | 4 | 1 | 1 | 3 | |||
| 5 | 12 | 9 | 42 | 4 | 11 | 1 | 2 | 4 | 2 | 1 | 5 | 1 | 1 | 4 | |||
| 11 | 14 | 8 | 57 | 5 | 11 | 9 | 2 | 1 | 4 | 5 | 2 | 7 | 1 | 2 | 3 | ||
| 41 | 43 | 30 | 171 | 7 | 53 | 14 | 14 | 11 | 8 | 4 | 20 | 1 | 19 | 7 | |||
| 8 | 20 | 11 | 42 | 5 | 21 | 5 | 4 | 5 | 6 | 9 | 1 | 4 | 8 | ||||
OCS, One-component systems; RR, DNA-binding response regulators; SF, sigma factors; TR, transcriptional regulators; ODP; other DNA-binding proteins.
CRISPR-Cas systems analysis of .
| CRISPR-1 | Confirmed | GTTGTGGTTTGATTAAAGATTAGAAAACACGATATT | 44 | 29–78 | 391564 | 2,912 | cas9_TypOeII, cas1_TypeII | |
| CRISPR-2 | Confirmed | GTTGGGAAAGCCCTTATTTTGAAGGGTATCTACAAC | 9 | 30 | 1679967 | 563 | ||
| CRISPR-3 | Questionable | GTTGTAAATTGCTTTCAATTTTT | 2 | 54 | 1994455 | 99 | ||
| CRISPR-19 | Confirmed | CAATAGAAGTTACAGAATTAGGAAT | 5 | 44 | 232904 | 300 | cas9_TypeII, cas1_TypeII, cas2_TypeI-II-III | |
| CRISPR-20 | Confirmed | GTTGGTAATTATAAGCTAAAATACAATTTTGAAAGCAATTCACAAC | 21 | 29–30 | 1325548 | 1,560 | ||
| CRISPR-21 | Questionable | TTAATCTATATTCAAAAATCAAAT | 2 | 30 | 17425 | 77 | ||
| CRISPR-22 | Questionable | ACCGAACTAAACGCAAAAAAGTGTTGAGA | 3 | 27–56 | 2020986 | 170 | ||
| CRISPR-23 | Questionable | ACCGAACTAAACGCAAAAAAGTGTTGAGA | 3 | 27–56 | 2021433 | 169 | ||
| CRISPR-24 | Questionable | TCTCAACACTTTTTTGCGTTCAGTTCGGT | 2 | 56 | 2026310 | 113 | ||
| CRISPR-25 | Questionable | AGTTTTTTCTCCACACTTTTTTGCGTTCAGTTCGG | 2 | 55 | 2084863 | 124 | ||
| CRISPR-4 | Confirmed | CATATCGTGTTTTCTAATCTTTAATCAAACCACAAC | 29 | 29–78 | 1899989 | 1,928 | ||
| CRISPR-5 | Confirmed | GTTGTAACTGCCCTTATTTTGAAGGGTAAACACAAC | 12 | 29–30 | 2549336 | 756 | ||
| CRISPR-6 | Confirmed | GTTGTAACTGCCCTTATTTTGAAGGGTAAACACAAC | 10 | 29–30 | 2550434 | 628 | ||
| CRISPR-7 | Confirmed | GTTGTAACTGCCCTTATTTTGAAGGGTAAACACAAC | 6 | 29–30 | 2551183 | 364 | ||
| CRISPR-8 | Confirmed | GTTTAAAACCACTTTAAAATTTCTACTATTGTAGAT | 38 | 26–32 | 3018429 | 2,392 | ||
| CRISPR-9 | Questionable | CTCCTGCAAGGTTTCCAAAACCTTGTAGGTG | 3 | 25-41 | 85275 | 158 | ||
| CRISPR-10 | Questionable | ATGGAAAAATTATTTGCTTAATCCCCATAAAATTCATGTGAACC | 2 | 31 | 1392707 | 118 | ||
| CRISPR-11 | Questionable | TTTATTTTTTATAGGTATTCGTT | 2 | 38 | 1460985 | 83 | ||
| CRISPR-12 | Questionable | ATGGATTTTTATTTTTTATAGGTATTCGTT | 2 | 31 | 1697623 | 90 | ||
| CRISPR-13 | Questionable | GTTGTAACTGCCCTTATTTTGAAGGGTAAACACAAC | 2 | 29 | 2550212 | 101 | ||
| CRISPR-14 | Questionable | TAAAAAAAAACAGATAAAAATCCGTGTTTACGAAGTATCCGT | 2 | 58 | 2858645 | 141 | ||
| CRISPR-15 | Questionable | GGATTTTATTTTTTATAGGTATTCGTT | 2 | 34 | 3071863 | 87 | ||
| CRISPR-26 | Confirmed | AATCAATTTACAGGTACAATTCC | 5 | 49 | 644859 | 310 | ||
| CRISPR-27 | Confirmed | ACTTTTACTTAACATCGTTAGATAATTTAAT | 4 | 26–48 | 5194431 | 227 | ||
| CRISPR-28 | Questionable | AATCAATTTACAGGTACAATTCC | 2 | 49 | 644499 | 95 | ||
| CRISPR-29 | Questionable | TTAAAACCATATAAATCTATATG | 2 | 36 | 3182380 | 81 | ||
| CRISPR-30 | Questionable | TTTTATTTCTGAACAAAGATTTCAAAAC | 2 | 61 | 5739460 | 116 | ||
| CRISPR-16 | Questionable | TTCTATGCGCCCCTCTTTGTCAT | 2 | 39 | 1744952 | 84 | ||
| CRISPR-17 | Questionable | AGAACAAATACTACAACACAACCTAGA | 2 | 51 | 2058296 | 104 | ||
| CRISPR-18 | Questionable | TACTGTAGGACATTTGTCATCTTTATC | 2 | 60 | 2872075 | 113 |
Figure 4Insertion elements distribution in .
Figure 5Growth of Growth under aerobic conditions and (B) growth under anaerobic conditions.
Novel protein coding regions.
| 39037 | 39636 | − | 200 | WP_065213772.1 | Hypothetical protein | No hit | |
| 85376 | 85573 | − | 66 | ANO48202.1 | Hypothetical protein Pf1_02748 | TTG | No hit |
| 95864 | 96004 | + | 47 | ANO48210.1 | Hypothetical protein Pf1_02756 | CNF modifies all members of the Rho family of small GTPases, leading to their activation | |
| 134375 | 124572 | + | 66 | WP_065213298.1 | Hypothetical protein | CTT | capsular polysaccharide biosynthesis protein Cps4D [Streptococcus pneumoniae TIGR4] |
| 141945 | 142067 | + | 41 | ANO48250.1 | Hypothetical protein Pf1_02796 | TTG | No hit |
| 200731 | 200895 | + | 55 | ANO49209.1 | Hypothetical protein Pf1_00961 | hypothetical protein Z5095 [Escherichia coli O157:H7 EDL933] | |
| 468693 | 469853 | + | 387 | WP_014164538.1 | Molybdopterin molybdenumtransferase MoeA | Vacuolating cytotoxin; | |
| 554290 | 554439 | − | 50 | ANO48894.1 | Hypothetical protein Pf1_00646 | GTG | toxin co-regulated pilus biosynthesis protein T [Vibrio cholerae O1 biovar El Tor str. N16961] |
| 594135 | 594797 | − | 221 | WP_065213557.1 | ATPase | AatC ATB binding protein of ABC transporter [Escherichia coli] | |
| 680568 | 680756 | − | 63 | ANO49370.1 | Hypothetical protein Pf1_01125 | TTG | Zinc metalloproteinase/disintegrin |
| 692884 | 693144 | − | 87 | ANO49363.1 | Hypothetical protein Pf1_01118 | trans-sialidase [Trypanosoma cruzi strain CL Brener] | |
| 727112 | 727264 | − | 51 | ANO49336.1 | Hypothetical protein Pf1_01091 | Group III snake venom metalloproteinase | |
| 744844 | 744960 | + | 39 | ANO49317.1 | Hypothetical protein Pf1_01072 | two-component system sensor histidine kinase [Bacteroides thetaiotaomicron VPI-5482] | |
| 745653 | 745790 | + | 46 | ANO49315.1 | Hypothetical protein Pf1_01070 | TTG | Streptokinase |
| 973850 | 974353 | + | 168 | WP_065212991.1 | Rod shape-determining protein MreD | No hit | |
| 1067797 | 1067946 | − | 50 | ANO47321.1 | Hypothetical protein Pf1_01864 | No hit | |
| 1171749 | 1171907 | + | 53 | ANO47901.1 | Hypothetical protein Pf1_02447 | TTG | hypothetical protein XF1753 [Xylella fastidiosa 9a5c] |
| 1404311 | 1404457 | + | 49 | ANO48160.1 | Hypothetical protein Pf1_02706 | chloramphenicol acetyltransferase [Vibrio cholerae 1587] | |
| 1583470 | 1583604 | − | 45 | ANO48610.1 | Hypothetical protein Pf1_00362 | TTG | erythrocyte membrane protein 1 PfEMP1 [Plasmodium falciparum 3D7] |
| 1583723 | 1583881 | − | 53 | ANO48609.1 | Hypothetical protein Pf1_00361 | TTG | No hit |
| 1606964 | 1607101 | − | 46 | ANO48589.1 | Hypothetical protein Pf1_00341 | No hit | |
| 1682815 | 1682925 | + | 37 | ANO48526.1 | Hypothetical protein Pf1_00278 | TTG | No hit |
| 1691105 | 1691263 | − | 54 | ANO48516.1 | Hypothetical protein Pf1_00268 | Outer membrane protein | |
| 1712008 | 1712136 | + | 43 | ANO48500.1 | Hypothetical protein Pf1_00252 | TTG | No hit |
| 1848535 | 1848669 | + | 45 | ANO49588.1 | Hypothetical protein Pf1_01346 | TTG | flagellum-specific ATP synthase [Campylobacter jejuni subsp. jejuni NCTC 11168] |
| 1973301 | 1973432 | − | 44 | ANO49696.1 | Hypothetical protein Pf1_01455 | TTG | minor fimbrial subunit [Salmonella typhimurium LT2] |
| 1975257 | 1975415 | − | 53 | ANO49696.1 | Hypothetical protein Pf1_01458 | TTG | minor fimbrial subunit [Salmonella typhimurium LT2] |
| 1994443 | 1994565 | − | 41 | ANO49717.1 | Hypothetical protein Pf1_01476 | Phosphoenolpyruvate-Protein Phosphotransferase Phosphotransferase System,Enyzme I; EC = 2.7.3.9 | |
| 2131190 | 2131318 | − | 43 | ANO47037.1 | Hypothetical protein Pf1_01580 | SecD/SecF fusion protein | |
| 2142368 | 2142484 | − | 39 | ANO47045.1 | Hypothetical protein Pf1_01588 | TTG | Hemolysin-activating lysine-acyltransferase |
| 2172014 | 2172355 | − | 114 | WP_065212880.1 | Hypothetical protein | erythrocyte membrane protein 1 PfEMP1 [Plasmodium falciparum 3D7] | |
| 2174006 | 2174338 | − | 111 | WP_036217889.1 | Hypothetical protein | FUR protein | |
| 2216770 | 2216907 | + | 46 | ANO47112.1 | Hypothetical protein Pf1_01655 | No hit | |
| 2298050 | 2298268 | + | 73 | ANO47180.1 | Hypothetical protein Pf1_01723 | flavin-binding monooxygenase-like, L-ornithine 5-monooxygense | |
| 2472639 | 2472761 | − | 41 | ANO47787.1 | Hypothetical protein Pf1_02332 | Arginine succinyltransferase | |
| 2684501 | 2684620 | + | 40 | ANO47598.1 | Hypothetical protein Pf1_02143 | No hit | |
| 2775026 | 2775151 | + | 42 | ANO47538.1 | Hypothetical protein Pf1_02083 | putative ABC exporter outer membrane component [Salmonella typhimurium LT2] | |
| 2847516 | 2847671 | − | 52 | ANO48075.1 | Hypothetical protein Pf1_02621 | putative O-antigen acetylase | |
| 2848787 | 2848936 | − | 50 | WP_014166478.1 | DUF4295 domain-containing protein | merozoite surface antigen 2c [Babesia bovis] | |
| 2898382 | 2898615 | + | 78 | WP_014166523.1 | Hypothetical protein | TTG | motor torque providing |
| 2947619 | 2948902 | + | 428 | ANO48353.1 | Teichoic-acid-transporting ATPase | ABC transporter, ATP-binding protein [Enterococcus faecalis V583] |
| 1 | 1 | 1 | 1 | 1 | 1 | 2 | |||||||||||||||||||||||||||||||||||
| 1 | 1 | 1 | 2 | 1 | 1 | 2 | |||||||||||||||||||||||||||||||||||
| 1 | 2 | ||||||||||||||||||||||||||||||||||||||||
| 1 | 1 | 1 | 1 | 2 | |||||||||||||||||||||||||||||||||||||
| 1 | 2 | ||||||||||||||||||||||||||||||||||||||||
| 2 | 8 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 4 | 1 | 1 | ||||||||||||||||||||||||||||||||
| 1 | 1 | ||||||||||||||||||||||||||||||||||||||||||
| 1 | 4 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | 1 | 1 | 1 | |||||||||||||||||||||||||||||||
| 1 | 2 | 1 | 4 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 5 | 1 | 1 | ||||||||||||||||||||||||||||||
| 2 | 1 | ||||||||||||||||||||||||||||||||||||||||||
| 6 | 3 | 4 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | ||||||||||||||||||||||||||||
| 7 | 3 | 5 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | |||||||||||||||||||||||||||
| 5 | 2 | 4 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | |||||||||||||||||||||||||||
| 11 | 5 | 15 | 3 | 3 | 3 | 3 | 3 | 3 | 7 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | ||||||||||||||||||||
| 8 | 5 | 7 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | ||||||||||||||||||||||||||||
M.G., mandatory genes (genes that must be present in the genome to define this system); A.G., accessory genes (genes that can be present, but do not have to be found in every case); and F.G., forbidden genes (genes that must not be present in the system).