| Literature DB >> 24501637 |
Jordy P M Coolen1, Andreas Sjödin2, Boulos Maraha3, Gerard F Hajer4, Mats Forsman2, Ellen Verspui5, Hendrina M E Frenay3, Daan W Notermans6, Maaike C de Vries6, Frans A G Reubsaet6, Armand Paauw1, Guus Roeselers1.
Abstract
Francisella tularensis is a facultative intracellular bacterium in the class Gammaproteobacteria. This strain is of interest because it is the etiologic agent of tularemia and a highly virulent category A biothreat agent. Here we describe the draft genome sequence and annotation of Francisella tularensis subsp. holarctica BD11-00177, isolated from the first case of indigenous tularemia detected in The Netherlands since 1953. Whole genome DNA sequence analysis assigned this isolate to the genomic group B.FTNF002-00, which previously has been exclusively reported from Spain, France, Italy, Switzerland and Germany. Automatic annotation of the 1,813,372 bp draft genome revealed 2,103 protein-coding and 46 RNA genes.Entities:
Keywords: Netherlands; biodefence; phylogeography; tularaemia; zoonotic infection
Year: 2013 PMID: 24501637 PMCID: PMC3910693 DOI: 10.4056/sigs.4217923
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Subspecies | TAS [ | ||
| Strain BD11-00177 | NAS | ||
| Gram stain | negative | TAS [ | |
| Cell shape | short rods or coccoid forms | TAS [ | |
| Motility | No | TAS [ | |
| Sporulation | No | TAS [ | |
| Temperature range | Mesophilic | TAS [ | |
| Optimum temperature | 37 | IDA | |
| Carbon source | Carbohydrates | TAS [ | |
| Energy source | Chemoorganotrophic | TAS [ | |
| Terminal electron receptor | Facultative anaerobe | TAS [ | |
| MIGS-6 | Habitat | Host | TAS [ |
| MIGS-15 | Biotic relationship | Obligate host-dependent | TAS [ |
| MIGS-16 | Host name | Homo sapiens | TAS [ |
| MIGS-14 | Pathogenicity | Pathogen | TAS [ |
| MIGS-4 | Geographic location | The Netherlands | IDA |
| MIGS-5 | Sample collection time | October 2011 | IDA |
| MIGS-4.1 | Latitude | unknown | |
| MIGS-4.2 | Longitude | unknown | |
| MIGS-4.3 | Depth | unknown | |
| MIGS-4.4 | Altitude | unknown | |
| MIGS-4.5 | Isolation site | Human host | IDA |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement [i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement [i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project.
Figure 1Maximum likelihood tree illustrating the phylogenetic relationships among several members of the genus and members of the order based on full-length 16S rRNA gene sequences.
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Standard Draft |
| MIGS-29 | Sequencing platforms | Illumina MiSeq, 454 Roche GS Junior |
| MIGS-31.2 | Fold coverage | 713× |
| MIGS-30 | Assemblers | Ray Assembler V2.1 |
| MIGS-32 | Gene calling method | Prodigal [ |
| GOLD ID | Gi21611 | |
| MIGS-38 | Project relevance | Medical, biodefence |
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome Size (bp) | 1,813,372 | 100.00% |
| DNA coding region (bp) | 1,611,603 | 88.87% |
| DNA G+C content (bp) | 584,435 | 32.23% |
| Total genesb | 2149 | 100.00% |
| RNA genes | 46 | 2.14% |
| Protein-coding genes | 2103 | 97.86% |
| Genes in paralog clusters | 1262 | 58.72% |
| Genes assigned to COGs | 1584 | 73.71% |
| Protein coding genes connected to KEGG pathways | 611 | 28.43% |
| not connected to KEGG pathways | 1492 | 69.43% |
| Genes with signal peptides | 111 | 5.17% |
| Genes with transmembrane helices | 573 | 26.66% |
a) The total is based either on the size of the genome in base pairs or on the total number of protein coding genes in the annotated genome.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 152 | 8.79 | Translation |
| A | 1 | 0.06 | RNA processing and modification |
| K | 65 | 3.76 | Transcription |
| L | 198 | 11.45 | Replication, recombination and repair |
| B | - | - | Chromatin structure and dynamics |
| D | 18 | 1.04 | Cell cycle control, mitosis and meiosis |
| Y | - | - | Nuclear structure |
| V | 31 | 1.79 | Defense mechanisms |
| T | 24 | 1.39 | Signal transduction mechanisms |
| M | 112 | 6.47 | Cell wall/membrane biogenesis |
| N | 19 | 1.1 | Cell motility |
| Z | 1 | 0.06 | Cytoskeleton |
| W | - | - | Extracellular structures |
| U | 44 | 2.54 | Intracellular trafficking and secretion |
| O | 66 | 3.82 | Posttranslational modification, protein turnover, chaperones |
| C | 107 | 6.18 | Energy production and conversion |
| G | 118 | 6.82 | Carbohydrate transport and metabolism |
| E | 158 | 9.13 | Amino acid transport and metabolism |
| F | 65 | 3.76 | Nucleotide transport and metabolism |
| H | 96 | 5.55 | Coenzyme transport and metabolism |
| I | 64 | 3.7 | Lipid transport and metabolism |
| P | 71 | 4.1 | Inorganic ion transport and metabolism |
| Q | 37 | 2.14 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 172 | 9.94 | General function prediction only |
| S | 111 | 6.42 | Function unknown |
| - | 565 | 26.29 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.
Figure 2BRIG diagram of the BD11-00177, FTNF002-00 and SCHU S4 genomes using the FSC200 genome as a reference backbone. White regions represent absent genetic regions.
Figure 3A) Overview of the genus phylogeny based on 52 public whole genome sequences. B) The phylogeny of strains based on whole genome sequences. The new isolate, BD11-00177 belongs to the FTNF002-00 genomic group inside the B.IV clade.