Literature DB >> 24501637

Draft genome sequence of Francisella tularensis subsp. holarctica BD11-00177.

Jordy P M Coolen1, Andreas Sjödin2, Boulos Maraha3, Gerard F Hajer4, Mats Forsman2, Ellen Verspui5, Hendrina M E Frenay3, Daan W Notermans6, Maaike C de Vries6, Frans A G Reubsaet6, Armand Paauw1, Guus Roeselers1.   

Abstract

Francisella tularensis is a facultative intracellular bacterium in the class Gammaproteobacteria. This strain is of interest because it is the etiologic agent of tularemia and a highly virulent category A biothreat agent. Here we describe the draft genome sequence and annotation of Francisella tularensis subsp. holarctica BD11-00177, isolated from the first case of indigenous tularemia detected in The Netherlands since 1953. Whole genome DNA sequence analysis assigned this isolate to the genomic group B.FTNF002-00, which previously has been exclusively reported from Spain, France, Italy, Switzerland and Germany. Automatic annotation of the 1,813,372 bp draft genome revealed 2,103 protein-coding and 46 RNA genes.

Entities:  

Keywords:  Netherlands; biodefence; phylogeography; tularaemia; zoonotic infection

Year:  2013        PMID: 24501637      PMCID: PMC3910693          DOI: 10.4056/sigs.4217923

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

is a Gram negative, non-motile, non-spore forming, facultative intracellular bacterium appearing as short rods or coccoid forms [1]. is the etiologic agent of tularemia, a zoonotic infection also known as rabbit fever and deer-fly fever. Transmission to humans has been reported by direct contact with infected animals, arthropod bites, inhalation of contaminated dust or ingestion of contaminated food or water. This pathogen is highly infectious as it can cause infection upon inhalation of as few as 10 cells. This extremely low infectious dose makes transmission via aerosols easy, and previous attempts to weaponize this microorganism have led to its recognition as a category A biothreat agent (CDC classification) [2,3]. contains three subspecies that are infectious to humans; the highly virulent , which often causes a lethal multi-systemic disease with a fatality rate of up to 30%, the less virulent subsp. holartica and , which both seldom cause infectious in humans. Here we present a summary classification together with the description of the draft genome sequence and annotation of BD11-00177, that was isolated from a vesicle on the forehead of a 72-year-old male living in The Netherlands. As the patient had not been abroad for years, this was the first documented case of indigenous tularemia in The Netherlands since 1953.

Classification and features

is the only genus within the family and is a member of the order and the class [4] [Table 1]. Besides , the genus includes the species , and the misclassified [4,17, Figure 1]. Only rare human infections with and , and are described, often caused after nearly drowning [18,19]. is capable of infecting hundreds of different vertebrate and invertebrate hosts [20]. The most widely distributed subspecies is , which is found throughout much of the Northern Hemisphere and is the only subspecies naturally occurring in Europe [21].
Table 1

Classification and general features

MIGS ID     Property      Term    Evidence codea
      Domain Bacteria    TAS [5]
      Phylum Proteobacteria    TAS [6]
      Class Gammaproteobacteria    TAS [7,8]
     Current classification      Order Thiotrichales    TAS [7,9]
      Family Francisellaceae    TAS [7-10]
      Genus Francisella    TAS [11-14]
      Species Francisella tularensis    TAS [11,12]
      Subspecies Francisella tularensis holarctica    TAS [15,16]
      Strain BD11-00177    NAS
     Gram stain      negative    TAS [1].
     Cell shape      short rods or coccoid forms    TAS [1].
     Motility      No    TAS [1].
     Sporulation      No    TAS [1].
     Temperature range      Mesophilic    TAS [1].
     Optimum temperature      37    IDA
     Carbon source      Carbohydrates    TAS [1].
     Energy source      Chemoorganotrophic    TAS [1].
     Terminal electron receptor      Facultative anaerobe    TAS [1].
MIGS-6     Habitat      Host    TAS [1].
MIGS-15     Biotic relationship      Obligate host-dependent    TAS [1].
MIGS-16     Host name     Host taxon ID     Host gender      Homo sapiens      9606      Male    TAS [1].    TAS [1].    NAS
MIGS-14     Pathogenicity     Biosafety Level      Pathogen      3    TAS [2].
MIGS-4     Geographic location      The Netherlands    IDA
MIGS-5     Sample collection time      October 2011    IDA
MIGS-4.1      Latitude      unknown
MIGS-4.2     Longitude      unknown
MIGS-4.3     Depth      unknown
MIGS-4.4     Altitude      unknown
MIGS-4.5     Isolation site     Isolation source      Human host      vesicle on the forehead    IDA    IDA

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement [i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement [i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project.

Figure 1

Maximum likelihood tree illustrating the phylogenetic relationships among several members of the genus and members of the order based on full-length 16S rRNA gene sequences.

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement [i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement [i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project. Maximum likelihood tree illustrating the phylogenetic relationships among several members of the genus and members of the order based on full-length 16S rRNA gene sequences.

Genome sequencing information

Genome project history

Strain BD11-00177 was sequenced because of its relevance to biodefense. The draft genome sequence was finished in August 2012. The GenBank accession number for the project is 177784. The genome project is listed in the Genome OnLine Database (GOLD) [22] as project Gi21611. Sequencing was carried out at the Dutch Organization for Applied Scientific Research (TNO) and the Swedish Defense Research Agency (FOI). Initial automatic annotation was performed using the DOE-JGI Microbial Annotation Pipeline (DOE-JGI MAP). Table 2 shows the project information and its association with MIGS 2.0 compliance.
Table 2

Project information

MIGS ID       Property      Term
MIGS-31       Finishing quality      Standard Draft
MIGS-29       Sequencing platforms      Illumina MiSeq, 454 Roche GS Junior
MIGS-31.2       Fold coverage      713×
MIGS-30       Assemblers      Ray Assembler V2.1
MIGS-32       Gene calling method      Prodigal [23]
       GOLD ID       IMG Taxon ID       NCBI PROJECT ID      Gi21611      1244086      177784
MIGS-38       Project relevance      Medical, biodefence

Growth conditions and DNA isolation

For DNA preparation, strain BD11-00177 was grown on 5% sheep blood agar plates for 72 h at 35°C in the presence of 5% CO2. DNA was extracted using the Qiamp DNA Micro Kit according manufacturers guidelines (Qiagen, Westburg b.v., Leusden, The Netherlands).

Genome sequencing and assembly

Sequencing was performed by the Microbiology and Systems Biology group at TNO and the Division for CBRN Defence and Security at FOI using 454 Roche GS Junior and the Illumina MiSeq platforms. The initial draft assembly yielded 95 large (>1,000 bp) and 86 small (<1,000 bp), non-redundant contigs of 1,813,372 bp by combing 75,245 Roche/454 reads at 23× coverage and 8,289,332 Illumina reads at 690× coverage by hybrid assembly through the Ray Assembler V2.1 [24].

Genome annotation

Open Reading Frames (ORFs) were predicted using the Prodigal gene prediction algorithm [23] as part of the DOE-JGI Microbial Annotation Pipeline (DOE-JGI MAP) using default parameters, followed by a round of manual curation. CRISPR elements were predicted using CRT and PILERCR [25]. Predictions from both methods were concatenated. Identification of tRNAs was performed using tRNAScan. Ribosomal RNA genes (5S, 16S, 23S) are predicted using the program RNAmmer [26]. With the exception of tRNA and rRNA, all models from Rfam [27] are used to search the genome sequence. For faster detection, sequences are first compared to a database containing all the ncRNA genes in the Rfam database using BLAST, with a very loose cutoff. Subsequently, sequences that have hits to any genes belonging to an Rfam model are searched using the program INFERNAL [27]. Protein coding genes were compared to protein families (e.g., COGs, Pfam, KEGG) and the proteome of selected “core” genomes, which are publicly available, and the product names were assigned based on the results of these comparisons.

Genome properties

The genome was assembled into 95 large (>1,000 bp) contigs and includes one circular chromosome with a total size of 11,813,372 bp (32.23% GC content). A total of 2,149 genes were predicted, 2,103 of which are protein-coding genes. Of the protein coding genes, 1,592 were assigned to a putative function, with the remaining being annotated as hypothetical proteins. The properties and the statistics of the genome are summarized in Tables 3 and 4.
Table 3

Nucleotide content and gene count levels of the genome

Attribute     Value     % of totala
Genome Size (bp)     1,813,372     100.00%
DNA coding region (bp)     1,611,603     88.87%
DNA G+C content (bp)     584,435     32.23%
Total genesb     2149     100.00%
RNA genes     46     2.14%
Protein-coding genes     2103     97.86%
Genes in paralog clusters     1262     58.72%
Genes assigned to COGs     1584     73.71%
Protein coding genes connected to KEGG pathways     611     28.43%
not connected to KEGG pathways     1492     69.43%
Genes with signal peptides     111     5.17%
Genes with transmembrane helices     573     26.66%

a) The total is based either on the size of the genome in base pairs or on the total number of protein coding genes in the annotated genome.

Table 4

Number of genes associated with the 25 general COG functional categories

Code     Value     %agea      Description
J     152     8.79      Translation
A     1     0.06      RNA processing and modification
K     65     3.76      Transcription
L     198     11.45      Replication, recombination and repair
B     -     -      Chromatin structure and dynamics
D     18     1.04      Cell cycle control, mitosis and meiosis
Y     -     -      Nuclear structure
V     31     1.79      Defense mechanisms
T     24     1.39      Signal transduction mechanisms
M     112     6.47      Cell wall/membrane biogenesis
N     19     1.1      Cell motility
Z     1     0.06      Cytoskeleton
W     -     -      Extracellular structures
U     44     2.54      Intracellular trafficking and secretion
O     66     3.82      Posttranslational modification, protein turnover, chaperones
C     107     6.18      Energy production and conversion
G     118     6.82      Carbohydrate transport and metabolism
E     158     9.13      Amino acid transport and metabolism
F     65     3.76      Nucleotide transport and metabolism
H     96     5.55      Coenzyme transport and metabolism
I     64     3.7      Lipid transport and metabolism
P     71     4.1      Inorganic ion transport and metabolism
Q     37     2.14      Secondary metabolites biosynthesis, transport and catabolism
R     172     9.94      General function prediction only
S     111     6.42      Function unknown
-     565     26.29      Not in COGs

a) The total is based on the total number of protein coding genes in the annotated genome.

a) The total is based either on the size of the genome in base pairs or on the total number of protein coding genes in the annotated genome. a) The total is based on the total number of protein coding genes in the annotated genome.

Comparisons with other fully sequenced genomes

Comparison of the assembled draft genome sequence of strain BD11-00177 with publicly available genome sequences revealed that it clusters in the FTNF002-00 genomic group (B.Br.FTNF002-00 and BIV.FTNF002-00) defined by the FTNF002-00 genome sequence [28-30] within the B.IV clade. The presence of the 1.59 kb RD23 deletion event [31] as well as the 464 bp size of the MLVA marker FtM24 [32], both typical for the FTNF002-00 genomic group, were confirmed in silico. Notably, isolates from this genomic group had previously been exclusively reported from Spain, France, Italy, Switzerland and Germany [28,31-35]. A BLAST Ring Image Generator (BRIG) analysis comparing the BD11-00177 genome against the genomes of F92, LVS, and FTNF002-00 revealed that the BD11-00177 draft genome shows considerable resemblance to FTNF002-00 (Figure 2).
Figure 2

BRIG diagram of the BD11-00177, FTNF002-00 and SCHU S4 genomes using the FSC200 genome as a reference backbone. White regions represent absent genetic regions.

BRIG diagram of the BD11-00177, FTNF002-00 and SCHU S4 genomes using the FSC200 genome as a reference backbone. White regions represent absent genetic regions. Evolutionary history of strain BD11-00177 was inferred using publicly available whole genome sequences. The trees in Figure 3 A and B are drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the number of differences method and are in the units of the number of base differences per sequence. The overview of genus involved 52 public genome sequences using Piscirickettia salmonis as outgroup (Figure 3A). The detailed analysis involved 14 genome sequences using strain SCHU S4 as outgroup (Figure 3B) [17,30,33,36-41]. All positions containing gaps and missing data were eliminated. There were a total of 1,599,589 positions in the final dataset.
Figure 3

A) Overview of the genus phylogeny based on 52 public whole genome sequences. B) The phylogeny of strains based on whole genome sequences. The new isolate, BD11-00177 belongs to the FTNF002-00 genomic group inside the B.IV clade.

A) Overview of the genus phylogeny based on 52 public whole genome sequences. B) The phylogeny of strains based on whole genome sequences. The new isolate, BD11-00177 belongs to the FTNF002-00 genomic group inside the B.IV clade.

Conclusion

Here we have presented the draft genome of the first member of FTNF002-00 genomic group of . As more genetic information of members from this genomic group becomes available, a better understanding of the evolution and biogeography of this pathogen will be gained. This knowledge may help us to understand the epidemiology and potential expansion of the geographical distribution of this genomic group. Despite potential biases associated with discontinuous draft genomes, we would like to focus on the added value of draft bacterial genome sequencing. Taking advantage of low cost and high-throughput sequencing platforms allows us to probe the vast microbial diversity present in nature and rapidly respond to clinical outbreaks and acute biosecurity hazards. From an evolutionary ecology perspective, increased sequencing efforts allow us to characterize the biogeography of microbial taxa and differentiate between neutral and conserved genome contents.
  31 in total

1.  A new organism resembling P. tularensis isolated from water.

Authors:  C L LARSON; W WICHT; W L JELLISON
Journal:  Public Health Rep       Date:  1955-03       Impact factor: 2.792

2.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

3.  Chromosome rearrangement and diversification of Francisella tularensis revealed by the type B (OSU18) genome sequence.

Authors:  Joseph F Petrosino; Qin Xiang; Sandor E Karpathy; Huaiyang Jiang; Shailaja Yerrapragada; Yamei Liu; Jason Gioia; Lisa Hemphill; Arely Gonzalez; T M Raghavan; Akif Uzman; George E Fox; Sarah Highlander; Mason Reichard; Rebecca J Morton; Kenneth D Clinkenbeard; George M Weinstock
Journal:  J Bacteriol       Date:  2006-10       Impact factor: 3.490

4.  Phylogeography of Francisella tularensis ssp. holarctica in France.

Authors:  A J Vogler; D N Birdsell; J Lee; J Vaissaire; C L Doujet; M Lapalus; D M Wagner; P Keim
Journal:  Lett Appl Microbiol       Date:  2011-02       Impact factor: 2.858

5.  Description of Francisella hispaniensis sp. nov., isolated from human blood, reclassification of Francisella novicida (Larson et al. 1955) Olsufiev et al. 1959 as Francisella tularensis subsp. novicida comb. nov. and emended description of the genus Francisella.

Authors:  Birgit Huber; Raquel Escudero; Hans-Jürgen Busse; Erik Seibold; Holger C Scholz; Pedro Anda; Peter Kämpfer; Wolf D Splettstoesser
Journal:  Int J Syst Evol Microbiol       Date:  2009-09-25       Impact factor: 2.747

6.  Genomic deletion marking an emerging subclone of Francisella tularensis subsp. holarctica in France and the Iberian Peninsula.

Authors:  M P Dempsey; M Dobson; C Zhang; M Zhang; C Lion; C B Gutiérrez-Martín; P C Iwen; P D Fey; M E Olson; D Niemeyer; S Francesconi; R Crawford; M Stanley; J Rhodes; D M Wagner; A J Vogler; D Birdsell; P Keim; A Johansson; S H Hinrichs; A K Benson
Journal:  Appl Environ Microbiol       Date:  2007-09-21       Impact factor: 4.792

7.  Genome sequence of Francisella tularensis subspecies holarctica strain FSC200, isolated from a child with tularemia.

Authors:  Kerstin Svensson; Andreas Sjödin; Mona Byström; Malin Granberg; Mitchell J Brittnacher; Laurence Rohmer; Michael A Jacobs; Elizabeth H Sims-Day; Ruth Levy; Yang Zhou; Hillary S Hayden; Regina Lim; Jean Chang; Donald Guenthener; Allison Kang; Eric Haugen; Will Gillett; Rajinder Kaul; Mats Forsman; Pär Larsson; Anders Johansson
Journal:  J Bacteriol       Date:  2012-12       Impact factor: 3.490

8.  The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.

Authors:  Konstantinos Liolios; I-Min A Chen; Konstantinos Mavromatis; Nektarios Tavernarakis; Philip Hugenholtz; Victor M Markowitz; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2009-11-13       Impact factor: 16.971

9.  Genome characterisation of the genus Francisella reveals insight into similar evolutionary paths in pathogens of mammals and fish.

Authors:  Andreas Sjödin; Kerstin Svensson; Caroline Ohrman; Jon Ahlinder; Petter Lindgren; Samuel Duodu; Anders Johansson; Duncan J Colquhoun; Pär Larsson; Mats Forsman
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

10.  Rfam: annotating non-coding RNAs in complete genomes.

Authors:  Sam Griffiths-Jones; Simon Moxon; Mhairi Marshall; Ajay Khanna; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  3 in total

Review 1.  Whole-Genome Sequencing of Bacterial Pathogens: the Future of Nosocomial Outbreak Analysis.

Authors:  Scott Quainoo; Jordy P M Coolen; Sacha A F T van Hijum; Martijn A Huynen; Willem J G Melchers; Willem van Schaik; Heiman F L Wertheim
Journal:  Clin Microbiol Rev       Date:  2017-10       Impact factor: 26.132

2.  Long-range dispersal moved Francisella tularensis into Western Europe from the East.

Authors:  Chinmay Dwibedi; Dawn Birdsell; Adrian Lärkeryd; Kerstin Myrtennäs; Caroline Öhrman; Elin Nilsson; Edvin Karlsson; Christian Hochhalter; Andrew Rivera; Sara Maltinsky; Brittany Bayer; Paul Keim; Holger C Scholz; Herbert Tomaso; Matthias Wittwer; Christian Beuret; Nadia Schuerch; Paola Pilo; Marta Hernández Pérez; David Rodriguez-Lazaro; Raquel Escudero; Pedro Anda; Mats Forsman; David M Wagner; Pär Larsson; Anders Johansson
Journal:  Microb Genom       Date:  2016-12-12

3.  Phylogeographic Distribution of Human and Hare Francisella Tularensis Subsp. Holarctica Strains in the Netherlands and Its Pathology in European Brown Hares (Lepus Europaeus).

Authors:  Miriam Koene; Jolianne Rijks; Miriam Maas; Robin Ruuls; Marc Engelsma; Peter van Tulden; Marja Kik; Jooske IJzer; Daan Notermans; Maaike de Vries; Ewout Fanoy; Roan Pijnacker; Marcel Spierenburg; Herjan Bavelaar; Hanneke Berkhout; Sanjay Sankatsing; Rob Diepersloot; Kerstin Myrtennas; Malin Granberg; Mats Forsman; Hendrik-Jan Roest; Andrea Gröne
Journal:  Front Cell Infect Microbiol       Date:  2019-02-11       Impact factor: 5.293

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.