Literature DB >> 27942352

Complete genome anatomy of the emerging potato pathogen Dickeya solani type strain IPO 2222T.

Slimane Khayi1, Pauline Blin1, Teik Min Chong2, Kok-Gan Chan2, Denis Faure1.   

Abstract

Several species of the genus Dickeya provoke soft rot and blackleg diseases on a wide range of plants and crops. Dickeya solani has been identified as the causative agent of diseases outbreaks on potato culture in Europe for the last decade. Here, we report the complete genome of the D. solani IPO 2222T. Using PacBio and Illumina technologies, a unique circular chromosome of 4,919,833 bp was assembled. The G + C content reaches 56% and the genomic sequence contains 4,059 predicted proteins. The ANI values calculated for D. solani IPO 2222T vs. other available D. solani genomes was over 99.9% indicating a high genetic homogeneity within D. solani species.

Entities:  

Keywords:  Blackleg; Dickeya solani; Genome; Potato; Short genome report; Soft rot

Year:  2016        PMID: 27942352      PMCID: PMC5127095          DOI: 10.1186/s40793-016-0208-0

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

are pectinolytic enterobacteria that cause soft rot and blackleg diseases on a wide range of crops worldwide including potato plants () [1, 2]. They are equipped with an arsenal of plant-cell wall degrading enzymes that macerate tuber and stem tissues provoking disease symptoms [3]. In the beginning of the 2000′s, emerged as a novel species causing blackleg and soft rot diseases on potato in Europe and Mediterranean Basin [4]. Initially, several pectinolytic strains isolated from potatoes grown in Europe and Israel, were identified as members of the genus, but shown to exhibit distinctive genetic and physiological traits (biovar 3). Thereafter, additional phylogenetic and biochemical analyses have brought these isolates into a distinct clade called [5-8]. The strain IPO 2222 T was isolated from infected potato plants in The Netherlands in 2007 [9]. To date, 12 draft genomes of are available in GenBank databases. Among them, the genome of the strain IPO 2222 T was sequenced using 454-pyrosequencing with a low average genome coverage (14×). The resulting draft genome is composed of 91 contigs that were assembled in a single scaffold [9]. In this report, we combined Illumina and Pacific Biosciences technologies to provide a complete genome sequence of the strain IPO 2222 T. We also highlighted some phylogenetic and phenotypic key-features of the species.

Organism information

Classification and features

IPO 2222 T belongs to the order of Enterobacteria and the class of . The gapA-based phylogenetic tree (Fig. 1) was congruent with the previously reported trees inferred from MLSA [8, 10], gathering all strains in a distinct clade within the genus. The gapA housekeeping gene was chosen instead of 16S rRNA gene because the sequence analysis of gapA permit a highly resolved view of distinction between members of the genus [8, 10].
Fig. 1

Phylogenetic tree highlighting the relative position of D. solani IPO 2222T within other Dickeya and Pectobacterium species. The unique gapA gene was retrieved from each of the complete and draft genomes that are available in NCBI database; alignment was generated using MUSCLE [23]; the evolutionary history was inferred using the Neighbor-Joining method [24] and the evolutionary distances were computed using the Maximum Composite Likelihood method [25]. Phylogenetic analyses were conducted using MEGA7 software [26]

Phylogenetic tree highlighting the relative position of D. solani IPO 2222T within other Dickeya and Pectobacterium species. The unique gapA gene was retrieved from each of the complete and draft genomes that are available in NCBI database; alignment was generated using MUSCLE [23]; the evolutionary history was inferred using the Neighbor-Joining method [24] and the evolutionary distances were computed using the Maximum Composite Likelihood method [25]. Phylogenetic analyses were conducted using MEGA7 software [26] IPO 2222 T is a Gram negative, non-spore-forming, motile and facultative anaerobic bacterium with rod shaped cells (0.9x2.0 μm) (Fig. 2) [8]. The strain IPO 2222 T grows in TY medium (tryptone 5 g/L, yeast extract 3 g/L and agar 1.5%) at 28 °C forming 1–2 mm colonies within 24 h. It produces phosphatase and indole and belongs to biovar 3 as described previously [10]. Distinctive metabolic abilities of species were described using BIOLOG system [11]; among them, IPO 2222 T uses urea as sole nitrogen source (Additional file 1: Figure S1). IPO 2222 T was recovered form naturally infected potato plants showing blackleg and soft rot symptoms. Its aggressiveness was confirmed by infecting potato tubers and plants in greenhouse assays (Additional file 2: Figure S2). In addition, its ability to colonize the roots and stem tissues and to provoke disease symptoms has been reported using green fluorescent protein-tagged strain [12].
Fig. 2

Photomicrographs of D. solani IPO 2222T using DAPI (4′,6-diamidino-2-phenylindole) staining (a), differential interference contrast (b) and blue methylene staining (c). These photomicrographs show the rod shaped forms of D. solani species

Photomicrographs of D. solani IPO 2222T using DAPI (4′,6-diamidino-2-phenylindole) staining (a), differential interference contrast (b) and blue methylene staining (c). These photomicrographs show the rod shaped forms of D. solani species The strain IPO 2222 T has been registered at the Belgian Co-ordinated Collections of Micro-organisms (LMG 25993 T), the National Collection of Plant Pathogenic Bacteria in UK (NCPPB 4479 T), and the International Center for Microbial Resources - French collection of plant-associated bacteria (CFBP 8199 T). MIGS of strain IPO 2222 T is summarized in Table 1.
Table 1

Classification and general features of Dickeya solani strain IPO 2222T [13]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [15]
Phylum Proteobacteria TAS [27]
Class Gammaproteobacteria TAS [28, 29]
Order “Enterobacteriales” TAS [28, 29]
Family Enterobacteriaceae TAS [30]
Genus Dickeya TAS [1]
Species Dickeya solani TAS [8]
Type strain: IPO 2222T (CP015137)
Gram stainnegativeTAS [8]
Cell shapeRodTAS [8]
MotilityMotileIDA
SporulationNon sporulatingNAS [8]
Temperature rangeMesophilicTAS [8]
Optimum temperature39°CTAS [8]
pH range; OptimumNot reported;7IDA
Carbon sourceD-Arabinose, MannitolTAS [8]
MIGS-6HabitatRhizosphereTAS [8]
MIGS-6.3Salinity0.5% NaCl (w/v)TAS [31]
MIGS-22Oxygen requirementFacultatively anaerobicTAS [8]
MIGS-15Biotic relationshipfree-livingTAS [8]
MIGS-14PathogenicityPathogenicNAS [8]
MIGS-4Geographic locationNetherlandsTAS [8, 9]
MIGS-5Sample collection2007TAS [8, 9]
MIGS-4.1LatitudeNot reportedNAS
MIGS-4.2LongitudeNot reportedNAS
MIGS-4.4AltitudeNot reportedNAS

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]

Classification and general features of Dickeya solani strain IPO 2222T [13] aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]

Genome sequencing information

Genome project history

The genome sequence of strain IPO 2222 T was sequenced using two technologies, PacBio RSII and Illumina NextSeq 500. This organism was selected based on the agricultural relevance as an emerging pathogen with a significant impact on the potato production and trade in Europe and around the world. Project information is available from Genome Online database number Gp0138842 under the Gold study number Gs0118682 at Joint Genome Institute. The complete genome sequence is also deposited in GenBank under the accession number CP015137. In Table 2, we provide a summary of the project information and its association with MIGS [13].
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityComplete genome
MIGS-28Libraries usedPaired-end
MIGS 29Sequencing platformsIllumina NextSeq500, PacBio
MIGS 31.2Fold coverage450X
MIGS 30AssemblersCLC Genomics
MIGS 32Gene calling methodNCBI Prokaryotic Genome Annotation Pipeline
Locus TagA4U42
Genbank IDCP015137
GenBank Date of Release16 Mai 2016
GOLD IDGp0138842
BIOPROJECTPRJNA317288
MIGS 13Source Material IdentifierIPO 2222T
Project relevanceAgricultural
Project information

Growth conditions and genomic DNA preparation

IPO 2222 T was routinely cultured in TY medium at 28 °C. Genomic DNA extraction was performed from 5 mL overnight culture using a phenol-chloroform purification method followed by an ethanol precipitation as described by Wilson [14]. Quantification and quality control of the DNA was completed using a NanoDrop (ND 1000) device, Qubit® 2.0 fluorometer and agarose (1.0%) gel electrophoresis.

Genome sequencing and assembly

Second generation sequencing was performed using NextSeq 500 (Illumina, CA, USA) at the I2BC platform (Gif-sur-Yvette, France). A paired-end library was constructed with an insert size of 390 bp and sequencing was carried out using 2 × 151 bp paired-end read module. The de novo assembly (length fraction, 0.5; similarity, 0.8) was performed using CLC Genomics Workbench (v8.0) software (CLC Inc, Aarhus, Denmark). After quality (quality score threshold 0.05) and length (above 40 nucleotides) trimming of the sequences, 33 contigs (N50 = 266,602 bp) were generated (CLC parameters: automatic determination of the word and bubble sizes with no scaffolding) with a 450× average genome coverage. The largest contig length was 617,431 bp. Third generation sequencing was performed using PacBio RSII (Pacific Biosciences, CA, USA) at the University of Malaya (Kuala Lumpur, Malaysia). The SMRTbell template library at the size of 20 kbp was constructed using the commercial Template Preparation Kit (Pacific Biosciences, CA, USA) followed by sequencing using P6/C4 sequencing chemistry with sequence collection time set at 240 min. Prior to assembly, short reads (less than 500 bp) were filtered off and the minimum polymerase read quality used for mapping of sub-reads from a single zero-mode waveguides was set at 0.75. In total 146,263 reads were obtained (N50 value was 9,161 bp) and total base pair number was at 1,070,191,526 resulting in a 217× average genome coverage. Reads were assembled using RS_HGAP_Assembly software (V2.0). The cut-off length of seeding reads was set at 13,304 bp in order to serve as a reference for the recruitment of shorter reads for preassembly. The resulted consensus accuracy based on multiple sequence alignment of the sub-reads was at 99.99%. The de novo Illumina-contigs were used to verify the RS_HGAP assembly by blasting them against the PacBio sequence. In addition, the trimmed Illumina reads were mapped (length fraction, 0.5; similarity, 0.8) against the PacBio sequence and errors (SNPs and InDels), that might be generated by homopolymers during PacBio sequencing, were searched and corrected using basic variant calling tool from CLC genomic workbench. Using these two sets of sequences, the complete genome sequence was approved and circularized.

Genome annotation

The complete genome of IPO 2222 T was annotated using the NCBI prokaryotic genome annotation pipeline [15]. The protein coding gene prediction process begin by an alignment using ProSplign [16] where only complete alignments with 100% identity to a reference protein are kept for final annotation. Then the remaining frameshift or partial alignments were further analyzed by GeneMarkS+ [17]. To identify structural rRNA, the pipeline uses BLASTn search against the curated reference set. tRNAscan-SE was used to identify the tRNAs [18]. The CRISPRs are identified by using the CRISPR database [15].

Genome properties

The detailed information about IPO 2222 T genome is provided in Table 3. The genome is constituted of one circular chromosome, 4,919,833 bp in size. The annotation predicted 4,208 genes including 4,059 CDSs (Table 4), 104 RNA genes (75 tRNA, 22 rRNA and 7 ncRNA genes) and 45 pseudo genes. The G + C reached 56%. The graphical genome map is provided in the Fig. 3.
Table 3

Genome statistics

AttributeValue% of total
Genome size (bp)4,919,833100.00
DNA coding (bp)4,243,94486.33
DNA G + C (bp)2,767,15556.24
DNA scaffolds1100.00
Total genes4,208100.00
Protein coding genes4,10497.5
RNA genes1042.5
Pseudo genes451.06
Genes in internal clusters1,09325.97
Genes with function prediction3,67087.21
Genes assigned to COGs3,36579.97
Genes with Pfam domains3,78899.02
Genes with signal peptides3869.17
Genes with transmembrane helices95322.65
CRISPR repeats1-
Table 4

Number of genes associated with general COG functional categories

CodeValue% agea Description
J2346.09Translation, ribosomal structure and biogenesis
A10.03RNA processing and modification
K3057.94Transcription
L1122.92Replication, recombination and repair
B00.00Chromatin structure and dynamics
D421.09Cell cycle control, Cell division, chromosome partitioning
V902.34Defense mechanisms
T2165.62Signal transduction mechanisms
M2456.38Cell wall/membrane biogenesis
N1062.76Cell motility
U822.14Intracellular trafficking and secretion
O1383.59Posttranslational modification, protein turnover, chaperones
C2225.78Energy production and conversion
G3248.44Carbohydrate transport and metabolism
E43811.41Amino acid transport and metabolism
F962.5Nucleotide transport and metabolism
H1925.0Coenzyme transport and metabolism
I1303.39Lipid transport and metabolism
P2817.32Inorganic ion transport and metabolism
Q932.42Secondary metabolites biosynthesis, transport and catabolism
R2827.34General function prediction only
S1754.56Function unknown
-84320.03Not in COGs
bTotal4,683120

aThe percentage is based on the total number of protein coding genes in the annotated genome

bThe total does not correspond to 4,208 CDS because some genes are associated with more than one COG functional categories

Fig. 3

Graphical circular map of D. solani IPO 2222T chromosome

Genome statistics Number of genes associated with general COG functional categories aThe percentage is based on the total number of protein coding genes in the annotated genome bThe total does not correspond to 4,208 CDS because some genes are associated with more than one COG functional categories Graphical circular map of D. solani IPO 2222T chromosome

Insights from the genome sequence

species is genetically highly homogenous with 99.9% in genomic similarity (ANI value) [19, 20]. Between two given genomes, the number of variations (SNPs/InDels) is below one hundred. For example, when strain 3337 and strain IPO 2222 T were compared, 49 variations were observed: 15 were located out of CDS and 34 within CDS [19]. Only a few of genomes (strains RNS 07.7.3B, PPO 9019 and PPO 9134) exhibited a higher number of variations (>1000) because they acquired genes by horizontal gene transfer [19]. None horizontal gene transfer from was observed in strain IPO 2222 T. Plant-cell wall degrading enzymes comprising pectinases, proteinases and cellulases, play a major role in the plant tissue maceration process [21]. Indeed, 10 pectates lyase enzymes (genes pelABCDEILXWZ) were predicted in strain IPO 2222 T genome; they showed a 93.3% average nucleotide identity when compared to the orthologous genes of 3937. Recent comparative analyses underlined the major genetic and metabolic divergences between species and the nearest clades that are D. dandatii (ANI 94%) and (ANI 92%) [11, 19]. is characterized by a low content of phages elements and CRISPR system: in strain IPO 2222 T genome, only one CRISPR cluster (208 bp) was identified. Using PHAST tool [22], the strain IPO 2222 T harbors one questionable prophage (11 CDSs) in a 10,687 bp region. In addition, some genomic regions were shown to be specific for species and contain some metabolic and NRPS/PKS encoding genes [11].

Conclusions

The complete sequence of IPO 2222 T is the first complete genome of a member of this species, the type strain. This work provides a substantial resource in terms of knowledge of the bacterial genetic material. It may help to understand the successful fitness of in invading potato fields, opening the way to new control strategies against this phytopathogen.
  23 in total

1.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

2.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

3.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

4.  Systemic colonization of potato plants by a soilborne, green fluorescent protein-tagged strain of Dickeya sp. biovar 3.

Authors:  Robert Czajkowski; Waldo J de Boer; Henk Velvis; Jan M van der Wolf
Journal:  Phytopathology       Date:  2010-02       Impact factor: 4.025

5.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

6.  PHAST: a fast phage search tool.

Authors:  You Zhou; Yongjie Liang; Karlene H Lynch; Jonathan J Dennis; David S Wishart
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

7.  Draft Genome Sequences of Four Dickeya dianthicola and Four Dickeya solani Strains.

Authors:  Leighton Pritchard; Sonia Humphris; Steve Baeyen; Martine Maes; Johan Van Vaerenbergh; John Elphinstone; Gerry Saddler; Ian Toth
Journal:  Genome Announc       Date:  2013-07-25

8.  Genome Sequence of the Emerging Plant Pathogen Dickeya solani Strain RNS 08.23.3.1A.

Authors:  Slimane Khayi; Samuel Mondy; Amélie Beury-Cirou; Mohieddine Moumni; Valérie Hélias; Denis Faure
Journal:  Genome Announc       Date:  2014-01-30

9.  Population genomics reveals additive and replacing horizontal gene transfers in the emerging pathogen Dickeya solani.

Authors:  Slimane Khayi; Pauline Blin; Jacques Pédron; Teik-Min Chong; Kok-Gan Chan; Mohieddine Moumni; Valérie Hélias; Frédérique Van Gijsegem; Denis Faure
Journal:  BMC Genomics       Date:  2015-10-14       Impact factor: 3.969

10.  Genomic and metabolic comparison with Dickeya dadantii 3937 reveals the emerging Dickeya solani potato pathogen to display distinctive metabolic activities and T5SS/T6SS-related toxin repertoire.

Authors:  Jacques Pédron; Samuel Mondy; Yannick Raoul des Essarts; Frédérique Van Gijsegem; Denis Faure
Journal:  BMC Genomics       Date:  2014-04-15       Impact factor: 3.969

View more
  5 in total

1.  Comparative genomics and pangenome-oriented studies reveal high homogeneity of the agronomically relevant enterobacterial plant pathogen Dickeya solani.

Authors:  Agata Motyka-Pomagruk; Sabina Zoledowska; Agnieszka Emilia Misztak; Wojciech Sledz; Alessio Mengoni; Ewa Lojkowska
Journal:  BMC Genomics       Date:  2020-06-29       Impact factor: 3.969

2.  Comparison of Highly and Weakly Virulent Dickeya solani Strains, With a View on the Pangenome and Panregulon of This Species.

Authors:  Malgorzata Golanowska; Marta Potrykus; Agata Motyka-Pomagruk; Michal Kabza; Giovanni Bacci; Marco Galardini; Marco Bazzicalupo; Izabela Makalowska; Kornelia Smalla; Alessio Mengoni; Nicole Hugouvieux-Cotte-Pattat; Ewa Lojkowska
Journal:  Front Microbiol       Date:  2018-08-31       Impact factor: 5.640

3.  Resistance of Dickeya solani strain IPO 2222 to lytic bacteriophage ΦD5 results in fitness tradeoffs for the bacterium during infection.

Authors:  Przemyslaw Bartnik; Kinga Lewtak; Marta Fiołka; Paulina Czaplewska; Magdalena Narajczyk; Robert Czajkowski
Journal:  Sci Rep       Date:  2022-06-24       Impact factor: 4.996

4.  Oxygen Availability Influences Expression of Dickeya solani Genes Associated With Virulence in Potato (Solanum tuberosum L.) and Chicory (Cichorium intybus L.).

Authors:  Wioletta Lisicka; Jakub Fikowicz-Krosko; Sylwia Jafra; Magdalena Narajczyk; Paulina Czaplewska; Robert Czajkowski
Journal:  Front Plant Sci       Date:  2018-03-21       Impact factor: 5.753

5.  The Periplasmic Oxidoreductase DsbA Is Required for Virulence of the Phytopathogen Dickeya solani.

Authors:  Tomasz Przepiora; Donata Figaj; Aleksandra Bogucka; Jakub Fikowicz-Krosko; Robert Czajkowski; Nicole Hugouvieux-Cotte-Pattat; Joanna Skorko-Glonek
Journal:  Int J Mol Sci       Date:  2022-01-09       Impact factor: 5.923

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.