| Literature DB >> 32381115 |
Sébastien Bridel1,2,3, Frédéric Bourgeon4, Arnaud Marie2, Denis Saulnier5, Sophie Pasek6, Pierre Nicolas7, Jean-François Bernardet1, Eric Duchaud8.
Abstract
Tenacibaculum maritimum is responsible for tenacibaculosis, a devastating marine fish disease. This filamentous bacterium displays a very broad host range and a worldwide geographical distribution. We analyzed and compared the genomes of 25 T. maritimum strains, including 22 newly draft-sequenced genomes from isolates selected based on available MLST data, geographical origin and host fish. The genome size (~3.356 Mb in average) of all strains is very similar. The core genome is composed of 2116 protein-coding genes accounting for ~75% of the genes in each genome. These conserved regions harbor a moderate level of nucleotide diversity (~0.0071 bp-1) whose analysis reveals an important contribution of recombination (r/m ≥ 7) in the evolutionary process of this cohesive species that appears subdivided into several subgroups. Association trends between these subgroups and specific geographical origin or ecological niche remains to be clarified. We also evaluated the potential of MALDI-TOF-MS to assess the variability between T. maritimum isolates. Using genome sequence data, several detected mass peaks were assigned to ribosomal proteins. Additionally, variations corresponding to single or multiple amino acid changes in several ribosomal proteins explaining the detected mass shifts were identified. By combining nine polymorphic biomarker ions, we identified combinations referred to as MALDI-Types (MTs). By investigating 131 bacterial isolates retrieved from a variety of isolation sources, we identified twenty MALDI-Types as well as four MALDI-Groups (MGs). We propose this MALDI-TOF-MS Multi Peak Shift Typing scheme as a cheap, fast and an accurate method for screening T. maritimum isolates for large-scale epidemiological surveys.Entities:
Mesh:
Year: 2020 PMID: 32381115 PMCID: PMC7204230 DOI: 10.1186/s13567-020-00782-0
Source DB: PubMed Journal: Vet Res ISSN: 0928-4249 Impact factor: 3.683
General genome features.
| Strain | Country | Host | Tissue | Date of isolation | Technology | Reads (post-trimming) | Contigs (> 2000pb) | Total length | GC % | Coverage | Number of predicted genomic islands | SNPs vs. NCIMB 2154T | Predicted CDS | Genbank Assembly |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NCIMB 2154T | Japan | Skin | 1977 | [ | n/a | 1 | 3453 971 | 32.01 | 1734 | 29 | 0 | 2774 | GCA_900119795.1 | |
| TM-KORJJ | Korea | n/a | n/a | PacBio | n/a | 1 | 3333 272 | 31.98 | 300 | 24 | 15 049 | 2735 | GCA_004803875.1 | |
| NBRC 15946 | Japan | n/a | n/a | n/a | HiSeq | n/a | 96 | 3 240 791 | 31.80 | 123 | 20 | 10 084 | 2692 | GCA_000509405.1 |
| P1-39 | France | Liver | 2010 | HiSeq (2 × 100 bp) | 56 562 140 | 104 | 3 337 690 | 31.79 | 93 | 23 | 15 401 | 2788 | GCA_902705535 | |
| P4-45 | France | Skin | 2010 | HiSeq (2 × 100 bp) | 54 681 392 | 73 | 3 349 546 | 31.81 | 70 | 27 | 15 707 | 2843 | GCA_902705495 | |
| 902 | France | Skin | 2013 | HiSeq (2 × 100 bp) | 88 844 854 | 68 | 3 372 337 | 31.80 | 85 | 24 | 15 521 | 2851 | GCA_902705365 | |
| Aq16-85 | French Polynesia | Skin | 2016 | MiSeq (2 × 300 bp) | 2 174 369 | 43 | 3 198 696 | 31.88 | 220 | 21 | 14 929 | 2664 | GCA_902705305 | |
| Aq16-88 | French Polynesia | Skin | 2016 | MiSeq (2 × 300 bp) | 3 434 486 | 45 | 3 196 671 | 31.88 | 301 | 21 | 14 966 | 2665 | GCA_902705275 | |
| Aq16-89 | French Polynesia | Skin | 2016 | MiSeq (2 × 300 bp) | 2 156 781 | 45 | 3 196 642 | 31.89 | 171 | 21 | 14 914 | 2666 | GCA_902705375 | |
| TFA4 | French Polynesia | Skin | 2013 | MiSeq (2 × 300 bp) | 2 369 550 | 66 | 3 356 632 | 31.82 | 226 | 33 | 14 961 | 2865 | GCA_902705565 | |
| FS08(1) | Italy | Skin | 2006 | MiSeq (2 × 300 bp) | 1 256 466 | 54 | 3 399 437 | 31.81 | 87 | 30 | 4424 | 2866 | GCA_902705395 | |
| NAC SLCC MFF | Malta | Skin | 1995 | MiSeq (2 × 300 bp) | 1 091 150 | 80 | 3 352 671 | 31.81 | 76 | 29 | 40 598 | 2795 | GCA_902705345 | |
| USC SP9.1 | Spain | Skin | 1993 | MiSeq (2 × 300 bp) | 727 120 | 80 | 3 395 385 | 31.84 | 46 | 30 | 31 252 | 2779 | GCA_902705515 | |
| DPIF 89/3001-6.2 | Tasmania | Skin | 1989 | MiSeq ( 2 × 300 bp) | 1 019 320 | 129 | 3 448 890 | 31.77 | 66 | 33 | 26 962 | 2788 | GCA_902705315 | |
| DPIF 89/0239-1 | Tasmania | Skin | 1989 | MiSeq (2 × 300 bp) | 1 496 188 | 55 | 3 353 931 | 31.90 | 92 | 28 | 17 743 | 2773 | GCA_902705355 | |
| USC SE30.1 | Spain | Mouth | 1993 | MiSeq (2 × 300 bp) | 749 406 | 111 | 3 544 405 | 31.75 | 52 | 30 | 31 928 | 2928 | GCA_902705525 | |
| UCD SB2 | California | n/a | 1995 | MiSeq (2 × 300 bp) | 1 323 140 | 50 | 3 308 376 | 31.91 | 85 | 23 | 17 064 | 2715 | GCA_902705445 | |
| JIP 32/91-4 | France | Skin | 1991 | MiSeq (2 × 300 bp) | 1 031 822 | 59 | 3 447 003 | 31.80 | 59 | 31 | 17 265 | 2872 | GCA_902705385 | |
| CVI1001048 | Holland | Skin | 2010 | MiSeq (2 × 300 bp) | 1 242 180 | 42 | 3224 047 | 31.94 | 90 | 16 | 1659 | 2670 | GCA_902705265 | |
| FC | Chile | Eye | 1998 | MiSeq (2 × 300 bp) | 1086 816 | 57 | 3 505 634 | 32.01 | 70 | 27 | 16 571 | 2921 | GCA_902705415 | |
| P2-48 | France | Skin | 2010 | MiSeq (2 × 300 bp) | 1 349 672 | 55 | 3 418 994 | 31.86 | 90 | 34 | 43 607 | 2910 | GCA_902705555 | |
| P2-27 | Spain | Skin | 2011 | MiSeq (2 × 300 bp) | 1 087 036 | 88 | 3 371 677 | 31.82 | 76 | 24 | 19 639 | 2821 | GCA_902705465 | |
| JIP 46/00 | France | Skin | 2000 | MiSeq (2 × 300 bp) | 1 155 334 | 56 | 3 371 335 | 31.89 | 73 | 25 | 17 215 | 2781 | GCA_902705435 | |
| JIP 10/97 | France | Skin | 1997 | MiSeq (2 × 300 bp) | 1 345 346 | 52 | 3 333 073 | 31.86 | 86 | 22 | 17 441 | 2746 | GCA_902705285 | |
| NCIMB 2158 | Scotland | Skin | 1981 | MiSeq (2 × 300 bp) | 1 138 826 | 74 | 3 369 590 | 31.87 | 79 | 28 | 17 387 | 2797 | GCA_902705425 |
The list of contributors is available in Additional file 1.
Figure 1Neighbor-joining tree based on a Jukes-Cantor distance of the 25genomes. The tree is based on snippy alignment without gap regions. It is reconstructed by NJ method using a Jukes-Cantor distance. 100 bootstrap replicates were made. A black line denotes a branch support of 100/100, while a grey line denotes a value between 80/100 and 99/100 and a grey dotted line a value below 80/100. The origin of strains is indicated next to the tree and the meanings of each abbreviation are detailed below. The MALDI-Type, the MALDI-Group and the corresponding MALDI isomorphic profile are plotted on the right hand side of the figure. Origin: Australia (Tasmania), AU; Chile, CL; France, FR; French Polynesia, FP; Italy, IT; Japan, JP; Malta, MT; Spain, SP; United Kingdom (Scotland), UK; USA (California), US; no data available, n/d. Fish host species: ASI, Acanthopagrus shlegeli; ANS, Atrasctoscion nobilis; CTS, Carcharias taurus; DLX, Dicentrarchus labrax; EMX, Engraulis mordax; LLA, Latris lineata; OKH, Onchorynchus kisutch; OMS, Onchorynchus mykiss; PMR, Pagrus major; POL, Paralichthys olivaceus; POS, Platax orbicularis; SSR, Salmo salar; SMS, Scophtalmus maximus; SQA, Seriola quinqueradiata; SSS, Solea senegalensis; SAA, Sparus aurata; n/d no data available.
The retained polymorphic biomarkers.
| Biomarkers | First methionine cleavage | H+ | Predicted mass | Observed mass | Delta ppm | Isoform |
|---|---|---|---|---|---|---|
| RpmD | S(2) 91% | 1 | 6700.56 | 6703.90 | 498 | IF1 |
| 6638.48 | 6641.21 | 411 | IF2 | |||
| 6622.48 | 6625.88 | 513 | IF3 | |||
| RpmC | M(1) 99% | 1 | 7245.24 | 7248.41 | 437 | IF1 |
| 7273.26 | 7275.95 | 273 | IF2 | |||
| 7259.31 | 7262.33 | 416 | IF3 | |||
| 7301.31 | 7303.94 | 360 | IF4 | |||
| RpsP | P(2) 98% | 2 | 9168.45 | 9169.730 | 139 | IF1 |
| 9197.49 | 9199.03 | 167 | IF2 | |||
| 9183.47 | 9184.97 | 163 | IF3 | |||
| 9190.48 | 9191.34 | 93 | IF4 | |||
| 9161.44 | 9163.107 | 181 | IF5 | |||
| RpsT | A(2) 97% | 1 | 9404.57 | 9406.69 | 225 | IF1 |
| 9376 | 9379.19 | 225 | IF2 | |||
| RpsN | A(2) 97% | 1 | 10049.41 | 10051.07 | 165 | IF1 |
| 10061.46 | 10064.14 | 266 | IF2 | |||
| RpsQ | M(1) 99% | 1 | 10097.52 | 10099.99 | 244 | IF1 |
| 10070.50 | 10071.79 | 68 | IF2 | |||
| RpsO | M(1) 99% | 1 | 10521.89 | 10524.30 | 229 | IF1 |
| 10507.82 | 10510.00 | 207 | IF2 | |||
| RplX | M(1) 99% | 1 | 11117.46 | 11119.12 | 146 | IF1 |
| 11135.48 | 11136.94 | 131 | IF2 | |||
| RplT | P(2) 88% | 1 | 13171.38 | 13172.27 | 67 | IF1 |
| 13157.34 | 13157.79 | 34 | IF2 | |||
| 13144 | 13145.75 | 57 | IF3 |
Figure 2Characteristic peak shifts used for strain characterization and MALDI-Types definition. The 24 average spectra are plotted, corresponding to all except one sequenced isolates. Intensity varies from 200 to 1000. The RpmC IF4 (strain NAC SLCC MFF) is present but partially masked by a stronger peak shared by the other 23 isolates. The central figure displays 3 different biomarkers that correspond to 5 different peaks. The first at 10 300 m/z is shared by all isolates in our collection and corresponds to RpsS (species biomarker). The two peaks in the middle correspond to RpsN and the last two peaks correspond to RpsO, isoforms 1 and 2 respectively.
Figure 3eBurst network, visual representation of the links between the MALDI-Types and the hierarchical tree based on MALDI-Types isomorphic profile. A eBurst defines clusters when isolates share 8 out of the 9 polymorphic biomarkers. These clusters may be considered as MALDI-Groups. Each of them contains a central MT, which represents the “founder” MT of the corresponding group. B The classification tree, based on average-link hierarchical clustering (average-link method), offers another view of the links existing between MALDI-Types and MALDI-Groups. The distance used for this tree corresponds to the number of differences between each MALDI profile.