Literature DB >> 30040063

Genomic epidemiology of the commercially important pathogen Renibacterium salmoninarum within the Chilean salmon industry.

Sion C Bayliss1, David W Verner-Jeffreys2, David Ryder2, Rudy Suarez3,4, Roxana Ramirez3, Jaime Romero5, Ben Pascoe1, Sam K Sheppard1, Marcos Godoy4,6,7,3, Edward J Feil1.   

Abstract

Renibacterium salmoninarum is the causative agent of bacterial kidney disease (BKD), which is a commercially important disease of farmed salmonids. Typing by conventional methods provides limited information on the evolution and spread of this pathogen, as there is a low level of standing variation within the R. salmoninarum population. Here, we apply whole-genome sequencing to 42 R. salmoninarum isolates from Chile, primarily from salmon farms, in order to understand the epidemiology of BKD in this country. The patterns of genomic variation are consistent with multiple introductions to Chile, followed by rapid dissemination over a 30 year period. The estimated dates of introduction broadly coincide with major events in the development of the Chilean aquaculture industry. We find evidence for significant barriers to transmission of BKD in the Chilean salmon production chain that may also be explained by previously undescribed signals of host tropism in R. salmoninarum. Understanding the genomic epidemiology of BKD can inform disease intervention and improve sustainability of the economically important salmon industry. This article contains data hosted by Microreact.

Entities:  

Keywords:  Renibacterium salmoninarum; aquaculture; bacterial kidney disease; epidemiology; whole-genome sequencing

Mesh:

Year:  2018        PMID: 30040063      PMCID: PMC6202448          DOI: 10.1099/mgen.0.000201

Source DB:  PubMed          Journal:  Microb Genom        ISSN: 2057-5858


Data Summary

Short-read data for isolates sequenced as a part of this work have been archived in the EMBL-EBI database under study accession number PRJEB26486 (www.ebi.ac.uk/ena/data/search?query=PRJEB26486). Assemblies have been deposited in the BIGSdb aquaculture database (url - https://pubmlst.org/bigsdb?db=pubmlst_fish_isolates). Metadata and the tree file are available for download from the Microreact project at https://microreact.org/project/SkR7uB10M. Bacterial kidney disease (BKD) causes a high level of mortality and morbidity in salmonid aquaculture worldwide. The causative agent of BKD, Renibacterium salmoninarum, has a slow mutation rate and a highly conserved genome, which limits the discriminatory power of conventional typing techniques such as multi-locus sequence typing and pulsed-field gel electrophoresis. Whole-genome sequencing (WGS) allows for almost all genomic variation to be used for molecular tracing and phylogenetic reconstruction. This technique has been successfully applied to the fine-scale epidemiology of monomorphic human pathogens, such as Mycobacterium tuberculosis and Yersinia pestis, and a global collection of R. salmoninarum [Traynor BJ. Neuroepidemiology 2009;33:276–9; Brynildsrud O, Feil EJ, Bohlin J, Castillo-Ramirez S, Colquhoun D et al. ISME J 2014;8:746–56]. For the first time, we apply WGS to a collection of R. salmoninarum isolates from Chile, a country that hosts the second largest salmon industry in the world and in which BKD imposes a significant commercial burden. We use WGS for fine-scale molecular epidemiology of R. salmoninarum, and to place the Chilean isolates within the context of a global isolate collection. Our results point to multiple introductions of BKD into Chile from global sources over a 30 year period, followed by rapid spread within Chile. This study provides a basis for the development of more informed disease management strategies.

Introduction

Aquaculture is the fastest growing livestock production sector, and has a critical role in the economic welfare of many low- and middle-income countries. Since 2014, the majority of finfish consumed globally are from farmed stocks [1]. However, the rapid expansion and intensification of aquaculture presents serious challenges with respect to sustainability and infectious-disease management. A notorious example is the devastating outbreak of infectious salmon anemia (ISA) virus that swept through Chilean salmon farms in 2007, bankrupting the industry and leaving debts of 1.8 billion US dollars (1.4 billion pounds) [2]. The Chilean Atlantic salmon industry has subsequently recovered, and is now the second largest in the world with an annual production of 502 000 tonnes in 2016 [1]. However, other diseases now pose a major threat to the sustainable development of aquaculture in Chile. One such example is bacterial kidney disease (BKD), caused by the Gram-positive bacillus Renibacterium salmoninarum, a chronic granulomatous disease that affects salmonid species worldwide. R. salmoninarum was first isolated in Chile from chum salmon (Oncorhynchus keta, Walbaum) reared in seawater cages [3], and has gone on to cause disease in Atlantic salmon (Salmo salar), rainbow trout (Oncorhynchus mykiss) and coho salmon (Oncorhynchus kisutch). Treatment of BKD places a significant burden on the industry and currently requires the second largest application of antibiotics in both saltwater and freshwater aquaculture in Chile [4]. Whole-genome sequencing (WGS) provides a powerful approach to tracking the emergence and spread of pathogens [5]. Many of the analytical methods and platforms developed to manage pathogens of relevance to public health can be directly applied to aquaculture [6]. In addition to understanding the epidemiology of R. salmoninarum, WGS data can also inform vaccine design and improve understanding of host switching and adaptation. R. salmoninarum was the first aquaculture pathogen to which WGS was applied in order to understand evolutionary and epidemiological dynamics [7]. The study revealed a major phylogenetic division within the R. salmoninarum population. The data indicated that the pathogen had been introduced to Europe from North America on multiple occasions over the last 50 years via trade in eggs and fry. The data also pointed to frequent host switching of the pathogen between host species. Despite the commercial burden of R. salmoninarum in Chile, the origins, diversity and patterns of spread of this pathogen within this country remain unknown. A recent study used a range of methods to demonstrate that 39 R. salmoninarum isolates recovered from four different Chilean salmon farms between 2014 and 2016 were genetically and phenotypically homogeneous [8]. However, to date, the level of genetic heterogeneity within the Chilean population has not been investigated using WGS. Here, we describe an analysis of WGS data for 42 R. salmoninarum isolates representing geographically widespread Chilean fish farms. All the isolates were collected as a part of the General Sanitary Mortality Management Program (PSGM) and General Sanitary Management Reproduction Fish Program (PSGR) surveillance programmes. By comparing our data to data from a global collection of R. salmoninarum genomes, we identified multiple introductions of the BKD-causing R. salmoninarum into Chile. The data also points to a host-restricted lineage, indicating that the frequency of host switching may not always be as high as previously thought. Alternatively, it suggests that other barriers to inter-host transmission may exist in the Chilean fish production chain that have had a strong influence on the spread of R. salmoninarum in Chile.

Methods

Isolate collection

BKD was listed as a High Risk Disease by the Chilean government as a part of the EAR (Enfermedades de Alto Riesgo) classification in July 9 2013 [9]. A total of 42 R. salmoninarum samples were collected and analysed for the current study (Table 1). These isolates were collected as part of two ongoing Chilean surveillance programmes. The General Sanitary Mortality Management Program (PSGM) involves companies and private laboratories performing passive surveillance of diseases according to sanitary/productive criteria [10]. The Servicio Nacional de Pesca (National Fisheries Service) produces sanitary and production PSGM reports at weekly and monthly intervals respectively. The General Sanitary Management Reproduction Fish Program (PSGR) is a programme that mandates screening of all breeding females and has encouraged the voluntary screening of breeding males of all salmonid species since January 24 2003 [11].
Table 1.

Sample metadata

Additional metadata is available for download from the Microreact project at: https://microreact.org/project/SkR7uB10M.

Sample IDLineageHostCountryRegionYearAccession no.
NCIMB_11111UnknownUSA/UKUnknownNot knownERR327924
Car_961O. tshawytschaUSAWashington1996ERR327957
WR99_c21O. kisutchUSAWashington1999ERR327960
ATCC_332091O. tshawytschaUSAOregon1974NC_010168.1
D61O. tshawytschaUSAOregon1982ERR327961
NCIMB_22351O. tshawytschaUSAOregon1974ERR327911
GR51T. thymallusUSAMontana1997ERR327959
05372K1O. tshawytschaUSAGrande Ronde Basin, Oregon2005ERR327906
Cow-chs-941O. tshawytschaUSACowlitz River, Washington1994ERR327915
Carson_5b1O. tshawytschaUSAConfluence Tyee Creek and Wind River, Washington1994ERR327905
993261O. mykissUKWales, site Y1999ERR327938
993321O. mykissUKWales, site Y1999ERR327943
993291O. mykissUKWales, site X1998ERR327937
993331O. mykissUKWales, site X1998ERR327921
993451O. mykissUKWales, site X1998ERR327948
MT4441S. salarUKWestern Isles, Scotland1988ERR327916
MT14701O. mykissUKTayside, Scotland1994ERR327910
MT15111O. mykissUKStrathclyde, Scotland, site B1994ERR327914
MT33151O. mykissUKStrathclyde, Scotland, site B2008ERR327928
MT13631O. mykissUKStrathclyde, Scotland1993ERR327920
MT18801S. salarUKStrathclyde, Scotland1996ERR327909
MT31061O. mykissUKStrathclyde, Scotland2006ERR327939
MT34821S. salarUKStrathclyde, Scotland2009ERR327934
MT34831S. salarUKStrathclyde, Scotland2009ERR327941
MT34791S. salarUKOrkney, Scotland2008ERR327933
MT12621S. salarUKHighlands, Scotland1992ERR327922
MT13511S. salarUKHighlands, Scotland1993ERR327904
MT29431S. salarUKHighlands, Scotland2005ERR327936
MT29791O. mykissUKHighlands, Scotland2005ERR327935
MT8391S. salarUKHighlands, Scotland1990ERR327917
MT4521O. mykissUKDumfries and Galloway, Scotland, site A1988ERR327918
MT32771O. mykissUKDumfries and Galloway, Scotland, site A2008ERR327926
MT33131O. mykissUKCentral Scotland2008ERR327925
50071O. mykissUKScotland2005ERR327923
MT2391S. salarUKScotland1988ERR327913
MT8611S. salarUKScotland1990ERR327919
90251O. mykissUKYorkshire, England2009ERR327912
960711O. mykissUKHampshire, England, site Z1996ERR327927
993411O. mykissUKHampshire, England, site Z1998ERR327949
993441O. mykissUKHampshire, England1998ERR327940
12051O. mykissUK2001ERR327930
71051O. mykissUK2007ERR327932
993271O. mykissUK1997ERR327931
74391S. salarNorwaySognefjorden, Sogn og Fjordane1984ERR327971
68631O. mykissNorwayOsterøy, Hordaland2009ERR327965
52231S. salarNorwayKvinnherad, Hordaland2005ERR327964
65531S. salarNorwayHemne, Sør-Trøndelag2008ERR327955
66421S. salarNorwayHemne, Sør-Trøndelag2008ERR327956
66941O. mykissNorwayHemne, Sør-Trøndelag2008ERR327962
66951O. mykissNorwayHemne, Sør-Trøndelag2008ERR327968
Rs_81S. salarCanadaNew Brunswick2008ERR327944
Rs_101S. salarCanadaNew Brunswick2009ERR327945
Rs_21S. salarCanadaNew Brunswick2005ERR327951
Rs_31S. salarCanadaNew Brunswick2005ERR327947
Rs_41S. salarCanadaNew Brunswick2006ERR327946
Rs_51S. salarCanadaNew Brunswick2007ERR327950
Rs_61S. salarCanadaNew Brunswick2007ERR327953
BPS_911O. gorbuschaCanadaNanaimo, British Columbia1991ERR327952
BQ96_91–11O. kisutchCanadaNanaimo, British Columbia1996ERR327963
50061O. kisutchCanadaBella Bella, British Columbia1996ERR327942
DR1431S. fontinalisCanadaAlberta1972ERR327954
NCIMB_11142S. salarUKRiver Dee, Scotland1962ERR327908
NCIMB_11162S. salarUKRiver Dee, Scotland1962ERR327907
74482S. salarNorwayStranda, Møre og Romsdal1986ERR327970
74412S. salarNorwayStorfjord, Møre og Romsdal1985ERR327966
74492S. salarNorwaySkjervøy, Troms1987ERR327969
6842S. truttaNorwayAurland, Sognefjorden, Sogn og Fjordane1987ERR327958
74502S. salarNorwayAskøy, Hordaland1987ERR327967
Ch21aS. salarChileXI2014ERS2444820
Ch411cO. mykissChileXII2016ERS2444846
Ch31bS. salarChileX2013ERS2444838
Ch271cS. salarChileXII2016ERS2444843
Ch301aS. salarChileXII2015ERS2444817
Ch11aS. salarChileXI2015ERS2444819
Ch91cO. kisutchChileXI2013ERS2444851
Ch221aS. salarChileX2013ERS2444827
Ch331aS. salarChileX2012ERS2444825
Ch161bS. salarChileX2012ERS2444837
Ch211dS. salarChileVIII2014ERS2444844
Ch361aS. salarChileXI2016ERS2444823
Ch61aO. kisutchChileX2013ERS2444847
Ch101bS. salarChileX2012ERS2444840
Ch181bS. salarChileX2013ERS2444839
Ch251bO. kisutchChileX2013ERS2444850
Ch241aS. salarChileXII2015ERS2444818
Ch371bO. mykissChileX2013ERS2444845
Ch341aS. salarChileXI2016ERS2444824
Ch201aS. salarChileXI2016ERS2444822
Ch401bS. salarChileXI2016ERS2444836
Ch291aO. kisutchChileX2013ERS2444848
Ch131aO. kisutchChileX2013ERS2444849
Ch311cO. kisutchChileX2015ERS2444856
Ch231cO. kisutchChileX2015ERS2444857
Ch261cO. kisutchChileX2015ERS2444858
Ch351cO. kisutchChileX2013ERS2444854
Ch391cO. kisutchChileX2015ERS2444853
Ch191aS. salarChileIX2013ERS2444831
Ch281aS. salarChileIX2012ERS2444835
Ch71aS. salarChileIX2015ERS2444829
Ch151aS. salarChileIX2015ERS2444830
Ch141aS. salarChileIX2013ERS2444834
Ch51aS. salarChileIX2013ERS2444832
Ch81aS. salarChileIX2013ERS2444833
Ch111aS. salarChileX2012ERS2444828
Ch381aS. salarChileX2016ERS2444826
Ch41cO. kisutchChileX2015ERS2444852
Ch421cO. kisutchChileX2016ERS2444855
Ch321aS. salarChileXI2015ERS2444821
Ch171bS. salarChileIX2013ERS2444841
Ch121bS. salarChileIX2013ERS2444842

Sample metadata

Additional metadata is available for download from the Microreact project at: https://microreact.org/project/SkR7uB10M.

WGS

R. salmoninarum isolates were grown on KDM-2 solid medium [12] at 15 °C, with single colonies being selected and grown for DNA extraction. Paired-end sequencing was performed on the Illumina MiSeq platform with a read length of 300 bp. Short reads have been deposited in the ENA archive under project accession number PRJEB26486 (Table 1). Short-read sequences from Brynildsrud et al. [7] were included as a reference collection and analysed using the same tools and settings detailed below [7].

Mapping and variant calling

Reads from the current study alongside those from the reference collection were mapped to the reference genome ATCC 33209 using smalt 0.7.6 (default settings) and a mean insert size of 300 bp [13]. Reads containing insertions or deletions were realigned using the Genome Analysis Toolkit's Indel Realigner 3.2.2 [14]. Single nucleotide polymorphisms (SNPs) were called using SAMtools 0.1.18 [15]. Variants were filtered using in-house scripts to include only SNPs with >4× read depth per base (>2 per strand), >95 % read support for the alternative variant and mapping quality >30. All bioinformatics analyses were performed using MRC CLIMB [16].

Phylogenetic analysis

The consensus sequences from aligning short reads to the complete genome sequence of ATCC 33209 were used to reconstruct the phylogeny of R. salmoninarum. Regions in repetitive tracts of DNA, or within 50 bp of a repetitive region as identified by Nucmer, were masked from downstream analyses [17]. A maximum-likelihood (ML) tree using RAxML v8.2.10 produced trees that exhibited poor bootstrap support values at multiple nodes [18]. Initial observation of a distance matrix of the SNP-based alignment indicated that a number of isolates shared zero SNP differences and visualization of the bootstrap tree using treespace indicated that isolates with zero SNP distances were being variably positioned within the phylogeny tree, negatively impacting on the reproducibility of the tree [19]. All isolates with zero SNP distances were clustered together and a representative sequence was selected from the cluster using the isolate with the lowest number of missing sites from an alignment of variable sites [20]. The date of the earliest isolate from each cluster was used for subsequent dating analyses. Removal of the redundant sequences produced an alignment of 85 single genome and representative sequences for downstream analysis. There were no recombination regions identified in the data set by either ClonalFrameML 1.11 or Gubbins 2.3.1 [21, 22]. ML trees were reconstructed using a rapid bootstrap analysis best-scoring tree search in RAxML with 100 bootstraps to generate support values [18]. Trees and figures were visualized using ggtree 1.09.00 and ggplot 2.2.1 in the R software environment [23, 24].

Establishing the timescale of disease emergence

The root-to-tip distance of the ML tree produced by RAxML was analysed using TempEst 1.5.1 (supplementary analysis) using the heuristic residual mean squared value to assess regression fit [25]. Root-to-tip analysis indicated that there was a clock-like signal within the data set. Therefore, beast was used to further investigate the temporal signal within the data [26]. The data set containing both single isolate genomes and genome sequences representative of multiple isolates detailed previously was used for tip-based divergence dating. Genome sequences that represented multiple identical genome sequences (0 pairwise SNP differences) sampled at different times were assigned the date of isolation of the oldest isolate. An initial beast run was performed to test for deviations from a strict clock using an uncorrelated log normal relaxed clock model and a Bayesian Skyline tree prior (exponential prior, mean=10, stdev=0.3337). Three individual runs were performed and these converged to produce comparable parameter estimates. The standard deviation from the mean of the uncorrelated lognormal relaxed clock was small (mean of combined runs: 0.449), indicating that there was low variation in rates among branches. When this parameter takes values greater than 1 then the standard deviation in branch rates is greater than the mean rate, and the data is considered to exhibit substantial rate heterogeneity among lineages. This was considered sufficient evidence to test the assumptions of a strict molecular clock (normal prior, mean=3.324×10–4, sigma=0.477×10–4) on the data using a prior estimate of 3.324×10–4 mutations per site per year as calculated by previous authors [7]. Both strict and relaxed clock models were performed using the general time reversible (GTR) model of nucleotide substitutions (exponential prior with an alpha of 2 and beta of 0.5) with a Bayesian skyline tree model (Jeffreys prior). The Markov chain Monte Carlo (MCMC) value was sampled every 1000 of 200 000 000 iterations. The tree produced by these analyses with the highest maximum clade credibility was annotated with the dates and associated confidence intervals (CIs) assigned to each node. Each model was repeated in three independent runs and compared to ensure consistently good mixing and consensus results. In addition to this, a run sampling only from the prior was performed for both analyses to ensure that none of the estimates were expressions of the priors. The relaxed clock was applied to downstream analysis as previous authors suggest that it represents real phylogenies more accurately than a strict clock model [27].

Results and Discussion

Phylogenetic analysis reveals multiple lineages of R. salmoninarum in Chile

The phylogenetic analysis of WGS sequences from 42 Chilean R. salmoninarum isolates, combined with the global collection from Brynildsrud et al. [7] is presented in Fig. 1. A Microreact project has been created for the data presented in Fig. 1, which allows for visualization of the tree and associated metadata in an interactive user interface [28]. The Microreact project is available at https://microreact.org/project/SkR7uB10M. Whole-genome assemblies have been deposited in the BIGSdb aquaculture database (https://pubmlst.org/fish/).
Fig. 1.

ML phylogenetic tree of 110 R. salmoninarum genomes. Short reads were mapped against reference genome ATCC 33209 and consensus sequence alignments were passed to RAxML for phylogenetic reconstruction. Inset: phylogenetic tree of all R. salmoninarum isolates. Clade 1 isolates are highlighted in blue. Main panel: phylogenetic tree of R. salmoninarum clade 1. The multi-isolate Chilean lineages are labelled. Chilean isolates are highlighted in red. Bootstrap values are shown at each node.

ML phylogenetic tree of 110 R. salmoninarum genomes. Short reads were mapped against reference genome ATCC 33209 and consensus sequence alignments were passed to RAxML for phylogenetic reconstruction. Inset: phylogenetic tree of all R. salmoninarum isolates. Clade 1 isolates are highlighted in blue. Main panel: phylogenetic tree of R. salmoninarum clade 1. The multi-isolate Chilean lineages are labelled. Chilean isolates are highlighted in red. Bootstrap values are shown at each node. The phylogenetic tree confirms two deep-branching clades within the R. salmoninarum population, as noted by Brynilsrud et al., with little intra-clade heterogeneity [7] (Fig. 1, inset). All of the Chilean isolates sequenced as a part of the present study were found within the much more deeply sampled, aquaculture-associated clade 1 [7]. This clade contains nearly all of the known genome sequences of R. salmoninarum recovered from salmonids from North America and Europe. Clade 1 has low genetic heterogeneity, with less than 500 SNP differences between the most distant isolates within the clade. This observation broadly supports a recent study that used a range of methods to demonstrate that 39 R. salmoninarum isolates recovered from four different Chilean salmon farms between 2014 and 2015 were genetically and phenotypically homogeneous [8]. The Chilean isolates belong to three multi-isolate lineages (1a, 1b, 1c) and one singleton lineage (1d) (Fig. 1; Microreact – https://microreact.org/project/SkR7uB10M). The nomenclature 1a, 1b, 1c and 1d is used here to describe the Chilean R. salmoninarum lineages, but these groupings are distinct from those used by previous authors to delineate phylogenetic groupings within clade 1 [7]. None of these clusters contained isolates from outside of Chile, providing initial evidence that there has been very limited transmission of the pathogen from Chile to other countries. However, the data reveal that each of the lineages, except for the singleton lineage 1d, have spread between multiple administrative regions within Chile (Fig. 2). Lineage 1a is the largest of the three Chilean lineages and contains 22 isolates sampled between 2012 and 2016 from the three central regions (IX, X and XI) and one southern region (XII). Lineage 1b contains nine isolates collected from three regions (IX, X and XI) in central Chile between 2012 and 2016. Lineage 1c contains ten isolates collected from central and southern regions X, XI and XII between 2013 and 2016. The single representative isolate from lineage 1d was recovered from region VIII in 2014.
Fig. 2.

A map of Chile with administrative regions highlighted. The five regions from which R. salmoninarum isolates were collected as part of the current study are indicated. Bar charts show the number of isolates collected from each lineage in each region. The maximum value of the y-axis in all bar charts is seven.

A map of Chile with administrative regions highlighted. The five regions from which R. salmoninarum isolates were collected as part of the current study are indicated. Bar charts show the number of isolates collected from each lineage in each region. The maximum value of the y-axis in all bar charts is seven. Within each lineage, the Chilean isolates are more closely related to one another than to isolates from other countries, consistent with clonal dissemination within Chile. However, two of the Chilean lineages share a recent common ancestor with isolates from other countries; thus, indicating potential sources of these introductions. Chilean lineage 1a is closely related, and positioned immediately proximal to, two North American isolates, Car96 and Rs_4. A cluster of four Canadian isolates (plus a single exceptional Norwegian isolate) are positioned immediately basal to those two North American isolates. This points to a North American origin of lineage 1a. Similarly, lineage 1b is immediately proximal to four isolates from Scotland, consistent with a transmission from Scotland to Chile. There were no isolates within the collection immediately basal to lineage 1c. Strains GR5 (USA), MT444 (UK) and 5006 (Canada) were located basally, but were too distant to suggest a recent introduction from these sources. The singleton representative isolate of lineage 1d does not cluster with any of the other isolates, Chilean or otherwise, within the collection. Both lineage 1c and 1d are sufficiently distant from other isolates within the collection that no inference of transmission source can be drawn. These lineages likely represent introductions into Chile from source populations that have not been sampled within the current study, such as those in Asia.

Spread of introduced lineages within the Chilean aquaculture network

Raising of salmonids for farming is typically separated into at least two stages. Eggs are produced from managed breeding lines (broodstock), hatched and raised to a juvenile stage (smolt) in freshwater hatcheries, before being transferred to seawater sites to ‘grow-out’ until the adult animal is of sufficient size for harvesting. Information on the stage in the food production chain at which a diseased individual was sampled was available for the majority of the Chilean isolates as given in the Microreact project (Fig. 3). This included anonymized information on the company managing the hatcheries and farm sites, the freshwater hatchery at which it was raised and the point of sampling, whether that was a freshwater hatchery for broodstock, fingerlings and smolt or a seawater/freshwater farm site for post-smolt to adult fish.
Fig. 3.

Sampling site and history within the food production network for all Chilean isolates. A timed phylogeny for each lineage is presented on the left. Tip labels contain the life stage of the host animal and the year in which a sample was taken. Tips are coloured by host species (Atlantic salmon, blue; rainbow trout, orange; coho salmon, red). The company, hatchery and point of sample origin are shown as coloured columns to the right of the phylogeny. Each unique company, hatchery and farm site is designated with a unique colour. In those cases in which the colour provided for the point of origin matches the colour provided for the hatchery then this indicates that the sample was taken in the hatchery. Metadata fields for which there is no data were left blank.

Sampling site and history within the food production network for all Chilean isolates. A timed phylogeny for each lineage is presented on the left. Tip labels contain the life stage of the host animal and the year in which a sample was taken. Tips are coloured by host species (Atlantic salmon, blue; rainbow trout, orange; coho salmon, red). The company, hatchery and point of sample origin are shown as coloured columns to the right of the phylogeny. Each unique company, hatchery and farm site is designated with a unique colour. In those cases in which the colour provided for the point of origin matches the colour provided for the hatchery then this indicates that the sample was taken in the hatchery. Metadata fields for which there is no data were left blank. There is no clear clustering of strains or lineages by company or freshwater or saltwater site (Fig. 3; Microreact – https://microreact.org/project/SkR7uB10M). Each of the lineages was sampled from hatchery or farm sites of two or more companies. In three cases, a cluster of effectively identical isolates, such as Ch7 and Ch30, were sampled from sites owned by two separate companies. Moreover, phylogenetically distinct lineages were observed to have been sampled concurrently from the same site, for example multiple isolates from lineage 1a and a single isolate from lineage 1b (Ch40) were collected from freshwater site 1 (Fig. 3, dark green) in 2016. This indicates that there has been frequent transmission between sites owned by different companies, and our analysis has not been overly biased by over-sampling from specific sites or stages of the food production process. It is unknown at which stage in the food production chain R. salmoninarum infection is acquired. Highly similar strains were observed in isolates raised on the same hatchery sites over consecutive years, such as isolates Ch8, Ch5, Ch19 and Ch28 between 2012–2013 (Fig. 3, light green), suggesting related dominant strains, private to individual sites, persist from year to year. Conversely, R. salmoninarum from diseased fish sampled from hatchery site 6 and from farm sites stocked by smolt from site 6 were found to be representatives of all three major Chilean lineages (1a, 1b and 1c). These data suggest that hatchery sites may be colonized by multiple unrelated lineages. An alternative explanation would be that strains contributing to outbreaks on farms represent a mixture of strains acquired from both farm and hatchery sites. Further sampling of ‘house strains’ within individual hatchery and farm sites would be required to quantify to what extent isolates acquired in seawater or freshwater farm sites contribute to disease in the field.

Limited host association amongst Chilean lineages

R. salmoninarum has been previously shown to have a high rate of host switching, with genetically identical, or near-identical, isolates capable of causing disease in multiple host species [7]. The data presented here broadly support this, as each of the Chilean lineages co-occurred in multiple host species. Lineage 1a was predominantly from Atlantic salmon (19/22), with the remaining three strains isolated from coho salmon. One of the coho salmon isolates, Ch13, was genetically identical to a strain isolated from Atlantic salmon, Ch20, suggesting a very recent host transmission. The three coho salmon isolates also clustered closely with five Atlantic salmon isolates. Lineage 1b was also predominantly from Atlantic salmon hosts (7/9), with single strains from rainbow trout and coho salmon. In contrast, lineage 1c was predominantly (8/10) isolated from coho salmon, with single isolates from Atlantic salmon and rainbow trout. Lineage 1c is significantly associated with a coho salmon host when compared the rest of the collection (Table S1, available in the online version of this article; Fishers exact P=<0.005). Despite this, lineage 1c contains another example of a very recent inter-host transmission, as rainbow trout isolate Ch41 is genetically identical to coho salmon isolate Ch26. The association between lineage 1c and coho salmon may imply a previously undescribed signal of host association in R. salmoninarum. However, it seems likely that the clustering of isolates from coho salmon in lineage 1c is a product of the surveillance programme sampling efforts, as all but one of the coho salmon isolates in lineage 1c were collected from farms supplied by a single hatchery or from the hatchery itself (FW3, Fig. 3; Microreact - https://microreact.org/project/SkR7uB10M). The only example of lineage 1c in Atlantic salmon was also from a site owned by the same company. Therefore, lineage 1c may have had limited opportunity to spread outside of the immediate production network since its initial introduction. However, farm sites in Chile often contain multiple farmed species in close proximity, which would provide many opportunities for host-switching. The presence of isolates Ch41 and Ch27, from rainbow trout and Atlantic salmon, respectively, suggests that lineage 1c has the capacity for inter-host transmission.

Dating the introductions of R. salmoninarum into Chile

Having described four introductions of R. salmoninarum to Chile, three of which resulted in significant onward spread within Chile, we used the WGS data to estimate the dates of these international transmission events. The low rate of recombination in this species facilitates such an analysis, and leads to more clock-like accumulation of mutations [7]. A root-to-tip analysis of a ML phylogeny confirmed a significant temporal signal within the R. salmoninarum data set (supplementary analysis, Fig. S1). In order to estimate the rate of molecular evolution, beast2, a software platform for Bayesian analysis of molecular sequences using the Markov chain Monte Carlo (MCMC) method, was applied to a tip-dated phylogeny to generate a tree with estimates of divergence dates assigned to the nodes within the phylogeny [26]. Application of a relaxed clock model suggested that there was a clock-like signal within the data (standard deviation from the mean=0.449). The results of a relaxed and strict clock analysis produced similar estimates of divergence dates (Table S2, Fig. 4). The mutation rate predicted by the relaxed clock model was 2.09×10−7 mutations per genome per year, which is only marginally slower than the previous estimate from the global data alone of 3.79×10−7 mutations per genome per year [7] (Table S2). These values are also comparable to mutation rate estimates from monomorphic, human pathogens such as Mycobacterium tuberculosis [29]. The slight decrease in our estimate from the previous estimate for this species may be attributed to the removal of SNPs from repetitive regions in the current study. This produced 3504 variable sites compared to 3600 in the previous study. Repetitive regions are both more likely to contain false positive SNPs due to problematic mapping in repetitive sequences and also to contain horizontally acquired mobile genetic elements that have the potential for higher mutation and recombination rates [30]. It was observed during the root-to-tip regression analysis that older isolates had, on average, more mutations than the model expectation. This may be due to mutations accrued during the long-term storage and sub-culturing of historical isolates. We did not account for this bias in the current work, but incorporation of this effect into the clock model should be a consideration for future researchers working with historical isolates.
Fig. 4.

beast reconstruction of the tip-dated phylogenetic tree of R. salmoninarum clade 1 using a relaxed-clock model. Short reads were mapped against reference genome ATCC 33209 and consensus sequence alignments were passed to beast for phylogenetic reconstruction. Left panel: phylogenetic tree of all R. salmoninarum clade 1 isolates. Chilean isolates are highlighted in red. Chilean multi-isolate lineages are labelled. Tip labels of Chilean isolates are highlighted in red. Right panel: phylogenetic trees of the three Chilean multi-isolate lineages (1a, 1b, 1c). Tips are coloured by host species (Atlantic salmon, blue; rainbow trout, orange; coho salmon, red). Estimated divergence dates are annotated at each node and 95 % CIs indicated (red bar).

beast reconstruction of the tip-dated phylogenetic tree of R. salmoninarum clade 1 using a relaxed-clock model. Short reads were mapped against reference genome ATCC 33209 and consensus sequence alignments were passed to beast for phylogenetic reconstruction. Left panel: phylogenetic tree of all R. salmoninarum clade 1 isolates. Chilean isolates are highlighted in red. Chilean multi-isolate lineages are labelled. Tip labels of Chilean isolates are highlighted in red. Right panel: phylogenetic trees of the three Chilean multi-isolate lineages (1a, 1b, 1c). Tips are coloured by host species (Atlantic salmon, blue; rainbow trout, orange; coho salmon, red). Estimated divergence dates are annotated at each node and 95 % CIs indicated (red bar). The date for the introduction of lineage 1a was estimated as 1996 (95 % CI: 1988–2003), lineage 1b as 2008 (95 % CI: 2005–2011) and lineage 1c as 1986 (95 % CI: 1967–2002) (Fig. 4, Table S2). The estimated date of lineage 1c coincides with the rise of large-scale commercial aquaculture in Chile in the 1980s (Fig. S2). The first confirmed report of outbreaks of BKD in Chile was in 1983 in saltwater net pen reared Oncorhynchus from region XI [3]. Further outbreaks were subsequently confirmed in 1984 in the southern region X hatcheries. The current analysis would suggest that lineage 1c isolates in the current study represent descendants of this original outbreak that have diversified and spread around the network of farms that raise coho salmon, with limited spread outside of this network. The first salmonids introduced to Chile were Oncorhynchus species from the USA and Japan that were introduced as part of ranching programmes [31]. These were hatchery raised and released into rivers. As the current collection contains no isolates from Japan and few isolates from the USA, we are unable to support or refute a North American or Japanese origin of the initial 1980s outbreaks of BKD in Chile. The tree dating analysis suggests that lineages 1a (1996) and 1b (2008) were introduced after the rise of Atlantic salmon aquaculture, and that ongoing introductions of R. salmoninarum from other countries further contributed to cases of BKD in Chile. The estimated introduction date of lineage 1a in ~1996 (likely from North America) coincides with the shift in the Chilean aquaculture industry from coho salmon to Atlantic salmon production in the mid-1990s (Fig. S2) [32]. Consistent with this, lineage 1a was predominantly identified in Atlantic salmon stock. The previous analysis suggests that lineage 1b was introduced at a more recent date coinciding with the major outbreaks of ISA in Chile in 2007 [33]. The phylogenetic analysis suggests that lineage 1b is closely related to isolates recovered from Atlantic salmon farmed in Scotland. Although there is some dispute, studies have implicated transfer of Atlantic salmon ova from Europe as a likely source of introduction of the virus ISA into Chile [34]. Indeed, the volume of imported Atlantic salmon ova peaked in 2008, with the two largest exporters being Norway (149 710 000 eggs) and Scotland (55 889 500 eggs) (Fig. S3). These data are, thus, consistent with the evidence provided by the WGS data in support for the introduction of lineage 1b into Chile from Scotland in around 2008. The source and date of introduction of lineage 1d into Chile is unclear. The tree dating analysis estimates the date at which lineage 1d (5853) diverged from the most recent common ancestor of the most closely related isolate in the collection (MT2943) as 1924 (95 % CI: 1883–1964), which could represent a pre-industrial introduction of R. salmoninarum to Chile. Alternatively, the isolate could have arisen from a geographically-distinct source population that was not sampled as a part of the current collection. There is currently insufficient evidence to test either hypothesis, but we consider the latter hypothesis to be more likely.

Conclusion

Our WGS analysis reveals that there have been at least four introductions of R. salmoninarum into Chile, three of which resulted in substantial onward spread. All four of these Chilean lineages arose from within clade 1, which has previously been described as a successful, aquaculture-associated, clade that has spread internationally. As there are no salmon species native to South America, the most likely mechanism by which R. salmoninarum was introduced is via live fish or egg movements from other countries. Phylogenetic reconstruction of a collection of R. salmoninarum from Chile, North America and Europe indicated that introductions have taken place repeatedly over the last 35 years, most likely via transfer of Oncorhynchus species and Atlantic salmon for stocking of rivers and aquaculture purposes. These data would suggest that the source of the introductions was most likely of North American and European origin, but that sample coverage in these regions was insufficient to confidently establish the exact sources of these introductions into Chile. Furthermore, two of the Chilean lineages are not closely related to any of the other clade 1 isolates, which suggests that the source of lineages 1c and 1d remained unsampled within the current collection. Historical records suggest that further sampling of both North American and Japanese isolates may help to identify the populations from which these endemic lineages arose. The Chilean lineages have spread pervasively throughout the aquaculture production network in this country, with representative samples isolated from farms and hatcheries owned by multiple companies. We suggest that the observed phylogeny is not adequately explained by any single stage in the food production chain. This is likely to be a product of the spread of strains via the trade in eggs and contributed to by sub-lineages endemic to individual farms and hatcheries. These possibilities have important implications for understanding the spread of R. salmoninarum and the management of BKD in fish farming. The Chilean lineages were found in multiple host species, but lineage 1c was significantly associated with a single host species, coho salmon. This lineage was predominantly sampled from sites owned by a single company, but may represent the early stages of adaptation to a host species, something not previously observed in R. salmoninarum. This study suggests that R. salmoninarum has been introduced into Chile multiple times since the development of large-scale aquaculture. The presence of distinct, low-prevalence lineages, such as lineage 1d, may indicate that these introductions are common, but that these lineages fail to become established over large geographical ranges. The parity between the estimated introduction dates and key developments in the salmon industry would suggest that perturbations in the industry were formative events in the dissemination of these successful lineages throughout Chile. Potentially unsampled reservoirs of R. salmoninarum in the form of salmon that run the rivers in Chile should not be discounted as factors that have contributed to the longevity of these successful lineages. Further sampling of R. salmoninarum over a range of geographical locations, both within Chile and worldwide, and from wild host species is necessary to build a deeper understanding of the current and historical movements of R. salmoninarum. Brynildsrud O, Feil EJ, Bohlin J, Castillo-Ramirez S, Colquhoun D et al. European Bioinformatics Institute (EBI) Short Read Archive. Accession numbers ERR327904 to ERR327971 inclusive (2014). Click here for additional data file.
  21 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Evidence by the fluorescent antibody test for the occurrence of Renibacterium salmoninarum among salmonid fish in Chile.

Authors:  J E Sanders; M J Barros
Journal:  J Wildl Dis       Date:  1986-04       Impact factor: 1.535

3.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

4.  ISA virus in Chile: evidence of vertical transmission.

Authors:  Siri Vike; Stian Nylund; Are Nylund
Journal:  Arch Virol       Date:  2008-11-26       Impact factor: 2.574

5.  treespace: Statistical exploration of landscapes of phylogenetic trees.

Authors:  Thibaut Jombart; Michelle Kendall; Jacob Almagro-Garcia; Caroline Colijn
Journal:  Mol Ecol Resour       Date:  2017-05-15       Impact factor: 7.090

6.  CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community.

Authors:  Thomas R Connor; Nicholas J Loman; Simon Thompson; Andy Smith; Joel Southgate; Radoslaw Poplawski; Matthew J Bull; Emily Richardson; Matthew Ismail; Simon Elwood- Thompson; Christine Kitchen; Martyn Guest; Marius Bakke; Samuel K Sheppard; Mark J Pallen
Journal:  Microb Genom       Date:  2016-09-20

7.  Genome-scale rates of evolutionary change in bacteria.

Authors:  Sebastian Duchêne; Kathryn E Holt; François-Xavier Weill; Simon Le Hello; Jane Hawkey; David J Edwards; Mathieu Fourment; Edward C Holmes
Journal:  Microb Genom       Date:  2016-11-30

8.  Whole genome analysis of Yersinia ruckeri isolated over 27 years in Australia and New Zealand reveals geographical endemism over multiple lineages and recent evolution under host selection.

Authors:  Andrew C Barnes; Jerome Delamare-Deboutteville; Nicholas Gudkovs; Cara Brosnahan; Richard Morrison; Jeremy Carson
Journal:  Microb Genom       Date:  2016-11-30

9.  First detection, isolation and molecular characterization of infectious salmon anaemia virus associated with clinical disease in farmed Atlantic salmon (Salmo salar) in Chile.

Authors:  Marcos G Godoy; Alejandra Aedo; Molly J T Kibenge; David B Groman; Carmencita V Yason; Horts Grothusen; Angelica Lisperguer; Marlene Calbucura; Fernando Avendaño; Marcelo Imilán; Miguel Jarpa; Frederick S B Kibenge
Journal:  BMC Vet Res       Date:  2008-08-04       Impact factor: 2.741

10.  BEAST 2: a software platform for Bayesian evolutionary analysis.

Authors:  Remco Bouckaert; Joseph Heled; Denise Kühnert; Tim Vaughan; Chieh-Hsi Wu; Dong Xie; Marc A Suchard; Andrew Rambaut; Alexei J Drummond
Journal:  PLoS Comput Biol       Date:  2014-04-10       Impact factor: 4.475

View more
  3 in total

1.  Renibacterium salmoninarum and Mycobacterium spp.: two bacterial pathogens present at low levels in wild brown trout (Salmo trutta fario) populations in Austrian rivers.

Authors:  M R Delghandi; S Menanteau-Ledouble; K Waldner; M El-Matbouli
Journal:  BMC Vet Res       Date:  2020-02-03       Impact factor: 2.741

2.  Assessing the presence and spread of Renibacterium salmoninarum between farmed and wild fish in Sweden.

Authors:  David B Persson; Anna Aspán; Paulina Hysing; Eva Blomkvist; Eva Jansson; Ludvig Orsén; Hampus Hällbom; Charlotte Axén
Journal:  J Fish Dis       Date:  2022-01-29       Impact factor: 2.580

3.  Atlantic Salmon Pre-smolt Survivors of Renibacterium salmoninarum Infection Show Inhibited Cell-Mediated Adaptive Immune Response and a Higher Risk of Death During the Late Stage of Infection at Lower Water Temperatures.

Authors:  Marco Rozas-Serri; Carlos Lobos; Rodolfo Correa; Ricardo Ildefonso; Jorge Vásquez; Ariel Muñoz; Lucerina Maldonado; Victoria Jaramillo; Darling Coñuecar; Camila Oyarzún; Romina Walker; Carolina Navarrete; Jorge Gayosa; Patricio Mancilla; Andrea Peña; Carolina Senn; Francisco Schwerter
Journal:  Front Immunol       Date:  2020-06-30       Impact factor: 7.561

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.