| Literature DB >> 30519443 |
Audrey Rohfritsch1, Maxime Galan1, Mathieu Gautier1, Karim Gharbi2, Gert Olsson3, Bernhard Gschloessl1, Caroline Zeimes4, Sophie VanWambeke4, Renaud Vitalis1, Nathalie Charbonnel1.
Abstract
Natural reservoirs of zoonotic pathogens generally seem to be capable of tolerating infections. Tolerance and its underlying mechanisms remain difficult to assess using experiments or wildlife surveys. High-throughput sequencing technologies give the opportunity to investigate the genetic bases of tolerance, and the variability of its mechanisms in natural populations. In particular, population genomics may provide preliminary insights into the genes shaping tolerance and potentially influencing epidemiological dynamics. Here, we addressed these questions in the bank vole Myodes glareolus, the specific asymptomatic reservoir host of Puumala hantavirus (PUUV), which causes nephropathia epidemica (NE) in humans. Despite the continuous spatial distribution of M. glareolus in Sweden, NE is endemic to the northern part of the country. Northern bank vole populations in Sweden might exhibit tolerance strategies as a result of coadaptation with PUUV. This may favor the circulation and maintenance of PUUV and lead to high spatial risk of NE in northern Sweden. We performed a genome-scan study to detect signatures of selection potentially correlated with spatial variations in tolerance to PUUV. We analyzed six bank vole populations from Sweden, sampled from northern NE-endemic to southern NE-free areas. We combined candidate gene analyses (Tlr4, Tlr7, and Mx2 genes) and high-throughput sequencing of restriction site-associated DNA (RAD) markers. Outlier loci showed high levels of genetic differentiation and significant associations with environmental data including variations in the regional number of NE human cases. Among the 108 outliers that matched to mouse protein-coding genes, 14 corresponded to immune-related genes. The main biological pathways found to be significantly enriched corresponded to immune processes and responses to hantavirus, including the regulation of cytokine productions, TLR cascades, and IL-7, VEGF, and JAK-STAT signaling. In the future, genome-scan replicates and functional experimentations should enable to assess the role of these biological pathways in M. glareolus tolerance to PUUV.Entities:
Keywords: RAD sequencing; adaptation; hantavirus; immunity; molecular epidemiology; selection; tolerance; voles
Year: 2018 PMID: 30519443 PMCID: PMC6262921 DOI: 10.1002/ece3.4603
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Maps showing the localities of bank vole sampling in Sweden. The red lines delimit the counties with 90% of human hantavirus cases reported. Geographic variations in ecoregions are represented with yellow, orange, and green colors. The contact zone between the two mitochondrial lineages of Myodes glareolus is indicated with a black line
Sampling information
| Localities | County | Latitude | Longitude |
| Date of sampling |
PUUV—human cases |
|---|---|---|---|---|---|---|
| Hörnefors | Västerbotten | N63°33′ 50″ | E19°47′20″ | 49 | Apr 2012 | 13/808/2,152/259,239 |
| Härnösand | Västernorrland | N62°29′30″ | E17°49′00″ | 57 | Apr 2012 | 14/391/1,289/242,347 |
| Njurunda | Västernorrland | N62°15′15″ | E17°29′45″ |
20 |
Apr 2012 | 14/391/1,289/242,347 |
| Gnarp | Gävleborg | 61°51′15″ | 17°12′10″ | 47 | Oct 2012 | 4/58/219/276,323 |
| Enånger | Gävleborg | 61°25′25″ | 16°57′58″ | 28 | Oct 2012 | 4/58/219/276,323 |
| Gimo | Uppsala | 60°33′36″ | 17°50′06″ |
25 |
Apr 2012 | 1/15/95/200,032 |
Localities of sampling and their administrative county, geographic coordinates (center point from where the voles are trapped, voles are all caught within 1 kilometer from that point), the number N of voles trapped, the date of sampling, and the minimum (over 2001–2011), maximum (over 2001–2011) and total number of human cases reported per county between 2001 and 2011 (SMI data) are reported. Note that southern human cases most often correspond to residents spending their holidays in the north of Sweden (Olsson et al., 2009).
Environmental parameters and their potential impacts on Myodes glareolus and Puumala
| Variables | Bank voles | Virus | Resolution | Units | Sources |
|---|---|---|---|---|---|
| PUUV presence | |||||
| Total number of human cases between 2001 and 2011 (SWI dataset) | x | County | |||
| Climatic variables | |||||
| Minimum temperature in winter (MinTempWinter) | x | x |
0.0083° | °C | WorldClim |
| Maximum temperature in summer (MaxTempSummer) | x |
0.0083° | °C | WorldClim | |
| Snow cover (Snow) | x | x |
0.005° | Area percentage | MODIS |
| Annul precipitation (Pp) | x | x |
0.0083° | mm | WorldClim |
| Forest and soil indices | |||||
| Proportion of forest | x | 100 m | Area percentage | Corine 2006 (EEA) | |
| Proportion of broadleaved forest | x | 100 m | Area percentage | Corine 2006 (EEA) | |
| Proportion of coniferous forest | x | 100 m | Area percentage | Corine 2006 (EEA) | |
| Proportion of mixed forest | x | 100 m | Area percentage | Corine 2006 (EEA) | |
| Mean volume of spruce | x | 25 m | m3/ha | SLU Skogskarta | |
| Standard deviation of the volume of spruce | x | 25 m | m3/ha | SLU Skogskarta | |
| Mean volume of pine | x | 25 m | m3/ha | SLU Skogskarta | |
| Standard deviation of the volume of pine | x | 25 m | m3/ha | SLU Skogskarta | |
| Average contiguity index of forest patches | x | 1:20,000 | Relative unit | Lantmäteriet | |
| Average shape index of forest patches | x | 1:20,000 | Relative unit | Lantmäteriet | |
| Perimeter of the forest patches | x | 1:20,000 | m | Lantmäteriet | |
| Soil water index (SWI) | x | x |
25 km | Relative unit | TU‐WIEN |
“x” indicates that the environmental factor may affect bank voles’ abundance or virus survival outside the host.
Figure 2Principal component analysis (PCA) plot of the environmental data (black labels) and of the six populations (blue labels) analyzed. “Dim 1” and “Dim 2” indicate the proportions of variance explained by each axis. Details about environmental variables are provided in Table 2. The inset graph indicates the proportion of variance explained by each subsequent eigenvalues of the covariance matrix of the data
Diversity indices per sampling locality and SNP
| SNP | Indices | Localities | |||||
|---|---|---|---|---|---|---|---|
| Hörnefors | Härnösand | Njurunda | Gnarp | Enånger | Gimo | ||
|
|
| 0.4639 | 0.1759 | 0.000 | 0.000 | 0.000 | 0.000 |
|
| 0.4694 | 0.1579 | 0.000 | 0.000 | 0.000 | 0.000 | |
|
| −0.012 | 0.103 | – | – | – | – | |
|
|
| 0.000 | 0.000 | 0.0294 | 0.1311 | 0.1726 | 0.5065 |
|
| 0.000 | 0.000 | 0.0294 | 0.1389 | 0.1875 | 0.2703 | |
|
| – | – | 0.000 | −0.061 | −0.088 |
| |
|
|
| 0.0600 | 0.0517 | 0.2950 | 0.4261 | 0.4583 | 0.2755 |
|
| 0.0612 | 0.0526 | 0.2941 | 0.4286 | 0.5000 | 0.1622 | |
|
| −0.021 | −0.018 | 0.003 | −0.006 | −0.093 |
| |
|
|
| 0.0791 | 0.0517 | 0.2950 | 0.4409 | 0.4583 | 0.2755 |
|
| 0.0816 | 0.0526 | 0.2941 | 0.4167 | 0.5000 | 0.1622 | |
|
| −0.032 | −0.018 | 0.003 | 0.056 | −0.093 |
| |
|
|
| 0.0600 | 0.0517 | 0.3021 | 0.4343 | 0.4583 | 0.2936 |
|
| 0.0612 | 0.0526 | 0.3030 | 0.4054 | 0.5000 | 0.1892 | |
|
| −0.021 | −0.018 | −0.003 | 0.067 | −0.093 | 0.359 | |
|
|
| 0.0412 | 0.0517 | 0.2950 | 0.4409 | 0.4583 | 0.2755 |
|
| 0.0417 | 0.0526 | 0.2941 | 0.4167 | 0.5000 | 0.1622 | |
|
| −0.011 | −0.018 | 0.003 | 0.056 | −0.093 |
| |
|
|
| 0.0000 | 0.3552 | 0.1383 | 0.4261 | 0.5079 | 0.0000 |
|
| 0.0000 | 0.1404 | 0.1471 | 0.3143 | 0.3750 | 0.0000 | |
|
| – |
| −0.065 | 0.265 | 0.265 | – | |
|
|
| 0.4033 | 0.4988 | 0.3652 | 0.2950 | 0.2285 | 0.4832 |
|
| 0.0204 | 0.0175 | 0.1765 | 0.2941 | 0.1935 | 0.2432 | |
|
|
|
|
| 0.003 | 0.155 |
| |
H e is the gene diversity among individuals within samples; H o is the gene diversity within individuals; F IS is the inbreeding coefficient. All these parameters were estimated with Genepop. Values in bold indicate significant deviation from Hardy–Weinberg equilibrium. p‐values are indicated in brackets.
Figure 3Graphical representations of the Myodes glareolus population genetic structure based on the 95,988 SNPS. (a) Principal component analysis (PCA) based on the variance covariance matrix of the six bank vole populations studied, estimated using BayPass and based on the 95,988 SNPs included in the statistical analyses. (b) Representation of the scaled covariance matrix as estimated from BayPass under the core model with = 1. (c) Multilocus estimates of pairwise genetic differentiation, computed as F ST/(1 − F ST), plotted against logarithm of geographic distances between all possible pairs of bank vole populations. The regression is represented as a dashed line
Figure 4SelEstim analysis of 95,988 SNPs in the six bank vole populations. (a) Kullback–Leibler divergence (KLD) as a function of F ST estimates for all SNPs. Horizontal lines indicate the 99.90% (solid), 99.95% (dashed), and 99.99% (dotted) quantiles computed using the KLD calibration procedure. Outlier SNPs detected at the 99.90% threshold value are depicted in blue. (b) Regression of the locus‐ and population‐specific coefficient of selection, scaled as: ζij ≡ 2 σij (κij − 0.5) (see the Materials and Methods section), to the coordinates of the first principal components of the environmental variables. The blue lines correspond to outlier SNPs detected at the 99.90% threshold value, and the thin gray lines correspond to all other markers
Figure 5Outlier detection based on 95,988 SNPs in the six bank vole populations using BayPass. (a) Correlation between the Kullback–Leibler divergence (KLD) for all SNPs and the X T X statistics estimated using BayPass. Horizontal lines correspond to the 99.90% (solid), 99.95% (dashed), and 99.99% (dotted) quantiles calculated from the KLD calibration procedure. Vertical lines correspond to the 99.90% (solid), 99.95% (dashed), and 99.99% (dotted) quantiles of the X T X distribution calculated using simulations from a predictive distribution based on the observed dataset. SNPs that were detected at the 99.90% threshold value of KLD with SelEstim are depicted in green. SNPs that were detected at the 99.90% threshold value of X T X with BayPass are depicted in blue. SNPs that were detected by both SelEstim and BayPass are depicted in red. (b) Correlation between the statistics eBP (with the environmental synthetic variable being PCA axis 1) and X T X estimated using BayPass. Horizontal and vertical lines correspond to the 99.90%, 99.95% and 99.99% quantiles calculated from the simulations described above. (c) Correlation between the statistics eBP (with the environmental synthetic variable being PCA axis 2) and X T X estimated using BayPass. Horizontal and vertical lines correspond to the 99.90%, 99.95% and 99.99% quantiles calculated from the simulations described above
Outlier SNPs identified using at least one of the three methods implemented to detect signatures of selection: SelEstim, BayPass and BayPass with environmental variables (respectively when associations were found with PC1 or PC2). Only SNPS that belonged to contigs that aligned to the mouse genome and corresponded to genes coding for proteins related to immunity or involved in responses to hantavirus infections are indicated. Gene name and description were obtained in ENSEMBL. Pathways are indicated following KEGG or Reactome results
| Gene ID (ENSEMBL) | SNP (MRK_RAD) | Consensus Sequence | Gene abbreviation and synonyms | Gene name and description | Method of outlier detection | Pathway |
|---|---|---|---|---|---|---|
| ENSMUSG00000026395 | 13,196 | C134235 | Ptprc, B220, CD45R, Cd45, L‐CA, Ly‐5, Lyt‐4, T200, loc | Protein tyrosine phosphatase, receptor type C; protein tyrosine‐protein phosphatase required for T‐cell activation through the antigen receptor. | KLD |
Cell adhesion molecules (CAMs) |
| ENSMUSG00000053158 | 12,626 | C132789 | Fes | Feline sarcoma oncogene | KLD | Developmental biology pathway; axon guidance pathway; IL‐3, IL‐4, IL‐6, and IL‐11 pathways |
| ENSMUSG00000044583 | Candidate gene | Tlr7 | Toll‐like receptor 7; key component of innate and adaptive immunity |
KLD | Toll‐like receptor signaling pathway | |
| ENSMUSG00000000791 | 89,604 | C84915 | Il12rb1, CD212, IL‐12R[b] | Interleukin‐12 receptor, beta 1; functions as an interleukin receptor which binds interleukin‐12 with low affinity and is involved in IL‐12 transduction | eBP‐1 |
Cytokine–cytokine receptor interaction |
| ENSMUSG00000016024 | 94,448 | C96304 | Lbp, Bpifd2, Ly88 | Lipopolysaccharide binding protein; binds to the lipid A moiety of bacterial lipopolysaccharides (LPS) and acts as an affinity enhancer for CD14 | eBP‐1 |
NF‐kappa B signaling pathway |
| ENSMUSG00000024965 | 71,214 | C41693 | Fermt3, C79673, Kindlin3 | Fermitin family homolog 3 (Drosophila); plays a central role in cell adhesion in hematopoietic cells and acts by activating the integrin beta‐1‐3 required for integrin‐mediated platelet adhesion and leukocyte adhesion to endothelial cells | eBP‐1 | Platelet activation |
| ENSMUSG00000039304 | 42,318 | C217696 | Tnfsf10 | Tumor necrosis factor (ligand) superfamily, member 10 | eBP‐1 | Cytokine–cytokine receptor interaction, apoptosis, natural killer cell‐mediated cytotoxicity |
| ENSMUSG00000040254 | 81,126 | C65031 | Sema3d | Semaphorin 3D | eBP‐1 | Cell adhesion molecules and their ligands, axon guidance pathway, immunoglobulin superfamily |
| ENSMUSG00000039191 | 13,804 | C13591 | Rbpj | Recombination signal binding protein for immunoglobulin kappa J region | eBP‐1 | Notch signaling pathway, disease pathway, Th1 and Th2 cell differentiation |
| ENSMUSG00000034218 | 86,924 | C78794 | Atm | Ataxia telangiectasia mutated family protein | eBP‐1 | NF‐kappa B signaling pathway |
| ENSMUSG00000022064 | 94,686 | C96906 | Pibf1 | Progesterone immunomodulatory binding factor 1 | eBP‐1 | Immune system process, regulation of interleukin production |
| ENSMUSG00000052430 | 1560 | C104095 | Bmpr1b | Bone morphogenetic protein receptor, type 1B | eBP‐1 | Cytokine–cytokine receptor interaction pathway, TGF‐beta signaling pathway |
| ENSMUSG00000031309 | 3,690 | C10961 | RpS6ka3 | Ribosomal protein S6 kinase polypeptide 3 | eBP‐1 | Toll‐like receptor 3 (TLR3) cascade pathway, TRAF6 mediated induction of NFkB and MAP kinases upon TLR7/8 or 9 activation pathway, Toll‐like receptor 2 (TLR2) cascade pathway |
| ENSMUSG00000020516 | 18,252 | C147558 | RpS6kb1 | Ribosomal protein S6 kinase, polypeptide 1 | eBP‐1 | TGF‐beta signaling pathway, Fc gamma R‐mediated phagocytosis pathway |
Figure 6TreeMap view of REVIGO Biological Process analyses. Each rectangle represents a single cluster; the clusters are grouped into “superclusters” of related terms, represented with different colors. The size of the rectangles reflects the frequency of the GO term in the set of outliers included in this analysis