| Literature DB >> 25775214 |
Jacob A Tennessen1, André Théron2, Melanie Marine1, Jan-Ying Yeh1, Anne Rognon2, Michael S Blouin1.
Abstract
Schistosomiasis, a neglected global pandemic, may be curtailed by blocking transmission of the parasite via its intermediate hosts, aquatic snails. Elucidating the genetic basis of snail-schistosome interaction is a key to this strategy. Here we map a natural parasite-resistance polymorphism from a Caribbean population of the snail Biomphalaria glabrata. In independent experimental evolution lines, RAD genotyping shows that the same genomic region responds to selection for resistance to the parasite Schistosoma mansoni. A dominant allele in this region conveys an 8-fold decrease in the odds of infection. Fine-mapping and RNA-Seq characterization reveal a <1Mb region, the Guadeloupe Resistance Complex (GRC), with 15 coding genes. Seven genes are single-pass transmembrane proteins with putative immunological roles, most of which show strikingly high nonsynonymous divergence (5-10%) among alleles. High linkage disequilibrium among three intermediate-frequency (>25%) haplotypes across the GRC, a significantly non-neutral pattern, suggests that balancing selection maintains diversity at the GRC. Thus, the GRC resembles immune gene complexes seen in other taxa and is likely involved in parasite recognition. The GRC is a potential target for controlling transmission of schistosomiasis, including via genetic manipulation of snails.Entities:
Mesh:
Year: 2015 PMID: 25775214 PMCID: PMC4361660 DOI: 10.1371/journal.pgen.1005067
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Resistance phenotypes.
Standard errors of proportions are indicated by vertical bars. (A) Susceptibility declined rapidly over five generations in snail lines exposed to 10 (R10) or 30 (R30) miracidia, when only uninfected snails were allowed to contribute to the next generation (N = 46–100). A control line (GUA) that was not exposed to the parasite showed no comparable change (N = 46–65). (B) Among 289 snails, the R allele at the GRC locus grc1 is strongly correlated with resistance in a dominant fashion. There are no significant differences in resistance among genotypes with R, nor among genotypes without R.
Fig 2FST values between the unselected control line (GUA) and each selected line (R10 and R30).
Three null RAD markers, aligned with BWA [65] to Scaffold1, all showed unusually high FST in one or both comparisons (red). These markers conform to perfect linkage disequilibrium with the haplotypes as determined by Sanger sequencing, with the different FST values owing only to the error inherent in estimating allele frequencies from null markers. An additional unaligned marker with high FST in both comparisons was identified with Stacks [35] and found to reside on Scaffold4 (blue).
Markers used in Sanger sequencing to delimit borders of GRC.
| Site | r |
|---|---|
| Scaffold1_1501k | 0.41 |
| Scaffold1_1541k | 1 |
| Scaffold1_1732k | 1 |
| Scaffold1_1904k | 1 |
| Scaffold1_1940k | 1 |
| Scaffold4_1468k | 1 |
| Scaffold4_1305k | 1 |
| Scaffold4_1187k | 0.99 |
| Scaffold4_1081k | 0.58 |
aPosition in B. glabrata reference genome v. BglaB1 (scaffold, followed by site position in kb)
bcorrelation coefficient (r) with grc1 among 94 snails
c grc1
d grc2
Fig 3Pairwise divergence of GRC genes.
Genes are aligned to their approximate genomic position on the x-axis, staggered slightly for ease of visualization. Scaffold7477 has been inserted into its inferred position within Scaffold1. Scaffold4 has been inverted relative to its arbitrarily designated reference genome orientation to indicate that only the end of this scaffold is part of the GRC. Pairwise divergence for all three haplotype combinations (indicated by color) is shown in the left y-axis. Gene symbols vary for ease of distinguishing adjacent genes, and to show divergence type: solid symbols represent nonsynonymous divergence (dN) in coding genes, open symbols represent silent (synonymous and noncoding) divergence in coding genes (dS), and other symbols represent divergence in noncoding genes. Only the six TM1 genes in the center of the region of association (grctm2–7) show high (>1%) dN. The names of the two most promising candidates, grctm5 and grctm6, are highlighted in red. Brown lines and squares indicate the boundaries of the GRC region of statistical association as determined by Sanger sequencing markers: for these markers, the right y-axis indicates linkage disequilibrium correlation (r; Table 1) with marker locus grc1 (labeled). The gap between Scaffold1 and Scaffold4 (dotted line) is of unknown size, but contains no expressed genes with sequence or expression differences among haplotypes, as these would have been detected in our RNA-Seq analysis.
Genes in the GRC.
| Gene | Description | E-value | Site | ORF | RS1 dN
| RS1 dS
| RS2 dN
| RS2 dS
| S1S2 dN
| S1S2 dS
|
|---|---|---|---|---|---|---|---|---|---|---|
|
| hyaluronidase | 1.00E-162 | Scaffold1 1520–1531k | 1479 | 0.68 | 3.71 | 0.68 | 3.92 | 0.51 | 3.52 |
|
| noncoding | NA | Scaffold1 1534–1536k | 0 | NA | 2.69 | NA | 4.58 | NA | 3.68 |
|
| myosin light chain kinase | 2.00E-12 | Scaffold1 1541–1553k | 5604 | 0.46 | 4.06 | 0.29 | 2.3 | 0.43 | 2.59 |
|
| noncoding | NA | Scaffold1 1571–1573k | 0 | NA | 7.07 | NA | 8.24 | NA | 4.21 |
|
| noncoding | NA | Scaffold1 1573–1574k | 0 | NA | 7.92 | NA | 5.16 | NA | 5 |
|
| noncoding | NA | Scaffold1 1574–1575k | 0 | NA | 4.15 | NA | 12.04 | NA | 10.71 |
|
| noncoding | NA | Scaffold1 1607–1609k | 0 | NA | 1.57 | NA | 1.05 | NA | 1.95 |
|
| single-pass transmembrane | NA | Scaffold1 1610–1621k | 1002 | 0.64 | 0.19 | 0.26 | 0.19 | 0.64 | 0.38 |
|
| noncoding | NA | Scaffold1 1624–1626k | 0 | NA | 3.1 | NA | 1.93 | NA | 1.76 |
|
| noncoding | NA | Scaffold1 1626–1627k | 0 | NA | 0.62 | NA | 1.86 | NA | 2.48 |
|
| noncoding | NA | Scaffold1 1786–1787k | 0 | NA | NA | NA | NA | NA | NA |
|
| receptor-type tyrosine-protein phosphatase | 1.00E-28 | Scaffold1 1792–1806k, Scaffold7477 2–3k | 2013 | 10.75 | 13.26 | 2.44 | 3.6 | 10.91 | 13.45 |
|
| chitinase | 6.00E-12 | Scaffold1 1821–1837k | 864 | 1.63 | 3.38 | 7.1 | 16.5 | 7.38 | 17.79 |
|
| noncoding | NA | Scaffold1 1865–1867k | 0 | NA | 5.36 | NA | 4.02 | NA | 3.51 |
|
| chitinase | 5.00E-13 | Scaffold1 1863–1877k | 1104 | 4.35 | 5.52 | 6.37 | 6.35 | 5.55 | 6.06 |
|
| noncoding | NA | Scaffold1 1884–1885k | 0 | NA | 4.06 | NA | 7.55 | NA | 6.53 |
|
| chitinase | 9.00E-14 | Scaffold1 1886–1895k | 855 | 5.53 | 7.51 | 7.46 | 7.9 | 6.94 | 6.92 |
|
| single-pass transmembrane | NA | Scaffold1 1906–1936k | 2013 | 8.74 | 5.66 | 7.85 | 3.46 | 8.57 | 6.16 |
|
| single-pass transmembrane | NA | Scaffold1 1922–1934k | 1359 | 8.1 | 3.52 | 5.44 | 8.4 | 9.36 | 11.39 |
|
| noncoding | NA | Scaffold4 1497–1500k | 0 | NA | 0.66 | NA | 0.9 | NA | 1.56 |
|
| hemolysin | 3.00E-16 | Scaffold4 1477–1501k | 2385 | 0.32 | 0.52 | 0.22 | 1.17 | 0.32 | 0.91 |
|
| paraspeckle component | 5.00E-89 | Scaffold4 1441–1456k | 2028 | 0.19 | 1.27 | 0 | 0.59 | 0.19 | 0.85 |
|
| zinc finger | 0 | Scaffold4 1406–1423k | 5400 | 0.35 | 1.26 | 0.52 | 0.81 | 0.4 | 1.17 |
|
| zinc finger | 1.00E-130 | Scaffold4 1395–1403k | 2256 | 0.28 | 1.31 | 0.5 | 1.07 | 0.67 | 1.39 |
|
| ADP ribosylation factor GTPase-activating protein | 5.00E-151 | Scaffold4 1368–1387k | 1416 | 0 | 1.52 | 0.18 | 0.88 | 0.18 | 1.09 |
|
| noncoding | NA | Scaffold4 1365–1366k | 0 | NA | 1.22 | NA | 0.91 | NA | 1.11 |
|
| palmitoyltransferase | 4.00E-68 | Scaffold7477 5–8k | 903 | 1.28 | 3.71 | 0.71 | 6.44 | 0.99 | 5.44 |
agene name following Bayne [70]
bfor coding genes, putative protein description based on BLAST hits and secondary structure
clowest E-value for protein sequence BLASTed against NCBI non-redundant protein sequences. "NA" means non-coding or no E-values < 10-2
dapproximate location in B. glabrata reference genome v. BglaB1 via BLAST (site position in kb)
esize of open reading frame in bp
fnonsynonymous divergence between R and S1 (%)
gsilent (synonymous and noncoding) divergence between R and S1 (%)
hnonsynonymous divergence between R and S2 (%)
isilent (synonymous and noncoding) divergence between R and S2 (%)
jnonsynonymous divergence between S1 and S2 (%)
ksilent (synonymous and noncoding) divergence between S1 and S2 (%)
Fig 4Allele-specific nonsynonymous substitution among alleles of the seven TM1 genes.
Alleles of the same gene are aligned to each other (gaps indicated by whitespace), but across genes only the transmembrane domain (TM) is aligned (extracellular (N-terminal) regions are to the left of the TM). If a haplotype includes two copies of a gene that is single-copy on the other haplotypes, both copies are shown (e.g. gene grctm5 is duplicated on the R haplotype, so we label those sequences Ra and Rb). For each allele, we calculated allele-specific nonsynonymous substitution (i.e. divergence from the inferred ancestral sequence) in 75bp sliding windows, indicated by color (“allele-specific dN”). Regions with no sequence similarity are indicated in red. Substitution across a 75bp window could not be calculated in black sections. Fibronectin III domains (FN3) are shown. The premature stop codon in grctm1 S2 is shown with an asterisk. Nonsynonymous substitution is extremely high across the TM1 genes, exceeding 15% in some windows for all genes except grctm1, and occasionally reaching over 30%. Only grctm5 and grctm6 show high nonsynonymous substitution specific to R alleles.