| Literature DB >> 22174757 |
Daniela Brites1, Francisco Encinas-Viso, Dieter Ebert, Louis Du Pasquier, Christoph R Haag.
Abstract
In insects and crustaceans, the Down syndrome cell adhesion molecule (Dscam) occurs in many different isoforms. These are produced by mutually exclusive alternative splicing of dozens of tandem duplicated exons coding for parts or whole immunoglobulin (Ig) domains of the Dscam protein. This diversity plays a role in the development of the nervous system and also in the immune system. Structural analysis of the protein suggested candidate epitopes where binding to pathogens could occur. These epitopes are coded by regions of the duplicated exons and are therefore diverse within individuals. Here we apply molecular population genetics and molecular evolution analyses using Daphnia magna and several Drosophila species to investigate the potential role of natural selection in the divergence between orthologs of these duplicated exons among species, as well as between paralogous exons within species. We found no evidence for a role of positive selection in the divergence of these paralogous exons. However, the power of this test was low, and the fact that no signs of gene conversion between paralogous exons were found suggests that paralog diversity may nonetheless be maintained by selection. The analysis of orthologous exons in Drosophila and in Daphnia revealed an excess of non-synonymous polymorphisms in the epitopes putatively involved in pathogen binding. This may be a sign of balancing selection. Indeed, in Dr. melanogaster the same derived non-synonymous alleles segregate in several populations around the world. Yet other hallmarks of balancing selection were not found. Hence, we cannot rule out that the excess of non-synonymous polymorphisms is caused by segregating slightly deleterious alleles, thus potentially indicating reduced selective constraints in the putative pathogen binding epitopes of Dscam.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22174757 PMCID: PMC3236188 DOI: 10.1371/journal.pone.0027947
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Dscam of Daphnia magna.
A) Protein domains; Ig-immunoglobulin domains; FNIII- fibronectin III domains. The grey and black boxes represent the transmembrane and cytoplasmic domains. B) mRNA, each box corresponds to a constitutive exon and the colored boxes 4, 6 and 11, correspond to exons that are the result of mutual exclusive alternative splicing of arrays of duplicated exons, as indicated in C) * Dscam exons of Daphnia sampled in the present study.
Figure 2Dscam horse-shoe structure outline and detailed epitope II.
A) Outline of the Dscam horse-shoe structure formed by the first four Ig domains (D1–D4). B & C) Detail of Epitope II, formed by the two interstrand loops C′-D of exon 4 and A′-B of exon 6, respectively. Each strand is indicated by an encircled letter. The Drosophila aminoacid residues corresponding to the actual structures are in black uppercase initials (exon 4.1 and 6.34 of Dr. melanogaster). Da. magna residues have been positioned in function of the known homology of the molecule in the region coded by exon 4 and 6 (BRITES et al. 2008) and are represented by red lowercase initials. Polymorphic sites at exons 6 for Da. magna and Dr. melanogaster are represented by lowercase initials, each color corresponds to positions on Epitope II coding regions in different paralogous exons 6.
Geographic origin of the Da. magna populations sampled.
| Genotype | Geographic origin | Latitude | Longitude |
|
| Tvärminne, Finland | 59°50.18′N | 23°14.16′E |
|
| Tvärminne, Finland | 59°49.43′N | 23°15.15′E |
|
| Tvärminne, Finland | 59°48.42′N | 23°12.31′E |
|
| Åland Islands, Finland | 60°01.30′N | 19°54.15′E |
|
| Hungary | 46°48′N | 19°08′E |
|
| Hungary | 46°48′N | 19°08′E |
|
| Hungary | 46°48′N | 19°08′E |
|
| Kniphagen, Germany | 54°10.45′N | 10°47.3′E |
|
| Munich, Germany | 48°12.23′N | 11°42.34′E |
|
| Munich, Germany | 48°12.23′N | 11°42.34′E |
|
| Ismaning, Germany | 48°12.23′N | 11°42.34′E |
|
| Leitholm, UK | 55°43.9′N | 02°20.43′W |
|
| Cummor, UK | 51°43.9′N | 01°20.4′W |
|
| Sedlec, Czech Republic | 48°46.52′N | 16°43.41′E |
|
| Leuven, Belgium | 50°52′N | 04°41′E |
|
| Kenia | 0°26.25′N | 35°18.16′E |
|
| Sweden, East coast | 60°25.93′N | 18°31.34′E |
Genotypes for which only array 6 exons were amplified, and which were only used in parts of the analysis.
Number of sites and number of polymorphic sites per Dscam genomic region analyzed in Da. magna (Dmag) and Dr. melanogaster (Dmel), the latter obtained from [10].
| Gene region | N of sites (L) | N of polymorphic sites (S) | ||||||||
| Dmag | Dmel | Dmag | Dmel | |||||||
| Ls | La | Lnc | Ls | La | Ss | Sa | Snc | Ss | Sa | |
| Array 4 total | 218 | 731 | 778 | 458 | 1524 | 4 | 6 | 20 | 11 | 9 |
| Epitopes I | 34 | 117 | n.a. | n.a. | n.a. | 0 | 0 | n.a. | n.a. | n.a. |
| Epitopes II | 56 | 187 | n.a. | 120 | 447 | 2 | 1 | n.a. | 2 | 4 |
| Remaining | 128 | 427 | n.a. | 338 | 1077 | 2 | 5 | n.a. | 9 | 5 |
| Array 6 total | 213 | 628 | 728 | 1443 | 4325 | 17 | 10 | 27 | 60 | 46 |
| Epitopes I | 44 | 124 | n.a. | n.a. | n.a. | 1 | 1 | n.a. | n.a. | n.a. |
| Epitopes II | 40 | 128 | n.a. | 278 | 864 | 0 | 5 | n.a. | 29 | 17 |
| Remaining | 129 | 376 | na | 1164 | 3461 | 16 | 4 | na | 77 | 29 |
| Ig6 coding exon | 81 | 246 | 0 | 60 | 173 | 6 | 4 | 0 | 25 | 0 |
Estimates of Dscam nucleotide diversity (π in Da magna, θ in Dr melanogaster), divergence of orthologous sequences between Da. magna and Da. lumholtzi, and amino acid divergence between paralogous regions of Da. magna, as well as divergence of orthologous sequences between Dr. melanogaster and a reconstructed ancestral sequence estimated in [10].
| Species | Gene region | Diversity (π, θ) | Divergence (k)2 | |||||||
|
| Array 4 Total | 0.0014 | 0.004 | 0.005 | 0.0008 | 0.2 | 0.132 | 0.013 | 0.098 | 0.837 |
| Epitopes I | 0 | n.a. | 0 | 0 | n.a. | 0.118 | 0.000 | 0 | 0.980 | |
| Epitopes II | 0.0014 | n.a. | 0.005 | 0.0009 | 0.18 | 0.164 | 0.032 | 0.195 | 1.431 | |
| Remaining | 0.0014 | n.a. | 0.005 | 0.0004 | 0.08 | 0.137 | 0.004 | 0.029 | 0.567 | |
| Array6 Total | 0.0064 | 0.01 | 0.017 | 0.003 | 0.176 | 0.148 | 0.013 | 0.088 | 0.593 | |
| Epitopes I | 0.003 | n.a | 0.003 | 0.0006 | 0.1 | 0.139 | 0.008 | 0.057 | 1.379 | |
| Epitopes II | 0.007 | n.a. | 0.000 | 0.009 | n.a. | 0.178 | 0.031 | 0.174 | 1.616 | |
| Remaining | 0.007 | n.a. | 0.023 | 0.001 | 0.04 | 0.144 | 0.004 | 0.028 | 0.211 | |
| Exon10 (Ig6) | 0.006 | n.a. | 0.011 | 0.005 | 0.454 | 0.149 | 0.003 | 0.02 | n.a. | |
|
| Array 4 Total | 0.01 | n.a. | 0.024 | 0.006 | 0.25 | 0.039 | 0.003 | 0.077 | n.a. |
| Epitopes II | 0.0106 | n.a. | 0.017 | 0.009 | 0.53 | 0.033 | 0.005 | 0.151 | n.a. | |
| Array 6 Total | 0.018 | n.a. | 0.042 | 0.011 | 0.26 | 0.076 | 0.008 | 0.105 | n.a. | |
| Epitopes II | 0.0253 | n.a. | 0.043 | 0.006 | 0.14 | 0.082 | 0.01 | 0.121 | n.a. | |
| Exon7 (Ig6) | 0.008 | n.a. | 0.033 | 0 | n.a. | 0.083 | 0 | n.a. | n.a. | |
| Remaining Dscam4 | 0.019 | n.a. | 0.048 | 0.009 | 0.18 | 0.067 | 0.005 | 0.075 | n.a. | |
| Control genes5 | n.a. | n.a. | 0.015 | 0.002 | 0.13 | n.a. | n.a. | n.a. | n.a. | |
| Immune genes5 | n.a. | n.a. | 0.016 | 0.009 | 0.56 | n.a. | n.a. | n.a. | n.a. | |
MacDonald Kreitman tests for the comparison between Da. magna and Da. lumholtzi.
| Gene region | Raw values | Corrected MAF | ||||||||
| Fixed | Polymorphic |
| Fixed | Polymorphic |
| |||||
| Syn | Nonsyn | Syn | Nonsyn | Syn | Nonsyn | Syn | Nonsyn | |||
| Array 4 Total | 28 | 9 | 4 | 6 | 0.05 | 28 | 9 | 1 | 0 | 1 |
| Epitopes II | 10 | 7 | 2 | 2 | 1 | 10 | 7 | 0 | 0 | n.a. |
| Array 6 Total | 26 | 7 | 17 | 10 | 0.25 | 29 | 7 | 4 | 2 | 0.6 |
| Epitopes II | 6 | 4 | 0 | 5 | 0.04 | 6 | 4 | 0 | 2 | 0.4 |
| Exon 10 (Ig6) | 10 | 0 | 6 | 4 | 0.08 | 12 | 0 | 0 | 2 | 0.01 |
p values are according to a two-tailed Fisher's exact test. n.a., not assessed.
The test was performed on raw frequencies of alleles as well on frequencies after correcting for minor allele frequency (MAF). This correction was done by eliminating all allele frequencies lower than 0.15 when considering all Da. magna populations.
MacDonald Kreitman tests for the comparison between Dr. melanogaster and an ancestral sequence inferred by [10].
| Gene region | Raw values | Corrected MAF | ||||||||
| Fixed | Polymorphic |
| Fixed | Polymorphic |
| |||||
| Syn | Nonsyn | Syn | Nonsyn | Syn | Nonsyn | Syn | Nonsyn | |||
| Array 4 Total | 13 | 0 | 11 | 9 | 0.005 | 13 | 0 | 5 | 0 | n.a |
| Epitopes II | 3 | 0 | 2 | 4 | 0.16 | 3 | 0 | 0 | 0 | n.a |
| Array 6 Total | 81 | 14 | 60 | 46 | <0.001 | 86 | 18 | 18 | 8 | 0.1 |
| Epitopes II | 17 | 7 | 12 | 17 | 0.051 | 19 | 7 | 2 | 7 | 0.01 |
| Exon 7 (Ig6) | 4 | 0 | 2 | 5 | n.a | 4 | 0 | 1 | 0 | n.a |
p values are according to a two-tailed Fisher's exact test. n.a., not assessed.
The test was performed on raw frequencies of alleles as well on frequencies corrected for minor allele frequency effects (MAF). This correction was done by eliminating all allele frequencies lower than 0.15 when considering all Dr. melanogaster populations.
Likelihood ratio tests and maximum likelihood estimates of dN/dS for six Drosophila species of the melanogaster group.
| Gene region (Models tested) | N° variable sites | LRT | Parameter estimates |
| Array 4 total | |||
| (M1a | 292 | n.s. | ω0 = 0.009 (96%) |
| (M7 vs. M8) | ω1&2 = 1 (4%) | ||
| Epitopes II | 84 | χ2 = 52 | dN/dS = 0.11 |
| Remaining | 208 | p<0.001 | dN/dS = 0.006 |
| Array 6 total | |||
| (M1a | 784 | n.s. | ω0 = 0.03 (94%) 3 |
| (M7 vs. M8) | ω1&2 = 1 (6%) 3 | ||
| Epitopes II | 242 | χ2 = 119 | dN/dS = 0.19 |
| Remaining | 542 | p<0.001 | dN/dS = 0.03 |
Abbreviation: LRT, Likelihood ratio test.
M1a: ω0 varies between 0 and 1 whereas ω1 = 1;
M2a adds to M1a, ω2>1, which is estimated from the data;
proportions of sites under ω0, ω1, and ω2.
Tests whether the dN/dS relative to the two partitions are significantly different from each other.