Literature DB >> 23308258

Patterns of nucleotide diversity at the regions encompassing the Drosophila insulin-like peptide (dilp) genes: demography vs. positive selection in Drosophila melanogaster.

Sara Guirao-Rico1, Montserrat Aguadé.   

Abstract

In Drosophila, the insulin-signaling pathway controls some life history traits, such as fertility and lifespan, and it is considered to be the main metabolic pathway involved in establishing adult body size. Several observations concerning variation in body size in the Drosophila genus are suggestive of its adaptive character. Genes encoding proteins in this pathway are, therefore, good candidates to have experienced adaptive changes and to reveal the footprint of positive selection. The Drosophila insulin-like peptides (DILPs) are the ligands that trigger the insulin-signaling cascade. In Drosophila melanogaster, there are several peptides that are structurally similar to the single mammalian insulin peptide. The footprint of recent adaptive changes on nucleotide variation can be unveiled through the analysis of polymorphism and divergence. With this aim, we have surveyed nucleotide sequence variation at the dilp1-7 genes in a natural population of D. melanogaster. The comparison of polymorphism in D. melanogaster and divergence from D. simulans at different functional classes of the dilp genes provided no evidence of adaptive protein evolution after the split of the D. melanogaster and D. simulans lineages. However, our survey of polymorphism at the dilp gene regions of D. melanogaster has provided some evidence for the action of positive selection at or near these genes. The regions encompassing the dilp1-4 genes and the dilp6 gene stand out as likely affected by recent adaptive events.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23308258      PMCID: PMC3538593          DOI: 10.1371/journal.pone.0053593

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

In Drosophila, like in all holometabolous insect species, adult body size is mainly determined during the larval stages, as a product of both the growth rate and the duration of the growth period in each larval phase. Nutrition plays also a critical role in determining adult body size, since variation in caloric intake (quality and amount) in larval stages causes variation in growth rate, which in turn affects size at different larval stages. In insects, the insulin-signaling pathway is the main known metabolic pathway involved in establishing adult body size [1]–[4]. This pathway also plays a central role in fundamental biological processes such as metabolism, reproduction, aging and growth [4]–[6]. Multiple observations concerning variation in Drosophila adult body size are indicative of its adaptive character [7]–[23]. Moreover, life history traits such as fertility and lifespan generally reflect adaptive responses to environmental pressures and, thus, both positive and negative selection might have played an important role in the molecular evolution of the genes underlying such characters [24]. Genes involved in this pathway are, therefore, good candidates to have experienced adaptive changes and to reveal the footprint of positive selection. It has, indeed, been recently shown that the insulin-like receptor, which is the first component of the insulin-signaling pathway, has undergone adaptive change in its evolutionary past [25], [26]. The Drosophila insulin-like peptides (DILPs) are the ligands that trigger the insulin-signaling cascade. In Drosophila melanogaster, there are several dilp genes encoding proteins that are structurally similar to mammalian insulin. In D. melanogaster, five dilp genes are located on the 3L chromosomal arm at cytological position 67C8-9 (genes dilp1-5) whereas genes dilp6 and dilp7 are on the X chromosome at cytological positions 3A1 and 3E2, respectively. The autosomal genes consist of a cluster with four contiguous genes (dilp1-4) and a fifth gene (dilp5) separated from the rest by one intervening gene (figures S1 and S2). Over the last decade, the isolation and characterization of diverse D. melanogaster mutants for the dilp genes has provided experimental support for the involvement of these genes in regulating body size [27]–[29]. dilp genes are independently transcriptionally regulated in response to nutrition, as well as in a tissue- and stage-specific manner during development [27], [30]. It has been shown that genes dilp1, dilp2 and dilp3 are consecutively expressed during larval stages, whereas dilp 6 controls growth specifically during the pupal stage. Among dilp genes, dilp2 is the most potent growth promoter and dilp3 is the gene most responsive to diet changes [29]. It has also been shown that DILPs can act redundantly [29]. There is, therefore, some evidence for both functional differentiation and functional redundancy among DILPs. In new environments, like those encountered by D. melanogaster in its colonization and expansion through Europe, there would be ample opportunities for selection to have acted either separately or jointly on dilp genes. Moreover, the out-of-Africa expansion of the species imposed demographic changes during this process. In order to detect the putative action of positive selection in this very recent past of dilp genes, we surveyed nucleotide variation at each of the dilp genes in a European population of D. melanogaster, because levels and patterns of polymorphism can be informative on recent adaptive changes. We also compared levels of polymorphism and divergence from D. simulans at synonymous and nonsynonymous sites of coding regions, because this comparison can be informative of adaptive amino acid replacements after the species split. These analytical approaches capture different aspects of the footprint left by positive selection on DNA sequences, and in the window of evolutionary time in which they are able to detect this footprint. Our comparison of polymorphism and divergence at synonymous and nonsynonymous sites at the dilp genes has provided no evidence for amino acid adaptive substitutions in any of the DILPs since the split of D. melanogaster and D. simulans. In contrast, our survey of polymorphism at dilp gene regions of D. melanogaster has provided some evidence for the recent action of positive selection at or near these genes. Indeed, the dilp1-4 region exhibited a significant excess of high-frequency derived variants (as indicated by a highly negative H value), whereas the spatial pattern of variation at the dilp6 region had a significantly better fit to a selective than to a non-selective model.

Materials and Methods

Drosophila Strains

Twelve isochromosomal lines for each the third chromosome (CNIII) and the X chromosome (CNX) that had been extracted from a natural population of Drosophila melanogaster (Sant Sadurní d’Anoia, Barcelona, Spain) were used to obtain the sequences of the dilp genes located on the third (dilp1-5) and X (dilp6 and dilp7) chromosomes, respectively. These lines were obtained and kindly provided by D. Orengo [31]. In addition, two highly inbred Drosophila simulans lines –SAL and VSAL from a natural population in Alella (Barcelona, Spain) obtained by 10 generations of sib mating– were used to sequence the autosomal dilp genes, whereas one X-isochromosomal line –MO [32] from a natural population in Montblanc (Tarragona, Spain)– was used to sequence the X-linked genes.

Polymerase Chain Reaction (PCR) Amplification and Sequencing

DNA was extracted from 1 and 10 individuals (in case of highly inbred lines and isochromosomal lines, respectively) using either a modification of protocol 48 in Ashburner [33] or the Puregene DNA purification kit (Puregene, Gentra systems). The Oligo version 4.1 program [34] was used to design oligonucleotides, for both PCR amplification and DNA sequencing, based on the dilp genes sequences retrieved from Flybase [35] available at http://flybase.bio.indiana.edu/. The amplification products were purified either with a single-strand DNA enzymatic hydrolysis reaction (“ExoSAP-IT” method, USB), or with Amicon Microcon-PCR columns (Millipore). All fragments were cycle sequenced using the ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction kit (Applied Biosystems) and subsequently separated on ABI PRISM 3700 automated DNA sequencer (ABI Applied Biosystems). DNA sequences were obtained on both strands for each line. All new sequences in this article have been deposited in the EMBL database under the accession numbers HE654131-HE654180.

DNA Sequence Analysis

For each line and sequenced region, the DNA sequences were assembled using the SeqMan version 5.53 program (DNASTAR, Madison, WI). Sequences were multiply aligned and manually edited using the ClustalW program [36] and the MacClade version 3.05 program [37], respectively. Intraspecific and interspecific analyses were performed using the DnaSP version 5.10.01 program [38]. Nucleotide polymorphism was estimated by the number of segregating sites (S), nucleotide diversity (π; [39]), the number of haplotypes (h) and haplotype diversity (Hd; [39]). Interspecific divergence was estimated as the number of nucleotide substitutions per site (K). Synonymous (Ks) and nonsynonymous (Ka) divergence was estimated with the Nei and Gojobori method [40], and subsequently corrected according to Jukes and Cantor [41].

Neutrality Tests

The multilocus HKA test ([42]; J. Hey, http://lifesci.rutgers.edu/~heylab/index.html) was used to evaluate the putative heterogeneity across regions of the polymorphism to divergence ratio. Confidence intervals were established via coalescent simulations as well as using the Chi-square approximation. The McDonald and Kreitman (MK) test [43] was performed to examine whether the ratio of polymorphic to fixed changes was similar at synonymous and nonsynonymous sites. The Tajima [44] and Fay and Wu [45], [46] tests were conducted to examine whether the frequency spectrum of polymorphic nucleotide mutations conformed to neutral expectations. For each gene or sequenced region, the orthologous D. simulans sequence was used as outgroup in the normalized Fay and Wu H test, which is based on the unfolded frequency spectrum. Monte Carlo simulations based on the coalescent process [47] were carried out to obtain the P-values of these tests both under the standard neutral model (hereafter SNM), and under a previously described bottleneck model [48], [49]. Confidence intervals were established from the distribution of each test statistic obtained from 1000 simulated replicates. All simulations were carried out using the mlcoalsim version 1 program [50]. Simulations were performed fixing the number of segregating sites (S) and taking into account the uncertainty of θ ([51]; Rejection Algorithm method, RA), and with the possibility of multiple hits. The simulations were performed using a uniform prior distribution of θ values (ranging from 0.001 to 0.06) and two estimates of the population recombination rate –RM, which is obtained from the comparison of the physical and genetic maps ([52]; Release 5.19 of the Drosophila melanogaster Recombination Rate Calculator)–, and R0.25 =  RM/4 as an intermediate value of R.

Composite Likelihood Ratio (CLR) Test and Goodness-of Fit (GOF) Test

The statistical composite likelihood method implemented in the clsw program [53] was used to evaluate whether there is any evidence for a recent selective event (hitchhiking effect) in any of the four regions studied. We applied test B, which uses the level of variation estimated from the data (θ; [54]) to calculate the likelihood, and option 1, which uses information from an outgroup (D. simulans) to distinguish between ancestral and derived alleles. Significance was established through comparison of the CLR value obtained from the data to the CLR values obtained from simulations under the SNM and also under a bottleneck model [48], [49]. When the CLR test revealed a better fit of the data to the selective sweep than to both the SNM model and the bottleneck (BN) model, the GOF test [55] was performed to discriminate false positives. In this test, the null model is the selective sweep model and the alternative model is a general model in which the number of sequences at each position that carry the derived mutation are binomially distributed. The null distribution was obtained by applying the GOF test to a simulated data set obtained under the selective sweep model using the mlcoalsim version 1 program [50].

Results

Polymorphism in Drosophila Melanogaster

Nucleotide polymorphism was estimated both for the four dilp regions (table 1, tables S1 and S2) and for each of the seven dilp genes (table 2). A total of 204 segregating sites (figures S1, S2, S3) were detected with singletons in over half of the sites (i. e., 113 out of 204). The estimated level of nucleotide diversity varied from 0.003 to 0.008 for π total and from 0.003 to 0.011 for π silent. Table 2 summarises nucleotide polymorphism at different functional classes of the dilp genes. A total of 65 segregating sites were detected in these genes, 42 of which were singletons. Levels of silent nucleotide diversity were similar at the dilp2, 3, 4, and 7 genes (table 2). These estimates were approximately one order of magnitude higher than those at the dilp1 and dilp5 genes. Synonymous sites were in general more polymorphic than intronic sites. Estimates of synonymous nucleotide variation at the dilp2, 3, 4 and 7 genes were similar to the average values reported for other genes in this species (π  = 0.0134, [56]; π  = 0.0165, [57]), and one order of magnitude higher than estimates at the dilp1, dilp5 and dilp6 genes. Additionally, levels of silent polymorphism were more variable at both flanking and intergenic regions than at coding regions (varying from 0.0002 to 0.012 vs 0.001 to 0.012, respectively; tables S1 and S2). Nonsynonymous nucleotide diversity (π) was low in all genes except dilp1, which exhibited similar levels of synonymous and nonsynonymous nucleotide diversity. Indeed, DILP1 was the most variable DILP protein, with 5 amino acid polymorphisms (figure S4), 4 located in the C peptide and 1 in the B chain. The DILP6 and DILP7 proteins were the only other polymorphic proteins, each with a single polymorphic residue in the A chain.
Table 1

Nucleotide polymorphism and divergence at the four dilp gene regions.

dilp1-4 1 dilp5 dilp6 2 dilp7
No. sites
Silent3 6186.72960.22106.8334.1
Total742432012352699
S
Silent91 (77)66 (12)27 (13)13 (5)
Total96 (81)66 (12)28 (14)14 (6)
π
Silent0.0030.0090.0040.011
Total0.0030.0080.0030.005
h 1010129
Hd 1.0000.9701.0000.910
K
Silent0.0590.0510.0610.164
Total0.0520.0480.0560.073

Sample size was 12 for all regions except dilp1-4 (10).

The sequenced region consist of two fragments separated by a ∼390-bp stretch located at the first intron of the dilp6 gene.

Silent refers to variation at non-coding sites and at synonymous sites of coding regions.

S, number of segregating sites (number of singletons in parentheses); π, nucleotide diversity; h, number of haplotypes; Hd, haplotype diversity; K, nucleotide divergence.

Table 2

Nucleotide polymorphism and divergence at the seven dilp genes.

dilp1 dilp2 dilp3 dilp4 dilp5 dilp6 dilp7
No. sites
Intronic073726171870170
Synonymous110.398.585.8103.180.275.8112.1
Silent1 110.3171.5157.8164.1151.21312.8282.1
Nonsynonymous351.7312.5274.2298.9240.8245.2364.9
Total4624844324633921558647
S
Intronicn. a.2 (1)2 (1)0018 (10)5 (0)
Synonymous1 (1)6 (6)6 (5)7 (6)1 (1)06 (4)
Silent1 (1)8 (7)8 (6)7 (6)1 (1)22 (11)11 (4)
Nonsynonymous5 (4)00001 (1)1 (1)
Total6 (5)8 (7)8 (6)7 (6)1 (1)23 (12)12 (5)
π
Intronicn. a.0.0080.008000.0060.010
Synonymous0.0020.0120.0160.0150.00200.012
Silent0.0020.0100.0120.0090.0010.0050.011
Nonsynonymous0.00300000.00070.0005
Total0.0030.0040.0040.0030.00040.0040.005
K
Intronicn. a.0.0820.0540.0510.0300.0620.166
Synonymous0.1160.1430.1970.1590.0940.1770.118
Silent0.1160.1160.1330.1170.0640.0710.143
Nonsynonymous0.0310.0100.0070.0200.0080.0210.010
Total0.0510.0460.0490.0530.0290.0620.059
ω 0.2700.0700.0370.1280.0890.1200.084

Silent refers to variation at non-coding sites and at synonymous sites of coding regions.

S, number of segregating sites (number of singletons in parentheses); π, nucleotide diversity; K, nucleotide divergence; ω, K/K ratio; n. a., not applicable.

Sample size was 12 for all regions except dilp1-4 (10). The sequenced region consist of two fragments separated by a ∼390-bp stretch located at the first intron of the dilp6 gene. Silent refers to variation at non-coding sites and at synonymous sites of coding regions. S, number of segregating sites (number of singletons in parentheses); π, nucleotide diversity; h, number of haplotypes; Hd, haplotype diversity; K, nucleotide divergence. Silent refers to variation at non-coding sites and at synonymous sites of coding regions. S, number of segregating sites (number of singletons in parentheses); π, nucleotide diversity; K, nucleotide divergence; ω, K/K ratio; n. a., not applicable.

Divergence between Drosophila Melanogaster and Drosophila Simulans

The sequences newly obtained for the four regions spanning the seven dilp genes in D. simulans were used to estimate nucleotide divergence between this species and D. melanogaster both for the four dilp regions (table 1) and for each dilp gene (table 2). Levels of nucleotide divergence at the dilp regions, when considering all sites and only silent sites, were similar for all regions except for the dilp7 region, which exhibited the highest level of divergence (table 1). Among genes, dilp5 showed the lowest silent divergence estimate (table 2). The estimates of synonymous divergence (K) were similar to the average values reported previously for other genes between these species (K  = 0.11, [58]; K  = 0.11, [59]; K  = 0.13, [60]) except at genes dilp5 and dilp6. At the dilp genes, nucleotide divergence was more heterogeneous at nonsynonymous sites (K) than at synonymous sites (table 2). The ratio of nonsynonymous to synonymous divergence (ω = K/K) was <1 in all cases (table 2), reflecting the action of purifying selection against nonsynonymous changes. Similarly to K estimates, the ω estimates varied among genes (table 2). Amino acid divergence between the D. melanogaster and D. simulans DILPs (with 28 amino acid replacements; figure S4) also varied among peptides. Indeed, divergence at DILP1, DILP4 and DILP6 (with 8, 6 and 5 amino acid changes, respectively) was higher than at the rest of DILPs. Moreover, most of the detected amino acid replacements (25 out of 28) were located either in the signal peptide or in the C peptide, which might be indicative of weaker purifying selection acting on these protein domains.

Polymorphism and Divergence: the HKA and MK Tests

The multilocus HKA test ([42]; J. Hey, http://lifesci.rutgers.edu/~heylab/index.html) was used to evaluate whether the levels of silent nucleotide polymorphism and divergence at the dilp genes were correlated. No significant heterogeneity in the polymorphism to divergence ratio was detected. We also conducted the MK test [43] that compares the amount of variation within and between species at two different types of sites (synonymous and nonsynonymous). No significant departure from the proportionality between polymorphism and divergence expected under the SNM was detected at any of the seven dilp genes, either when changes fixed between D. melanogaster and D. simulans or when the D. melanogaster lineage-specific fixed changes, were considered. There is no evidence, therefore, for adaptive protein evolution of the DILP proteins since the D. melanogaster and D. simulans split.

Patterns of Nucleotide Diversity: Tajima’s D and Fay and Wu’s H Test Statistics

Tajima D’s [44] and Fay and Wu H’s [45] test statistics were calculated separately for each of the four regions studied (table 3). D values were negative for 3 regions (dilp1-4, dilp6 and dilp7), whereas H values were negative for 2 (dilp1-4 and dilp5). For 3 regions (dilp5, dilp6 and dilp7), Tajima’s D and Fay and Wu’s normalized H values did not depart significantly from expectations of either the SNM or the bottleneck (BN) model, irrespective of the recombination rate estimate used. For the fourth region (dilp1-4), the Tajima test and the Fay and Wu test revealed significant departures from SNM expectations, which would indicate a significant excess of high-frequency derived variants (table 3). However, under the most realistic bottleneck scenario, only the estimated H value remained significant (table 3). According to the test results, this excess cannot be solely due to the demographic history of the population studied. Indeed, when the observed D and H values were compared to the corresponding empirical distributions obtained from multilocus surveys of variation in the same population, for either X-linked or autosomal loci ([61], unpublished data), only the H value estimated for the dilp1-4 region fell within the bottom 5% of the empirical Fay and Wu’s H distribution.
Table 3

Neutrality test of the dilp gene regions.

Tajima’s D Fay and Wu’s H
Region D valueRSNMBN H valueRSNMBN
dilp1-4 −1.72RM <0.001 0.11−3.55RM <0.001 0.01
R0.25 <0.001 0.22R0.25 <0.001 0.04
dilp5 0.87RM 0.990.68−0.23RM 0.360.94
R0.25 0.980.60R0.25 0.810.85
dilp6 −0.59RM 0.080.910.06RM 0.570.84
R0.25 0.160.93R0.25 0.730.85
dilp7 −0.71RM 0.130.480.44RM 0.780.90
R0.25 0.210.49R0.25 0.660.94

R, population recombination rate per nucleotide (see Materials and Methods); D, Tajima’s D; H, normalized Fay and Wu’s H; SNM, standard neutral model; BN, bottleneck model. Statistically significant values are indicated in bold.

R, population recombination rate per nucleotide (see Materials and Methods); D, Tajima’s D; H, normalized Fay and Wu’s H; SNM, standard neutral model; BN, bottleneck model. Statistically significant values are indicated in bold.

The Composite Likelihood Ratio (CLR) Test and the Goodness-of-fit (GOF) Test: Sweep Detection and Localization by Maximum Likelihood

A composite-likelihood method was used to distinguish selective sweeps from stochastic neutral variation [53]. The CLR test was applied separately to each of the four regions studied: the dilp1-4 gene cluster and the dilp5, dilp6 and dilp7 regions (table 4). This analysis yielded a significant better fit of variation at the dilp1-4 gene cluster to the selective sweep model than to the SNM. Variation at the dilp5 and dilp6 regions also exhibited a better fit to the selective sweep model than to the SNM but unlike at the dilp1-4 region, only under a particular recombination rate value (RM and R0.25 values, respectively). For each dilp region, the observed CLR values were additionally tested against the null distribution built from bottleneck simulations in order i) to evaluate the possibility of false positives in cases where the CLR test yielded significant results, and ii) to confirm that for those regions that conformed to the SNM predictions, the results were robust to demographic change. This analysis yielded a significant better fit of variation at the dilp1-4 gene cluster to the selective sweep model than to the bottleneck model, irrespective of the recombination rate estimate used. Also, variation at the dilp6 region showed a better fit to the selective sweep model than to the bottleneck model but only under the R0.25 recombination rate value (table 4). Estimates of the strength of selection and of the location of the putative target of selection (using the clsw program) for each the dilp1-4 and dilp6 regions were used to perform the GOF test, which allows further discrimination of false positives. Only for the dilp6 region under the R0.25 recombination value, this test fails to show a significantly better fit to the more general model than to the selective model (P-value  = 0.214).
Table 4

Composite likelihood ratio test and Goodness-of-fit test.

Region L 1 S 1 RSNMBN a X GOF
dilp1-4 753493RM 0.063 <0.001 0.007 3461.55100.035
R0.25 0.016 <0.001 0.005 716.30260.041
dilp5 326656RM 0.063 0.010 0.288n. a.n. a.n. a.
R0.25 0.0160.0510.197n. a.n. a.n. a.
dilp6a 237824RM 0.0470.2930.071n. a.n. a.n. a.
R0.25 0.012 0.008 0.001 22.762604 0.214
dilp7 71011RM 0.0580.8920.968n. a.n. a.n. a.
R0.25 0.0140.7580.952n. a.n. a.n. a.

Sites with missing or ambiguous information in the outgroup were excluded from this analysis.

L, number of nucleotides of the multiply alignment region; S, number of segregating sites; R, population recombination rate per nucleotide (in 2N units); SNM, standard neutral model; BN, bottleneck model; α strength of positive selection (in 2Ns units); X, location of the target of positive selection; GOF, goodness-of-fit test; n. a., not applicable. For the GOF test, probability values that do not support a better fit to the general alternative model than to the selective sweep model are indicated in bold.

Sites with missing or ambiguous information in the outgroup were excluded from this analysis. L, number of nucleotides of the multiply alignment region; S, number of segregating sites; R, population recombination rate per nucleotide (in 2N units); SNM, standard neutral model; BN, bottleneck model; α strength of positive selection (in 2Ns units); X, location of the target of positive selection; GOF, goodness-of-fit test; n. a., not applicable. For the GOF test, probability values that do not support a better fit to the general alternative model than to the selective sweep model are indicated in bold.

Discussion

Changes in the biotic and abiotic environment of organisms promote adaptation, i.e., evolutionary change driven by positive selection. Even if populations may be constantly exposed to environmental change, it is easy to visualize certain scenarios, like the range expansion of a species, in which a population encounters a higher than average degree of change. These scenarios can be considered candidates for the species and/or populations involved to have experienced bursts of adaptive change. It is for this reason that surveys of nucleotide variation often target derived populations. In Drosophila, the multilocus analysis of polymorphism and divergence at coding regions has revealed that ∼50% of the amino acid substitutions detected between closely related species had been driven by positive selection [62], [63]. Moreover, the comparison of coding and non-coding regions similarly has revealed that adaptive changes at non-coding regions might have been considerably common in the evolution of Drosophila melanogaster [64]. Application of the MK test [43] to the 7 dilp coding regions has provided no evidence for adaptive amino acid substitutions in any of the DILP peptides since the split of the D. melanogaster and D. simulans lineages. A similar study on the insulin receptor, a transmembrane receptor with an extracellular part that binds insulin and a cytosolic part with signal-transduction capacities, was previously conducted [25]. Indeed, this study yielded a negative result for the extracellular part of the receptor, but not for its cytosolic part, which together with the present result would indicate that selection had not favoured changes in the ligand-receptor (DILP-InR) interaction but on the signal-transduction capacity of the receptor upon its activation by the ligand. Drosophila melanogaster is a cosmopolitan species that originated in central Africa and later expanded its distribution area worldwide [65]. European populations of this species are non-stationary derived populations, as confirmed by multilocus analysis of variation at non-coding regions [31], [66], [67]. These surveys revealed that a simple bottleneck scenario could explain, despite not completely, the pattern of variation detected at these regions [31], [48], [49], [66], [67], but see also [68], [69]. Moreover, they provided estimates for the parameters of the proposed bottleneck model, which can thereafter be used in hypothesis testing. Indeed, this approach has already led to the identification of a few regions that were the targets of recent selective events [70]–[74]. The present survey of polymorphism at the regions encompassing the dilp genes in D. melanogaster has provided some evidence for the recent action of positive selection at or near some of these genes, more specifically at the dilp1-4 and dilp6 regions. Indeed, the pattern of variation at the dilp1-4 region exhibited a significant excess of high-frequency derived variants (as indicated by the highly negative H value) relative not only to expectations of the SNM and the more realistic bottleneck model (table 3) but also when compared to the corresponding empirical distributions of the H test statistic obtained from multilocus surveys of variation in the same population ([31], unpublished data). Although the Kim and Stephan test [53] also yielded a significant result for this region when using bottleneck simulations, this result was not clearly supported by the GOF test. In contrast to these results for the dilp1-4 region, the frequency spectrum at the dilp6 region did not depart from bottleneck expectations, whereas the GOF test for this region clearly supported the significant result of the Kim and Stephan test under an intermediate level of recombination (but not under the higher level estimated from the genetic map). The dependency of the Kim and Stephan test result on the level of recombination clearly points to the need for accurate estimates of this parameter and therefore for new experimental efforts using a large and dense set of markers to obtain fine-scale genetic maps [75], [76]. There is also some degree of uncertainty in the effect of the particular bottleneck scenario considered on our conclusions [77]. In summary, present estimates of the level and pattern of polymorphism at the dilp genes do not provide strong evidence for recent adaptive changes either in the genes themselves or in their vicinity, although the dilp1-4 and dilp6 regions stand out as likely affected by such events. It is worth noting that our population-genetic analysis has unveiled the footprint of positive selection at the dilp1-4 cluster region and the dilp6 region. These regions encompass four of the genes (dilp1, dilp2, dilp3 and dilp6) that are involved in establishing adult body size by promoting growth at the larval and pupal stages, with one of them (dilp3) also involved in the response to nutritional changes [29]. The signals detected at the dilp6 gene and at the dilp1-4 cluster might reflect that gene dilp6 and at least one of the genes in the cluster might have been the target of selection acting on their distinct functional roles. Selection might have also acted on DILP copies with partially redundant functions as a way to downplay stochastic variations in DILP synthesis or secretion in response to varying external conditions [78]. The out-of-Africa expansion of D. melanogaster exposed the colonizing populations to new environmental physical conditions as well as to new food sources. Selective pressures resulting from the flies’ exposure to these new environments led to many adaptive changes, among which those in adult body size and response to nutritional conditions might have targeted genes in the insulin-signaling pathway. (A) Genomic organization of the dilp1-4 gene region of D. melanogaster. Genomic DNA is represented by a line. The black arrow head points to the centromere. In genes, arrows indicate the direction of transcription. Colored boxes indicate exons of dilp genes. Introns are represented by a V symbol. (B) Nucleotide polymorphism at the dilp1-4 gene region of D. melanogaster. The last row shows nucleotide information present in D. simulans for each polymorphic site detected in D. melanogaster. *, nonsynonymous polymorphism. Dots indicate nucleotide variants identical to the first sequence and dashes indicate gaps. d, deletion; i, insertion; E, exon. (PDF) Click here for additional data file. (A) Genomic organization of the dilp5 gene region of D. melanogaster. Genomic DNA is represented by a line. The black arrow head points to the centromere. In gene, arrow indicates the direction of transcription. Colored boxes indicate exons of dilp5 gene. Intron are represented by a V symbol. (B) Nucleotide polymorphism at the dilp 5 gene region of D. melanogaster. The last row shows nucleotide information present in D. simulans for each polymorphic site detected in D. melanogaster. Dots indicate nucleotide variants identical to the first sequence and dashes indicate gaps. d, deletion; i, insertion; E, exon. (PDF) Click here for additional data file. (A) Genomic organization of the dilp6 and dilp7 gene regions of D. melanogaster. Genomic DNA is represented by a line. The black arrow head points to the centromere. In genes, arrows indicate the direction of transcription. Colored boxes indicate exons of dilp genes Introns are represented by a V symbol. (B) Nucleotide polymorphism at the dilp6 and dilp7 gene regions of D. melanogaster. The last row shows nucleotide information present in D. simulans for each polymorphic site detected in D. melanogaster. *, nonsynonymous polymorphism. Dots indicate nucleotide variants identical to the first sequence and dashes indicate gaps. i, insertion; E, exon. (PDF) Click here for additional data file. (A) Schematic representation of the predicted structure of the DILP1-5 proteins of D. melanogaster. SP, signal peptide; B, B chain; C, C peptide; A, A chain. The active peptide chains are denoted by colors. (B) Amino acid polymorphism in D. melanogaster and amino acid replacements between D. melanogaster and D. simulans at the DILP1-5 proteins. The last row shows the amino acid present in D. simulans for each polymorphic site detected in D. melanogaster and also for the sites with fixed differences between species. Dots indicate amino acid variants identical to the first sequence and dashes indicate deletions. (PDF) Click here for additional data file. (A) Schematic representation of the predicted structure of the DILP6 and DILP7 proteins of D. melanogaster. SP, signal peptide; B, B chain; C, C peptide; A, A chain. The active peptide chains are denoted by colors. (B) Amino acid polymorphism in D. melanogaster and amino acid replacements between D. melanogaster and D. simulans at the DILP6 and DILP7 proteins. The last row shows the amino acid present in D. simulans for each polymorphic site detected in D. melanogaster and also for the sites with fixed differences between species. Dots indicate amino acid variants identical to the first sequence and dashes indicate deletions. (PDF) Click here for additional data file. Nucleotide polymorphism and divergence at the autosomal dilp1-4 and dilp5 gene regions. (PDF) Click here for additional data file. Nucleotide polymorphism and divergence at the X-linked dilp6 and dilp7 gene regions. (PDF) Click here for additional data file.
  64 in total

1.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

2.  Demography and natural selection have shaped genetic variation in Drosophila melanogaster: a multi-locus approach.

Authors:  Sascha Glinka; Lino Ometto; Sylvain Mousset; Wolfgang Stephan; David De Lorenzo
Journal:  Genetics       Date:  2003-11       Impact factor: 4.562

Review 3.  Why size matters: altering cell size.

Authors:  Leslie J Saucedo; Bruce A Edgar
Journal:  Curr Opin Genet Dev       Date:  2002-10       Impact factor: 5.578

4.  Inferring the effects of demography and selection on Drosophila melanogaster populations from a chromosome-wide scan of DNA variation.

Authors:  Lino Ometto; Sascha Glinka; David De Lorenzo; Wolfgang Stephan
Journal:  Mol Biol Evol       Date:  2005-06-29       Impact factor: 16.240

5.  Statistical tests for detecting positive selection by utilizing high-frequency variants.

Authors:  Kai Zeng; Yun-Xin Fu; Suhua Shi; Chung-I Wu
Journal:  Genetics       Date:  2006-09-01       Impact factor: 4.562

6.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

7.  Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations.

Authors:  Penelope R Haddrill; Kevin R Thornton; Brian Charlesworth; Peter Andolfatto
Journal:  Genome Res       Date:  2005-06       Impact factor: 9.043

8.  Evidence of gene conversion associated with a selective sweep in Drosophila melanogaster.

Authors:  Sascha Glinka; David De Lorenzo; Wolfgang Stephan
Journal:  Mol Biol Evol       Date:  2006-07-25       Impact factor: 16.240

9.  Uncovering the footprint of positive selection on the X chromosome of Drosophila melanogaster.

Authors:  Dorcas J Orengo; Montserrat Aguadé
Journal:  Mol Biol Evol       Date:  2010-01       Impact factor: 16.240

10.  Heritability of two morphological characters within and among natural populations of Drosophila melanogaster.

Authors:  J A Coyne; E Beecham
Journal:  Genetics       Date:  1987-12       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.