Literature DB >> 26297726

Similar Efficacies of Selection Shape Mitochondrial and Nuclear Genes in Both Drosophila melanogaster and Homo sapiens.

Brandon S Cooper1, Chad R Burrus2, Chao Ji2, Matthew W Hahn3, Kristi L Montooth4.   

Abstract

Deleterious mutations contribute to polymorphism even when selection effectively prevents their fixation. The efficacy of selection in removing deleterious mitochondrial mutations from populations depends on the effective population size (Ne) of the mitochondrial DNA and the degree to which a lack of recombination magnifies the effects of linked selection. Using complete mitochondrial genomes from Drosophila melanogaster and nuclear data available from the same samples, we reexamine the hypothesis that nonrecombining animal mitochondrial DNA harbor an excess of deleterious polymorphisms relative to the nuclear genome. We find no evidence of recombination in the mitochondrial genome, and the much-reduced level of mitochondrial synonymous polymorphism relative to nuclear genes is consistent with a reduction in Ne. Nevertheless, we find that the neutrality index, a measure of the excess of nonsynonymous polymorphism relative to the neutral expectation, is only weakly significantly different between mitochondrial and nuclear loci. This difference is likely the result of the larger proportion of beneficial mutations in X-linked relative to autosomal loci, and we find little to no difference between mitochondrial and autosomal neutrality indices. Reanalysis of published data from Homo sapiens reveals a similar lack of a difference between the two genomes, although previous studies have suggested a strong difference in both species. Thus, despite a smaller Ne, mitochondrial loci of both flies and humans appear to experience similar efficacies of purifying selection as do loci in the recombining nuclear genome.
Copyright © 2015 Cooper et al.

Entities:  

Keywords:  cytoplasmic sweep; mtDNA; neutrality index; tests of selection

Mesh:

Substances:

Year:  2015        PMID: 26297726      PMCID: PMC4592998          DOI: 10.1534/g3.114.016493

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


The effective size of a population (N) impacts how effectively selection removes deleterious mutations and fixes advantageous mutations. The unique genetics of the mitochondrial genome (mitochondrial DNA; mtDNA) are thought to reduce its N relative to the nuclear genome, via haploid, uniparental inheritance, the mitochondrial bottleneck in the maternal germline, and a lack of recombination that decreases N via selection on linked sites (Hill and Robertson 1966; Maynard Smith and Haigh 1974; Gillespie 2000; Meiklejohn ; White ; Charlesworth 2012). In addition, cytoplasmic transmission can link the mtDNA to selfish cytoplasmic elements (e.g., Wolbachia in insects) that may sweep through populations, further decreasing mitochondrial N and possibly increasing mitochondrial substitution rates via the fixation of slightly deleterious mutations (Shoemaker ). For these reasons it has been widely hypothesized that selection is less effective in mitochondrial genomes than in their nuclear counterparts and that mitochondrial genomes may accumulate greater numbers of deleterious substitutions (Lynch 1996, 1997). Analyses of sequence data in Drosophila and mammals have largely supported the conclusion that mtDNA harbors significant levels of slightly deleterious polymorphism (Ballard and Kreitman 1994; Rand and Kann 1996; Nachman 1998; Rand and Kann 1998; Weinreich and Rand 2000). N is not the only evolutionary parameter that distinguishes mitochondrial and nuclear genomes. The distinct functional landscape of the mitochondrial genome likely affects the distribution of selective effects (s) of mutations that arise in this genome. Animal mitochondrial genomes typically encode regulatory information for replication and transcription nested within a hypervariable region (also known as the D-loop, control, or A+T-rich region), 22 transfer RNAs (tRNAs), two ribosomal components, and 13 protein-coding genes—all core components of oxidative phosphorylation (OXPHOS). Outside of the hypervariable region, there is little noncoding DNA in animal mtDNAs. In Drosophilids, 99% of the genome outside of the hypervariable region encodes DNA and RNA genes with highly conserved sequences that function in mitochondrial protein synthesis and aerobic respiration (Clary and Wolstenholme 1985; Wolstenholme and Clary 1985; Ballard 2000; Montooth ), which suggests that the distribution of selective effects in the mtDNA may be shifted toward larger negative effects on fitness. The mutational landscape of the mtDNA also differs from the nuclear genome. In most animal taxa, the mitochondrial mutation rate greatly exceeds that of the nuclear genome (Lynch ), and the mitochondrial mutational process is also highly biased (Montooth and Rand 2008). For example, nearly all mitochondrial mutations in D. melanogaster change a G:C base pair to an A:T (Haag-Liautard ). When combined with the strong A+T-bias in this mitochondrial genome, where 95% of third codon positions are an A or a T (Montooth ), this indicates that the most commonly occurring mutations in protein-coding loci of the Drosophila mtDNA will change an amino acid. Relative to the nuclear genome, animal mitochondrial genomes thus experience a greater mutational pressure that can also be biased in some taxa toward nonsynonymous mutations; these are likely to have deleterious effects in a molecule that encodes such highly conserved functions. Some of the strongest population genetic patterns in support of distinct selective pressures acting on mitochondrial and nuclear genomes come from analyses of the neutrality index (NI) (Rand and Kann 1996; Nachman 1998; Rand and Kann 1998; Weinreich and Rand 2000). NI is a summary statistic of the deviation from the neutral expectation in the McDonald-Kreitman (MK) test (McDonald and Kreitman 1991; Rand and Kann 1996) and is calculated from counts of synonymous and nonsynonymous polymorphic and divergent sites within and between related species. Weakly deleterious nonsynonymous mutations that segregate in the population, but that will not contribute to divergence, lead to a value of NI greater than 1. When the efficacy of selection is decreased, the expectation is that the number of segregating weakly deleterious polymorphisms will increase; this is the pattern that has been observed in mtDNA. Meta-analyses of MK tables and their associated NI values for mitochondrial and nuclear loci in animals have concluded that NI is predominantly greater than the neutral expectation of 1 for the mtDNA (Rand and Kann 1996; Nachman 1998; Rand and Kann 1998; Betancourt ) and exceeds the average NI of the nuclear genome (Weinreich and Rand 2000). Although the relative sparseness of the data was recognized early on (Nachman 1998), and the conclusions were largely limited to how selection shapes animal mtDNA, these patterns are often taken as evidence that selection is largely ineffective in the mtDNA because of its reduced N and that mitochondrial genomes are expected to harbor more deleterious polymorphisms than do their nuclear counterparts. Here we revisit this pattern using new, complete mitochondrial genomes from D. melanogaster that we compare with published nuclear data from the same samples (Langley ) and with available human data from both mitochondrial and nuclear genomes (Bustamante ; Just ; Rubino ). We find little evidence that the effects of purifying selection differ on average between mitochondrial and nuclear genomes within flies or within humans, despite evidence that there is much reduced N due to a lack of recombination and linkage with the cytoplasm. We discuss reasons why NI is, on average, similar between mitochondrial and nuclear loci, despite the distinct population genetic properties of these two genomes.

Materials and Methods

D. melanogaster mtDNA assembly, annotation, and estimates of sequence diversity

Raw sequence read files from 38 genetic lines of D. melanogaster from Raleigh, North Carolina (Mackay ), sequenced by the 50 Genomes subproject of the Drosophila Population Genomics Project (Langley ) were downloaded from the National Center for Biotechnology Information Sequence Read Archive. We used the Burrows-Wheeler Aligner, and specifically the fast and accurate short read alignment with Burrows-Wheeler Transform (Li and Durbin 2009), to map sequence reads to the D. melanogaster mitochondrial reference genome (NC_001709). We allowed up to five gaps, five gap extensions, and five mismatches per aligned read, but few reads needed such flexibility and most were filtered out in later steps. Using SAMtools, we postprocessed the alignments to filter out low-quality alignments and to detect single-nucleotide polymorphisms (SNPs) (Li ). SNPs with a quality score greater than 20 and indels with a quality score greater than 50 were kept for further analyses. We then generated a consensus sequence for each of the D. melanogaster mtDNAs listed in Supporting Information, Table S1. Because of the high variance in coverage across the hypervariable region, we did not include this region in our final assemblies or analyses. We annotated the consensus sequence for each mtDNA using the GenBank annotation of the D. melanogaster reference sequence (NC_001709). Using ClustalW (Larkin ), we performed a whole-genome alignment, as well as gene-specific alignments, of each consensus sequence to the reference sequence and to the outgroup species Drosophila yakuba (NC_001322). There are very few indels in the protein-coding regions of Drosophila mtDNA (Montooth ), making alignment straightforward. From these alignments we calculated expected heterozygosity (π), the number of segregating sites (S), and Watterson’s θ (Watterson 1975) as measures of sequence diversity. The mitochondrial haplotype network was inferred from 80 segregating sites in the coding region of the mtDNA for which there were no missing or ambiguous data using TCS version 1.21 (Clement ).

Tests for recombination in the D. melanogaster mtDNA

We estimated linkage disequilibrium (LD) between all pairs of mitochondrial SNPs using the statistic D′ (Lewontin 1964), where D′ = 0 indicates no LD and |D′| = 1 indicates perfect LD. Because recombination erodes LD as a function of distance, a negative correlation between |D′| and genetic distance between pairs of SNPs has been used as evidence for recombination in mtDNA (Awadalla ). To test this prediction, we looked for significant negative correlations between |D′| and genetic distance. We also conducted these same tests using another statistical measure of association, r (Hill and Robertson 1966), which is more robust to variation in mutation rates (Awadalla ; Meunier and Eyre-Walker 2001; Innan and Nordborg 2002). We calculated these correlations by using a variety of minor allele frequency cutoffs. We also tested for the presence of all four genotypes at pairs of SNPs (the “four-gamete test”; Hudson and Kaplan 1985) using DNAsp version 5 (Rozas ).

Neutrality tests

Using π and θ, we calculated Tajima’s D (Tajima 1989), which is expected to be 0 for a neutrally evolving locus. Demographic effects will skew the site-frequency spectrum of both synonymous and nonsynonymous polymorphisms at a locus. Contrasting Tajima’s D between nonsynonymous and synonymous polymorphisms therefore tests whether nonsynonymous alleles experience a greater skew in frequency relative to putatively neutral synonymous alleles, indicative of selection (Rand and Kann 1996). We implemented this analysis using the heterogeneity test (Hahn ), which simulates 10,000 genealogies with no recombination by using the values of synonymous and nonsynonymous S calculated from the data and compares the estimated difference in Tajima’s D to the random distribution of differences between synonymous and nonsynonymous polymorphisms. We calculated several other summaries of the site-frequency spectrum, including Fu and Li’s D, which characterizes the proportion of mutations on external and internal branches of a genealogy (Fu and Li 1993) and Fay and Wu’s H, which tests for an excess of high-frequency, derived alleles in a sample relative to the neutral expectation (Fay and Wu 2000). These latter statistics were calculated using a set of 80 segregating sites in the coding region of the mtDNA for which there were no missing or ambiguous data. Significance was determined using 10,000 coalescent simulations as implemented in DNAsp version 5 (Rozas ). We constructed MK (McDonald and Kreitman 1991) two-by-two contingency tables of counts of nonsynonymous and synonymous polymorphisms (P and P) within D. melanogaster and nonsynonymous and synonymous fixed differences (D and D) between D. melanogaster and either D. yakuba or Drosophila simulans. Polymorphic sites within D. melanogaster only contributed to fixed differences if the allele in the outgroup sequence was not present in D. melanogaster. We tested for significant deviations from neutrality by using the Fisher’s exact tests of the MK table in R version 2.15.1 (R Core Team 2012). We calculated NI—the ratio of P to D—as a summary statistic of the MK table (Rand and Kann 1996). Assuming that selection is constant, the neutral expectation is that D will equal P (Kimura 1983; McDonald and Kreitman 1991), and NI is expected to be 1. When calculating NI for any gene with a count of 0 in any cell of the MK table, we added a count of 1 to all cells (Sheldahl ; Presgraves 2005). Twenty-three percent of 13 D. melanogaster mitochondrial genes, 9.5% of 6113 D. melanogaster nuclear genes, 0% of 13 H. sapiens mitochondrial genes, and 73% of 11,624 H. sapiens nuclear genes required these additional counts. If the MK test is significant, an NI value less than 1 indicates a significant excess of nonsynonymous fixed differences, whereas an NI value greater than 1 indicates a significant excess of nonsynonymous polymorphisms. We also calculated , as in Presgraves (2005), the sign of which is more intuitive; negative values are consistent with an excess of weakly deleterious (negatively selected) polymorphisms and positive values are consistent with an excess of advantageous (positively selected) substitutions. The short length and low D values for mitochondrial genes upwardly biases NI (Weinreich and Rand 2000), and we initially used D. yakuba as the outgroup to increase the amount of divergence. However, using D. yakuba to calculate divergence also increases the potential for multiple substitutions at silent sites. Because of this, we constructed MK tables for the 13 mitochondrial protein-coding genes using a range of taxa and methods that capture different amounts of sequence divergence (Table S2 and Table S3). We used both D. simulans and D. yakuba to polarize changes on the branch leading to D. melanogaster, which resulted in very few nonsynonymous substitutions and highly variable NI values (Table S2). We also used either D. simulans or D. yakuba singly to calculate divergence with D. melanogaster in two ways. The “more-inclusive” approach included codons that were missing data in some lines, and averaged across all possible mutational pathways between codons with multiple substitutions to estimate D and D (Nei and Gojobori 1986). The “less-inclusive” method omitted codons with any missing data, omitted mtDNA SRX022291 (which contained more missing data than any other mtDNA) and calculated divergence using the mutational path that minimized D between codons with multiple substitutions. Unless otherwise noted, the more-inclusive method using D. yakuba as outgroup is presented and discussed. In addition to using counts for single genes, we also analyzed MK tables of the summed counts of polymorphic and divergent sites for each of the mitochondrial-encoded OXPHOS complexes: Complex I (NADH dehydrogenase, ND), Complex IV (cytochrome c oxidase), and Complex V (ATP synthase). Cytochrome B is the only Complex III gene encoded by the mtDNA. Stoletzki and Eyre-Walker (2011) emphasize that contingency data generally should not be summed, particularly when there is heterogeneity among contingency tables, and they provide an unbiased estimator of overall NI for combining counts across genes, . We calculated this statistic and used the DoFE software package (Stoletzki and Eyre-Walker 2011) to calculate bootstrap confidence intervals and to conduct Woolf’s tests of homogeneity (Woolf 1955). The only data set with significant heterogeneity was the D. melanogaster nuclear gene set (P < 0.0001). Similar statistics were used to analyze polymorphism and divergence in the human data sets, as well as in a subset of the mitochondrial haplotypes reported in our study that were independently sequenced and assembled by Richardson (Table S4).

Comparisons of mitochondrial and nuclear NI in flies and humans

To compare patterns of polymorphism and divergence between mitochondrial and nuclear genomes, we obtained existing data for nuclear genes in D. melanogaster and for nuclear and mitochondrial genes in Homo sapiens. Counts of polymorphism and divergence for D. melanogaster nuclear genes were obtained from the Drosophila Population Genomics Project analysis of the same 38 genomes from Raleigh, North Carolina, with divergence polarized along the D. melanogaster lineage and using the mutational path that minimized D between codons with multiple substitutions (Langley ). The human nuclear data from Bustamante included counts of polymorphic and divergent sites from 19 African Americans and 20 European Americans, using the chimpanzee Pan troglodytes as an outgroup. We calculated the number of polymorphic and divergent sites for human mitochondrial genes by using mtDNA sequences from 19 African Americans (Just ), 20 European Americans (Rubino ), and the chimpanzee mitochondrial reference genome D38113.1 (Horai ) using the “more-inclusive” method described previously. We also analyzed subsets of the human mitochondrial data to illustrate the sensitivity of NI to sampling (Table S5 and Table S6). Comparisons of the distributions of NI and Z* between gene sets were performed using Mann−Whitney U-tests in R version 2.15.1 (R Core Team 2012).

Data availability

File S1 contains the 38 D. melanogaster assembled mtDNA genomes used in this study aligned with D. yakuba (NC_001322). Human mtDNA sequence data are available in GenBank, and the accession numbers are listed in Table S6.

Results

An excess of low- and high-frequency, derived mitochondrial polymorphisms

We assembled 14,916 bp of sequence containing the transcribed regions of the mtDNA with a median coverage of 32x for 38 genetic lines sampled from a single population of D. melanogaster in Raleigh, North Carolina (Langley ; Mackay ) (Table S1). More than 98% of these nucleotides encode the 13 protein-coding genes, 22 tRNAs, and two ribosomal RNAs. The per-site expected heterozygosity in this region (π) of the mtDNA was 0.0008. We identified 137 segregating sites in this population sample, 103 of which were in protein-coding genes. Median heterozygosity in protein-coding genes was 0.0023 per synonymous site and 0.0002 per nonsynonymous site. Silent site heterozygosity was significantly lower in mitochondrial genes relative to nuclear genes (Mann−Whitney U, mtDNA vs. X chromosome, P = 0.00002; mtDNA vs. autosomes, P < 0.00001) and was only 0.16 times that of the autosomes (Figure 1A), lower than what is expected if the mtDNA has an effective population size that is one-quarter that of the autosomes.
Figure 1

Effect of genomic location on silent-site heterozygosity and on summary statistics of polymorphism and divergence in D. melanogaster. (A) Genomic location has a significant effect on per-site silent-site heterozygosity (P < 0.001 for all pairwise contrasts), consistent with predicted differences in the effective population size (N). The ratio of median mitochondrial to autosomal silent site heterozygosity was 0.157, less than predicted for neutral sites if mitochondrial N is one quarter that of the autosomes. mitochondrial DNA (mtDNA), X-chromosome, and autosome data sets contained 12, 1255, and 8073 genes, respectively. (B, C) Distributions of neutrality index (NI) and Z* are similar between mitochondrial and autosomal (abbreviated as A) genes, with moderately significant differences between mitochondrial and X-linked (abbreviated as X) genes. The four mtDNA boxes represent estimates from the corresponding MK tables in Table S2, B−E that used either D. simulans (Table S2, B−C) or D. yakuba (Table S2, D−E) as the outgroup. Dashed lines represent the neutral expectations for these statistics. Three nuclear loci for which NI exceeded 50 were excluded from (B) to improve visualization. Statistical results are presented in Table S2. mtDNA, X-chromosome, and autosome data sets contained 13, 712, and 5401 genes, respectively.

Effect of genomic location on silent-site heterozygosity and on summary statistics of polymorphism and divergence in D. melanogaster. (A) Genomic location has a significant effect on per-site silent-site heterozygosity (P < 0.001 for all pairwise contrasts), consistent with predicted differences in the effective population size (N). The ratio of median mitochondrial to autosomal silent site heterozygosity was 0.157, less than predicted for neutral sites if mitochondrial N is one quarter that of the autosomes. mitochondrial DNA (mtDNA), X-chromosome, and autosome data sets contained 12, 1255, and 8073 genes, respectively. (B, C) Distributions of neutrality index (NI) and Z* are similar between mitochondrial and autosomal (abbreviated as A) genes, with moderately significant differences between mitochondrial and X-linked (abbreviated as X) genes. The four mtDNA boxes represent estimates from the corresponding MK tables in Table S2, B−E that used either D. simulans (Table S2, B−C) or D. yakuba (Table S2, D−E) as the outgroup. Dashed lines represent the neutral expectations for these statistics. Three nuclear loci for which NI exceeded 50 were excluded from (B) to improve visualization. Statistical results are presented in Table S2. mtDNA, X-chromosome, and autosome data sets contained 13, 712, and 5401 genes, respectively. In addition to the scarcity of segregating sites in the D. melanogaster mtDNA, polymorphisms at these sites were skewed toward low frequencies (Figure 2, A and B), as evidenced by consistently negative values of Tajima’s D (Table 1). Tajima’s D across the mtDNA was −2.607 and differed significantly from the neutral expectation of 0 (S = 80 for coding sites with no missing data, P < 0.0001), as did Fu and Li’s D (D = −2.67, P < 0.05). The minor allele frequency for unpolarized synonymous polymorphisms was always less than 11%, and all but one of the nonsynonymous polymorphisms were singletons (Figure 2, A and B). Using D. yakuba as an outgroup revealed that the derived allele was nearly fixed for 44% of segregating synonymous sites, whereas there was only a single derived nonsynonymous polymorphism at high frequency (Figure 2, C and D). Using both D. simulans and D. yakuba to polarize mutations did not qualitatively change this result (Figure S1). Thus, the mitochondrial genome was essentially devoid of intermediate-frequency polymorphisms, with derived synonymous mutations at either very high (greater than 89%) or very low (less than 11%) frequencies and nearly all derived nonsynonymous polymorphisms at frequencies less than 5%. This skew toward high-frequency, derived alleles resulted in a significant negative value of Fay and Wu’s H statistic (H = −41.2, P = 0.005).
Figure 2

Site-frequency spectra of synonymous and nonsynonymous polymorphisms in the D. melanogaster mitochondrial DNA. (A, B) Folded site-frequency spectra for synonymous and nonsynonymous segregating sites across the mitochondrial protein-coding region reveal that mitochondrial polymorphisms are skewed to low frequencies. (C, D) Unfolded site-frequency spectra reveal that derived, synonymous polymorphisms are almost equally likely to be at low frequency (56% of 59 sites at frequencies less than 0.11) or nearly fixed (44% of 59 sites at frequencies greater than 0.89), while derived, nonsynonymous polymorphisms are nearly always present as singletons (94% of 32 sites). There are essentially no mitochondrial polymorphisms at intermediate frequencies. Sites were omitted from the unfolded site frequency spectra if neither allelic state was shared with D. yakuba. The number of sites included in each distribution is 67 (A), 35 (B), 59 (C), and 32 (D).

Table 1

Synonymous and nonsynonymous variation in D. melanogaster mitochondrial genes and OXPHOS complexes

Gene/ComplexaSynonymous SitesbNonsynonymous Sites
n (bp)SπӨWDn (bp)SπӨWD
COI347161.573.81−1.911183000Undef
COII14250.311.19−1.9053910.050.24−1.13
COIII169.3390.722.14−1.96613.6710.050.24−1.13
ND119430.160.71−1.7273930.160.71−1.72
ND219850.411.19−1.6982260.341.43−2.07
ND36810.050.24−1.1228010.050.24−1.12
ND4273.3370.511.67−1.951061.6770.381.67−2.17
ND4L5610.050.24−1.1322920.110.48−1.49
ND5351100.682.38−2.16136540.210.95−1.88
ND6103.3340.260.95−1.75415.6720.110.48−1.49
Complex I1243.66312.127.38−2.484912.34251.355.95−2.64
Complex III23950.551.19−1.3889220.160.48−1.29
Complex IV658.33302.607.14−2.212335.6720.110.48−1.49
Complex V174.6720.160.71−1.72650.3360.321.43−2.10
Total2315.66685.4316.42−2.448790.34351.938.33−2.70

OXPHOS, oxidative phosphorylation; Undef, undefined; CO, cytochrome c oxidase; ND, NADH dehydrogenase; ATPase, ATP synthase.

Complex I (ND), 7 loci; Complex III (Cytochrome B), 1 locus; Complex IV (CO), 3 loci; Complex V (ATPase), 2 overlapping loci.

For synonymous and nonsynonymous sites, we calculated the number of segregating sites (S), heterozygosity (π), Watterson’s Ө, and Tajima’s D. The heterogeneity test for differences between synonymous and nonsynonymous D was never significant (P > 0.35 for all comparisons).

Site-frequency spectra of synonymous and nonsynonymous polymorphisms in the D. melanogaster mitochondrial DNA. (A, B) Folded site-frequency spectra for synonymous and nonsynonymous segregating sites across the mitochondrial protein-coding region reveal that mitochondrial polymorphisms are skewed to low frequencies. (C, D) Unfolded site-frequency spectra reveal that derived, synonymous polymorphisms are almost equally likely to be at low frequency (56% of 59 sites at frequencies less than 0.11) or nearly fixed (44% of 59 sites at frequencies greater than 0.89), while derived, nonsynonymous polymorphisms are nearly always present as singletons (94% of 32 sites). There are essentially no mitochondrial polymorphisms at intermediate frequencies. Sites were omitted from the unfolded site frequency spectra if neither allelic state was shared with D. yakuba. The number of sites included in each distribution is 67 (A), 35 (B), 59 (C), and 32 (D). OXPHOS, oxidative phosphorylation; Undef, undefined; CO, cytochrome c oxidase; ND, NADH dehydrogenase; ATPase, ATP synthase. Complex I (ND), 7 loci; Complex III (Cytochrome B), 1 locus; Complex IV (CO), 3 loci; Complex V (ATPase), 2 overlapping loci. For synonymous and nonsynonymous sites, we calculated the number of segregating sites (S), heterozygosity (π), Watterson’s Ө, and Tajima’s D. The heterogeneity test for differences between synonymous and nonsynonymous D was never significant (P > 0.35 for all comparisons).

A partial sweep in the D. melanogaster mtDNA

The large fraction of derived alleles at high frequencies is a consequence of the haplotype structure of this sample (Figure 3). Nearly 30% of individuals in this population shared an identical mitochondrial haplotype, and an additional 66% of individuals differed from this haplotype by only one to five mutations. The two remaining haplotypes (RAL-639 and RAL-335) were highly divergent from this common haplotype group, contributing nearly half of the segregating sites to the population sample. These two haplotypes shared the ancestral state with D. yakuba at 17 of the 23 derived high-frequency synonymous polymorphisms (i.e., they have the low-frequency ancestral allele). When these two haplotypes were removed from the analysis, there remained a strong skew toward rare alleles (Tajima’s D = −2.31, P < 0.01; Fu and Li’s D = −3.14, P < 0.02), but Fay and Wu’s H, which is sensitive to the number of high-frequency derived alleles, was only weakly significant (H = −10.34, P = 0.043). The remaining six derived, high-frequency synonymous polymorphisms, as well as the single derived, high-frequency nonsynonymous polymorphism, were the result of single mtDNAs within the common haplotype group having the same allelic state as D. yakuba. Given the lack of recombination in the mtDNA, these are likely new, rather than ancestral, mutations. Six of these seven mutations would have changed a C or G to a T or A, consistent with the mutation bias in the D. melanogaster mtDNA (Haag-Liautard ).
Figure 3

Haplotype network for 38 D. melanogaster mitochondrial DNAs (mtDNAs) sampled from Raleigh, North Carolina. The network, inferred from 80 coding region single-nucleotide polymorphisms (SNPs) with no missing information, reveals that nearly 30% of individuals sampled (11/38) share the same common haplotype (red) and an additional 65% of individuals carry a haplotype only a few mutations away from this haplotype. This common set of mitochondrial haplotypes is highly diverged from the two other mtDNAs sampled in the population; lines RAL-639 and RAL-335 differ from the common haplotype at 14 and 34 SNPs, respectively. At least one of these two haplotypes carries the ancestral state (shared with D. yakuba) at 38% of these SNPs. Numbers represent the Raleigh line carrying the haplotype. Red, yellow, blue, and white nodes were present in 11, 3, 2, and 1 lines, respectively.

Haplotype network for 38 D. melanogaster mitochondrial DNAs (mtDNAs) sampled from Raleigh, North Carolina. The network, inferred from 80 coding region single-nucleotide polymorphisms (SNPs) with no missing information, reveals that nearly 30% of individuals sampled (11/38) share the same common haplotype (red) and an additional 65% of individuals carry a haplotype only a few mutations away from this haplotype. This common set of mitochondrial haplotypes is highly diverged from the two other mtDNAs sampled in the population; lines RAL-639 and RAL-335 differ from the common haplotype at 14 and 34 SNPs, respectively. At least one of these two haplotypes carries the ancestral state (shared with D. yakuba) at 38% of these SNPs. Numbers represent the Raleigh line carrying the haplotype. Red, yellow, blue, and white nodes were present in 11, 3, 2, and 1 lines, respectively.

No evidence for recombination in the D. melanogaster mtDNA

We tested for a negative correlation between LD and the distance between each pair of polymorphic sites in the D. melanogaster mitochondrial genome, as a signature of the decay of LD over distance via recombination (Awadalla ). There was no evidence for a decrease in LD with increasing distance between sites, regardless of the measure of LD or the minor allele cutoff used (Table S7). There were no pairs of polymorphic sites for which all four gametes were present (Hudson and Kaplan 1985; Bruen ), further supporting an absence of effective recombination.

Weakly deleterious polymorphism in the D. melanogaster mtDNA

The skew in the site-frequency spectrum toward rare alleles (Figure 2) resulted in negative values of Tajima’s D across the entire mtDNA (Table 1). However, there was no evidence that the skew toward rare alleles differed between synonymous and nonsynonymous polymorphisms (Figure 2, A and B)—heterogeneity tests (Hahn ) of Tajima’s D between synonymous and nonsynonymous sites were never significant (P > 0.35 for all genes and complexes). However, unfolding the site frequency spectra revealed that the large number of high-frequency, derived sites were nearly all synonymous (Figure 2, C and D), suggesting that the haplotype that has increased in frequency carried many more synonymous than nonsynonymous polymorphisms. Given that the mutation bias in the D. melanogaster mtDNA greatly favors nonsynonymous mutations (Haag-Liautard ), this pattern suggests a history of effective purifying selection removing mitochondrial haplotypes that contain nonsynonymous polymorphisms. Furthermore, all nonsynonymous polymorphisms that have arisen on the common mitochondrial haplotype are present at very low frequencies. The current distribution of polymorphisms relative to divergence in D. melanogaster showed little evidence for a large and significant excess of segregating deleterious polymorphisms. Across MK tables, no single gene departed significantly from the neutral expectation after Bonferroni correction (P < 0.05/13) (Table 2 and Table S2). For the entire set of protein-coding mitochondrial genes, there was a slight excess of nonsynonymous polymorphism relative to the neutral expectation, as indicated by moderately significant MK tests [Fisher’s exact test, P ranged from 0.0004 to 0.041 across methods (Table S2)] and values of NI that ranged from 1.67 to 2.57 across methods, with confidence intervals that did not include the neutral expectation of 1 (Table S2). There was some OXPHOS-complex specificity to this result—Complexes I (ND) and V (ATPase) tended to deviate significantly from neutrality with NI values greater than 1, whereas Complex IV (CO) was consistent with the neutral expectation (Table 3 and Table S3).
Table 2

Counts of polymorphic (P) and divergent (D) nonsynonymous () and synonymous () sites along with summary statistics of the MK table for D. melanogaster mitochondrial genes

GenePNaPSaDNaDSaNIbZ*cPFETd
ATPase65211357.955−0.7780.021
ATPase810286.000−0.7780.273
COI01681010.6670.1760.595
COII156391.300−0.2801.000
COIII198.547.50.621−0.0091.000
Cyt-b2517.567.51.543−0.2670.641
ND13311454.091−0.5840.122
ND26525411.968−0.2750.334
ND3115224.400−0.5840.377
ND47724632.625−0.4080.120
ND4L211714.00−0.7780.152
ND541055.833107.1670.7680.0630.775
ND62421.522.50.5230.2030.669

MK, McDonald-Kreitman; NI, neutrality index; ATPase, ATP synthase; CO, cytochrome c oxidase; Cyt-b, Cytochrome B; ND, NADH dehydrogenase.

MK counts from the “more-inclusive” method. Values from other methods are in Table S2.

A count of 1 was added to each cell when calculating for any gene with a zero count in any cell.

, as in Presgraves (2005).

P-value from Fisher’s exact test of the MK table.

Table 3

Summary statistics of the MK table for mitochondrially encoded OXPHOS complexes and nuclear genes

SpeciesGenomeGene SetaNIbNITGcZbPFETd
D. melmtDNAeComplex I1.731.59 (0.94, 3.17)−0.2380.070
mtDNAComplex IV0.560.55 (0, 1.30)0.2550.750
mtDNAComplex V9.929.64 (undef)−0.9970.007
mtDNAAll coding1.59 (1.97,3.57,3.89)1.67 (1.03, 2.86)−0.201 (-0.28,-0.33,0.36)0.041
NuclearAutosomes1.16 (1.52,2.83,4.25)1.39 (1.35, 1.43)−0.064 (-0.13,-0.15,0.42)<1e-6
NuclearX chrom0.91 (1.17,2.21,3.24)1.02 (0.94, 1.10)0.041 (-0.07,-0.06,0.41)0.001
H. sapmtDNAComplex I1.191.20 (0.56, 2.40)−0.0740.475
mtDNAComplex IV1.982.00 (0.86, 3.61)−0.2960.402
mtDNAComplex V1.621.66 (0.79, 2.42)−0.2090.127
mtDNAAll coding1.46 (1.38,1.95,1.38)1.48 (0.90, 2.24)−0.165 (-0.23,-0.23,0.32)0.034
NuclearAll coding1.51 (1.39,2.28,2.85)1.57 (1.51, 1.63)−0.180 (-0.12,-0.13,0.40)<1e-6

MK, McDonald-Kreitman; OXPHOS, oxidative phosphorylation; NI, neutrality index; D. mel, D. melanogaster; mtDNA, mitochondrial DNA; undef, undefined; X chrom, X chromosomes; H. sap, H. sapiens.

Complex I (ND) seven loci; Complex IV (CO) three loci; Complex V (ATPase), two overlapping loci. Complex II is nuclear encoded and Complex III has only a single mitochondrial locus, Cyt-b.

NI and were calculated using counts of P, P, D, and D summed across genes within gene sets. Median, mean, and SD provided for whole genome.

with confidence intervals from 5000 bootstrap samples (Stoletzki and Eyre-Walker 2011).

P-value from Fisher’s exact test of the MK table.

D. melanogaster mtDNA data from the “more-inclusive” method. Values from other methods are in Table S3.

MK, McDonald-Kreitman; NI, neutrality index; ATPase, ATP synthase; CO, cytochrome c oxidase; Cyt-b, Cytochrome B; ND, NADH dehydrogenase. MK counts from the “more-inclusive” method. Values from other methods are in Table S2. A count of 1 was added to each cell when calculating for any gene with a zero count in any cell. , as in Presgraves (2005). P-value from Fisher’s exact test of the MK table. MK, McDonald-Kreitman; OXPHOS, oxidative phosphorylation; NI, neutrality index; D. mel, D. melanogaster; mtDNA, mitochondrial DNA; undef, undefined; X chrom, X chromosomes; H. sap, H. sapiens. Complex I (ND) seven loci; Complex IV (CO) three loci; Complex V (ATPase), two overlapping loci. Complex II is nuclear encoded and Complex III has only a single mitochondrial locus, Cyt-b. NI and were calculated using counts of P, P, D, and D summed across genes within gene sets. Median, mean, and SD provided for whole genome. with confidence intervals from 5000 bootstrap samples (Stoletzki and Eyre-Walker 2011). P-value from Fisher’s exact test of the MK table. D. melanogaster mtDNA data from the “more-inclusive” method. Values from other methods are in Table S3. Analysis of 36 of the 38 mitochondrial haplotypes in our sample that were independently sequenced and assembled by Richardson confirmed these patterns (Table S4). When counts of polymorphism and divergence differed between datasets, they typically differed by a single count. The exception was in several Complex I (ND) genes, for which our assembled mtDNAs had a small number of additional nonsynonymous polymorphisms relative to the Richardson data set (ND genes, df = 6, P, = 0.021; all other genes, df = 5, P, = 1) that resulted in slightly greater values of NI (ND included, df = 12, P, = 0.016; all other genes, df = 5, P, = 1). This was not due to absence of two mitochondrial haplotypes in the Richardson data set, and the additional polymorphisms in our data were not clustered on any single mitochondrial haplotype. The reduced number of nonsynonymous polymorphisms in the Richardson data provided even less support for an excess of nonsynonymous segregating variation in the mitochondrial genome. Summed counts of polymorphism and divergence for the entire set of mitochondrial-encoded proteins in this dataset did not deviate from the neutral expectation (P = 0.423), and the confidence intervals on NI for mitochondrial-encoded proteins contained the neutral expectation of 1 (NI = 0.821, 95% confidence interval = 0.386 to 1.90).

On average, NI is similar for mitochondrial and nuclear genes in flies

Although there is a weak signature of an excess of nonsynonymous segregating variation in the D. melanogaster mitochondrial genome, both mitochondrial and nuclear gene sets have median NI, Z*, or NI values that deviate in the same manner from the neutral expectation, indicative of both genomes harboring weakly deleterious polymorphisms. Furthermore, the distribution of D. melanogaster mitochondrial gene NI values was contained within that of the nuclear genes, with many nuclear genes having both more positive and more negative values of NI and Z* (Figure 1, B and C). Weakly significant differences between mitochondrial and nuclear gene NI were affected by the genomic location of nuclear genes (Table S2), as X-linked genes had significantly lower values of NI and more positive Z* values relative to autosomal genes (P < 1e-6 for both statistics) (Figure 1, B and C). Because of this, mitochondrial gene NI differed significantly from X-linked genes (P ranged from 0.007 to 0.065) but not from autosomal genes (P ranged from 0.047 to 0.325), with similar patterns for Z* (mtDNA vs. X, P ranged from 0.002 to 0.019; mtDNA vs. autosomes, P ranged from 0.012 to 0.104). The lower counts of polymorphic sites in the assembled mtDNAs from Richardson provided less support for genomic differences in MK statistics. Neither NI nor Z* differed significantly between the mitochondria and either the X or the autosomes (NI, P > 0.441 for both comparisons; Z*, P > 0.471 for both comparisons). The moderate levels of significance associated with some of these contrasts, and the sensitivity of these contrasts to small differences in MK counts and methods, suggest that although there is a trend for mitochondrial genes to have larger NI (and more negative Z*) values relative to nuclear genes, the differences between genomes are not large. Contrasts with mitochondrial genomes may have low power due to the smaller number of genes and low levels of nonsynonymous polymorphism and divergence, relative to nuclear genomes. However, the mitochondrial data are not a sample of genes, as they represent the complete protein-coding complement of this genome. Nevertheless, a traditional power analysis suggests that we would require an 18-fold increase in the number of mitochondrial genes for the smallest effect size (Table 3) to reach statistical significance. To provide biological context for the small differences in MK summary statistics that we observed between genomes, we calculated and contrasted effect sizes as the difference in means between genomes divided by the root mean square of the SD for NI and Z*. Across MK tables, the differences in NI between mitochondrial and autosomal genes yielded effect sizes that range from 0.18 to 0.71, smaller than those reported in the meta-analyses of Weinreich and Rand (2000), where the difference in mean NI between mitochondrial and nuclear genes was 3.2, with an effect size of 0.96. In an analysis of 98 nuclear loci in D. melanogaster, Presgraves (2005) reported significant differences in Z* for genes located in regions of high and low recombination for which the effect size was 0.96, over twice that which we observed between mitochondrial and autosomal gene Z* (Table 3).

NI does not differ between mitochondrial and nuclear genes in humans

Summary statistics of the MK table also did not differ between mitochondrial and nuclear genes in H. sapiens (NI, P = 0.657; Z*, P = 0.243), nor did the site-frequency spectrum differ between nonsynonymous and synonymous mitochondrial polymorphisms in humans (heterogeneity test, P > 0.36 for all genes). Values of NI and Z* for mitochondrial genes in humans were well within the distribution of these statistics for nuclear genes (Figure 4), and the confidence intervals around NI for the mitochondrial and nuclear genomes were overlapping (Table 3). Similar to the fly mtDNA and nuclear genome, the median values of NI and Z* for both the human mtDNA and nuclear genome were consistent with a slight excess of nonsynonymous polymorphism (Table 3). The distributions of NI and Z* were also largely overlapping and did not differ significantly between D. melanogaster and H. sapiens mitochondrial genes (NI, P = 0.545; Z*, P = 0.441) (Figure 4), despite differing nuclear N between these species. This further supports the idea that the efficacy of purifying selection in these mitochondrial genomes is largely independent of N.
Figure 4

Distributions of (A) neutrality index (NI) and (B) Z* for mitochondrial and nuclear genes in D. melanogaster and H. sapiens. Three nuclear genes in flies and two nuclear genes in humans that had NI values greater than 50 were removed to improve visualization. Dashed lines represent the neutral expectation for each statistic. The D. melanogaster mitochondrial DNA (mtDNA) and nuclear sets contained 13 and 6113 genes, respectively. The H. sapiens mtDNA and nuclear sets contained 13 and 11,624 genes, respectively. See Table S2 and Table 3 and main text for statistical results.

Distributions of (A) neutrality index (NI) and (B) Z* for mitochondrial and nuclear genes in D. melanogaster and H. sapiens. Three nuclear genes in flies and two nuclear genes in humans that had NI values greater than 50 were removed to improve visualization. Dashed lines represent the neutral expectation for each statistic. The D. melanogaster mitochondrial DNA (mtDNA) and nuclear sets contained 13 and 6113 genes, respectively. The H. sapiens mtDNA and nuclear sets contained 13 and 11,624 genes, respectively. See Table S2 and Table 3 and main text for statistical results. Using data from flies and humans, we tested whether contrasts between nuclear and mitochondrial genes with similar function in OXPHOS and putatively similar selective effects of mutations (s) would reveal greater differences in NI between mitochondrial and nuclear genomes as a function of differing N. For humans, there was no difference in NI or Z* between OXPHOS genes encoded in the mitochondrial and nuclear genomes (P > 0.46 for both statistics), whereas for flies there was a weakly significant difference that was driven by the fact that the nuclear OXPHOS genes in our sample had values of NI and Z* that were more consistent with an excess of nonsynonymous substitutions (NI, P = 0.026; Z*, P = 0.022) (Figure S2). However, these data should be treated with some caution, as there were only 11 genes in our nuclear data set annotated to have OXPHOS function, and nine of these genes are part of Complex I (ND). Mitochondrial ND genes accumulate more amino acid substitutions than do other OXPHOS-complex genes in Drosophila (Ballard 2000; Montooth ), potentially reflecting differences in functional constraint among complexes that are consistent with the OXPHOS-complex differences in NI that we observed in this study (Table 3 and Table S3). Finally, we used the human data to illustrate the sensitivity of NI to sampling. When only a few individuals are sampled, the choice of genomes can lead to high variability and extreme values in NI, potentially as a result of single haplotypes that may carry multiple polymorphisms, as appears to be the case for human ND6 (Table 4 and Table S5). For example, depending on which Japanese individuals we included in our analyses, NI for ND6 takes on values of 30.71 (MK test, P = 0.001), 5.50 (P = 0.308), or 1.79 (P = 0.522) when sampling only three mtDNAs. As more mtDNAs are sampled, NI and Z* for each mitochondrial gene become more similar to the neutral expectation (Table 4 and Table S5). Overall, these analyses using D. melanogaster and H. sapiens mitochondrial genomes highlight the sensitivity of these MK statistics to the number of genomes sampled, the amount of divergence between species, and the low levels of polymorphism in these genes.
Table 4

The sensitivity of NI to sampling

Sample1 African1 African1 African19 African-American30 African-American
1 European1 European1 European20 European-Americanc30 European-Americanc
1 Japanesea1 Japaneseb1 Japaneseb
NIdZ*NIZ*NIZ*NIZ*NIZ*
ATPase2.27−0.372.78−0.442.84−0.451.62−0.221.65−0.22
COI12.50-1.086.89−0.8810.33-1.033.61-0.563.68-0.56
COII3.33−0.525.80−0.823.28−0.520.86−0.130.64−0.01
COIII2.36−0.531.38−0.141.60−0.391.38−0.230.95−0.08
Cyt-b3.78-0.573.64−0.553.64−0.552.29−0.363.23*-0.50
ND11.58−0.282.51−0.432.63−0.451.49−0.201.49−0.19
ND25.86-0.765.86-0.767.73-0.863.59−0.552.95-0.46
ND32.83−0.526.80−0.772.83−0.521.33−0.281.52−0.25
ND40.93−0.090.520.050.520.050.130.600.110.70
ND4L2.20−0.342.63−0.422.63−0.425.00−0.624.00−0.54
ND52.00−0.322.38−0.392.23−0.371.20−0.091.44−0.16
ND630.71*-1.225.50−0.741.79−0.251.30−0.201.17−0.16
All coding2.88*-0.462.50*-0.402.41*-0.391.46-0.171.55-0.19
NITG (C.I.)e2.85 (1.83, 4.99)2.60 (1.65, 3.91)2.56 (1.59, 3.81)1.48 (0.90, 2.24)1.59 (0.93, 2.36)

NI, neutrality index; ATPase, ATP synthase; CO, cytochrome c oxidase; Cyt-b, Cytochrome B; ND, NADH dehydrogenase; C.I., confidence interval; MK, McDonald-Kreitman.

MK table counts from Nachman .

MK table counts as mentioned previously, but substituting two different, randomly chosen Japanese samples.

MK table counts from African-American and European-American sequences sampled from (Just ) and (Rubino ) with the chimpanzee mitochondrial reference genome as an outgroup (Horai ).

A count of 1 was added to each cell when calculating NI for any locus with a zero count in any cell. Values in bold indicate P ≤ 0.05; * indicates significant sample-wise Bonferroni-corrected P-value of less than 0.004 for Fisher’s exact test of the MK table.

Calculated as in Table 3. No sample rejected Woolf’s test of homogeneity (P > 0.19 for all samples). Values in bold indicate that the confidence intervals do not overlap the neutral expectation of 1.

NI, neutrality index; ATPase, ATP synthase; CO, cytochrome c oxidase; Cyt-b, Cytochrome B; ND, NADH dehydrogenase; C.I., confidence interval; MK, McDonald-Kreitman. MK table counts from Nachman . MK table counts as mentioned previously, but substituting two different, randomly chosen Japanese samples. MK table counts from African-American and European-American sequences sampled from (Just ) and (Rubino ) with the chimpanzee mitochondrial reference genome as an outgroup (Horai ). A count of 1 was added to each cell when calculating NI for any locus with a zero count in any cell. Values in bold indicate P ≤ 0.05; * indicates significant sample-wise Bonferroni-corrected P-value of less than 0.004 for Fisher’s exact test of the MK table. Calculated as in Table 3. No sample rejected Woolf’s test of homogeneity (P > 0.19 for all samples). Values in bold indicate that the confidence intervals do not overlap the neutral expectation of 1.

Discussion

Using a large sample of whole-genome sequence data, we have tested a number of hypotheses about mtDNA evolution, and about differences in the efficacy of selection on mitochondrial vs. nuclear genes. Our data confirm that mtDNA do not have a signature of recombination and have lower silent-site diversity than do nuclear genes in D. melanogaster, which supports the prediction that the mitochondrial genome has a lower N than the nuclear genome. We also show a skew in the site-frequency spectrum toward rare alleles in D. melanogaster that likely has two sources: 1) the accumulation of new mutations on what appears to be a mtDNA haplotype that has swept to high frequency in the recent past, and 2) the ancestral polymorphisms contained on migrant or remnant haplotypes that are now rare in this population. Despite the apparent reduction in N for mtDNA, our findings indicate that selection is similarly effective at purging deleterious polymorphisms from the mitochondrial and nuclear genomes of D. melanogaster, and that the same is true in H. sapiens. Although all genomes that we analyzed showed some evidence of an excess of nonsynonymous polymorphism relative to the neutral expectation, the only significant differences in NI and Z* were between D. melanogaster mitochondrial genes and X-linked genes. X-linked genes in Drosophila have a greater proportion of beneficial substitutions than do autosomes (Langley ; Mackay ; Meisel and Connallon 2013; Garrigan ), suggesting that what differs between mitochondrial genes and nuclear genes is likely the fraction of beneficial substitutions rather than the efficacy of purifying selection, which appears to be largely independent of N in the D. melanogaster and H. sapiens mitochondrial genomes that we have analyzed. Given its uniparental and haploid transmission, the expectation under neutrality is that the mtDNA has one-quarter the population size of the autosomes. This reduced value of N (and subsequently N) matches that expected for the Y (or W) chromosome, and, like the Y chromosome, the mtDNA has little to no recombination. However, very much unlike the Y chromosomes that have been sequenced (e.g., Charlesworth and Charlesworth 2000; Carvalho ; Carvalho and Clark 2013; Bellott ), animal mtDNA genomes do not show an accumulation of transposable elements, and the gene content of the animal mitochondrial genome is remarkably stable, with few gene losses and even fewer pseudogenes (Boore 1999; Ballard and Rand 2005). Furthermore, d/d is two to 15 times lower for mitochondrial genes than for nuclear genes in mammals (Popadin ), and average values of d/d for mitochondrial genes are well under 0.1 and are, on average, only 13% that of nuclear genes in Drosophila (Bazin ; Montooth ). This pattern of amino acid conservation is particularly striking, given that the mutation rate in the D. melanogaster mtDNA is an order of magnitude greater than the per-site mutation rate in the nuclear genome, with an extreme bias toward nonsynonymous mutations in the mitochondrial genome (Haag-Liautard ; Haag-Liautard ). Although heteromorphic Y chromosomes do show signatures of less effective purifying selection, such as proliferation of satellite repeats and reduced codon bias (Bachtrog 2013; Singh ), the single copy, X-degenerate genes that have remained on the human Y chromosome experience effective purifying selection (Rozen ; Bellott ), as do the protein sequences of Drosophila Y-linked genes (Singh ). Thus, despite early loss of many genes when heteromorphic Y chromosomes and mtDNA formed, both these nonrecombining chromosomes contain genes maintained by effective purifying selection in the presence of reduced N. Many researchers have cited the early work on NI in Drosophila and mammals in support of the idea that mtDNA accumulate deleterious mutations (e.g., Meiklejohn ; Green ; Neiman and Taylor 2009; Akashi ). In fact, this idea has become so engrained that it is regularly cited in reviews of mitochondrial gene evolution (e.g., Ballard and Whitlock 2004; Lynch 2007). What is perhaps surprising about this conversion of a small set of intriguing initial studies into dogma is that the early studies themselves were quite circumspect about the implications of their results. For instance, Nachman (1998), in noting that very few nuclear loci were available for comparison, stated “It is also unclear whether the patterns reported here are unique to mitochondrial DNA.” Data from the few nuclear genes that had been sequenced raised “the possibility that the patterns reported here for mtDNA may also be found at some nuclear loci” (Nachman 1998). Even studies that did have access to additional nuclear datasets were only able to calculate NI for 36 nuclear loci (Weinreich and Rand 2000), and the NI values that were available often did not deviate significantly from neutrality (Nachman 1998; Weinreich and Rand 2000). Those that did reject neutrality tended to do so weakly, perhaps due to the small number of polymorphisms in mitochondrial samples even when the number of individuals sampled is high (e.g., ND3, Nachman ). Nevertheless, there are mitochondrial genes that do strongly reject neutrality, and some of these had NI values that greatly exceeded NI for the sampled nuclear loci. On the basis of these and similar comparisons, many authors have reached the conclusion that mtDNA evolves in a manner distinct from the nuclear genome. Our results using the whole genomes of flies and humans, combined with observations of low mitochondrial d/d, suggest that the mitochondrial genomes of flies and humans are not suffering less effective purifying selection relative to the nuclear genome, and that differences in selection between these genomes may lie in differing rates of adaptive evolution. Reductions in N—due either to reductions in census population size or to the increased effect of linked, selected variants in regions of low recombination—are expected to result in a reduction in the efficacy of purifying selection. Indeed, comparisons of MK test results across a range of species with different values of N have revealed this expected relationship (e.g., Li ; Wright and Andolfatto 2008; Gossmann ), as have comparisons of NI across regions of the D. melanogaster nuclear genome with different recombination rates (Presgraves 2005; Langley ). Therefore, all things being equal, mitochondrial loci would be expected to harbor an excess of nonsynonymous polymorphisms relative to nuclear loci due to reduced N. Our results suggest that all things are not equal between these two cellular compartments, and that there may be features of the mitochondrion that make it less likely to accumulate deleterious mutations. One such feature is the “bottleneck” that occurs in the number of mtDNAs that are passed from mother to offspring—this event makes it possible for selection to act within hosts, possibly increasing the power of selection to remove deleterious mutations (Bergstrom and Pritchard 1998; Rand 2011) and reducing variability in mitochondrial N among taxa, relative to nuclear genomes. The additional layers of selection imposed by mitochondrial inheritance, combined with stronger negative selective effects of amino acid changing mutations in mitochondrial genes (e.g., Popadin ), may allow the mtDNA to escape the accumulation of deleterious mutation, resulting in relatively similar values of NI between nucleus and mitochondria. If the selective effects of mutations in mitochondrial genes are beyond the “horizon” where all mutations will behave similarly regardless of N (Nachman 1998; Eyre-Walker and Keightley 2007), then we expect patterns of mitochondrial polymorphism and divergence to be largely independent of N. Our results come with several caveats. First, we have only studied two organisms—it may be that a more comprehensive review of NI in mtDNA and nuclear loci across many species will reveal a difference in the average efficacy of purifying selection or highlight lineage-specific patterns. The early meta-analyses of NI contained loci from a wide range of animals (Nachman 1998; Weinreich and Rand 2000), and using data from only Drosophila and humans may provide a limited perspective. Nevertheless, these are two model organisms for evolutionary biology that span a large range of mtDNA:nuclear substitution rates, and studies of these species have led the way for much of modern population genetics. Second, it is clear from our analysis of the D. melanogaster mtDNA that it is not at equilibrium, and may be recovering from a partial cytoplasmic sweep that may be associated with Wolbachia (Richardson ). Much of the theory used to predict NI values from N and s assumes mutation-selection-drift balance (see, e.g., Nachman 1998), and deviations from this equilibrium can result in more complex relationships between N, s, and NI (Messer and Petrov 2013). Although nonequilibrium histories may mean that mtDNA NI values are not at equilibrium, it is equally likely that nuclear genes from D. melanogaster are not at mutation-selection-drift equilibrium (Hahn 2008; Langley ). Whether or not the mtDNA is at equilibrium, and whether or not the NI values calculated from this snapshot of two species represent equilibrium values, our results still imply that there is little difference between nuclear and mitochondrial measures of the efficacy of purifying selection. Despite the mitochondrial genome experiencing a distinct population genetic environment relative to the nuclear genome, our whole-genome analyses uncovered little evidence for an excess accumulation of slightly deleterious mutations in mitochondrial genomes, relative to nuclear genomes. In fact, the only strong evidence for a reduced efficacy of selection in animal mtDNA, relative to nuclear genomes, comes from comparative studies of nuclear and mitochondrial tRNAs (Lynch 1996; Lynch 1997). As discussed previously in this article, in the absence of a pattern in NI, there are few patterns of molecular evolution in animal mtDNA indicative of deleterious mutation accumulation [but see Osada and Akashi (2012)]. This pattern is in stark contrast to the patterns found in analogous nuclear regions with reduced N and low recombination, like the Y chromosome. Determining whether mtDNA accumulate deleterious polymorphisms and substitutions more readily than nuclear DNA in a larger sample of species (and what type of loci may be affected) will be a particularly fruitful avenue for future studies.
  77 in total

1.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

2.  On estimating the relation between blood group and disease.

Authors:  B WOOLF
Journal:  Ann Hum Genet       Date:  1955-06       Impact factor: 1.670

3.  Population size does not influence mitochondrial genetic diversity in animals.

Authors:  Eric Bazin; Sylvain Glémin; Nicolas Galtier
Journal:  Science       Date:  2006-04-28       Impact factor: 47.728

4.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

5.  The hitch-hiking effect of a favourable gene.

Authors:  J M Smith; J Haigh
Journal:  Genet Res       Date:  1974-02       Impact factor: 1.588

6.  Comparative genomics of mitochondrial DNA in members of the Drosophila melanogaster subgroup.

Authors:  J W Ballard
Journal:  J Mol Evol       Date:  2000-07       Impact factor: 2.395

7.  Statistical tests of neutrality of mutations.

Authors:  Y X Fu; W H Li
Journal:  Genetics       Date:  1993-03       Impact factor: 4.562

8.  The effect of linkage on limits to artificial selection.

Authors:  W G Hill; A Robertson
Journal:  Genet Res       Date:  1966-12       Impact factor: 1.588

9.  The spectrum of mitochondrial mutation differs across species.

Authors:  Kristi L Montooth; David M Rand
Journal:  PLoS Biol       Date:  2008-08-26       Impact factor: 8.029

10.  Genome diversity and divergence in Drosophila mauritiana: multiple signatures of faster X evolution.

Authors:  Daniel Garrigan; Sarah B Kingan; Anthony J Geneva; Jeffrey P Vedanayagam; Daven C Presgraves
Journal:  Genome Biol Evol       Date:  2014-09-04       Impact factor: 3.416

View more
  20 in total

1.  Mitochondrial Mutation Rate, Spectrum and Heteroplasmy in Caenorhabditis elegans Spontaneous Mutation Accumulation Lines of Differing Population Size.

Authors:  Anke Konrad; Owen Thompson; Robert H Waterston; Donald G Moerman; Peter D Keightley; Ulfar Bergthorsson; Vaishali Katju
Journal:  Mol Biol Evol       Date:  2017-06-01       Impact factor: 16.240

2.  Sex and Mitonuclear Adaptation in Experimental Caenorhabditis elegans Populations.

Authors:  Riana I Wernick; Stephen F Christy; Dana K Howe; Jennifer A Sullins; Joseph F Ramirez; Maura Sare; McKenna J Penley; Levi T Morran; Dee R Denver; Suzanne Estes
Journal:  Genetics       Date:  2019-01-22       Impact factor: 4.562

Review 3.  What can we infer about the origin of sex in early eukaryotes?

Authors:  Dave Speijer
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2016-10-19       Impact factor: 6.237

4.  The Roles of Mutation, Selection, and Expression in Determining Relative Rates of Evolution in Mitochondrial versus Nuclear Genomes.

Authors:  Justin C Havird; Daniel B Sloan
Journal:  Mol Biol Evol       Date:  2016-08-25       Impact factor: 16.240

5.  Assessing the fitness consequences of mitonuclear interactions in natural populations.

Authors:  Geoffrey E Hill; Justin C Havird; Daniel B Sloan; Ronald S Burton; Chris Greening; Damian K Dowling
Journal:  Biol Rev Camb Philos Soc       Date:  2018-12-26

6.  High mitochondrial mutation rates in Silene are associated with nuclear-mediated changes in mitochondrial physiology.

Authors:  Ryan J Weaver; Gina Carrion; Rachel Nix; Gerald P Maeda; Samantha Rabinowitz; Erik N K Iverson; Kiley Thueson; Justin C Havird
Journal:  Biol Lett       Date:  2020-09-16       Impact factor: 3.703

7.  Maternal loading of a small heat shock protein increases embryo thermal tolerance in Drosophila melanogaster.

Authors:  Brent L Lockwood; Cole R Julick; Kristi L Montooth
Journal:  J Exp Biol       Date:  2017-11-02       Impact factor: 3.312

8.  Mitigating Mitochondrial Genome Erosion Without Recombination.

Authors:  Arunas L Radzvilavicius; Hanna Kokko; Joshua R Christie
Journal:  Genetics       Date:  2017-09-11       Impact factor: 4.562

9.  Repeated replacement of an intrabacterial symbiont in the tripartite nested mealybug symbiosis.

Authors:  Filip Husnik; John P McCutcheon
Journal:  Proc Natl Acad Sci U S A       Date:  2016-08-29       Impact factor: 11.205

Review 10.  Cytonuclear integration and co-evolution.

Authors:  Daniel B Sloan; Jessica M Warren; Alissa M Williams; Zhiqiang Wu; Salah E Abdel-Ghany; Adam J Chicco; Justin C Havird
Journal:  Nat Rev Genet       Date:  2018-10       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.