Literature DB >> 22628461

Evidence for widespread GC-biased gene conversion in eukaryotes.

Eugénie Pessia1, Alexandra Popa, Sylvain Mousset, Clément Rezvoy, Laurent Duret, Gabriel A B Marais.   

Abstract

GC-biased gene conversion (gBGC) is a process that tends to increase the GC content of recombining DNA over evolutionary time and is thought to explain the evolution of GC content in mammals and yeasts. Evidence for gBGC outside these two groups is growing but is still limited. Here, we analyzed 36 completely sequenced genomes representing four of the five major groups in eukaryotes (Unikonts, Excavates, Chromalveolates and Plantae). gBGC was investigated by directly comparing GC content and recombination rates in species where recombination data are available, that is, half of them. To study all species of our dataset, we used chromosome size as a proxy for recombination rate and compared it with GC content. Among the 17 species showing a significant relationship between GC content and chromosome size, 15 are consistent with the predictions of the gBGC model. Importantly, the species showing a pattern consistent with gBGC are found in all the four major groups of eukaryotes studied, which suggests that gBGC may be widespread in eukaryotes.

Entities:  

Mesh:

Year:  2012        PMID: 22628461      PMCID: PMC5635611          DOI: 10.1093/gbe/evs052

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


During meiotic recombination, parental chromosomes undergo not only large-scale genetic exchanges by crossover but also small-scale exchanges by gene conversion. These events of gene conversion can be biased. In particular, there is evidence that in some species gene conversion affecting G/C:A/T heterozygous sites yields more frequently to G/C than to A/T alleles, a phenomenon called GC-biased gene conversion (gBGC) (Eyre-Walker 1993; Galtier et al. 2001; Marais 2003; Duret and Galtier 2009a). gBGC is expected to increase the GC content of recombining DNA over evolutionary time and is considered a major contributor to the variation in GC content within and between genomes (Eyre-Walker 1993; Galtier et al. 2001; Marais 2003; Duret and Galtier 2009a). gBGC has caught a lot of attention because it affects the probability of fixation of GC alleles and looks like selection for increasing GC, which can mislead several tests designed to detect positive selection (Galtier and Duret 2007; Berglund et al. 2009; Duret and Galtier 2009b; Galtier et al. 2009; Ratnakumar et al. 2010; Webster and Hurst 2012). It has been demonstrated that gBGC occurs during meiosis in budding yeast (Birdsell 2002; Mancera et al. 2008), and there is strong indirect evidence that this process also affects mammals, where clear-cut relationships between local GC content and recombination rates and many other observations consistent with gBGC have been reported (Galtier 2003; Montoya-Burgos et al. 2003; Spencer et al. 2006; Duret and Arndt 2008; Romiguier et al. 2010). Other studies have investigated gBGC in several organisms such as opossum, chicken, sticklebacks, Drosophila, honeybees, Caenorhabditis elegans, Arabidopsis, wheat, rice, the marine unicellular algae Ostreococcus, and the ciliate Paramecium (Marais et al. 2001, 2003; International Chicken Genome Sequencing Consortium 2004; Marais et al. 2004; Beye et al. 2006; Galtier et al. 2006; Mikkelsen et al. 2007; Duret et al. 2008; Haudry et al. 2008; Jancek et al. 2008; Escobar et al. 2010; Capra and Pollard 2011; Muyle et al. 2011; Nabholz et al. 2011). However, most of the currently available data comes from animals and plants, and we lack a global picture on gBGC in eukaryotes. Here we wanted to investigate whether gBGC has affected genome evolution in other eukaryotic groups. One typical signature of gBGC is that, on the long term, this process leads to a positive correlation between local GC content and recombination rates (reviewed in Marais 2003; Duret and Galtier 2009a). We thus looked for such a relationship in eukaryotic species for which the genome was entirely sequenced. We focused our analyses on taxa for which the genome sequence was assembled and anchored on chromosomes. We included all species available, except for metazoans, which are clearly over-represented in genomic sequence databases, and for which we only selected a representative sample. Our dataset includes 36 species from four of the five major eukaryotic groups: Unikonts, Excavates, Stramenopiles and Plantae ([Keeling et al. 2005], see fig. 1). Recombination data are available for 17 of these species, mostly Metazoan (Unikonts) and Plantae (see table 1). Among these 17 species, 6 show a significant correlation between chromosome-averaged recombination rate and GC content (table 1). Interestingly, out of these six correlations, five are positive. Thus, when a significant correlation is detected, it is in most cases consistent with gBGC. Moreover, the mean correlation coefficient is significantly >0 (0.31, P = 0.0015), again consistent with gBGC.
F

Phylogenetic tree of the 36 species studied. Major groups in eukaryotes (see Keeling et al. 2005) are indicated. Green circles indicate significant positive correlations between GC content (total GC content and/or GC3) and recombination rates (measured directly or using chromosome size as a proxy), consistent with gBGC (this work and others). Red circles indicate significant negative correlations between GC content and recombination rates, not consistent with gBGC. Filled circles indicate new observations from the present study. The “?” indicates when results using direct or indirect measures of recombination rates are not fully consistent.

Table 1

Correlation between Recombination Rates and GC Content among Eukaryotes

SpeciesEukaryotic groupsaChromosome numberGenetic mapbTotal GC/rec ratescChrom size/rec ratesc
Saccharomyces cerevisiae Unikonts168610.62* (*)−0.6* (*)
Cryptococcus neoformans Unikonts132850.04 ns (ns)−0.14 ns (ns)
Monodelphis domestica Unikonts81500.29 ns (ns)−0.05 ns (ns)
Mus musculus Unikonts19101950.68* (*)−0.5* (ns)
Homo sapiens Unikonts22281210.75*** (**)−0.87*** (***)
Gallus gallus Unikonts2792680.89*** (***)−0.97*** (***)
Ciona intestinalis Unikonts13276−0.59* (ns)0.21 ns (ns)
Caenorhabditis elegans Unikonts57800.5 ns (ns)−1* (*)
Drosophila melanogaster Unikonts4d670.8 ns (ns)−0.4 ns (ns)
Apis mellifera Unikonts1620080.74* (*)−0.35 ns (ns)
Trypanosoma brucei Excavates111190.14 ns (ns)−0.09 ns (ns)
Plasmodium falciparum Chromal1434380.37 ns (ns)−0.54* (ns)
Arabidopsis thaliana Plantae56760 ns (ns)−0.2 ns (ns)
Populus trichocarpa Plantae195400.06 ns (ns)−0.28 ns (ns)
Vitis vinifera Plantae19515−0.33 ns (ns)−0.56* (*)
Oryza sativa Plantae121202−0.18 ns (ns)0.53 ns (ns)
Sorghum bicolor Plantae1020290.5 ns (ns)0.21 ns (ns)

aThe eukaryotic groups relate to those shown in figure 1. Chromal, Chromalveolates.

bNumber of markers in genetic maps.

cValues are Spearman correlation coefficients, then come P values: ns, nonsignificant, * <0.05, ** <10−3, *** <10−4 and q values (from FDR corrections for multiple tests) are indicated in parentheses.

dHere is indicated the number of chromosome arms instead of the number of chromosomes.

Phylogenetic tree of the 36 species studied. Major groups in eukaryotes (see Keeling et al. 2005) are indicated. Green circles indicate significant positive correlations between GC content (total GC content and/or GC3) and recombination rates (measured directly or using chromosome size as a proxy), consistent with gBGC (this work and others). Red circles indicate significant negative correlations between GC content and recombination rates, not consistent with gBGC. Filled circles indicate new observations from the present study. The “?” indicates when results using direct or indirect measures of recombination rates are not fully consistent. Correlation between Recombination Rates and GC Content among Eukaryotes aThe eukaryotic groups relate to those shown in figure 1. Chromal, Chromalveolates. bNumber of markers in genetic maps. cValues are Spearman correlation coefficients, then come P values: ns, nonsignificant, * <0.05, ** <10−3, *** <10−4 and q values (from FDR corrections for multiple tests) are indicated in parentheses. dHere is indicated the number of chromosome arms instead of the number of chromosomes. To investigate gBGC in a larger sample of species, including those without recombination data, we used chromosome size as a proxy for recombination rates. It has been shown that chromosome size and recombination rates are inversely correlated in many eukaryotes (e.g., Kaback 1996; Copenhaver et al. 1998; Kaback et al. 1999). This pattern reflects the fact that in many species, the proper segregation of chromosomes during meiosis requires having at least one crossover per chromosome, and that the occurrence of a crossover on a given chromosome decreases the probability of having a second one on the same chromosome (a process termed “crossover interference”). These constraints lead to a lower crossover rate (per Mb) in large chromosomes compared with small ones (Kaback 1996; Copenhaver et al. 1998; Kaback et al. 1999). Among species for which genetic maps are available, we found that in most cases (14/17) chromosome size indeed correlates negatively with recombination rates (table 1), and all significant correlations are negative (7/7). The gBGC model therefore predicts a negative correlation between chromosome size and GC content (although other explanations are possible, see Discussion below). Accordingly, this expected correlation has been found in yeast—for which there is direct evidence of gBGC—and mammals—for which there is strong indirect evidence of gBGC (Bradnam et al. 1999; Meunier and Duret 2004). Table 2 shows that among the 36 eukaryotic species studied, 13 show a significant correlation between chromosome size and chromosome-wide GC content (12 after correction for multiple testing, see table 2). Out of these 13 correlations, 12 are consistent with gBGC—that is, negative. The single exception is Trypanosoma brucei, which shows a significant positive correlation between chromosome size and GC content. Figure 2 shows three examples illustrating the different types of situations that we observed: Leishmania major (significant negative correlation), T. brucei (significant positive correlation) and Guillardia theta (no significant correlation).
Table 2

Correlation between Chromosome Size and GC Content among Eukaryotes

SpeciesEukaryotic groupsaChromosome numberMean GC content (%)Statistical powerb (%)GC total/chrom sizecGC3/chrom sizec
Encephalitozoon cuniculi Unikonts1147410.3 ns (ns)0.06 ns (ns)
Schizosaccharomyces pombe Unikonts3360−0.5 ns (ns)−0.5 ns (ns)
Saccharomyces cerevisiae Unikonts163870−0.83*** (**)−0.87*** (***)
Candida glabrata Unikonts133952−0.69* (*)−0.71* (*)
Pichia stipitis Unikonts841250.24 ns (ns)0.71 ns (ns)
Yarrowia lipolytica Unikonts649140.77 ns (ns)−0.09 ns (ns)
Aspergillus fumigatus Unikonts850250.71 ns (ns)0.26 ns (ns)
Magnaporthe grisea Unikonts75222−0.11 ns (ns)0.07 ns (ns)
Gibberella zeae Unikonts44800.4 ns (ns)0.4 ns (ns)
Ustilago maydis Unikonts2354100−0.46* (ns)−0.47* (*)
Cryptococcus neoformans Unikonts144958−0.33 ns (ns)−0.72* (*)
Coprinopsis cinerea Unikonts135252−0.91*** (***)−0.68* (*)
Monodelphis domestica Unikonts93828−0.1 ns (ns)−0.07 ns (ns)
Mus musculus Unikonts204299−0.28 ns (ns)−0.26 ns (ns)
Homo sapiens Unikonts2341100−0.57* (*)−0.54* (*)
Gallus gallus Unikonts2941100−0.93*** (***)−0.97*** (***)
Ciona intestinalis Unikonts1336520.2 ns (ns)0.29 ns (ns)
Caenorhabditis elegans Unikonts63514−0.54 ns (ns)−0.26 ns (ns)
Drosophila melanogaster Unikonts5d427−0.3 ns (ns)−0.5 ns (ns)
Apis mellifera Unikonts163570−0.03 ns (ns)−0.16 ns (ns)
Dictyostelium discoideum Unikonts62214−0.94* (*)−0.6 ns (ns)
Trypanosoma brucei Excavates1146410.73* (*)0.48 ns (ns)
Leishmania major Excavates3660100−0.85*** (***)−0.82*** (***)
Guillardia theta Chromal3260−1 ns (ns)−1 ns (ns)
Paramecium tetraurelia Chromal11428100−0.84*** (***)−0.89*** (***)
Plasmodium falciparum Chromal141958−0.8** (*)−0.77* (*)
Cryptosporidium parvum Chromal830250.1 ns (ns)−0.1 ns (ns)
Thalassiosira pseudonana Chromal2347100−0.06 ns (ns)−0.87*** (***)
Phaeodactylum tricornutum Chromal3349100−0.18 ns (ns)−0.46* (*)
Ostreococcus lucimarinus Plantae196094−0.72** (*)−0.66* (*)
Micromonas pusilla Plantae156463−0.83** (**)−0.42 ns (ns)
Arabidopsis thaliana Plantae5367−0.2 ns (ns)−0.3 ns (ns)
Vitis vinifera Plantae1934940.37 ns (ns)−0.31 ns (ns)
Populus trichocarpa Plantae1933940.22 ns (ns)0.36 ns (ns)
Oryza sativa Plantae1244460.47 ns (ns)0.48 ns (ns)
Sorghum bicolor Plantae1044360.3 ns (ns)0.68* (*)

aThe eukaryotic groups relate to those shown in figure 1. Chromal, Chromalveolates.

bStatistical power for chromosome number ≥23 is set to 100%.

cValues are Spearman correlation coefficients, then come P values: ns, nonsignificant, * <0.05,** <10−3,*** <10−4 and q values (from FDR corrections for multiple tests) are indicated in parentheses.

dHere is indicated the number of chromosome arms instead of the number of chromosomes.

F

Examples of relationships between chromosome size and total GC content. (a) L. major. (b) T. brucei. (c) G. theta. The values above the plots indicate the number of similar observations that were made among the 36 species (e.g., N = 12 for [a] means 12 significant positive correlations). ρ = Spearman coefficient. Statistical significance: ns, nonsignificant, * <0.05, ** <10−3, *** <10−4.

Examples of relationships between chromosome size and total GC content. (a) L. major. (b) T. brucei. (c) G. theta. The values above the plots indicate the number of similar observations that were made among the 36 species (e.g., N = 12 for [a] means 12 significant positive correlations). ρ = Spearman coefficient. Statistical significance: ns, nonsignificant, * <0.05, ** <10−3, *** <10−4. Correlation between Chromosome Size and GC Content among Eukaryotes aThe eukaryotic groups relate to those shown in figure 1. Chromal, Chromalveolates. bStatistical power for chromosome number ≥23 is set to 100%. cValues are Spearman correlation coefficients, then come P values: ns, nonsignificant, * <0.05,** <10−3,*** <10−4 and q values (from FDR corrections for multiple tests) are indicated in parentheses. dHere is indicated the number of chromosome arms instead of the number of chromosomes. The evolution of chromosomal GC content can be driven by various processes: point substitutions, deletions, or insertions (including repeated sequences). Interestingly, we observed similar correlations when using GC at third codon position (GC3) instead of total GC content (table 2). Given that third codon positions can only evolve by base replacement, this shows that the observed correlation is due to variation in the pattern of point substitutions, and not to variation in DNA repeat content across chromosomes (table 2). In several cases, the statistical significance of the correlation changed from the total GC content analysis to the GC3 one, but the total number of species showing data consistent with gBGC is similar (significant negative correlation: 13/36, significant positive correlation: 1/36). Both analyses gave qualitatively the same results, with—as expected—changes in statistical significance caused by slight changes of the coefficients of correlation in case of species with low chromosome number (i.e., Dictyostelium discoideum, Sorghum bicolor, T. brucei, Cryptococcus neoformans, Micromonas pusilla). Thalassiosira pseudonana and Phaeodactylum tricornutum, two diatoms with a relatively large number of chromosomes, show results consistent with gBGC only for GC3, which raises the possibility of different mutation patterns affecting coding and noncoding regions in these species. The fact that about half of the species shows the footprint of gBGC (i.e., a significant negative correlation) may indicate gBGC is absent in the other half. It may also indicate that our approach fails to detect gBGC in many species. Indeed, the statistical significance of the correlations strongly depends on the number of chromosomes. For species with few chromosomes, our ability to detect the signature of gBGC is limited. For instance, G. theta shows a strong negative correlation between chromosome size and GC content (fig. 2c), but with only three chromosomes, the P value is obviously nonsignificant. We thus performed a statistical power analysis using human as a reference (see Materials and Methods). Table 2 shows the statistical power (from 0 to 100%) for all species of our dataset. Most species have too few chromosomes to detect any significant correlation between GC content and chromosome size. Among the 19 species for which the estimated power of our test is >50%, 14 (74%) show a significant correlation with total or third position GC content, and in all cases the correlation is consistent with gBGC. Similarly, another power analysis using a more conservative reference (yeast) revealed that 14 out of the 28 species with a power of >50% show results consistent with gBGC. Moreover, the combined analysis of all species indicated a strong significant negative correlation (for total GC content and chromosomes size: P value = 10−50, for GC3 and chromosome size: P value = 10−63). However, focusing only on the species that show individually nonsignificant correlations, the combined analysis is not significant. There is thus no clear trend emerging from this subset of species. Given that chromosomal size is only a rough proxy for recombination rate, this result is most likely an underestimate of how widespread this pattern is in our set of species. For example, Mus musculus and Apis mellifera, which contain a high number of chromosomes, show no significant correlation between chromosome size and GC content (table 2). Yet, in both species, studies using recombination data inferred from genetic maps showed a significant positive correlation between local GC content and crossover rates (Beye et al. 2006; Khelifi et al. 2006; see table 1). In M. musculus, the absence of significant correlation between chromosome size and GC content can be explained by the lack of variance in chromosome size in that species (Meunier and Duret 2004). In A. mellifera, as in several other eukaryotes (e.g., Schizosaccharomyces pombe), chromosomes experience little or no crossover interference, and their mean recombination rate is therefore not correlated to their size, which explains that we do not observe any correlation between chromosome size and GC content in these species. Finally, it should be noted that the evolution of GC content is a slow process. If a genome has undergone recent chromosomal rearrangements, it might not show any significant correlation between chromosome size and GC content, simply because there was not enough time to establish the pattern (Duret and Arndt 2008). Given all these limitations of our test, it is remarkable that a majority of species (50–74% of all species with statistical power >50%) show correlations consistent with the predictions of the gBGC model. Several species, however, do not fit into this general pattern: Ciona instestinalis, C. neoformans, S. bicolor and T. brucei. Cryptococcus neoformans is a species with evidence for gBGC from table 2 but not (or incompletely) from table 1. This can look surprising at first sight since we use recombination data in table 1, which is a more direct way of testing for gBGC. However, this assumption is correct if recombination data are of high quality, which might not be the case for most of the species in table 1 with a small number of markers. Too few markers will tend to shorten genetic maps, underestimating recombination rates (other important parameters are the number of meioses analyzed, the distribution of markers along chromosomes). Cryptococcus neoformans and other species in table 1 may be in this situation. It is possible that in such species, chromosome length gives a better idea of the average chromosome-wide recombination rates, which could explain why we report comparatively more species showing evidence of gBGC in table 2 than in table 1. In C. neoformans, the use of two different strains for the available genetic map and the complete genome could be an additional problem for correlating GC content and recombination rates reliably. The conflicting results in Ciona intestinalis may also come from the poor-quality map found in this species (only 276 markers, see table 1). Using two genetic maps in Plasmodium falciparum, one from 1999 with 900 markers (Su et al. 1999) and a more recent one with 3,438 markers (Jiang et al. 2011), we found very different results (GC/recombination: −0.31 nonsignificant with the 1999 version map, 0.34 nonsignificant with the 2011 version map, Chromosome size/recombination: 0.23 nonsignificant with the 1999 version map, −0.54, P < 0.05 with the 2011 version map), which confirms that the quality of recombination data is critical. Trypanosoma brucei shows a significant positive correlation between chromosome size and GC content (fig. 2b). However, it turns out that, for an unknown reason, chromosome size is not a good proxy for recombination rate in this species: the two parameters are not correlated (ρ = −0.09; P = 0.797, see table 1). Table 1 reveals that GC content correlates positively with recombination rates in T. brucei (ρ = 0.14), although not significantly (P = 0.694). It thus appears that T. brucei is not an exception to the general pattern consistent with gBGC. Again, a better map in this species would help understand more clearly the relationships between GC content, chromosome size and recombination rates (there are only 119 markers in this species, see table 1). In S. bicolor, GC3 correlates strongly with chromosome size in a positive manner (table 2). We do not have explanations for this significant correlation, which is not in agreement with gBGC. Sorghum bicolor seems therefore to represent a true exception to the general pattern. In conclusion, we found 17 species with a significant correlation between chromosome-wide GC content and chromosome size, as a rough proxy for recombination rate. Most of them (15/17) showed a negative correlation, consistent with the gBGC model. Our results were unaltered when considering GC3, which rules out the insertion of transposable elements as a general explanation for the observed pattern. Other explanations are of course possible (mutational biases, selection on GC content). In species where these various hypotheses have been tested, gBGC has always come out as the most likely explanation (reviewed in Marais 2003; Duret and Galtier 2009a). More work will be needed, however, to test these alternative explanations and firmly establish gBGC in the species where we report data consistent with gBGC for the first time. Figure 1 shows, in our set of 36 eukaryotes, the species with a positive correlation between GC content and recombination rates (measured directly or using chromosome size as a proxy), consistent with gBGC. Remarkably, this correlation is found in all four major eukaryotic groups studied, which suggests gBGC is widespread in eukaryotes. This is in agreement with a recent study using GC content of ribosomal DNA as a proxy for gBGC, in which gBGC was inferred in several distantly related eukaryotes (Escobar et al. 2011). Firm evidence for gBGC is only available for a handful of species (yeasts and mammals) and our work suggests that gBGC should be further studied in many more species, where it could have important effects on genome evolution.

Materials and Methods

Genome Data

We selected species for which a complete genome assembly, anchored on chromosomes, was available. Animal species are clearly over-represented in public databases. As gBGC is already established in animals, we only selected a subset of species representing the main animal groups. Genome data were extracted from Hogenom version 3 (17 species [Penel et al. 2009]), the NCBI website (15 species, http://www.ncbi.nlm.nih.gov/), and the JGI website (4 species, http://www.jgi.doe.gov/). For the Paramecium genome, we selected the scaffolds that were at least chromosomal arms (Gout J-F, personal communications). The relationship between chromosome size and recombination rate only stands for recombining chromosomes and we therefore removed all the nonrecombining chromosomes (chromosomes 4 from Drosophila melanogaster, 2 and 18 from Ostreococcus lucimarinus, 1 and 17 from M. pusilla, Y and W chromosomes from mammals and chicken, respectively). For our 36 species, we thus had chromosome sizes and sequences to estimate the GC content.

Recombination Data

We got recombination data for C. elegans directly from MareyMap (Rezvoy et al. 2007), D. melanogaster from Flybase (http://flybase.org) and Saccharomyces cerevisiae from http://www.yeastgenome.org/pgMaps/pgI.shtml. Recombination data for other species was obtained from specific papers: M. musculus (Cox et al. 2009), Homo sapiens (Matise et al. 2007), Gallus gallus (Groenen et al. 2009), Monodelphis domestica (Samollow et al. 2007), A. mellifera (Beye et al. 2006), T. brucei (Cooper et al. 2008), P. falciparum (Jiang et al. 2011), Arabidopsis thaliana (Singer et al. 2006), C. intestinalis (Kano et al. 2006), S. bicolor (Mace et al. 2009), Populus trichocarpa (Yin et al. 2004), Vitis vinifera (Doligez et al. 2006), Oryza sativa (Muyle et al. 2011), and C. neoformans (Marra et al. 2004). The number of chromosomes indicated in table 1 may differ from the true chromosome number: the X and Z chromosomes were excluded from this analysis because they recombine only in one sex, and recombination patterns are thus different from those in the autosomes, and the recombination data are not available for some chromosomes (for instance, chromosome 10 for C. neoformans). The recombination rates were computed by dividing the genetic map length of each chromosome by its physical size (in bp), and are thus chromosomal-averaged estimates.

GC Content Analysis

The total GC content was computed using whole-chromosome sequences. The GC content at third codon position (GC3) was computed by collecting all the available CDS from a genome (extracting CDS from Hogenom or Ensembl, or using CDS files from JGI or Broad Institute). For both total GC content and GC3 estimates, ambiguous nucleotides were excluded. Chromosome-averaged GC values were then computed. R was used to obtain bilateral Spearman coefficients of correlation, P values, and q values (P values corrected for multiple testing using the false discovery rate method). The combined analysis was performed by first getting the P values (P) from unilateral tests on Spearman coefficients in order to test for a general trend for a negative correlation between GC content and chromosome size (null hypothesis: GC content and chromosome size are not correlated negatively). The sum of the −2 * log (P) for all species follows a chi-squared distribution with 2n degrees of freedom, n being the number of species, which gave the P value of the combined analysis (Sokal and Rohlf 2012).

Statistical Power Analysis

To estimate the power of our approach according to the number of chromosomes (N) in a given genome, we performed the following test: we took the human genome (for which there is clear evidence of gBGC and which shows a significant negative correlation between chromosome size and GC content) and we asked what would be the probability to detect a significant correlation if this genome only contained N chromosomes. We thus randomly sampled N human chromosomes, computed the Spearman coefficient between their size and GC content, repeated this for all the possible combinations (up to 50,000 samples) and measured the fraction of significant Spearman correlations in the simulated data using R. We took this fraction as the statistical power of our test for a given number of chromosomes N. The same was done using S. cerevisiae as a reference.
  54 in total

1.  A genetic map and recombination parameters of the human malaria parasite Plasmodium falciparum.

Authors:  X Su; M T Ferdig; Y Huang; C Q Huynh; A Liu; J You; J C Wootton; T E Wellems
Journal:  Science       Date:  1999-11-12       Impact factor: 47.728

2.  Detecting positive selection within genomes: the problem of biased gene conversion.

Authors:  Abhirami Ratnakumar; Sylvain Mousset; Sylvain Glémin; Jonas Berglund; Nicolas Galtier; Laurent Duret; Matthew T Webster
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-08-27       Impact factor: 6.237

3.  Contrasting GC-content dynamics across 33 mammalian genomes: relationship with life-history traits and chromosome sizes.

Authors:  Jonathan Romiguier; Vincent Ranwez; Emmanuel J P Douzery; Nicolas Galtier
Journal:  Genome Res       Date:  2010-06-07       Impact factor: 9.043

4.  GC content evolution of the human and mouse genomes: insights from the study of processed pseudogenes in regions of different recombination rates.

Authors:  Adel Khelifi; Julien Meunier; Laurent Duret; Dominique Mouchiroud
Journal:  J Mol Evol       Date:  2006-04-28       Impact factor: 2.395

5.  Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes.

Authors:  G Marais; D Mouchiroud; L Duret
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-24       Impact factor: 11.205

Review 6.  Biased gene conversion and the evolution of mammalian genomic landscapes.

Authors:  Laurent Duret; Nicolas Galtier
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

7.  A genetic linkage map of Cryptococcus neoformans variety neoformans serotype D (Filobasidiella neoformans).

Authors:  Robert E Marra; Johnny C Huang; Eula Fung; Kirsten Nielsen; Joseph Heitman; Rytas Vilgalys; Thomas G Mitchell
Journal:  Genetics       Date:  2004-06       Impact factor: 4.562

8.  Recombination drives the evolution of GC-content in the human genome.

Authors:  Julien Meunier; Laurent Duret
Journal:  Mol Biol Evol       Date:  2004-02-12       Impact factor: 16.240

9.  Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences.

Authors:  Tarjei S Mikkelsen; Matthew J Wakefield; Bronwen Aken; Chris T Amemiya; Jean L Chang; Shannon Duke; Manuel Garber; Andrew J Gentles; Leo Goodstadt; Andreas Heger; Jerzy Jurka; Michael Kamal; Evan Mauceli; Stephen M J Searle; Ted Sharpe; Michelle L Baker; Mark A Batzer; Panayiotis V Benos; Katherine Belov; Michele Clamp; April Cook; James Cuff; Radhika Das; Lance Davidow; Janine E Deakin; Melissa J Fazzari; Jacob L Glass; Manfred Grabherr; John M Greally; Wanjun Gu; Timothy A Hore; Gavin A Huttley; Michael Kleber; Randy L Jirtle; Edda Koina; Jeannie T Lee; Shaun Mahony; Marco A Marra; Robert D Miller; Robert D Nicholls; Mayumi Oda; Anthony T Papenfuss; Zuly E Parra; David D Pollock; David A Ray; Jacqueline E Schein; Terence P Speed; Katherine Thompson; John L VandeBerg; Claire M Wade; Jerilyn A Walker; Paul D Waters; Caleb Webber; Jennifer R Weidman; Xiaohui Xie; Michael C Zody; Jennifer A Marshall Graves; Chris P Ponting; Matthew Breen; Paul B Samollow; Eric S Lander; Kerstin Lindblad-Toh
Journal:  Nature       Date:  2007-05-10       Impact factor: 49.962

10.  The impact of recombination on nucleotide substitutions in the human genome.

Authors:  Laurent Duret; Peter F Arndt
Journal:  PLoS Genet       Date:  2008-05-09       Impact factor: 5.917

View more
  69 in total

1.  Phylogenetic sequence analysis and functional studies reveal compensatory amino acid substitutions in loop 2 of human ribonucleotide reductase.

Authors:  Andrew J Knappenberger; Sneha Grandhi; Reena Sheth; Md Faiz Ahmad; Rajesh Viswanathan; Michael E Harris
Journal:  J Biol Chem       Date:  2017-08-14       Impact factor: 5.157

2.  Great majority of recombination events in Arabidopsis are gene conversion events.

Authors:  Sihai Yang; Yang Yuan; Long Wang; Jing Li; Wen Wang; Haoxuan Liu; Jian-Qun Chen; Laurence D Hurst; Dacheng Tian
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-03       Impact factor: 11.205

3.  Insights into three whole-genome duplications gleaned from the Paramecium caudatum genome sequence.

Authors:  Casey L McGrath; Jean-Francois Gout; Thomas G Doak; Akira Yanagi; Michael Lynch
Journal:  Genetics       Date:  2014-05-19       Impact factor: 4.562

4.  A Fine-Scale Genetic Map for Vervet Monkeys.

Authors:  Susanne P Pfeifer
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

5.  Variable Spontaneous Mutation and Loss of Heterozygosity among Heterozygous Genomes in Yeast.

Authors:  Duong T Nguyen; Baojun Wu; Hongan Long; Nan Zhang; Caitlyn Patterson; Stephen Simpson; Krystalynne Morris; W Kelley Thomas; Michael Lynch; Weilong Hao
Journal:  Mol Biol Evol       Date:  2020-11-01       Impact factor: 16.240

6.  Biased Gene Conversion in Rhizobium etli Is Caused by Preferential Double-Strand Breaks on One of the Recombining Homologs.

Authors:  Fares Osam Yáñez-Cuna; Mildred Castellanos; David Romero
Journal:  J Bacteriol       Date:  2015-11-23       Impact factor: 3.490

7.  Genetic and epigenetic determinants mediate proneness of oncogene breakpoint sites for involvement in TCR translocations.

Authors:  N S D Larmonie; A van der Spek; A J J C Bogers; J J M van Dongen; A W Langerak
Journal:  Genes Immun       Date:  2013-12-05       Impact factor: 2.676

8.  Biased Gene Conversion Constrains Adaptation in Arabidopsis thaliana.

Authors:  Tuomas Hämälä; Peter Tiffin
Journal:  Genetics       Date:  2020-05-15       Impact factor: 4.562

9.  Genome sequence of a diabetes-prone rodent reveals a mutation hotspot around the ParaHox gene cluster.

Authors:  Adam D Hargreaves; Long Zhou; Josef Christensen; Ferdinand Marlétaz; Shiping Liu; Fang Li; Peter Gildsig Jansen; Enrico Spiga; Matilde Thye Hansen; Signe Vendelbo Horn Pedersen; Shameek Biswas; Kyle Serikawa; Brian A Fox; William R Taylor; John Frederick Mulley; Guojie Zhang; R Scott Heller; Peter W H Holland
Journal:  Proc Natl Acad Sci U S A       Date:  2017-07-03       Impact factor: 11.205

Review 10.  Codon usage and codon pair patterns in non-grass monocot genomes.

Authors:  Purabi Mazumdar; RofinaYasmin Binti Othman; Katharina Mebus; N Ramakrishnan; Jennifer Ann Harikrishna
Journal:  Ann Bot       Date:  2017-11-28       Impact factor: 4.357

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.