Matthew Z Anderson1, Lauren J Wigen1, Laura S Burrack1, Judith Berman2. 1. Department of Genetics, Cell Biology and Development, University of Minnesota, Twin Cities, Minneapolis, Minnesota 55455. 2. Department of Genetics, Cell Biology and Development, University of Minnesota, Twin Cities, Minneapolis, Minnesota 55455 Department of Microbiology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel judithberman11@gmail.com.
Abstract
Subtelomeric regions of the genome are notable for high rates of sequence evolution and rapid gene turnover. Evidence of subtelomeric evolution has relied heavily on comparisons of historical evolutionary patterns to infer trends and frequencies of these events. Here, we describe evolution of the subtelomeric TLO gene family in Candida albicans during laboratory passaging for over 4000 generations. C. albicans is a commensal and opportunistic pathogen of humans and the TLO gene family encodes a subunit of the Mediator complex that regulates transcription and affects a range of virulence factors. We identified 16 distinct subtelomeric recombination events that altered the TLO repertoire. Ectopic recombination between subtelomeres on different chromosome ends occurred approximately once per 5000 generations and was often followed by loss of heterozygosity, resulting in the complete loss of one TLO gene sequence with expansion of another. In one case, recombination within TLO genes produced a novel TLO gene sequence. TLO copy number changes were biased, with some TLOs preferentially being copied to novel chromosome arms and other TLO genes being frequently lost. The majority of these nonreciprocal recombination events occurred either within the 3' end of the TLO coding sequence or within a conserved 50-bp sequence element centromere-proximal to TLO coding sequence. Thus, subtelomeric recombination is a rapid mechanism of generating genotypic diversity through alterations in the number and sequence of related gene family members.
Subtelomeric regions of the genome are notable for high rates of sequence evolution and rapid gene turnover. Evidence of subtelomeric evolution has relied heavily on comparisons of historical evolutionary patterns to infer trends and frequencies of these events. Here, we describe evolution of the subtelomeric TLO gene family in Candida albicans during laboratory passaging for over 4000 generations. C. albicans is a commensal and opportunistic pathogen of humans and the TLO gene family encodes a subunit of the Mediator complex that regulates transcription and affects a range of virulence factors. We identified 16 distinct subtelomeric recombination events that altered the TLO repertoire. Ectopic recombination between subtelomeres on different chromosome ends occurred approximately once per 5000 generations and was often followed by loss of heterozygosity, resulting in the complete loss of one TLO gene sequence with expansion of another. In one case, recombination within TLO genes produced a novel TLO gene sequence. TLO copy number changes were biased, with some TLOs preferentially being copied to novel chromosome arms and other TLO genes being frequently lost. The majority of these nonreciprocal recombination events occurred either within the 3' end of the TLO coding sequence or within a conserved 50-bp sequence element centromere-proximal to TLO coding sequence. Thus, subtelomeric recombination is a rapid mechanism of generating genotypic diversity through alterations in the number and sequence of related gene family members.
TELOMERES, the sequences at the ends of linear chromosomes, protect the chromosome from terminal degradation. Telomere-adjacent sequences, termed subtelomeres (Pryde ), are often defined by repeat-rich and gene-poor heterochromatic sequence of variable length (Mefford and Trask 2002; Kupiec 2014). The repetitive nature and high sequence similarity (Louis and Haber 1990b; Gardner ) of subtelomeres complicates their inclusion in whole-genome sequence assemblies (Kellis ) and, as a result, subtelomere sequence and organization are often poorly characterized. Accordingly, genes within subtelomeric regions are often excluded from comparative genomics analyses, due to a lack of accurate assembly and sequence read assignment (Hahn ; Wapinski ).Subtelomeres are composed of two segments with different levels of repetitiveness and divergence. Telomere-proximal sequences include short tandem repeats, whereas, telomere-distal domains encode unique genes, gene families, and repetitive elements of varying frequency (Pryde ). Across eukaryotes, subtelomere variability is due, in part, to frequent acquisition of single-nucleotide polymorphisms (SNPs) and insertions/deletions (indels) (Cuomo ; Dreszer ; Anderson ; Carreto ), as well as copy number variation in subtelomeric gene families (Brown ). Thus, subtelomeres are often the most variable region of the genome (Winzeler ; Cuomo ; Carreto ; Kasuga ; Brown ) both in genome-wide studies of DNA sequence variation (Farman and Kim 2005; Dreszer ) and in studies of gene copy number variation (Dujon ; Carreto ; Gresham ). As such, subtelomeres provide ideal environments for the rapid evolution of genes upon which selection can then act. Accordingly, comparison of Saccharomyces cerevisiae strains that produce beer, sherry, or champagne revealed increased copy numbers of subtelomeric MAL, SUC, or MEL genes, respectively (Naumov ; Naumova ; Brown ; Dunn ). This highlights the ability of subtelomeric regions to facilitate the rapid amplification of genes that provide a selective advantage under different growth conditions. However, the mechanisms that give rise to these changes and how rapidly they arise have not been well studied.Recombination operates extensively at subtelomeres in all eukaryote kingdoms, including protists, animals, and plants (Louis ; Linardopoulou ; Gaut ; Rudd ). Studies in protozoa and fungi found that subtelomeric recombination is particularly common in noncoding sequences (Corcoran ; Rehmeyer ; Boothroyd ; Glover ), although it also occurs within coding sequences (Kraemer ; Fan ). For example, in S. cerevisiae, the most common subtelomeric recombination junctions are the Y′ elements, a family of helicase homologs of varying functional integrity (Louis and Haber 1990a; Louis ). In the malaria parasite Plasmodium falciparum, immune evasion is mediated by extensive recombination among subtelomeric var gene paralogs (Kraemer ; Kyes ). Recombination progresses through different mechanisms at subtelomeres including gene conversion (Morrison ; Claessens ) and break-induced replication (BIR) (Boothroyd ; Hovel-Miner ) (Supporting Information, Figure S1). However, the kinetics and stability of var switching events have been difficult to define (Bopp ) because selection against deleterious genotypes likely discards many subtelomeric recombination events before they become fixed to a detectable level in natural populations.The majority of subtelomeres include gene families that are important for adaptation to the ecological niches occupied by that organism (Celenza and Carlson 1985; Dujon ; Kyes ). For example, fluctuations frequently occur in S. cerevisiae subtelomeric metabolic gene families through selective pressures for efficient utilization of available carbon sources (Turakainen ; Brown ; Wenger ; Dunn ). Interestingly, natural S. cerevisiae populations produce new family member variants via recombination between previously established family members (Van Mulders ; Christiaens ). Consequently, recombination exploits existing paralog diversity to construct novel variant genes without requiring extensive mutation. Furthermore, gene noise and expression plasticity at subtelomeres is primarily due to stochastic transcriptional silencing (Gottschling ; Fischer ; Mondoux and Zakian 2007; Choi ; Halliwell ; Anderson ) and facilitates a bet-hedging mechanism in which new variant genotypes can be tested without committing all cells in a population to a given expression pattern.In Candida albicans, the telomere-associated (TLO) genes are the only widespread subtelomeric gene family (Van Het Hoog ), and the 13 subtelomeric paralogs are found at all but three chromosome arms (Chr2R, 6R, and 7L) (Van Het Hoog ; Anderson ). TLOs underwent a recent expansion from 1 copy in most Candida species to 14 paralogs in C. albicans (Butler ). All TLO genes are oriented similarly in the subtelomere, with transcription proceeding toward the centromere (Figure 1C). Sequence homology comparisons distinguish three TLO clades, α, β, and γ, which also exhibit different transcriptional levels and cellular localization patterns (Van Het Hoog ; Anderson ). Indels distinguish the lengths of TLO genes in a clade-specific manner: the seven TLOγ clade genes are shortest (∼525 bp), the five TLOα clade genes are longer, (675–750 bp), and the single TLOβ gene is the longest at 822 bp. Within a clade, TLOs are ∼97% identical and, between clades, they are 82% identical (when excluding indels). All TLOs encode a conserved N-terminal Med2 domain followed by a gene-specific repetitive region of variable length and a C-terminal region that distinguishes the three clades (Anderson ; Zhang ). Investigation of the subtelomeric TLO gene family composition uncovered differences between the original SC5314 reference genome (Jones ) and subsequently sequenced SC5314 isolates, including homologous chromosomes encoding two different TLO genes at the same position (Hirakawa ) and a previously unidentified TLO (Anderson ).
Figure 1
Evolution and sequencing of C. albicans subtelomeres. (A) Strains were evolved by passaging to new rich media agar plates every 2 days for 30 months using cells from the thick part of the streak to avoid bottlenecks during propagation. Passaging was performed either without stress, as described above, or with a 1hr cold shock (4°) followed by a 1hr mild heat shock (37°) immediately after being restruck every third passage. (B) TLO genes were amplified from each chromosome arm using primers represented as half arrows. Amplicons were sequenced from chromosome arm-specific unique sequence to the start codon of the TLO genes (pan-TLO primer). (C) Location of TLO genes in the SC5314 strain prior to passaging is shown. TLO clade is denoted by color. (D) Amplicons composed of mixed sequences as indicated in the chromatograph were separated using the chromosome arm sequence as a template to identify the second sequence. Once separated, these sequences were used to identify the second donor TLO gene.
Evolution and sequencing of C. albicans subtelomeres. (A) Strains were evolved by passaging to new rich media agar plates every 2 days for 30 months using cells from the thick part of the streak to avoid bottlenecks during propagation. Passaging was performed either without stress, as described above, or with a 1hr cold shock (4°) followed by a 1hr mild heat shock (37°) immediately after being restruck every third passage. (B) TLO genes were amplified from each chromosome arm using primers represented as half arrows. Amplicons were sequenced from chromosome arm-specific unique sequence to the start codon of the TLO genes (pan-TLO primer). (C) Location of TLO genes in the SC5314 strain prior to passaging is shown. TLO clade is denoted by color. (D) Amplicons composed of mixed sequences as indicated in the chromatograph were separated using the chromosome arm sequence as a template to identify the second sequence. Once separated, these sequences were used to identify the second donor TLO gene.All TLOs encode a Med2 domain-containing protein, such that the 14 paralogs (13 subtelomeric and one internal on Chr1L) are thought to encode interchangeable components of the Mediator transcriptional regulation complex (Zhang ). In C. albicans, MED3, which associates with TLOs, affects the stability of white–opaque switching (Zhang ), which is necessary for mating and important for formation of some types of biofilms (Park ; Jones ). In the closely related species C. dubliniensis, which has only two TLO genes, deletion analysis revealed that these TLOs contribute to filamentous growth forms, biofilm formation, and sugar metabolism (Jackson ).Here, we describe the genome changes that occurred to members of the TLO subtelomeric gene family during the course of laboratory passaging. Sequencing of subtelomeres in passaged isolates revealed 16 changes in the TLO gene repertoire where one TLO gene had replaced another one on a different chromosome arm. These TLO "movement" events appear to involve a nonreciprocal recombination mechanism that occurs frequently, suggesting that TLO repertoires change frequently and most likely provide a biologically relevant mechanism for generating genetic diversity in C. albicans.
Materials and Methods
Strain construction and passaging
Yeast cells were passaged on solid rich medium (YPAD) agar plates at 30° under standard conditions. Two sets of nine strains were restruck to fresh plates every 2 days from the thick part of the streak, which minimizes bottlenecking of the population. One set, termed “stress passaged,” was exposed immediately after being restruck to 1 hr at 4° and then one hour at 37° prior to incubation at 30° every three passages.The number of generations traversed between passages was determined by cell counts before and after 2 days of growth on YPAD agar plates. Cells from two progenitor strains were struck onto YPAD plates identical to the passaged isolates. Cell counts of a streak through the struck population immediately after plating and following 2 days of growth were performed using a hemocytometer to allow for calculation of generations per passage, Pf = Pi × 2, where Pf and Pi are the final and initial cell counts. Slightly elevated but not significant numbers of generations were found among end-point-passaged strains.
PCR amplification of gene sequences
SC5314 DNA was collected as previously described (Hoffman and Winston 1987). PCR of TLO genes was performed using arm-specific primers for all chromosomes together with a pan-TLO primer. The arm-specific primers were designed against the closest region of unique sequence centromere-proximal to the TLO or of similar homology on chromosome arms lacking a TLO gene. Attempts to sequence beyond the TLO gene were complicated by sequences not matching the expected chromosomal locations from the genome reference sequence.Amplification of the core centromere sequence was performed for each of the eight C. albicans chromosomes. Amplification of ENO1, LIP4, SAP1, and SAP7 was performed independently. Primers are listed in Table S1. All amplicons were amplified and sequenced twice independently using nested internal primers.
Analysis of TLO sequences
Sequenced amplicons were aligned to the reference C. albicans SC5314 genome (http://candidagenome.org/) using BLAST. Sequences were also independently aligned to the SC5314 TLO sequences using the multiple sequence comparison by log-expectation (MUSCLE) algorithm at http://www.ebi.ac.uk/Tools/muscle/index.html (Edgar 2004). Assignment of TLO encoded at specific arms was determined by TLO- and/or clade- specific SNPs (Table S2) and indels and comparison to the amplicon generated from the progenitor strain. Chromatographs were viewed using Sequencher (Gene Codes Corporation, Ann Arbor, MI) and Snapgene Viewer (GSL Biotech, Chicago, IL).
Mapping chromosome heterozygosity
Chromosome heterozygosity was determined using single nucleotide polymorphism–restriction fragment length polymorphism (SNP–RFLP) analysis of previously described SNP positions (Forche ).
Competitive growth assay
Competitive growth assays were performed for YJB9929 evolved strains relative to an RM1000 ENO1–GFP strain (CAY12638) as described in Thompson . Briefly, 5 ×106 cells/ml of each evolved and reference strain were mixed and seeded into 3 ml of YPAD. The ratio of the two competitors was quantified at both the initial and final (72 hr) time points by microscopic visualization of 10 fields of view using a Zeiss Inverted Microscope (Axio Observer Z1) fitted with an AxioCam HR. The selection coefficients were calculated as s = [ln (Ef/Rf) − ln (Ei / Ri)]/T, where E and R are the numbers of evolved and reference cells, subscripts are the final and initial populations, and T is the number of generations that reference cells have grown during the competition.
Statistical analysis
Statistics were performed using R (R Development Core Team 2013) or SPSS Statistics for Windows (IBM Corp. 2013).
Results
Sequence discrepancies between the SC5314 reference genome and SC5314 isolates suggested that the TLO gene family may be continuing to evolve. To determine the time scale of subtelomere evolution, we investigated TLO gene organization in strains that had been passaged continuously for >30 months (457 passages, ∼4570 divisions) under laboratory conditions. Two duplicate sets of nine SC5314-derived lines were passaged every 2 days on solid, rich-media agar plates. Both sets (nonstressed and stressed) were propagated on YPD medium at 30°; additionally, the stressed set was exposed to a cold shock (1 hr at 4°) followed by a heat shock (1 hr at 37°) every three passages (Figure 1A). After 30 months of passaging, we PCR amplified the 16 subtelomeric regions including TLOs from the populations of 18 independently evolved lines, using a chromosome arm-specific primer anchored in centromere-proximal, unique sequences together with a single, pan-TLO primer that bound to a sequence immediately downstream of the start codon and that is conserved among all TLO homologs (Figure 1B). Therefore, only chromosome arms encoding TLOs could produce amplicons that were captured for analysis. The subtelomeric composition of the progenitor strains was identical; 13 chromosome arms encoded a TLO gene (Figure 1C). Only the chromosome arms encoding TLOs in the parental strain generated PCR products for analysis in the passaged isolates, suggesting that no TLO had moved to a previously unoccupied chromosome arm.To determine the resident TLO on all amplified chromosome arms, we sequenced the PCR products and analyzed the sequence for homology among the TLO paralogs. Most subtelomeric amplicons were composed of a single nucleotide sequence, indicated by single, uniform peaks throughout the sequence trace. However, some sequence traces began as a single uniform sequence in the centromere-proximal end of the amplicon and transitioned to a heterogeneous sequence composed of double peaks as the sequence extended toward the telomere, suggesting that each homolog on that particular chromosome arm encoded distinct TLO sequences. The new TLO sequence was inferred by subtracting out the previously established TLO sequence for that chromosome arm from the double peaks (Figure 1D). This process was performed for the 13 chromosome arms with TLOs in 18 passaged strains for a total of 234 unique data points.
Timing of TLO recombination events
Fourteen major subtelomeric rearrangement events were evident following 30 months of strain passage (Figure 2A and Table 1). This was defined as any detectable alteration in the TLO repertoire resulting from chromosome homozygosis and/or “movement” of specific TLO gene family members between subtelomeres during passage.
Figure 2
Evolution of subtelomeric TLOs during passaging. (A) The timing of identified recombination events was determined by screening archived isolates. The most narrow time frame for each event is displayed visually as a solid bar for each strain. The strain used and the chromosome arm affected by recombination are indicated on the y-axis. Black or purple bars delimit the time frames during which the recombination events became detectable in the different strains. Black bars indicate a 1-month interval and purple bars indicate longer intervals because isolates were archived less frequently over these intervals. Bottom: moving average for recombination events occurring at 15 passage intervals during the experiment. (B) Sequential recombination events on the right arm of chromosome 4 resulted in homozygosis of Chr4R in strain YJB9929.
Table 1
TLO recombination events identified in passaged strains
Strain
Chr arm
Premovement
Postmovement
Recombination point
Recombination relative to CDS stop codon
YJB8648
4L
TLOα9/TLOα9
TLOα12–α9/TLOα12–α9
Chr4: 1612–1524
CDS 3′ end (−135, −47)
YJB8648s
4R
TLOα9/TLOα10
TLOα12/TLOα12
Chr4: 1597170–1597206
137–174 bp centromeric
YJB9907
4R
TLOα9/TLOα10
TLOα12/TLOα12–α10
Chr4: 1597488–1597612
CDS 3′ end (−265, −151)
YJB9915
4L
TLOα9/TLOα9
TLOα9/TLOα12–α9
Chr4: 1669–1519
9 nt centromeric
YJB9916s
3L
TLOγ7/TLOγ7
TLOγ11/TLOγ11
Chr3: 14321–14089
56 nt centromeric of the unspliced form
4L
TLOα9/TLOα9
TLOα12–α9/TLOα12–α9
Chr4: 1614–1527
CDS 3′ end (−133, −45)
YJB9929
RL
TLOα1/TLOα1
TLOα1/TLOγ13
Centromeric
Centromeric
1L
TLOα3/TLOα3
TLOα3/TLOα1
Chr1: 11547–11543
55–60 bp centromeric
4L
TLOα9/TLOα9
TLOα9/TLOα12
Chr4: 1666–1516
6 nt 3′ to −144bp into 3′ of CDS
4R
TLOα9/TLOα10
TLOα10/TLOα10
Centromeric
Centromeric
4R
TLOα10/TLOα10
TLOα10/TLOγ8
Chr4: 1597146–1597167
191–170 bp centromeric
4R
TLOα10/TLOγ8
TLOα10/TLOα10
Centromeric
Centromeric
YJB9929s
4L
TLOα9/TLOα9
TLOα12–α9/TLOα12-α9
Chr4: 1614–1527
CDS 3′ end (−133, −45)
YJB10777s
4L
TLOα9/TLOα9
TLOα12–α9/TLOα12–α9
Chr4: 1614–1527
CDS 3′ end (−133, −45)
YJB10779
4R
TLOα9/TLOα10
TLOα9/TLOα9
Chr4: 1597327–1597336
1–7 nt centromeric
YJB10780s
1L
TLOα3/TLOα3
TLOα3/TLOγ11
Chr1: 11695–11669
180 bp centromeric
Evolution of subtelomeric TLOs during passaging. (A) The timing of identified recombination events was determined by screening archived isolates. The most narrow time frame for each event is displayed visually as a solid bar for each strain. The strain used and the chromosome arm affected by recombination are indicated on the y-axis. Black or purple bars delimit the time frames during which the recombination events became detectable in the different strains. Black bars indicate a 1-month interval and purple bars indicate longer intervals because isolates were archived less frequently over these intervals. Bottom: moving average for recombination events occurring at 15 passage intervals during the experiment. (B) Sequential recombination events on the right arm of chromosome 4 resulted in homozygosis of Chr4R in strain YJB9929.To further refine the timing of the TLO recombination events, we assayed isolates at intermediate time points during passaging for either the ancestral or evolved TLO organization. Rearrangements occurred throughout the time course from the first month to the final month of passaging (Figure 2A). Recombination occurred randomly throughout passaging (P = 1.0, one-way ANOVA). Furthermore, no clear environmental perturbations or alterations in the passaging protocol coincided with these time points. Additionally, sequencing the subtelomeres at the earliest time points following recombination failed to identify co-occurrence of other subtelomeric recombination events with the exception of events at both arms of chromosome 4 (Chr4L and 4R) in YJB9929. Thus, TLO gene family changes occurred at random times throughout the passaging experiment.Most studies of genome evolution compare static representations of a continuous and ongoing process. The experimentally practical time scales of TLO recombination allowed transitionary phases to be captured by experimental evolution. For example, in strain YJB9929, the ancestral TLOα9/TLOα10 configuration on Chr4R had homozygosed to two copies of TLOα10 by the end of passaging (Table 1). Interestingly, this involved two transitionary states of Chr4R prior to the final TLOα10/TLOα10 arrangement (Figure 2B). The first transition, seen in passage 137, was loss of TLOa9 by homozygosis of the Chr4R TLOα10 allele; the second transition, seen between passages 168 and 184, was introduction of one copy of TLOγ8 from Chr3R and loss of a TLOα10 allele; and the final transition, between passage 372 and 403, was homozygosis of TLOα10. Identification of these different transition states during passaging suggests that additional recombination events are often missed when comparing static representatives of genome evolution.Capture of these transitional recombination events also provided a more accurate measure of the rate of TLO evolution. Strains underwent ∼4570 divisions during the course of the experiment based on estimates of ∼10 cell divisions between passages (data not shown). Therefore, we can estimate TLO movement to occur in ∼1:5142 cell divisions or 1:82,260 chromosome arms per division. These rates are orders of magnitude greater than either genome average per-base-pair mutation rates or insertion/deletion rates per cell division (∼10−9) (Lang and Murray 2008; Lynch ). Thus, subtelomeric recombination provides an opportunity for high-frequency gene evolution.Population-level strain passage requires that the detected recombination events rose to significant prominence in the passaged population. To determine the requirement and kinetics of event fixation, we sequenced the recombination site of 24 single colonies from strain YJB9929 from passage 411, immediately following recombination (TLOα10/TLOγ8 → TLOα10/TLOα10). Twenty-three of 24 colonies had undergone recombination that homozygosed TLOα10 at this time point (Figure S2), demonstrating that the detected recombination events rapidly dominate the population but are not completely fixed at the earliest time points.
Types of TLO recombination events
All of the evolved strains originated from the same genetic background with little time to accumulate unique, independent mutations prior to passaging. Thus, we assume that each strain and each TLO had an equal probability of recombination. Based on this assumption, there was no obvious effect of stress on either the number of strains that underwent TLO recombination (5 stressed vs. 5 unstressed) or the number of recombination events that occurred over the course of the passaging experiment (6 stressed vs. 10 unstressed). The distribution of recombination events among strains was random with the exception of a single strain (YJB9929) that underwent six separate events (Figure S3 and Table 1).The distribution of TLOs involved in recombination events suggested specific patterns of recombination among the TLO genes. Recombination events were significantly clustered at certain chromosome arms, χ2 ((N = 32) = 56.3; P = 0.0005; Monte Carlo). Interestingly, recombination was much more prevalent in α-clade TLOs than in TLOs of the β- and γ-clades.We separated TLO genes that moved into two groups, the donor gene, which was copied, and the recipient gene, which was lost (Figure 3). In all cases, a copy of the donor gene remained unaltered at its native locus. TLO recombination occurred extensively between some TLOs/chromosome arms (TLOα9, TLOα10, and TLOα12) and was excluded from others (TLOβ2, TLOγ4, TLOγ5, and TLOγ16). Some loci were overrepresented as gained or as lost during TLO recombination (gained, χ2 (N = 16) = 57.1; P < 0.0005; Monte Carlo); lost, χ2 ((N = 16) = 62.0; P < 0.0005; Monte Carlo). For example, TLOα12 always increased in copy number by expanding to other chromosome arms such as Chr4L, where TLOα9 consequently reduced in copy number.
Figure 3
Inventory of TLO events. The TLO family members affected by subtelomeric recombination were separated into those lost (replaced by another paralog) and those gained (copied to other subtelomeric loci). Some TLOs were significantly overrepresented in the identified recombination events.
Inventory of TLO events. The TLO family members affected by subtelomeric recombination were separated into those lost (replaced by another paralog) and those gained (copied to other subtelomeric loci). Some TLOs were significantly overrepresented in the identified recombination events.Two general types of recombination mechanisms were evident from TLO sequencing. The majority was ectopic recombination events; no recipient-specific sequence was seen telomere-proximal to the initial conversion tract. Thus, these were most likely due to BIR or gene conversion that extended to the end of the chromosome (Figure S1). No reciprocal genetic exchange was detected. The second set of TLO movements occurred due to loss of heterozygosity (LOH) of a previously heterozygous TLO locus following ectopic recombination for seven events (Table 1 and Table 2). The recipient TLO loci were completely lost in these seven events because the donor TLO was homozygosed at the recipient subtelomere, resulting in a net increase of two copies of the donor TLO. TLOα10 was lost in two independent isolates, suggesting that it may be either particularly susceptible to invasion or more dispensable under the passaging conditions. Complete loss of the native TLO in the strain following homozygosis of a donor TLO was significantly more prevalent in stress-passaged than unstressed strains (83.3% vs. 14.3%, χ2, d.f. = 1; P = 0.024; Fischer’s exact test).
Table 2
Subtelomeric recombination mechanisms
Recombination mechanism
Movement
Crossover
YJB9907–4R
YJB9929–4L
YJB10780s–1L
Crossover, then LOH
YJB8648–4L
YJB9916s–4L
YJB9929s–4L
YJB10777s–4L
Gene conversion
YJB9915–4L
YJB9929–RL
YJB9929–1L
YJB9929–4R, 186 passages
Gene conversion, then LOH
YJB8648s–4R
YJB9916s–3L
LOH
YJB9929–4R, 124 passages
YJB9929–4R, 388 passages
YJB10779–4R
Gene loss following invasion and LOH can alter the TLO repertoire both at the gene and clade level. The majority of the recombination events (11/16) were between TLOα genes, one was between TLOγ genes, and four were interclade recombinations. Of the four interclade events, including the LOH events on Chr4R in YJB9929 (Figure 2B), TLOα and TLOγ were equally likely to be the donor or the recipient.
Sequence elements associated with subtelomeric recombination events
The prevalence of intraclade recombination events suggests that specific DNA regions may facilitate sequence-dependent recombination (Myers ). To ask if recombination sites clustered at specific subtelomeric positions, we mapped the recombination site for each event using the junction sequence between the donor and recipient chromosome arm sequences (Figure 4A). Six of 14 recombination events occurred in the 3′ end of the TLO open reading frame (ORF), and two additional recombination events occurred immediately centromere-proximal of the 3′ UTR. All of these events were products of intraclade recombination, likely due to clade-specific sequence homology at the 3′ end of the TLO ORF. Three recombination events occurred within a highly conserved 50-bp sequence element found at all TLO-containing chromosome ends except for TLOβ2 and TLOγ16 and absent from chromosome arms lacking TLOs (Figure 4B and Figure S4). Previous attempts to assemble the subtelomeric regions from SC5314 whole-genome sequencing were problematic specifically due to the presence of this same sequence element that we termed the Bermuda Triangle sequence (BTS). Importantly, two of these events involved members of different TLO clades. Furthermore, the parental SC5314 background encodes a six nucleotide tandem duplication within the BTS that appeared at the same position that the TLOα10 sequence on Chr4R became heterozygous for TLOα9/TLOα10 (Figure S5, homolog 2 sequence). This suggests that recombination in the BTS produced the TLOα9/TLOα10 heterozygous organization on Chr4R.
Figure 4
Recombination clusters at defined subtelomeric regions. (A) Conversion tracts were mapped onto a representation of the C. albicans subtelomere and enriched regions of recombination were identified in the C-terminal portion of the TLO coding sequence and the centromeric BTS. Arrows indicate specific sites of recombination and horizontal lines indicate a region within which a single recombination event occurred. TLO replacements resulting from LOH were not included in the map. (B) Alignment of the 11 subtelomeric BTS elements reveals strong sequence conservation. (C) Recombination between the TLOα12 and TLOα9 ORFs produced a hybrid TLOα12–α9 sequence. Recombination occurred within a region of strong homology that is demarked by gene-specific SNPs and did not alter the Tloα12 protein sequence. The recombination region is shaded intermediate to either TLO gene. (D) Recombination between the TLOα12 and TLOα10 ORFs produced a hybrid TLOα12–α10 sequence with a unique protein sequence. The novel Tloα12–α10 sequence is characterized by a 111-nt deletion at the site of recombination and the incorporation of nonsynonymous SNPs between the TLO genes.
Recombination clusters at defined subtelomeric regions. (A) Conversion tracts were mapped onto a representation of the C. albicans subtelomere and enriched regions of recombination were identified in the C-terminal portion of the TLO coding sequence and the centromeric BTS. Arrows indicate specific sites of recombination and horizontal lines indicate a region within which a single recombination event occurred. TLO replacements resulting from LOH were not included in the map. (B) Alignment of the 11 subtelomeric BTS elements reveals strong sequence conservation. (C) Recombination between the TLOα12 and TLOα9 ORFs produced a hybrid TLOα12–α9 sequence. Recombination occurred within a region of strong homology that is demarked by gene-specific SNPs and did not alter the Tloα12 protein sequence. The recombination region is shaded intermediate to either TLO gene. (D) Recombination between the TLOα12 and TLOα10 ORFs produced a hybrid TLOα12–α10 sequence with a unique protein sequence. The novel Tloα12–α10 sequence is characterized by a 111-nt deletion at the site of recombination and the incorporation of nonsynonymous SNPs between the TLO genes.Sequencing of the subtelomeric region surrounding the TLO genes failed to identify a recombination site for 3 of the 16 gene movement events. To determine if recombination occurred centromere-proximal to the sequenced subtelomere, the heterozygosity of the altered chromosome was interrogated by SNP–RFLP using loci on each chromosome arm. In all three instances (YJB9929–RL, YJB9929–4R recombination 1, and YJB9929–4R recombination 3), heterozygosity was maintained across the SNP markers on both arms of the chromosome (Figure S6). Therefore, it is most likely that recombination occurred centromere-proximal to the subtelomere but distal to the SNP–RFLP markers.Recombination within gene family ORFs can produce novel recombinant proteins that may have unique properties. In one case, a crossover within the 3′ ends of TLOα9 and TLOα12 produced a hybrid gene on Chr4L based on acquisition of a single SNP (T → C). This synonymous mutation altered the codon usage for Asp238 and the 3′ noncoding region of the gene was different (Figure 4C). Interestingly, five isolates underwent a very similar recombination event (Figure 4C, Table 1, and Table 2). Subsequently, the recombinant TLO was homozygosed in four of the strains, such that the TLOα12–α9 hybrid became the only TLO at Chr4L (Table 1 and Table 2).The other case of inter-TLO recombination produced a TLOα12–TLOα10 hybrid in strain YJB9907. In this case, a single crossover between codon 164 (265 nucleotides (nt) upstream of the stop codon) in TLOα12 and codon 168 (151 nt upstream of the stop codon) in TLOα10 resulted in a protein that differed from TLOα12 by the loss of 37 amino acids in the gene-specific region of variable length and six new SNPs in the clade-specific, conserved C terminus, including two nonsynonymous codon changes (P203H and I209N) (Table S3 and Figure 4D). Thus, the recombination event produced a novel TLOα variant protein, TLOα12–10.To ask if specific chromosome arms were biased toward specific recombination sites, we categorized recombination border locations for each subtelomere. We divided the chromosome arm into defined regions with different sequence properties represented by the TLO gene (∼500–700 bp), the UTR immediately downstream of the TLO gene (UTR, ∼300 bp), the BTS (50 bp), and any sequence centromere-proximal to the BTS, and inferred the frequency of recombination in each region within the passaged strains. All four regions underwent recombination with different distributions dependent on the chromosome arm (Figure 5). For example, five of the six recombination events on Chr4L occurred within the 3′ region of the TLO gene whereas recombination on Chr4R was more evenly divided among the four categories. This suggests that the types of subtelomeric regions prone to recombination differ depending on the chromosome arm.
Figure 5
Association of recombination site and chromosome arm. The region of recombination was defined for all recipient chromosome arms as either within the TLO, between the TLO and BTS (Mid), within the BTS, or centromeric of the BTS (Cen). Binning of the number of recombination events within each region for a given chromosome arm revealed bias in the recombination site for certain chromosome arms.
Association of recombination site and chromosome arm. The region of recombination was defined for all recipient chromosome arms as either within the TLO, between the TLO and BTS (Mid), within the BTS, or centromeric of the BTS (Cen). Binning of the number of recombination events within each region for a given chromosome arm revealed bias in the recombination site for certain chromosome arms.
TLO sequence evolution during passaging
Elevated levels of polymorphism in subtelomeres, in addition to recombination, contribute to gene evolution at chromosome ends (Cuomo ; Dreszer ). To investigate the potential for nucleotide polymorphisms to contribute to TLO gene evolution during passaging, we analyzed the TLO sequences for acquisition of new SNPs during passage. TLOα and TLOγ clade members accumulated 12 and 19 novel SNPs within the ORFs, respectively. In contrast, no new SNPs were identified in the single TLOβ family member, TLOβ2. The total number of polymorphisms and nonsynonymous changes in individual genes varied considerably (Figure 6A and Table S4); overall, there were similar numbers of total nonsynonymous polymorphisms within genes of the TLOα and the TLOγ clades (7 vs. 9, respectively). Thus, in contrast to the frequency of recombination (Figure 3), the frequency of SNP accumulation was similar between the two clades.
Figure 6
SNP accumulation during passaging. (A) SNPs arising during strain propagation were identified by comparison to the progenitor strain. The number of SNPs per 100 bp for each subtelomeric TLO (5 for TLOα, 1 for TLOβ, and 7 for TLOγ), eight centromere core regions (CEN), ENO1, LIP4, SAP1, and SAP7 were plotted as individual data points for each locus. (B) Identified SNPs were grouped by clade of origin and plotted along the length of the TLO coding sequence. TLOγ clade SNPs clustered toward the Med2 domain at the 5′ end and TLOα SNPs arose primarily in the clade-specific region at the 3′ end of the gene.
SNP accumulation during passaging. (A) SNPs arising during strain propagation were identified by comparison to the progenitor strain. The number of SNPs per 100 bp for each subtelomeric TLO (5 for TLOα, 1 for TLOβ, and 7 for TLOγ), eight centromere core regions (CEN), ENO1, LIP4, SAP1, and SAP7 were plotted as individual data points for each locus. (B) Identified SNPs were grouped by clade of origin and plotted along the length of the TLO coding sequence. TLOγ clade SNPs clustered toward the Med2 domain at the 5′ end and TLOα SNPs arose primarily in the clade-specific region at the 3′ end of the gene.The TLO gene can be divided into three distinct regions with specific sequence properties. The N-terminal 300 nt encode the Med2 domain that provides the basis for assembly into the Mediator complex. This is followed by a short repetitive region of variable length that is gene specific and a C-terminal region of ∼250–350 nt that is clade specific. In TLOα genes, new SNPs were primarily distributed across the clade-specific region evenly (Figure 6B); in the TLOγ genes, new SNPs were clustered at different points along the gene and were enriched in the Med2 domain. Therefore, there may be clade-specific selective pressures exerted on the TLO genes during laboratory passaging.Previous investigation of sequence evolution among Candida species determined that centromeres evolve more rapidly than other loci (Padmanabhan ). Thus, we compared the rate of evolution at TLOs relative to centromeres by interrogating an 800-bp segment of the central core of all eight C. albicans centromeres for SNPs that arose during passaging. Eleven SNPs accumulated across the eight centromere core regions in the nine isolates (total of 11 SNPs/12,800 bp (800 bp × 8 centromeres × 2 alleles). The distribution of mutations among the centromeres was highly skewed; a single centromere, CEN2, accumulated eight new mutations, CENR accumulated two mutations, and CEN5 acquired a single mutation (Figure 6A and Table S5); in contrast, TLO genes acquired 31 novel mutations in 13 ORFs analyzed across a total of 15,970 bp (average of 614 bp × 13 telomeres × 2 alleles).As a control for the evolution of expanded gene family members (Butler ), we also analyzed 1600 bp (800 bp × 1 gene per strain × 2 alleles) within LIP4, a lipase within the C. albicans expanded LIP family, and two members of the expanded serine aspartyl protease (SAP) family, SAP1 and SAP7. Sequencing detected a single SNP (C877A) within LIP4 and two SNPs in each of the SAP ORFs (Figure 6A and Table S6). Additionally, the ENO1 ORF, which encodes a metabolic enolase, did not acquire any novel SNPs following passaging of these strains (Figure 6A). Thus, the TLO genes as a group accumulated more mutations per locus and per base pair during the 457 passages (2.58 per TLO and 19.4/10 kbp) compared to the centromere core region (1.13 per centromere and 8.59/10 kbp) and other genes with multiple family members (1 per LIP4 and 6.25/10 kbp and 2 per SAP and 12.5/10 kbp), indicating that telomeric genes have the potential to evolve at least as rapidly as the quickly evolving centromere regions and other expanded gene families that are acted on by host selection.
Phenotypic consequences of recombination
We next asked if the evolved strains that underwent subtelomeric recombination had acquired a fitness advantage under the passaging conditions using competitive fitness assays. We mixed equal cell concentrations of a related GFP-marked reference strain and one of several YJB9929 isolates from different passaging time points. We then allowed the cell mixtures to grow in liquid YPAD medium at 30° for 72 hr (∼30 generations). Comparison of final and initial frequencies of the two strains identified a significant increase in the competitive index of the evolved isolates after 1000 or more generations (Figure 7).
Figure 7
Increased growth rates during strain passaging. Passaged isolates from several time points associated with subtelomeric recombination were assayed for fitness using competition assays against a GFP-marked reference strain. The specific recombination events that arose in this passaged strain are illustrated for each chromosome arm by an arrow that is aligned with the time points on the x-axis below. As expected, relative fitness, as measured by the selection coefficient, increased with continued passaging.
Increased growth rates during strain passaging. Passaged isolates from several time points associated with subtelomeric recombination were assayed for fitness using competition assays against a GFP-marked reference strain. The specific recombination events that arose in this passaged strain are illustrated for each chromosome arm by an arrow that is aligned with the time points on the x-axis below. As expected, relative fitness, as measured by the selection coefficient, increased with continued passaging.
Discussion
Evolutionary histories of a wide range of species often point to subtelomeres as highly dynamic regions with rapid gene turnover (Kraemer ; Linardopoulou ; Anderson ; Carreto ; Dunn ). The degree of change and the dynamics with which it occurs have been largely underappreciated, due to the complexity of repeats at subtelomeres, which obviates detailed analysis by next-generation sequencing. Here, we followed the dynamic evolution of subtelomeres over relatively short time scales in nine related isolates evolved with or without a mild temperature stress. We found that the genes in subtelomeric regions evolve rapidly, undergoing ectopic recombination events and acquiring 31 point mutations (1 per 515 bp) in the course of over ∼450 passages and ∼4500 cell divisions. Furthermore, some types of changes were recurrent and transient (e.g., Chr4R changes in strain YJB9929; Figure 2 and Table 1), such that they likely would be missed by more static analyses.Recombination resulting in the duplication of some TLO family genes and the loss of others was frequent, occurring once per ∼5000 divisions. Specific TLOs preferentially increased or decreased in copy number during recombination. Not surprisingly, exchanges between genes within the same clade were more frequent (75%) than interclade recombination events. Recombination was most frequent within the C-terminal clade-specific regions of TLO ORFs or within the highly similar BTS sequence element (88% similarity across 11 subtelomeres) and occurred via either LOH or ectopic crossover events. Point mutations accumulated across the TLO coding sequences and were clustered toward the N-terminal Med2 domain in TLOγ clade genes and toward the 3′ clade-specific region in TLOα clade genes (Figure 4).The rates of subtelomeric recombination measured here are likely underestimates of recombination frequency at C. albicans subtelomeres. Recombination events that arise at low frequency are not captured when screening at the population level. Thus, only fixed or high-frequency recombination events could be used to calculate subtelomeric recombination rates. Furthermore, rare recombination via translocation might occur between subtelomeres and chromosome-internal positions (i.e., TLOα34) and would not have been detected in these experiments (Fan ).Subtelomeric regions typically evolve at faster rates (Dreszer ) and are the sites of more frequent gene duplication events (Ames ) than other genomic regions. Accordingly, subtelomeres have been proposed to be “hotbeds” for genomic evolution (Brown ). C. albicans subtelomeres appear to undergo a similar process of sampling evolutionary space through recombination, gene duplication, and mutation. TLOs are predicted to encode the same Med2 protein that is incorporated, as a monomer, into the Mediator tail and thus to regulate transcription of downstream gene sets (Ansari ). Expansion of some TLO paralogs relative to others allows drift to operate on that gene and can result in the establishment of new interacting partners without disrupting the molecular role of the original gene. Novel TLO genes, produced either by mutation or recombination, result in the same molecular outcome but are likely under heavier selective pressure to be retained. We suggest that together these novel functions may modify Mediator interactions that significantly alter transcriptional responses and the phenotypes that they proscribe (Haran ).Selection appears to favor some TLO family genes over others. For example, two TLOs, TLOγ7 and TLOα10, were completely lost from the genome during passaging of some strains, while TLOα12 was gained eight times during recombination and was subsequently homozygosed in six independent isolates. Interestingly, the copy number of TLOα9 was reduced in eight separate events, yet it was never lost: all strains encoded at least one copy of TLOα9 at the end of the passaging experiment. Therefore, the presence of some TLOα9 might provide a selective role in passaged strains.Most of the observed recombination events occurred between TLOα clade members, which fractionate as a single stoichiometric component of the Mediator complex in C. albicans (Zhang ). Thus, the recombination events observed likely alter the relative availability of different TLOα paralogs for incorporation into Mediator. While paralogous Mediator subunits have distinct transcriptional patterns and interacting proteins in other organisms (Tsutsui ; Fukasawa ), the results here suggest that some TLO Med2 copies are dispensable, while others are favored for C. albicans growth, at least under the in vitro passaging conditions used here.A trend toward fixation of the recombination events identified here occurred rapidly. There was no strong evidence in the sequence traces for mixed populations composed of an ancestral and recently invaded subtelomeric population. Furthermore, targeted sequencing of individual colonies following recombination found only a single instance of the ancestral subtelomeric configuration (Figure S2). Additionally, we detected homozygosis of seven recombination events within 160 generations (1 month) of the initial invasion. Consistent with a model of constant recombination between homologous chromosome ends, most new mutations became homozygous following continued passaging (data not shown). Recombination rates are elevated near telomere across many eukaryotes (Linardopoulou ; Rudd ) and homozygosity may be a by-product of constant subtelomeric recombination between sister homologs. However, it is unclear if this phenomenon is a common feature in other organisms, as all previously published experiments studied tractable subtelomeric evolution in haploid organisms (Bopp ).In S. cerevisiae, different classes of subtelomeric genes are amplified in strains grown on different sugar sources (e.g., maltose, sucrose, or melibiose) (Naumov ; Dujon ; Naumova ; Brown ). In this study C. albicans was grown under standard laboratory conditions (30°, rich medium) with and without occasional mild temperature stress. While it is possible that this passaging regime selected for genome evolution and TLO recombination, the C. albicans subtelomeres studied here were under selective pressures very different from, and likely less stressful than, those they encounter in their mammalian hosts. Therefore, the rates of recombination and mutation acting on C. albicans subtelomeres may be different in the mammalian host relative to those in vitro, but the relative amount of recombination that occurs at telomeres is likely higher than at other loci. Recombination frequency differences between specific subtelomeres may be influenced by the presence of recombination hotspots (Blitzblau ) and sequence conservation among subsets of chromosome arms.TLO recombination was most frequent within the C-terminal third of the TLO ORFs, a domain that includes clade-specific sequences. This is consistent with the higher frequency of recombination events among TLO paralogs with greater sequence homology (intraclade vs. interclade recombination). The second most frequent site for recombination between different TLOs was the BTS, which is 3′ (centromere-proximal) to the TLO ORFs and was named the Bermuda Triangle sequence because this sequence is very similar on 11 chromosomes arms and complicates the determination of the chromosome arm location of a given TLO gene (Figure 4B). Neither of these two sites of TLO recombination overlaps with recombination-prone features, such as transposable elements or tandemly repeated sequences, that can promote recombination at other loci (Rehmeyer ; Boothroyd ; Gemayel ).BTS recombination events should result in copying of one gene to a new chromosome arm, without affecting the TLO coding sequence. The benefit of moving a TLO to another chromosome arm is not clear, nor is it clear to what degree different subtelomeric positions affect the level of expression of a given TLO gene. In fact, genes expressed at different TLO loci appear to be affected more by Sir2p-mediated silencing and telomere-associated gene expression noise than by being located at a specific telomere (Anderson ). Importantly, recombination in the BTS accounted for two of three interclade recombination events and may contribute more heavily to major alterations in the TLO repertoire over time compared to recombination at other points. Furthermore, the BTS is specific to C. albicans, which may be connected to the mechanism of TLO amplification in C. albicans. While we did not detect any amplification events or introduction of TLOs to chromosome arms that did not have TLOs or a BTS previously, we presume that such nonhomologous events are relatively rare.In C. albicans, little is known about the subtelomeric recombination sites. Recombination within subtelomeric gene ORFs has been observed for Y′ genes in S. cerevisiae (Louis and Haber 1990a,b) and for the surface antigen ORFs in parasitic protozoa (Jiang ; Bopp ). However, in protozoa, 70-bp sequences flanking subtelomeric surface antigen genes promote intergenic recombination (Boothroyd ). Accordingly, we propose that the 50-bp BTS element may be analogous to the 70-bp motif that promotes homologous recombination between different subtelomeres in Trypanosoma brucei (Boothroyd ).A major mechanism for the construction of novel genes is recombination, as inferred from phylogenetic reconstruction of intergenic recombination events, like those that produced the TLOα9/α12 chimeras (Kraemer ; Kyes ; Sander ). Both of the major sites of TLO recombination encode physically unstable (TAA) repeats, which can promote break-induced replication after DNA replication stalling (Ohshima ; Barry and Mcculloch 2001). The gain and loss of TLO genes and the sequence conservation between chromosome arms at the site of the crossover event strongly suggests that ectopic recombination (Haber 2000) produced most TLO recombination events. The absence of reciprocal exchange and the loss of nucleotide information from the recipient TLO suggest that break-induced replication/crossover or gene conversion is likely the major mechanisms responsible for TLO recombination. Consistent with this, BIR between subtelomeric Y′ elements is a common ALT mechanism in telomere replication and repair in yeasts (Lundblad and Blackburn 1993; Teng ) and leads to elevated mutation rates through error-prone DNA synthesis machinery (Deem ). Although close proximity to the poorly characterized chromosome end makes it difficult to differentiate between gene conversion and BIR/XO, both mechanisms would result in the same general phenomenon: the apparent movement of TLO genes via loss of one TLO gene copy and expansion of another.C. albicans and C. dubliniensis diverged ∼20 MYA and C. dubliniensis has only two TLO genes. Thus, the TLO gene family expansion appears to have been a recent event in C. albicans. The expansion of gene families at chromosome ends is associated with a high prevalence of pseudogene fragments scattered across subtelomeric regions (Louis and Haber 1992; Gardner ; Marcello and Barry 2007). C. albicans has only a single TLO pseudogene (Anderson ), which appears to have been produced through insertion of a long terminal repeat retrotransposon, rather than by recombination or cumulative mutation (Anderson ). The lack of extensive, degenerate TLO copies is consistent with the recent origin of the TLO expansion but could also be due to the acquisition of nonredundant functions among TLO paralogs. It is tempting to speculate that the bias in expansion and contraction of some family members reflects specialized roles for individual TLO genes in shaping the Mediator activity within the cell. This view is consistent with previous work that subtelomeric evolution facilitates adaptation to new environmental conditions in other eukaryotes, including other yeasts (Carreto ; Chuma ; Haran ). It also suggests that analysis of TLO changes in strains isolated from different host niches may offer insights into the functional specialization of specific TLO genes.
Authors: Matthew P Hirakawa; Diego A Martinez; Sharadha Sakthikumar; Matthew Z Anderson; Aaron Berlin; Sharvari Gujja; Qiandong Zeng; Ethan Zisson; Joshua M Wang; Joshua M Greenberg; Judith Berman; Richard J Bennett; Christina A Cuomo Journal: Genome Res Date: 2014-12-11 Impact factor: 9.043
Authors: John Haran; Hannah Boyle; Karsten Hokamp; Tim Yeomans; Zhongle Liu; Michael Church; Alastair B Fleming; Matthew Z Anderson; Judith Berman; Lawrence C Myers; Derek J Sullivan; Gary P Moran Journal: PLoS Genet Date: 2014-10-30 Impact factor: 5.917
Authors: Jia-Xing Yue; Jing Li; Louise Aigrain; Johan Hallin; Karl Persson; Karen Oliver; Anders Bergström; Paul Coupland; Jonas Warringer; Marco Cosentino Lagomarsino; Gilles Fischer; Richard Durbin; Gianni Liti Journal: Nat Genet Date: 2017-04-17 Impact factor: 38.330
Authors: Peter R Flanagan; Jessica Fletcher; Hannah Boyle; Razvan Sulea; Gary P Moran; Derek J Sullivan Journal: PLoS One Date: 2018-07-20 Impact factor: 3.240
Authors: Iuliana V Ene; Rhys A Farrer; Matthew P Hirakawa; Kennedy Agwamba; Christina A Cuomo; Richard J Bennett Journal: Proc Natl Acad Sci U S A Date: 2018-08-27 Impact factor: 11.205
Authors: Nick Brouwers; Arthur R Gorter de Vries; Marcel van den Broek; Susan M Weening; Tom D Elink Schuurman; Niels G A Kuijpers; Jack T Pronk; Jean-Marc G Daran Journal: PLoS Genet Date: 2019-04-04 Impact factor: 5.917
Authors: Matthew J Dunn; Griffin M Kinney; Pamela M Washington; Judith Berman; Matthew Z Anderson Journal: PLoS Genet Date: 2018-04-09 Impact factor: 5.917