Literature DB >> 35042790

Rapid radiation in a highly diverse marine environment.

Kosmas Hench1, Martin Helmkampf2, W Owen McMillan3, Oscar Puebla1,3,4,5.   

Abstract

Rapid diversification is often observed when founding species invade isolated or newly formed habitats that provide ecological opportunity for adaptive radiation. However, most of the Earth's diversity arose in diverse environments where ecological opportunities appear to be more constrained. Here, we present a striking example of a rapid radiation in a highly diverse marine habitat. The hamlets, a group of reef fishes from the wider Caribbean, have radiated into a stunning diversity of color patterns but show low divergence across other ecological axes. Although the hamlet lineage is ∼26 My old, the radiation appears to have occurred within the last 10,000 generations in a burst of diversification that ranks among the fastest in fishes. As such, the hamlets provide a compelling backdrop to uncover the genomic elements associated with phenotypic diversification and an excellent opportunity to build a broader comparative framework for understanding the drivers of adaptive radiation. The analysis of 170 genomes suggests that color pattern diversity is generated by different combinations of alleles at a few large-effect loci. Such a modular genomic architecture of diversification has been documented before in Heliconius butterflies, capuchino finches, and munia finches, three other tropical radiations that took place in highly diverse and complex environments. The hamlet radiation also occurred in a context of high effective population size, which is typical of marine populations. This allows for the accumulation of new variants through mutation and the retention of ancestral genetic variation, both of which appear to be important in this radiation.
Copyright © 2022 the Author(s). Published by PNAS.

Entities:  

Keywords:  Hypoplectrus; adaptive radiation; genomic architecture; marine; reef fishes

Mesh:

Year:  2022        PMID: 35042790      PMCID: PMC8794831          DOI: 10.1073/pnas.2020457119

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


Adaptive radiation is driven by ecological opportunity, whereby newly accessible niches provide potential for diversification (1). This process often takes place in geographically isolated and/or newly formed habitats such as lakes or oceanic islands, where founding species are exposed to depauperate environments in which competition is relaxed. This is, for example, the case in exemplary adaptive radiations, like Darwin’s finches (2), East African cichlids (3, 4), and Caribbean Anolis lizards (5). Nevertheless, these radiations represent extremes of an evolutionary process that may also operate in diverse and complex habitats, where the relationship between biotic and abiotic drivers and rapid speciation is often more subtle. Unfortunately, with the notable exception of Heliconius butterflies (6), these radiations remain poorly explored despite the fact that they occur in environments that contain most of the diversity on Earth. As a result, we lack a broader comparative framework for identifying and generally understanding the main drivers of adaptive radiation. The hamlets (Hypoplectrus spp., Serranidae), a group of reef fishes from the wider Caribbean, present an excellent opportunity to investigate the genomic basis of radiation in diverse and complex environments where most niches are already occupied. Typical of marine species (7), most hamlets are characterized by extensive geographic ranges, large population sizes, high fecundity, and high potential for dispersal through a 3-wk pelagic larval stage (8–10). These characteristics are expected to shape their evolutionary potential in complex ways, possibly reducing the opportunity for speciation (11, 12). Against this backdrop, the genus diversified into at least 18 species that are highly sympatric (9, 13) and very similar in terms of habitat and diet (14, 15). They differ essentially in color pattern, a trait that is thought to be ecologically relevant through crypsis and mimicry (16–20). According to the aggressive mimicry hypothesis, some hamlet species (the mimics) achieve higher preying success by resembling other fishes from different families (the models) that are harmless to the hamlets’ prey. In agreement with this hypothesis, behavioral differences in resource acquisition have been documented between hamlet species (19, 20). Yet, resemblance between models and mimics is imperfect, and some species appear to be neither cryptic nor mimetic. Color pattern is an important cue for mate choice, and hamlets show strong assortative mating both in the field and in the laboratory (10, 13, 18, 19, 21). In addition, hamlets are simultaneously hermaphroditic, and mate choice is mutual (8, 22). This particular mating system results in complex pairing dynamics among individuals that can contribute to diversification by generating strong sexual selection (13, 23). Despite strong assortative mating, interspecific spawnings are occasionally observed in natural populations (10, 13, 18, 19, 21). Fertilization is external, and eggs are planktonic. The available evidence suggests that there are no barriers to fertilization among species (24) and that gene flow is ongoing (25). Genetically, the Caribbean hamlets are very closely related. Similar to East African cichlids from Lake Victoria (4), they show generally low levels of genetic divergence (13, 21, 25–27) and do not sort into distinct mitochondrial haplogroups (28–30). A chromosome-resolution reference genome is now available for the group (25), which provides the foundation to more fully explore the genomic basis of this radiation. Here, we examine the genomes of 170 individuals from 28 pairs of sympatric species to 1) uncover the genomic architecture of the radiation, 2) identify genomic regions associated with major phenotypic differences, and 3) ask how the hamlet radiation compares with other rapid radiations.

Results

Rapid Radiation, Old Lineage

We started by considering the hamlet radiation in the context of the phylogeny of the subfamily Serraninae and the Fish Tree of Life (FToL) estimates of evolutionary rates (31). This broad phylogenetic perspective indicates that the hamlets exhibit speciation rates that are among the highest in fishes (mean clade-specific speciation rate: 2.44) (). Within the Serraninae, a single burst of diversification was identified at the base of the hamlet radiation (Fig. 1 and ). No other serranine lineage radiated rapidly (subfamily background speciation rate excluding hamlets: 0.11). Notably, this includes the chalk bass (Serranus tortugarum) and the tobaccofish (Serranus tabacarius), two species that are closely related to the hamlets and have a similar mating system (33). The two clades that are simultaneously hermaphroditic (marked by a star in Fig. 1 and ) do not present an increase in diversification rate either. Finally, the time-calibrated FToL phylogeny illustrates that although the extant hamlet species appear to be very young, the Hypoplectrus lineage last shared a common ancestor with other species in the subfamily ∼26 Mya ().
Fig. 1.

Phylogenetic context and population genetic patterns. (A) Maximum likelihood phylogeny of the Serraninae subfamily based on 23 nuclear and mitochondrial genes. The hamlet radiation is highlighted in gray, and species considered in this study are marked with an asterisk. Gene sequences for the other species were obtained from the FToL project (31). Stars denote clades that are simultaneously hermaphroditic following ref. 32 (note that the mating system of Serranus accraensis is not known). Ch., Chelidoperca; Cp., Centropristis; D., Diplectrum; E., Epinephelus; H., Hypoplectrus; Pa., Paralabrax; Pl., Plectranthias; Sc., Schultzea; Se., Serranus. (B) Genetic differentiation (F) between pairs of sympatric hamlets. The 28 pairs are numbered and presented in order of increasing genome-wide differentiation, and they are color coded with respect to location (red, Belize; blue, Honduras; green, Panama). The pairs that are significantly differentiated at the α = 0.05 level are highlighted with filled bars. (C–E) Principal component analysis (PCA) for each location. The percentage of variation explained by each principal component (PC) is shown. The PCAs were repeated with the highly diverged genomic regions excluded and produced very similar results (), indicating that the clustering patterns are not driven by these regions.

Phylogenetic context and population genetic patterns. (A) Maximum likelihood phylogeny of the Serraninae subfamily based on 23 nuclear and mitochondrial genes. The hamlet radiation is highlighted in gray, and species considered in this study are marked with an asterisk. Gene sequences for the other species were obtained from the FToL project (31). Stars denote clades that are simultaneously hermaphroditic following ref. 32 (note that the mating system of Serranus accraensis is not known). Ch., Chelidoperca; Cp., Centropristis; D., Diplectrum; E., Epinephelus; H., Hypoplectrus; Pa., Paralabrax; Pl., Plectranthias; Sc., Schultzea; Se., Serranus. (B) Genetic differentiation (F) between pairs of sympatric hamlets. The 28 pairs are numbered and presented in order of increasing genome-wide differentiation, and they are color coded with respect to location (red, Belize; blue, Honduras; green, Panama). The pairs that are significantly differentiated at the α = 0.05 level are highlighted with filled bars. (C–E) Principal component analysis (PCA) for each location. The percentage of variation explained by each principal component (PC) is shown. The PCAs were repeated with the highly diverged genomic regions excluded and produced very similar results (), indicating that the clustering patterns are not driven by these regions.

Population Genetic Patterns

Given the low levels of genetic divergence among hamlets, we first used our genomic data from Belize, Honduras, and Panama to examine to what extent sympatric hamlets represent genetic clusters. Principal component analysis indicates that this is generally the case, albeit with varying degrees of overlap between species (Fig. 1 ). We next grouped samples from each species and location together to examine patterns of genome-wide differentiation (F) between pairs of sympatric species. This analysis reveals a continuous range of differentiation, from ¡0.003 to 0.1 (Fig. 1 and ). It is important to stress that this differentiation continuum does not constitute a temporal sequence of speciation, as it includes different species from different locations. Moreover, the barred, black, and butter hamlets are overrepresented in this dataset since they were sampled at the three locations. Nevertheless, levels of genetic differentiation among populations within species are within the range of differentiation between sympatric species (). From a population genetic perspective, populations of the same species can, therefore, be as distinct as different species (34, 35).

Genomic Architecture of Radiation

We then divided the genome into 50-kb windows to reveal how the genomic architecture of the hamlet radiation unfolds across the differentiation continuum. The radiation is characterized by a small number of sharp peaks [so-called “islands” (36)] of differentiation that do not expand with increasing genome-wide differentiation (Fig. 2 and ). Their number does not increase substantially either, with 6 peaks with F > 0.7 per species pair throughout the entire continuum. It is also noteworthy that at the lower end of the differentiation continuum, differentiation is largely independent from recombination rate (, row 1). In this respect, our results contrast with previous studies conducted across higher levels of differentiation that report a marked effect of recombination on differentiation (37–41). We nonetheless capture the onset of the effect of recombination, with differentiation accumulating disproportionately in regions of low recombination as genome-wide differentiation increases (). This effect is particularly strong in a large section of linkage group (LG) (chromosome) 8, which we had previously identified as a low-recombining region (25, 42) (Fig. 2). In contrast to genetic differentiation (F, which captures allele frequency differences), the genomic architecture of divergence (d which captures sequence divergence) is similar across all species pairs (). Divergence is generally elevated in the chromosome peripheries where recombination rate tends to be higher (). This is likely an effect of the recombination landscape that shaped ancestral variation, resulting in higher diversity in regions of high recombination (). In order to filter out the effect of ancestral diversity from the divergence signal, we considered the range of divergence among species pairs, which we defined as among the 28 pairs. This statistic varied markedly along the genome (), paralleling in part patterns of genetic differentiation (Fig. 2) and indicating heterogeneous selection among species at these loci. Divergence between species is also strongly correlated with nucleotide diversity (π) within species (), which is in line with the theoretical expectation that dapproaches ancestral diversity when divergence is very recent (43).
Fig. 2.

Whole-genome patterns. The alternating white and gray blocks represent the 24 LGs (chromosomes). All statistics are calculated over 50-kb sliding windows with 5-kb increments unless stated otherwise. (A) Joint differentiation (F) among the 14 groups of samples (species within locations). The red vertical lines highlight regions above the 99.8th F percentile (). Letters A, B, and C correspond to the three genomic regions that are highlighted in Fig. 5. (B and C) Topology weighting for Belize and Honduras, respectively, along nonoverlapping 200 SNP windows. The different colors correspond to different topologies, and the white horizontal lines indicate the null weighting (i.e., all topologies equally likely). (D–F) G × P association for bars, dark saddle on the caudal peduncle, and spot on the snout, respectively. An expanded version of this figure is presented in .

Whole-genome patterns. The alternating white and gray blocks represent the 24 LGs (chromosomes). All statistics are calculated over 50-kb sliding windows with 5-kb increments unless stated otherwise. (A) Joint differentiation (F) among the 14 groups of samples (species within locations). The red vertical lines highlight regions above the 99.8th F percentile (). Letters A, B, and C correspond to the three genomic regions that are highlighted in Fig. 5. (B and C) Topology weighting for Belize and Honduras, respectively, along nonoverlapping 200 SNP windows. The different colors correspond to different topologies, and the white horizontal lines indicate the null weighting (i.e., all topologies equally likely). (D–F) G × P association for bars, dark saddle on the caudal peduncle, and spot on the snout, respectively. An expanded version of this figure is presented in .
Fig. 5.

Close-up views of three genomic regions of interest. A–C correspond to the A, B, and C regions, respectively, highlighted in Fig. 2. The x axis shows the position on the respective LG (in megabases), with regions above the 99.8th F percentile highlighted in light gray. The first five subpanels correspond (from top to bottom) to the gene annotation, the log-transformed P value of the G × P association, genetic differentiation (F), genetic divergence (d, and [ among the 28 pairs of sympatric species] color coded by genome-wide F. All these statistics are calculated in 50-kb sliding windows with 5-kb increments. Note that the y scale varies between panels for G × P association. The next three subpanels show the topology weighting for Belize, with particular sets of topologies highlighted. These correspond to the topologies in which the two blue species (H. indigo and H. maya) are sister species (blue), in which the two species with vertical bars (H. indigo and H. puella) are grouped together (red), and in which H. unicolor is sister to all other species (yellow). The last three subpanels show population-level phylogenetic trees of the regions above the 99.8th F percentile highlighted in light gray. The first three letters of each sample indicate the species (Hypoplectrus aberrans, abe; H. floridae, flo; H. gummigutta, gum; H. indigo, ind; H. maya, may; Hypoplectrus nigricans, nig; H. puella, pue; H. randallorum, ran; and H. unicolor, uni), and the last three letters the location (Belize, bel; Honduras, hon; Panama, pan; and Florida, flo).

Recent Radiation

In order to estimate the age of the hamlet radiation, we inferred the demographic histories of all species in all locations using the multiple sequentially Markovian coalescent (MSMC2) (44). The cross-coalescence patterns suggest that hamlets diverged within the last 10,000 generations (Fig. 3). These results are consistent with previous analyses using a different approach that does not rely on phasing (45). Alternatively, gene flow among sympatric species may have obliterated the signature of older demographic events, which could be complex and involve several cycles of “fission–fusion–fission” (46). Nevertheless, reconstructing such events is challenging at the low levels of divergence that characterize the Caribbean hamlets. The MSMC2 analyses also indicate that ancestral effective population sizes remained high, in the order of (Fig. 3).
Fig. 3.

Demographic inference. (A) Inferred history of effective population size. Each line is based on three to four genomes per group (species within locations). (B) Cross-coalescence rates for the 28 pairs of sympatric species color coded by genome-wide F. Each line represents an independent run based on two genomes from two sympatric species (total of four). A cross-coalescence rate of one indicates completely shared ancestry, and a rate of zero indicates no shared ancestry. All estimates are scaled with a per site mutation rate . The most ancient and the two most recent time segments are omitted due to unreliable inference at these extremes.

Demographic inference. (A) Inferred history of effective population size. Each line is based on three to four genomes per group (species within locations). (B) Cross-coalescence rates for the 28 pairs of sympatric species color coded by genome-wide F. Each line represents an independent run based on two genomes from two sympatric species (total of four). A cross-coalescence rate of one indicates completely shared ancestry, and a rate of zero indicates no shared ancestry. All estimates are scaled with a per site mutation rate . The most ancient and the two most recent time segments are omitted due to unreliable inference at these extremes.

Ongoing Gene Flow and Introgression

The overall landscape of genomic differentiation suggests that it is shaped in part by gene flow and introgression. In order to test for ongoing gene flow, we searched for genetic hybrids and backcrosses in our dataset. A total of 12 high-probability hybrids and backcrosses were identified (), representing 7% of the individuals analyzed. This proportion is likely an underestimate since our sampling design explicitly excluded individuals with intermediate color patterns. It is also substantially higher than the 2% interspecific spawnings that were observed at the three locations (13, 19). However, behavioral observations indicate that hybridization rate can vary dramatically in the hamlets depending on the social context of individuals at the time of spawning (13). In order to test for past introgression events, we calculated D statistics for all possible trios of species/populations using Hypoplectrus floridae—which belongs to the Gulf of Mexico hamlet clade (47)—as the out-group. In the most treelike configuration (BBAA), a total of 69 trios show evidence of introgression, representing 19% of the 364 trios that we tested. This proportion is slightly higher than the 14% reported in East African cichlids from Lake Victoria (4). Importantly, all species/populations and 44% of all pairs show evidence of introgression (), indicating that it is pervasive across the radiation.

Weak Background Phylogenetic Signal

The low levels of divergence, ongoing gene flow, and history of extensive introgression observed in the hamlets are expected to obscure—possibly even rewrite—the phylogenetic relationships among species and populations. As a result, the genome-wide phylogeny of the radiation is not well resolved, particularly with respect to the deeper nodes of the tree (Fig. 4). The reconstruction only recovers the most differentiated species (Hypoplectrus indigo, Hypoplectrus gummigutta, and Hypoplectrus maya) as monophyletic. High levels of discordance among genomic regions suggest that lack of divergence and nontree-like evolutionary processes have prevented the buildup of sufficient phylogenetic signal to resolve the relationships among the other species. While most deeper splits thus remain ambiguous, the analysis implies that several species are polyphyletic, including Hypoplectrus randallorum and the widely distributed barred (Hypoplectrus puella) and butter (Hypoplectrus unicolor) hamlets. These patterns are also reflected in a network representation of the genomic data based on fragments of identity by descent (IBD) (Fig. 4 and ). Here, phylogenetically well-resolved clades form tightly linked clusters, whereas unsupported groups appear diffuse and weakly connected. Several subnetworks across species boundaries also emerge from the network, consistent with a history of rapid divergence and introgression.
Fig. 4.

Genome-wide phylogeny and IBD network. (A) Coalescent-based species tree of the Caribbean hamlet samples as inferred from 5,000 local trees of 5-kb windows each randomly distributed throughout the genome. Internal branches are measured in coalescent units, thus reflecting concordance among local trees. Note that terminal branch lengths were set to an arbitrary constant. Support values at the nodes are given as local posterior probabilities. The tree is rooted with H. floridae from Florida, a member of the Gulf of Mexico clade. A version rooted with Serranus is presented in . (B) IBD network considering IBD fragments larger than 5 and 10 × 103 consecutive SNPs for one and two shared haplotypes, respectively, and filtered for a minimum length of 0.2 cM. The distribution of IBD fragments along the genome and the networks for longer IBD fragments are presented in . Both analyses (phylogeny and IBD network) confirm that the most differentiated species (H. indigo, H. gummigutta, and H. maya) form tight genetic clusters, but they also indicate that low levels of divergence, ongoing gene flow, and pervasive introgression obscure the phylogenetic relationships among the other clades (note the low support values and short branches in the deeper nodes of the tree).

Genome-wide phylogeny and IBD network. (A) Coalescent-based species tree of the Caribbean hamlet samples as inferred from 5,000 local trees of 5-kb windows each randomly distributed throughout the genome. Internal branches are measured in coalescent units, thus reflecting concordance among local trees. Note that terminal branch lengths were set to an arbitrary constant. Support values at the nodes are given as local posterior probabilities. The tree is rooted with H. floridae from Florida, a member of the Gulf of Mexico clade. A version rooted with Serranus is presented in . (B) IBD network considering IBD fragments larger than 5 and 10 × 103 consecutive SNPs for one and two shared haplotypes, respectively, and filtered for a minimum length of 0.2 cM. The distribution of IBD fragments along the genome and the networks for longer IBD fragments are presented in . Both analyses (phylogeny and IBD network) confirm that the most differentiated species (H. indigo, H. gummigutta, and H. maya) form tight genetic clusters, but they also indicate that low levels of divergence, ongoing gene flow, and pervasive introgression obscure the phylogenetic relationships among the other clades (note the low support values and short branches in the deeper nodes of the tree).

Genetic Basis of Phenotypic Diversification

The weak phylogenetic signal in the hamlets provides the context to localize regions of the genome that are likely responsible for phenotypic differences among species. The underlying logic is that functionally important regions should group individuals by phenotype. We used topology weighting by iterative sampling of subtrees [Twisst (48)] to dissect the phylogenetic signal along the genome. Briefly, this method considers sliding-windows phylogenies along the genome and weights the contribution of each possible taxon topology to the full tree. As expected given the weak phylogenetic signal in the hamlets and the small size of the windows considered (200 single-nucleotide polymorphism [SNPs]), this approach failed to identify a leading taxon topology at most windows throughout the genome. Nevertheless, it revealed a number of sharp topology weighting peaks (Fig. 2 ). For example, in Belize, the topologies in which H. indigo and H. maya—the two blue hamlet species—are sister species dominate a narrow region on LG 4 (Fig. 5), while two other regions on LG 12 are dominated by topologies grouping the two hamlet species that display vertical bars (H. indigo and H. puella) (Fig. 5 ). These patterns are also visible in phylogenies based on the entire contiguous sequence of these regions (). Close-up views of three genomic regions of interest. A–C correspond to the A, B, and C regions, respectively, highlighted in Fig. 2. The x axis shows the position on the respective LG (in megabases), with regions above the 99.8th F percentile highlighted in light gray. The first five subpanels correspond (from top to bottom) to the gene annotation, the log-transformed P value of the G × P association, genetic differentiation (F), genetic divergence (d, and [ among the 28 pairs of sympatric species] color coded by genome-wide F. All these statistics are calculated in 50-kb sliding windows with 5-kb increments. Note that the y scale varies between panels for G × P association. The next three subpanels show the topology weighting for Belize, with particular sets of topologies highlighted. These correspond to the topologies in which the two blue species (H. indigo and H. maya) are sister species (blue), in which the two species with vertical bars (H. indigo and H. puella) are grouped together (red), and in which H. unicolor is sister to all other species (yellow). The last three subpanels show population-level phylogenetic trees of the regions above the 99.8th F percentile highlighted in light gray. The first three letters of each sample indicate the species (Hypoplectrus aberrans, abe; H. floridae, flo; H. gummigutta, gum; H. indigo, ind; H. maya, may; Hypoplectrus nigricans, nig; H. puella, pue; H. randallorum, ran; and H. unicolor, uni), and the last three letters the location (Belize, bel; Honduras, hon; Panama, pan; and Florida, flo). In order to further explore the association between genetic variation and specific components of color pattern, we scored all fishes for the presence or absence of vertical bars, saddle mark on the caudal peduncle, and spot on the snout (). These traits were chosen because they are polymorphic and can be scored unambiguously. Genotype × phenotype (G × P) association analysis revealed a strong association between the presence or absence of vertical bars and genetic variation in a narrow genomic interval on LG 12 (Fig. 2). Associations with the other two traits were more diffuse, but here again, association peaks emerged, notably on LG 12 for the saddle mark and on LG 4 for the snout spot (Fig. 2 ). Clustering patterns at these genomic regions () are consistent with the topology weighting, phylogenetic, and G × P analyses. The genomic regions identified by topology weighting and G × P analyses also match regions of high differentiation (F) among species (Figs. 2 and 5). Altogether, these analyses allow us to start dissecting the genetic variation associated with phenotypic diversity in the hamlets. The strongest signal was observed in a narrow region of LG 12 that shows a strong association with the presence or absence of vertical bars (letter B in Fig. 2). The 13 pairs of sympatric species that include one species with vertical bars and one without present high differentiation at this locus, while the 15 species pairs that include two species with bars or two species without bars do not (Fig. 5). In line with this pattern, the region is dominated by topologies in which the two hamlets with vertical bars are sister species. This locus, which we had previously identified (25), is centered on casz1 (Fig. 5). This gene encodes a castor zinc finger transcription factor that is involved in a number of processes through development. Our data also provide insights into the hoxca gene cluster on LG 12. In line with our preliminary analyses (25), we observe a strong association between variation at the hoxc13a locus and the presence or absence of a saddle mark on the caudal peduncle (Fig. 5), which is characteristic of the butter hamlet (H. unicolor). The pairs of sympatric species that are most differentiated at this locus include H. unicolor, and the region is dominated by topologies that single out the butter hamlet. In addition, we observe a secondary association with vertical bars between hoxc8a and hoxc11a. Hox genes play an important role in patterning tissues along the body axis and have been shown to be involved in color pattern development in insects (49, 50) and vertebrates (51). They are arranged and expressed in a sequence that follows the body axis, with 3 genes expressed anteriorly and 5 genes posteriorly (52). This pattern is consistent with our results as hoxc13a is the most 5 gene of the hoxca cluster, the saddle on the caudal peduncle is the most posterior mark in the hamlets, and hoxc13a is known to be expressed in the caudal peduncle and at the pigment appearance stage in fishes (53, 54). Vertical bars, on the other hand, are anterior to the saddle mark just as the hoxc8-11a gene is on the 3 side of hoxc13a. A list of the genes found in the 18 genomic regions above the 99.8th F percentile is presented in and includes a number of genes that are involved in pigmentation/skin development (e.g., tmem79, mafb, kitlg) and vision/photoreceptor development (e.g., grk7a, rab8a, slc12a5). Functional analyses are needed to test the role played by these candidate loci in vision and pigmentation in the hamlets.

Origin of Genomic Variants

The old age (Fig. 1) and high effective population size of the Hypoplectrus lineage (Fig. 3) suggest that ancestral variation may have played an important role in the hamlet radiation. In order to test this hypothesis, we estimated the age of the genomic variants within our three regions of interest. We used a nonparametric approach that combines probability distributions of the time to the most recent common ancestor of a large number of haplotypes, allowing us to estimate the age of genomic variants over a continuous timescale and without relying on a priori assumptions about demography or selection (55). We reasoned that the SNPs that are most strongly associated with the three phenotypic traits that we scored (spot on the snout, vertical bars, and saddle on the caudal peduncle) are the ones that are most likely to have a functional role and compared the strength of G × P association with the estimated age of the derived allele for all the SNPs within our three candidate genomic regions. The results indicate that the majority of genomic variants that are strongly associated with each trait predate the radiation (i.e., are older than 10,000 generations), although a smaller number of younger variants also show strong associations (Fig. 6).
Fig. 6.

Relationship between G × P association and derived allele age in the three candidate regions. The three columns correspond to the three traits considered (vertical bars, spot on the snout, and saddle on the caudal peduncle), and the three rows correspond to the three candidate regions identified in Fig. 2. The highlighted panels correspond to the candidate interval for each trait. The color scale indicates the degree of overplotting of SNPs. Note the logarithmic scale on both axes and the different scales on the y axis. The other candidate regions are presented in .

Relationship between G × P association and derived allele age in the three candidate regions. The three columns correspond to the three traits considered (vertical bars, spot on the snout, and saddle on the caudal peduncle), and the three rows correspond to the three candidate regions identified in Fig. 2. The highlighted panels correspond to the candidate interval for each trait. The color scale indicates the degree of overplotting of SNPs. Note the logarithmic scale on both axes and the different scales on the y axis. The other candidate regions are presented in .

Discussion

Hamlets live on coral reefs, a highly diverse and complex environment. They feed on small invertebrates and fishes (14, 56) but are not particularly specialized, and their 3-wk pelagic larval phase (10) provides potential for long-distance dispersal. Within this ecological backdrop, the hamlets show an extraordinary burst of speciation that is on par with radiations occurring in depauperate and geographically isolated environments. Moreover, our genomic data suggest that diversification happened very recently and in a backdrop of substantial gene flow and introgression, resulting in a genetic architecture that is characterized by sharp peaks of differentiation. These peaks contain genes that are known to be involved in vision and pigmentation in other groups and suggest a modular architecture of color pattern variation in the group. Here, the term modular [or combinatorial (46)] refers to the observation that major patterning elements (e.g., vertical bars) and at least one color [black (25)] are associated with one or a few genomic regions and that different species present different combinations of alleles at these loci. There is much more to be explored about the relationship between genomic and phenotypic variation in the hamlets, but such a modular architecture is similar to that observed in Heliconius butterflies (57), capuchino finches (58), and munia finches (59). These radiations also took place in diverse and complex environments and together with hamlets, provide the context for a more general understanding of the drivers of rapid speciation. Color pattern is an important ecological trait that is under strong selection in tropical butterflies and finches (57–59). In the hamlets, the levels of hybridization and gene flow revealed by behavioral (10, 13, 18, 19, 21) and genetic (this study) analyses suggest that the phenotypic differences that define species should break down unless other factors prevent their homogenization. They do not, and hamlet species maintain their genetic and phenotypic integrity in sympatry. The role that particular color patterns play in survival and reproductive success remains a focus of study. The hamlets compete with a number of other predatory fishes on the reef (17) and are themselves prey to larger visual predators, such as groupers. In this context, any variation in color pattern that contributes to improving their hunting efficiency or survival is expected to be strongly selected for. Several hamlets have been suggested to be aggressive mimics (16, 17, 19, 20). From this perspective, ecological opportunity for radiation is not provided by a depauperate environment but on the contrary, is provided by strong competition and a high diversity of potential model species for aggressive mimicry. In addition, some pattern elements may play a role in camouflage, allowing individuals to both get closer to prey and avoid predation. For example, the vertical bars of the barred and indigo hamlets have been suggested to be cryptic in specific reef environments (17, 18). Cryptic color patterns are probably ancestral in the hamlets since these patterns are also observed in their closest relatives and in the sea basses (the Serranidae family) more generally. However, some hamlets have “almost gaudy” (17) color patterns that appear neither mimetic nor cryptic and might be more the result of sexual than natural selection. In this respect, the simultaneously hermaphroditic mating system of the hamlets can provide a strong source of sexual selection that is expected to facilitate the evolution of assortative mating among color morphs (13). Nevertheless, mating system alone cannot explain the hamlet radiation since no other serranine lineage that is also simultaneously hermaphroditic radiated explosively (Fig. 1). In this light, our data also suggest that the hamlet radiation may have been further catalyzed by a tight coupling of loci involved in patterning and vision. The strongest and clearest signal in our data is the association between the presence/absence of vertical bars and genetic variation at the casz1 locus. This transcription factor has been shown to be involved in the development of photoreceptors in mice (60, 61), and a role in vision is also likely in the hamlets since casz1 is strongly and consistently expressed in the retinal tissue (25). The strong association with vertical bars that we report here suggests that casz1, or a locus in close proximity, might also be involved in patterning. This is similar to what has been observed in Heliconius butterflies, a system where strong natural and sexual selection acts on wing color patterns (62). The genetic basis of pattern variation is well characterized in this group, and recent work is demonstrating a tight physical linkage between loci responsible for color pattern variation and preferences for this variation (63). The possibility of physical linkage between patterning and sensory loci in hamlets is significant as it would both facilitate the evolution of reproductive isolation through visually-based assortative mating and help maintain it in the face of ongoing gene flow. In any event, the occurrence of genes that are known to be involved in vision and pigmentation in peaks of differentiation is broadly in line with the genic view of speciation (64, 65), whereby differentiation between species is initially restricted to genes that are involved in reproductive isolation. Nevertheless, these highly differentiated regions do not expand with increasing genome-wide differentiation as predicted by this framework. This parallels what has been observed in Heliconius butterflies (66) and Ficedula flycatchers (38) and indicates that divergence hitchhiking does not play the prominent role in the buildup of genomic differences that is implied by the genic view of speciation in these groups. In the hamlets, this finding is consistent with the rapid decay of linkage disequilibrium along chromosomes (25, 45). While marine systems are generally more open than terrestrial and freshwater habitats due to the scarcity of geographic barriers and the high mobility of many species (7), the hamlets remain nevertheless geographically constrained to the wider Caribbean. Within this region, they are essentially restricted to coral reefs, a highly discrete and patchy habitat. Their pelagic larval duration of 3 wk (10) is relatively short for a reef fish, and larval dispersal has been shown to be more restricted than previously thought in reef fishes generally (67) and in the hamlets in particular (68, 69). This explains the occurrence of local evolutionary processes (e.g., different levels of genetic clustering, differentiation, and hybridization) at our three study sites that are separated by just a few hundred kilometers. The hamlet lineage is also characterized by a history of high effective population size, which is typical of marine populations (11, 12, 70). This allows for both the accumulation of new genetic variants through mutation and the retention of ancestral genetic variants. While both ancestral and new variants are strongly associated with color pattern components, the majority of these variants predate the variation, pointing to an important role of ancestral variation. We posit that the rapid hamlet radiation is driven by strong selection on color pattern, a modular genetic architecture for this trait, and a mating system that is conducive to the evolution of reproductive isolation among color morphs. Here, the historically high effective population size of hamlets provides a rich genomic substrate from which hybridization can rapidly assemble new phenotypic variation. This attribute is emerging as the common denominator to a variety of radiations on land and in the sea.

Materials and Methods

Software Versions, Parameter Settings, and Scripts

Software versions and parameter settings were omitted from the text for readability; software versions are listed in . Data analysis was managed using nextflow (71). The workflows used to produce our results from raw data to figures are provided in the accompanying repository [accessible from the accompanying repository (72) and the documentation therein; hereafter git].

Sequencing

This study is based on a total of 170 genomes obtained from 167 hamlets and three out-group samples (2×S. tortugarum and 1× S. tabacarius). Fifty genomes are new to this study, 110 are from ref. 25, and 10 are from ref. 45. All new tissue samples were available from previous studies (13, 19), except for sample #28393, which was collected in 2017 in Bocas del Toro (Panama) under the Smithsonian Tropical Research Institute Institutional Animal Care and Use Committee protocol 2017-0101-2020-2, the Panamanian Ministry of Environment permits SC/A-53-16 and SEX/A-35-17, and the Access and Benefit-Sharing Clearing-House identifier ABSCH-IRCC-PA-241203-1. Genomic DNA was extracted from gill tissue using Qiagen MagAttract High Molecular Weight kits. Libraries were prepared and sequenced by Novogene and the Institute of Clinical Molecular Biology (Kiel University) on an Illumina HiSeq 4000 (PE; 2 × 151) to a mean postfiltering sequencing depth of 17×.

Variant Calling

All the samples considered in this study were genotyped jointly and anew. The variant calling procedure was adapted from the best practice recommendations for the Genome Analysis Toolkit (GATK) workflow (73) provided by the Broad Institute (74, 75). The general workflow is presented below, and the exact parameters used for each step are provided in git 1.1 to 1.17 and 2.1 to 2.7. GATK was used to transform the sequences from fq to uBAM format, assign read groups, and mark adapters (git 1.2 to 1.4). The sequences were then back transformed to fq format using GATK, mapped to the hamlet reference genome using BWA (76), and merged with the uBAM files containing the read group information with GATK (git 1.5). Duplicated reads were removed (git 1.6), and genotype likelihoods were called for each individual (git 1.9) and then merged for all samples (git 1.10). All individuals were then genotyped jointly on the basis of the genotype likelihoods from all samples. This step was duplicated to create two versions of the dataset (git 1.11 and 2.4): a lightweight version with variant sites only (SNPs only; git 1.10) and a full version including every callable site—even invariant ones—to calculate π and d(all base pairs [BP]; git 2.4). SNPs were extracted from the raw genotypes and hard filtered with respect to quality and missing data (git 1.14 and 2.6 to 2.7). The SNPs only dataset was also filtered for a minor allele count 2 and reduced to biallelic SNPs only using VCFtools (77) (git 1.14). In preparation of the phasing, the SNPs only dataset was subset by LG, and phase-informative reads were extracted based on the original alignments and the SNPs (git 1.15). Finally, genotypes were phased with SHAPEIT (78) (git 1.16 to 1.17). Bioinformatic phasing is notoriously difficult when it relies on population genetic data only (without, e.g., parent–offspring trios or linked reads). In order to mitigate this issue, we used the read-aware phasing approach implemented in SHAPEIT, which takes the phase information contained within the raw sequencing data into account. Demographic inference (see below) is the only analysis that relies on phase information.

Serraninae Phylogeny and Speciation Rates

To reconstruct the phylogenetic position of the hamlets within the Serraninae subfamily, we searched the H. puella reference genome for the 27 genes considered in the FToL project (31) with Basic Local Alignment Search Tool (79) (23 of which were found; git 19.1 to 19.2). The corresponding regions were extracted from an unfiltered precursor genotype dataset of all BP, keeping only one high-coverage individual per species. Genotypes were converted to continuous sequence (Fasta format) using a custom Perl script and reverse complemented if necessary, and individual genes were aligned with MAFFT (80) to their FToL homologs (git 19.3 to 19.6). Regarding the latter, only species in the Serraninae were retained, with species considered in the present study replacing original sequences. Ambiguously aligned positions were automatically removed with GBlocks (81), and minor adjustments were made by hand to finalize the mitochondrial gene alignments (we also dropped two poorly aligned genes in the two Serranus species altogether instead of manually editing them). Maximum likelihood reconstruction was performed with IQ-TREE (99) based on a concatenation approach with edge-linked partition model and 1,000 ultrafast bootstrap replicates (git 19.7). For divergence time estimates and evolutionary rate analysis, we used the time-calibrated FToL phylogeny and Bayesian Analysis of Macroevolutionary Mixtures-estimated rates from the project’s data repository (83). Tip-specific speciation rates (λ) were plotted for all ray-finned fish species included in the original data. These were then subset to the Serraninae subfamily, and mean speciation rates along the phylogeny were extracted using the R package BAMMtools. Clade-specific rates for hamlets and Serraninae excluding hamlets, the 95% credible set of rate shift configurations, and macroevolutionary rate cohorts were estimated following the BAMM documentation (git 20.8).

Population Genetic Statistics

Throughout the study, all windowed statistics were computed over 50-kb sliding windows with 5-kb increments unless stated otherwise.

Genetic differentiation was computed from the SNPs only dataset with VCFtools following Weir and Cockerham (84) and using the weighted mean. It was calculated within 50-kb windows for each species pair within each location (git 3.9) and among all groups (individual species within locations) jointly (git 3.7). The genome-wide weighted mean F between populations of the same species was also calculated (git 3.21 to 3.24).

Permutation Test

To test for significance, F permutation tests were conducted for all pairs of sympatric species as well as for all pairs of allopatric populations of the same species. This was done genome wide for the SNPs only dataset as well as for a subset that excludes the regions above the 99.8th F percentile (, git 12.3 to 12.4), both filtered for a minor allele count of three (git 12.6). For a total of 105 iterations for each pair, the group assignment of the samples was permuted, and F was computed (git 12.10 to 12.13). For each pair, the observed F was then compared with the F distribution derived from the permutations to compute an empirical P value. To account for multiple testing, the α levels used as significance thresholds were adjusted following the modified false discovery rate procedure (85) ().

Principal Component Analysis

Principal component analyses were conducted for each location using the R package SNPRelate (git 7.8 to 7.9). These were based on the SNPs only dataset filtered for minor allele count (git 7.7). Each principal component analysis was repeated on the whole genome as well as a subset that excludes the regions above the 99.8th F percentile (, git 7.4). Genetic divergence (86) was computed from the all BP data within 50-kb windows. The data were reformatted to a custom genotype format, and divergence was computed using popgenWindows.py (87) (git 4.3 to 4.9). This statistic, computed within 50-kb windows, was defined as the range of divergence among pairs of sympatric species: among the 28 pairs. Nucleotide diversity was calculated for each hamlet group (species/ population) within 50-kb windows (git 4.19) with popgenWindows.py using the all BP dataset. G × P associations were based on the SNPs only dataset and estimated using a linear model with GEMMA (88). This approach takes population structure into account by considering a matrix of relatedness among individuals. The dataset was transformed to the plink format using VCFtools and plink (89) (git 3.11 to 3.12). G × P association was calculated on an SNP basis for the presence/absence of three phenotypic traits: vertical bars, saddle mark on the caudal peduncle, and spot on the snout (git 3.17). Phenotyping was based on photographs of all but five samples for which photographs were not available. A Wald test was conducted using GEMMA to determine the association between the phenotype and the genotype on a per SNP basis. The results were averaged over 50-kb windows (10 and 50 kb; git 3.20). Note that Wald test P values were transformed before averaging, so is reported for every window. GEMMA was also run under the linear mixed model, which provided similar results (). Population recombination rate (ρ) was estimated using the R package FastEPRR (90). Briefly, this approach uses boosting (a machine learning approach) to select the best regression model between recombination rate and a set of summary statistics. The analysis was based on the SNPs only dataset considering all samples (except out-groups) and calculated within nonoverlapping windows of 50 kb using 250 parallel jobs (git 6.4 to 6.10). These results were also used to explore the correlation between ρ, F, and π with linear regression ().

and Cross-Coalescence Rate

Demographic history was inferred using the multiple sequentially Markovian coalescent method implemented in MSMC2 (44). This analysis was based on the phased SNPs only dataset, which was prepared following the MSMC2 authors recommendations (91) as detailed in ref. 45. This included masking the data on the basis of mappability to the reference genome and the occurrence of indels. The data were also filtered with respect to coverage for each individual (between and twice the individual mean coverage; git 8.10). Individuals from each species and location were randomly grouped into sets of three or four, with each individual included in only one set (git 8.12), and individual masks were combined to create the MSMC2 input files (git 8.16 to 8.21). Individuals were also grouped for the cross-coalescence rate analysis, with each group containing two individuals of each species for all pairs of sympatric species. Here, each individual was assigned to only one group for each species pair but reused across species pairs. All MSMC2 analyses were run with a time segmentation pattern of and the average of Watterson’s estimator across input datasets (). The mutation rate was set to based on the closest relative for which we could find a reliable mutation rate estimate (91, 92) (git 8.18 and 8.23).

Identification of Putative Hybrids and Backcrosses

We used the approach implemented in NewHybrids (93) to evaluate ongoing gene flow among sympatric species. This method is based on patterns of Mendelian inheritance at highly differentiated loci. This analysis was based on small subsets of the SNPs only dataset. First, the 800 most differentiated SNPs were selected for each pair of sympatric species (git 9.5 to 9.6). These were then filtered for a minimum physical distance of 5 kb to reduce linkage among them using VCFtools (git 9.7). From this SNP set, 80 SNPs were randomly chosen using bash scripting and converted to the NewHybrids input format using PGDSpider (94). The assignment to hybrid classes with NewHybrids was implemented in the R package parallelnewhybrid (95), which was run with a burn-in of 106 iterations and 107 sweeps (git 9.8). Individuals with assignment to one hybrid class with a posterior probability ¿0.99 were considered high-probability hybrids or backcrosses. The NewHybrids approach has the advantage of not requiring the a priori identification of pure individuals, but we note that it considers only one species pair at a time, ignoring the effect of other species on each pairwise analysis.

D Statistics

Levels of introgression among species/populations were assessed by applying Patterson’s D (96) and related statistics. After removing all Serranus samples, the SNPs only dataset was filtered to exclude sites with linkage disequilibrium coefficients greater than 0.5 within 50-kb windows using the BCFtools (97) plug-in prune (git 17.1 to 17.4). Using the resulting 4,902,398 sites, D statistics were then calculated for all 364 population/species trios with Dsuite’s Dtrios (98) using default parameters and H. floridae as the out-group. P values were adjusted for multiple testing using the Bonferroni procedure (git 17.5 to 17.8).

Genome-Wide Hamlet Phylogeny

To reconstruct the phylogenetic relationships among hamlets, we divided the genome into nonoverlapping windows of 5 kb. A total of 5,000 were selected randomly from those windows that contained less than 20% missing sites and at least 50 SNPs within hamlets in the all BP genotype dataset (69% of all windows met these criteria; git 13.1 to 13.11). The selected windows were extracted using VCFtools, and Serranus samples were removed in one copy of the dataset. After conversion to continuous sequence (Fasta format) using a custom Perl script, windows were individually realigned with MAFFT (80) (git 13.12 to 13.17). Maximum likelihood phylogenies were calculated for each window in IQ-TREE (82) based on the automatically selected best-fit model and default parameters (git 13.18). A coalescent-based species tree was then estimated from all local trees with ASTRAL-III (99), with internal branch length measured in coalescent units and support values given as local posterior probabilities (100) (git 13.19). Two species trees were produced that way, one rooted with H. floridae from the Gulf of Mexico clade and another one rooted with the three Serranus samples.

Region-Specific Phylogenies

The three genomic regions of interest were extracted from the all BP data (i.e., including invariant sites) set using VCFtools. All Serranus and hybrid samples identified by NewHybrids were removed alongside indel positions (git 14.1 to 14.3) before conversion to Fasta format and per site allele frequencies for each group using the cflib library. Region-specific phylogenies were then inferred at the level of individual samples with RAxML-NG (101) based on the GTR + G model, 10 each of random and parsimony starting trees, and 100 bootstrap replicates (git 14.5 to 14.6). Group-level maximum likelihood trees for each region were also estimated with IQ-TREE (82) using a polymorphism-aware model (102) and 100 nonparametric bootstrap replicates (git 14.4).

Topology Weighting

Topology weighting was conducted for Belize and Honduras independently (the number of taxa in Panama is too small to conduct this analysis). The SNPs only dataset was subset to include only hamlets from the respective location and filtered to include only SNPs with a minor allele count greater than or equal to three (git 5.12). The data were then split by LG and converted to a custom genotype format (git 5.14). Using phyml_sliding_windows.py (87), we applied PhyML (103) to build phylogenies from nonoverlapping sliding windows of 200 SNPs each along all LGs (git 5.17). Topology weighting was conducted on the resulting phylogenies using Twisst (48) (git 5.18).

IBD Fragments

Fragments of IBD were identified using truffle (104). Out-group samples were removed from the SNPs only dataset with VCFtools (git 16.2), and truffle was run with three different minimum sequence length thresholds for IBD1 and IBD2 (25/20, 15/7.5, and 10/5 consecutive SNPs; git 16.3 and 16.5). IBD fragment lengths were then converted from base pairs to centimorgans based on two available linkage maps for Hypoplectrus (42) (git 16.6). IBD networks were then constructed from the pairwise genome-wide average IBD [computed as ] as edge weights in a force-directed graph (105). In order to limit the effect of low recombination and stabilizing selection on the detection of IBD fragments, the IBD fragments that overlapped with regions of the genome above the 95th IBD score percentile (git 16.4 to 16.5) and that were smaller than 0.2 cM were also filtered out.

Admixture Analysis

We used admixture (106) to analyze the population genetic patterns at three genomic regions of interest. For this, the out-group species were removed from the dataset, and the genotypes were subset to the respective candidate regions and converted to plink format (git 10.4). Admixture was run for all k values between 2 and 15 for each region (git 10.6).

Allele Age Estimation

This was done using Genealogical Estimation of Variant Age, a nonparametric method that combines probability distributions of the time to the most recent common ancestor of a large number of haplotypes based on empirically constructed hidden Markov models to estimate the age of genomic variants over a continuous timescale and without relying on a priori assumptions about demography or selection (55). The ancestral state of all SNPs was determined using the Serranus out-groups and allele frequency (git 11.3). For SNPs that where invariant in both S. tabacarius and S. tortugarum, the Serranus allele was set as ancestral. For the SNPs that were variant within Serranus, the major allele was set as ancestral. The information about the ancestral state was added as an annotation to the vcf file (git 11.3) using vcf-annotate, and the vcf file was recoded using a custom java script in combination with Jvarkit (107) (git 11.4). GEVA was looped over the entire dataset in batches of 250 SNPs, with a mutation rate of and an estimated effective population size of . Recombination rate was fixed at the average of each LG and based on the FastEPRR estimation, ranging from to (git 11.6). The results were then compiled for each LG (git 11.7).

Visualization

All results were plotted using R (git 20). The details of the visualization are provided in the R scripts and their documentation (git docs/index.html, file within the git repository). Other than the scripts within the GitHub repository, the visualization relied on the three custom R packages (hypogen, hypoimg, and GenomicOriginsScripts), which can also be accessed via GitHub (108). The R packages used are listed and referenced in . Package versions were managed using the R package renv; thus, the precise R configuration used for this study can be restored based on the provided lock file (git renv.lock, file within the git repository).
  80 in total

1.  Lack of mitochondrial genetic structure in hamlets (Hypoplectrus spp.): recent speciation or ongoing hybridization?

Authors:  Marina L Ramon; Phillip S Lobel; Michael D Sorenson
Journal:  Mol Ecol       Date:  2003-11       Impact factor: 6.185

2.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments.

Authors:  Gerard Talavera; Jose Castresana
Journal:  Syst Biol       Date:  2007-08       Impact factor: 15.683

3.  Recombination in the eggs and sperm in a simultaneously hermaphroditic vertebrate.

Authors:  L Theodosiou; W O McMillan; O Puebla
Journal:  Proc Biol Sci       Date:  2016-12-14       Impact factor: 5.349

4.  Evolution of Darwin's finches and their beaks revealed by genome sequencing.

Authors:  Sangeet Lamichhaney; Jonas Berglund; Markus Sällman Almén; Khurram Maqbool; Manfred Grabherr; Alvaro Martinez-Barrio; Marta Promerová; Carl-Johan Rubin; Chao Wang; Neda Zamani; B Rosemary Grant; Peter R Grant; Matthew T Webster; Leif Andersson
Journal:  Nature       Date:  2015-02-11       Impact factor: 49.962

5.  Involvement of the conserved Hox gene Antennapedia in the development and evolution of a novel trait.

Authors:  Suzanne V Saenko; Marta Sp Marialva; Patrícia Beldade
Journal:  Evodevo       Date:  2011-04-19       Impact factor: 2.250

6.  Estimating dispersal from genetic isolation by distance in a coral reef fish (Hypoplectrus puella).

Authors:  Oscar Puebla; Eldredge Bermingham; Frédéric Guichard
Journal:  Ecology       Date:  2009-11       Impact factor: 5.499

7.  Transcriptomics of colour patterning and coloration shifts in crows.

Authors:  J W Poelstra; N Vijay; M P Hoeppner; J B W Wolf
Journal:  Mol Ecol       Date:  2015-09-10       Impact factor: 6.185

8.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

9.  RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.

Authors:  Alexey M Kozlov; Diego Darriba; Tomáš Flouri; Benoit Morel; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  1 in total

1.  Colour patterns influence symbiosis and competition in the anemonefish-host anemone symbiosis system.

Authors:  Kina Hayashi; Katsunori Tachihara; James Davis Reimer; Vincent Laudet
Journal:  Proc Biol Sci       Date:  2022-10-05       Impact factor: 5.530

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.