| Literature DB >> 35646309 |
Stephen J Sabatino1,2,3, Paulo Pereira1,2,3, Miguel Carneiro1,2,3, Jolita Dilytė1,2, John Patrick Archer1,2, Antonio Munoz1,2, Francesco Nonnis-Marzano3,4, Antonio Murias1,2,3.
Abstract
Studying the genetics of phenotypic convergence can yield important insights into adaptive evolution. Here, we conducted a comparative genomic study of four lineages (species and subspecies) of anadromous shad (Alosa) that have independently evolved life cycles entirely completed in freshwater. Three naturally diverged (A. fallax lacustris, A. f. killarnensis, and A. macedonica), and the fourth (A. alosa) was artificially landlocked during the last century. To conduct this analysis, we assembled and annotated a draft of the A. alosa genome and generated whole-genome sequencing for 16 anadromous and freshwater populations of shad. Widespread evidence for parallel genetic changes in freshwater populations within lineages was found. In freshwater A. alosa, which have only been diverging for tens of generations, this shows that parallel adaptive evolution can rapidly occur. However, parallel genetic changes across lineages were comparatively rare. The degree of genetic parallelism was not strongly related to the number of shared polymorphisms between lineages, thus suggesting that other factors such as divergence among ancestral populations or environmental variation may influence genetic parallelism across these lineages. These overall patterns were exemplified by genetic differentiation involving a paralog of ATPase-α1 that appears to be under selection in just two of the more distantly related lineages studied, A. f. lacustris and A. alosa. Our findings provide insights into the genetic architecture of adaptation and parallel evolution along a continuum of population divergence.Entities:
Keywords: adaptation; genomics; parallel evolution; phenotypic convergence; whole‐genome pool‐seq
Year: 2022 PMID: 35646309 PMCID: PMC9130566 DOI: 10.1002/ece3.8908
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 3.167
FIGURE 1(a) Map of sampling locations for the four lineages studied: A. alosa—Atlantic (navy); A. f. lacustris—Italy (orange); A. f. killarnensis—Ireland (green); and A. macedonica—Black Sea (black). The name of each location sampled is as follows: 1—Garonne River; 2—Mondego River; 3—Aguieira Reservoir; 4—Castelo de Bode Reservoir; 5—Alqueva Reservoir; 6—Tavignano River; 7—Po River; 8—Lake Maggiore; 9—Lake Como; 10—Lake Garda; 11—Mondego River; 12—Lima River; 13—Lough Leane; 14—Danube River, Tulcea; 15—Danube River, Iron Gate; and 16—Lake Volvi. The locations marked by a square on the map are anadromous populations, and those with a circle are freshwater. (b) Phylogenetic tree of the populations studied based on genetic distance (F ST). (c) The map identification number (ID), species, sample size, and population history of each sample. (d) A schematic of the experimental design used in this study
Statistics from the ALLPATHS genome assembly for A. alosa
| Estimated genome size (bp) | 889,386,068 |
| Assembly size (bp) | 876,687,730 |
| Scaffold N50 (bp) | 2,036,090 |
| Contig N50 (bp) | 9,900 |
| Total contig length (bp) | 621,125,889 |
| No. of scaffolds >1000 bp | 15,886 |
| Repeat content | 36% |
| % GC content | 43.4% |
| Heterozygosity | 1/230 |
| Protein‐coding gene count | 32,983 |
FIGURE 2Manhattan plots showing the significance of ΔAF between anadromous and freshwater populations of the four lineages studied: A. alosa—Atlantic (navy), A. f. killarnensis—Ireland (green); A. f. lacustris—Italy (orange); and A. macedonica—Black Sea (black). On the x‐axis, the SNPs are ordered based on their position in each scaffold from our A. alosa genome assembly, with the largest scaffolds on the left and the smallest on the right. The y‐axis shows ‐log(10) p‐values calculated using 1000 permutations of the data per lineage. The horizontal lines indicate the 99.9th (black) and 99th (white) percentiles
Numbers of outlier windows (20 consecutive SNPs) and regions (outlier windows within 20 kilobases of each other) per lineage and population(s) studied
| Lineage | Population | 99th percentile | 99.9th percentile | ||
|---|---|---|---|---|---|
| Windows | Regions | Windows | Regions | ||
|
| All | 1394 | 397 | 140 | 36 |
| Aguieira Reservoir | 398 | 37 | |||
| Castelo de Bode Reservoir | 412 | 57 | |||
| Alqueva Reservoir | 431 | 31 | |||
|
| All | 1568 | 554 | 157 | 85 |
| Lake Maggiore | 609 | 83 | |||
| Lake Como | 556 | 77 | |||
| Lake Garda | 581 | 79 | |||
|
| Lough Leane | 2173 | 552 | 218 | 70 |
|
| Lake Volvi | 1428 | 491 | 143 | 70 |
FIGURE 3Percentage of polymorphism shared compared with the percentage of outlier regions (99th percentile) in common for pairs and sets of freshwater populations studied. The colored circles with numbers refer to populations as shown in Figure 1
FIGURE 4Heat map of allele frequencies for nonsynonymous mutations found in ATPase‐α1.1b and Axin1‐like. The number atop each column is the location of the SNP in scaffold_411 in our A. alosa genome assembly. At the end of each row is the name of the population and a symbol indicating if it is anadromous (squares) or freshwater (circles). Each cell is shaded gray based on the frequency of the “freshwater” allele
FIGURE 53D‐model of Na,K‐ATPase and subsections of an amino acid alignment of ATPase‐α1 from several fish species. In the alignment, the alternative alleles for the three protein‐coding changes found in freshwater populations of A. alosa are shown. In the protein model, the domains where these three mutations occur are highlighted in yellow (nucleotide‐binding) and blue (transmembrane). The rest of ATPase‐α is colored white, while ATPase‐β and ATPase‐γ are both dark gray. The numbering of amino acid positions corresponds to that used in Shinoda et al. (2009) for the spiny dogfish