| Literature DB >> 26172569 |
Shosei Kubota1, Takaya Iwasaki2, Kousuke Hanada3, Atsushi J Nagano4, Asao Fujiyama5, Atsushi Toyoda5, Sumio Sugano6, Yutaka Suzuki6, Kouki Hikosaka7, Motomi Ito2, Shin-Ichi Morinaga8.
Abstract
Adaptive divergence at the microgeographic scale has been generally disregarded because high gene flow is expected to disrupt local adaptation. Yet, growing number of studies reporting adaptive divergence at a small spatial scale highlight the importance of this process in evolutionary biology. To investigate the genetic basis of microgeographic local adaptation, we conducted a genome-wide scan among sets of continuously distributed populations of Arabidopsis halleri subsp. gemmifera that show altitudinal phenotypic divergence despite gene flow. Genomic comparisons were independently conducted in two distinct mountains where similar highland ecotypes are observed, presumably as a result of convergent evolution. Here, we established a de novo reference genome and employed an individual-based resequencing for a total of 56 individuals. Among 527,225 reliable SNP loci, we focused on those showing a unidirectional allele frequency shift across altitudes. Statistical tests on the screened genes showed that our microgeographic population genomic approach successfully retrieve genes with functional annotations that are in line with the known phenotypic and environmental differences between altitudes. Furthermore, comparison between the two distinct mountains enabled us to screen out those genes that are neutral or adaptive only in either mountain, and identify the genes involved in the convergent evolution. Our study demonstrates that the genomic comparison among a set of genetically connected populations, instead of the commonly-performed comparison between two isolated populations, can also offer an effective screening for the genetic basis of local adaptation.Entities:
Mesh:
Year: 2015 PMID: 26172569 PMCID: PMC4501782 DOI: 10.1371/journal.pgen.1005361
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Location and of the studied populations.
(A) Geographic locations of the two mountains (the main study sites) and the four low-altitude reference populations. Altitude is indicated by the numbers in the population names. See S1 Table for coordinates. (B) Locations of the four altitude-specific populations on each mountain.
Summary statistics of genetic diversity and differentiation among the altitude-specific populations.
| Heterozygostiy statistics | Pairwise | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| IB0380 | 0.162 | 0.051 | 0.047 | |||
| IB0600 | 0.162 | 0.050 | 0.046 | 0.027 | ||
| IB1000 | 0.224 | 0.064 | 0.053 | 0.046 | 0.043 | |
| IB1250 | 0.209 | 0.061 | 0.056 | 0.048 | 0.046 | 0.034 |
|
|
|
|
|
|
|
|
| FJ0200 | 0.224 | 0.066 | 0.055 | |||
| FJ0400 | 0.204 | 0.060 | 0.049 | 0.034 | ||
| FJ0700 | 0.228 | 0.064 | 0.056 | 0.036 | 0.034 | |
| FJ1100 | 0.196 | 0.061 | 0.042 | 0.043 | 0.041 | 0.037 |
P, proportion of polymorphic loci
H e, mean expected heterozygosity
H o, mean observed heterozygosity.
Fig 2Genetic structures of the populations.
(A) structure analysis with a K of 2 to 8 using all 56 individuals from the 12 populations. The result for each K is based on the simulation that provided the best LnP(D) value (the log probability value) among 20 independent runs. Each bar represents an individual and the estimated membership in a particular genetic cluster. (B) Plotting of the mean LnP(D) values from the structure analysis (blue dots) and Evanno’s ΔK (red dots). Error bars indicate the standard deviation of LnP(D) values from the 20 independent runs. Both the maximum value of LnP(D) and the peak position of Evanno’s ΔK are found at K = 6. (C) Maximum likelihood tree for the 12 populations obtained from TreeMix. The bootstrap supports for the nodes were calculated from 100 replicates. The scale bar represents 10 times the average standard error of the entries in the covariance matrix. Horizontal branch lengths are proportional to the amount of genetic drift. See S2 Fig for additional analysis within each mountain.
Fig 3Genome-wide frequency distribution of the three indices.
Histograms show the frequency distribution of (A, B), (C, D), and (E, F) estimated for all 527,225 SNPs in Mt. Ibuki (A, C, E) and Mt. Fujiwara (B, D, F). Box plots for each index are shown above the histogram. Whiskers of the box plot indicates the 1.5 times the IQR (interquartile range). Spikes below the histograms show the SNPs that fulfilled the threshold for each criterion.
Fig 4Overlap of screened SNPs among the three criteria.
Venn diagram shows the overlaps of screened SNPs among the three criteria. The number of SNPs that fulfilled all three criteria were 5,523 (1.05% of the 527,225 SNPs) in Mt. Ibuki (A) and 5,407 (1.03% of the 527,225 SNPs) in Mt. Fujiwara (B).
Fig 5Enrichment analyses of the selected Gene Ontology terms.
The histograms show the fold enrichment of a given GO term within each dataset for the two mountains. Vertical red line indicate the expected ratio of SNPs or genes associated with a specific GO term under the null hypothesis. Significant enrichment was accepted and denoted with asterisks if the corresponding false discovery rate (FDR) q-value was below 0.05. Here, we show only a subset of the tested GO terms. See S2 Table for the full list of GO terms.
Genes within the top 20 genomic islands from each mountain.
|
| |||
|
|
|
|
|
| 1 |
|
| gluconeogenesis; cytoskeleton organization; embryo sac development |
| 2 | AT3G44713 | ||
| 3 | AT2G43160 | ||
| 4* |
|
| actin filament-based movement; Golgi localization; mitochondrion localization |
| 5* | AT4G00310 |
| seed dormancy process; leaf development; response to freezing |
| 6 | AT2G48060 | ||
| 7* (2) | AT4G31300 |
| hyperosmotic response; response to temperature stimulus; response to cadmium ion |
| 8 |
| protein import into nucleus | |
| 9* | AT1G80930 | translation | |
| 10* |
|
| response to cold; detection of biotic stimulus; photosynthesis |
| 11* | AT2G33820 |
| mitochondrial transport |
| 12* | AT4G04972 | ||
| 13 | AT1G28240 | ||
| 14* | AT2G41500 |
| meristem structural organization; seed dormancy process; response to freezing |
| 15* | AT4G27010 |
| embryo sac egg cell differentiation; regulation of flower development; maintenance of meristem identity |
| 16 | AT1G33410 |
| response to auxin; regulation of flower development; maintenance of meristem identity |
| 17* | AT1G79560 |
| chloroplast organization; embryo development ending in seed dormancy; ovule development |
| 18 (1) | AT2G36850 |
| meristem initiation; trichome morphogenesis; telomere maintenance in response to DNA damage |
| 19 |
| vegetative to reproductive phase transition of meristem; protein desumoylation; hydrogen peroxide biosynthetic process | |
| 20* |
|
| cytokinesis by cell plate formation; regulation of transcription, DNA-templated |
|
| |||
|
|
|
|
|
| 1 (18) |
|
| meristem initiation; trichome morphogenesis; telomere maintenance in response to DNA damage |
| 2* (7) | AT4G31300 |
| hyperosmotic response; response to temperature stimulus; response to cadmium ion |
| 3 | AT2G41225 | ||
| 4 | AT4G30990 | ||
| 5 | AT2G40270 | response to bacterium; response to insect; regulation of plant-type hypersensitive response | |
| 6 | AT2G45880 |
| vernalization response; regulation of shoot system development |
| 7* | AT5G63190 | auxin-activated signaling pathway; response to sucrose; response to fructose | |
| 8* | AT1G06040 |
| hyperosmotic response; response to temperature stimulus; response to light stimulus |
| 9 | AT1G29400 |
| fatty acid beta-oxidation; positive regulation of meiosis; positive regulation of growth |
| 10* | AT3G03340 |
| positive regulation of cell proliferation; double fertilization forming a zygote and endosperm |
| 11 | AT1G63440 |
| response to zinc ion; detoxification of copper ion; response to copper ion |
| 12* | AT2G38823 | ||
| 13* | AT1G15690 |
| response to water deprivation; response to salt stress; leaf development |
| 14* | AT1G25510 | proteolysis | |
| 15 | AT3G15300 |
| |
| 16* |
|
| intracellular protein transport |
| 17 | AT1G32750 |
| RNA splicing, via endonucleolytic cleavage and ligation; transcription from RNA polymerase II promoter; DNA mediated transformation |
| 18 | AT4G25530 |
| trichome morphogenesis; photoperiodism, flowering; cell wall organization |
| 19 | AT1G52830 |
| de-etiolation; response to auxin |
| 20 | AT2G46430 |
| |
Numbers in parenthesis indicate rank in the other mountain. Asterisks indicate the presence of other genes that are located in the same genomic island. AGI codes in bold indicates genes with nonsynonymous SNPs that were highly differentiated (G’ST > 0.4) between the lowest and highest populations. See S3 Table for the extended list.
Fig 6Local signature of a unidirectional allele frequency shift across altitudes.
(A–D) The four colour-coded line graphs in each panel correspond to the allele frequency difference of the altitude-specific populations compared to the lowest population. Each dot of the line graph is an average of allele frequency differences 2 kbp down- and upstream from its genomic position (4 kbp window size). Arrows indicate the mapped exons of A. thaliana genes and small black dots represent the observed SNP positions. Continuous trend of unidirectional allele frequency shift was considered a footprint of natural selection and the proximal gene was accepted as candidates. (A, B) Example for a mountain-specific candidate gene. A steep allele frequency cline is found in 3 to 13 kbp regions of scaffold 11982 in Mt. Ibuki with a peak located near the 5’ UTR of EDA8 (A). Conversely, no such trend is observed in the same genomic region in Mt. Fujiwara (B). (C, D) Example for a ‘shared’ candidate gene. Unidirectional allele frequency shift is detected from the 36 kbp and 50 kbp region (and most likely further) of scaffold 14751in both Mt. Ibuki (C) and Mt. Fujiwara (D). The region overlaps the exons of the GSL8 gene. See Table 2 for other genes screened in our analysis.