| Literature DB >> 28120863 |
João P Marques1,2, Liliana Farelo1, Joana Vilela1, Dan Vanderpool3, Paulo C Alves1,2, Jeffrey M Good3, Pierre Boursot4, José Melo-Ferreira1,2.
Abstract
Introgressive hybridization is an important and widespread evolutionary process, but the relative roles of neutral demography and natural selection in promoting massive introgression are difficult to assess and an important matter of debate. Hares from the Iberian Peninsula provide an appropriate system to study this question. In its northern range, the Iberian hare, Lepus granatensis, shows a northwards gradient of increasing mitochondrial DNA (mtDNA) introgression from the arctic/boreal L. timidus, which it presumably replaced after the last glacial maximum. Here, we asked whether a south-north expansion wave of L. granatensis into L. timidus territory could underlie mtDNA introgression, and whether nuclear genes interacting with mitochondria ("mitonuc" genes) were affected. We extended previous RNA-sequencing and produced a comprehensive annotated transcriptome assembly for L. granatensis. We then genotyped 100 discovered nuclear SNPs in 317 specimens spanning the species range. The distribution of allele frequencies across populations suggests a northwards range expansion, particularly in the region of mtDNA introgression. We found no correlation between variants at 39 mitonuc genes and mtDNA introgression frequency. Whether the nuclear and mitochondrial genomes coevolved will need a thorough investigation of the hundreds of mitonuc genes, but range expansion and species replacement likely promoted massive mtDNA introgression.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28120863 PMCID: PMC5264399 DOI: 10.1038/srep40788
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Geographic distribution of four hare species in the Iberian Peninsula (a,c) and Western Europe (b) (approximate distributions were based on Mitchel-Jones et al.57). (a) Localities sampled for L. granatensis (see Supplementary Table S1 for a detailed description); numbers indicate sample size, pie charts the proportion of STRUCTURE assignment to each of 3 clusters using the LOC prior and the complete SNP dataset (100 loci); the “X” marks the inferred origin of the range expansion; grey letters indicate specimens for which RNA was sequenced and used to build the transcriptome (“d-n” from Cahais et al.22 and Gayral et al.23, and “o” from this work). (b) Localities sampled for L. timidus, indicating the sample sizes; two sampling localities are not shown on this map (RUS – Russia; and FER – Far East Russia). (c) Proportion of the mitochondrial DNA lineages, native L. granatensis or introgressed from L. timidus, in the genotyped samples. Maps were generated in vectorial format using Inkscape v0.91 (https://inkscape.org).
Summary statistics of L. granatensis de novo transcriptome assembly, considering three filtering levels: raw assembly, removing redundancy and retaining contigs with ORF and/or reciprocal best blast hit onto rabbit transcripts and peptides.
| Raw | Non-redundant | ORF and/or blast hit | |
|---|---|---|---|
| Number of contigs | 54,838 | 50,580 | 24,608 |
| Average contig length (bp) | 800 | 761 | 1,063 |
| Total length (bp) | 43,877,813 | 38,480,389 | 26,161,714 |
| Maximum contig length (bp) | 12,481 | 12,481 | 12,481 |
| Minimum contig length (bp) | 224 | 224 | 224 |
| N50 (bp) | 1,334 | 1,247 | 1,724 |
| Number of contigs > 1 kb | 13,340 | 11,299 | 9,361 |
| Proportion of contigs > 1 kb (%) | 24.3 | 22.3 | 38.0 |
| Reference Proteins with blast hit (%) | 51 | 51 | 51 |
| Reference coverage (%) | 32 | 32 | 32 |
Figure 2Organization of genetic diversity in Lepus granatensis from the analysis of 100 SNP loci (see Supplementary Figs S2–S5 for a complete description of the results obtained with several subsets of the dataset): (a) Structure plots with individual assignment to 3 clusters, as determined using Evanno’s delta K method, and using the sample locations as priors of the admixture model; population codes as in Fig. 1; (b) coordinates of samples on the first two axes of variation determined with a Principal Component Analysis (PCA) (stars correspond to specimens from Northwestern Iberian population GAL; see Fig. 1); (c) Correlation between the first two PCA axes of variation and geographical coordinates of sample localities (Spearman rank correlation, p = 0.00 for both analyses; dashed line indicates a linear regression trendline); (d) correlation between genetic differentiation and geographic distance among pairs of populations (Spearman rank correlation, p = 0.00; dashed line indicates a linear regression trendline).