| Literature DB >> 26590214 |
Dan Bar-Yaacov1, Zena Hadjivasiliou2, Liron Levin1, Gilad Barshad1, Raz Zarivach1, Amos Bouskila1, Dan Mishmar3.
Abstract
Compatibility between the nuclear (nDNA) and mitochondrial (mtDNA) genomes is important for organismal health. However, its significance for major evolutionary processes such as speciation is unclear, especially in vertebrates. We previously identified a sharp mtDNA-specific sequence divergence between morphologically indistinguishable chameleon populations (Chamaeleo chamaeleon recticrista) across an ancient Israeli marine barrier (Jezreel Valley). Because mtDNA introgression and gender-based dispersal were ruled out, we hypothesized that mtDNA spatial division was maintained by mito-nuclear functional compensation. Here, we studied RNA-seq generated from each of ten chameleons representing the north and south populations and identified candidate nonsynonymous substitutions (NSSs) matching the mtDNA spatial distribution. The most prominent NSS occurred in 14 nDNA-encoded mitochondrial proteins. Increased chameleon sample size (N = 70) confirmed the geographic differentiation in POLRMT, NDUFA5, ACO1, LYRM4, MARS2, and ACAD9. Structural and functionality evaluation of these NSSs revealed high functionality. Mathematical modeling suggested that this mito-nuclear spatial divergence is consistent with hybrid breakdown. We conclude that our presented evidence and mathematical model underline mito-nuclear interactions as a likely role player in incipient speciation in vertebrates.Entities:
Keywords: RNA-seq; mito-nuclear coevolution; mitochondrial DNA; nuclear DNA; speciation; transcriptome
Mesh:
Year: 2015 PMID: 26590214 PMCID: PMC4700957 DOI: 10.1093/gbe/evv226
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FMap of sampling sites. Sampling sites marked in red and green represent chameleons with the south and north mtDNA types, respectively. Exact coordinates are specified in supplementary table S1 and file S1, Supplementary Material online. RNA-seq was performed on the following samples: D1, D2, D4, D6, D7, D8, D10, D12, D13, and D16. Black line indicates the geographic divergence boundary between chameleons with north and south mtDNA types. The figure was generated using Google earth program.
NSS in nDNA-Encoded Mitochondrial Proteins with Correlated Genotype Distribution to the mtDNA Geographic Pattern
| Gene Name | Sample Size | Number of NSS | Bonferroni | Fst | Human Protein Ref-seq Accession Number | |
|---|---|---|---|---|---|---|
| 70 | 2 | 0.000010 | 0.00014 | 0.18 | NP_005026.3 | |
| 69 | 1 | 0.000025 | 0.00035 | 0.29 | NP_002188.1 | |
| 70 | 1 | 0.000044 | 0.00062 | 0.25 | NP_004991.1 | |
| 70 | 1 | 0.001700 | 0.02380 | 0.14 | NP_065141.3 | |
| 70 | 3 | 0.002160 | 0.03024 | 0.14 | NP_612404.1 | |
| 61 | 1 | 0.003300 | 0.04620 | 0.11 | NP_054768.2 | |
| 69 | 1 | 0.006300 | >0.05 | — | NP_065796.1 | |
| 70 | 1 | 0.008500 | >0.05 | — | NP_000117.1 | |
| 70 | 1 | 0.014000 | >0.05 | — | NP_660213.1 | |
| 70 | 1 | >0.05 | >0.05 | — | NP_001203.1 | |
| 70 | 1 | >0.05 | >0.05 | — | NP_006010.2 | |
| 53 | 1 | >0.05 | >0.05 | — | NP_002992.1 | |
| Not included in the final analysis | — | NP_000584.2 | ||||
| Not included in the final analysis | — | NP_006205.1 | ||||
aSample size, total number of genotyped chameleons; number of NSS, number of NSS per gene; P value, chi-square P value; Bonferroni P value, P value after Bonferroni correction; Fst, calculated Fst value for each gene using Genepop (Rousset 2008).
Functionality Assessment of the NSS
| Protein Name | Human Position | AA North | AA South | PANTH | SIFT | CI | AA Distribution | Number of Species | Species used |
|---|---|---|---|---|---|---|---|---|---|
| POLRMT | 1090 | Q | L | 0.26 | 0.51 | 1 | V (32), | 78 | 46 mammals, 16 birds, 2 reptiles, 2 amphibians, 12 fish |
| Q>L | |||||||||
| POLRMT | 1218 | N | D | — | 0.38 | 78 | 46 mammals, 16 birds, 2 reptiles, 2 amphibians, 12 fish | ||
| D>N | |||||||||
| ACO1 | 601 | H | R | 0.2 | 1 | 106 | 71 mammals, 16 birds, 3 reptiles, 2 amphibians, 14 fish | ||
| NDUFA5 | 95 | A | V | 0.11 | 0.19 | 1 | I (30.1), L (26.2), | 103 | 69 mammals, 17 birds, 2 reptiles, 2 amphibians, 13 fish |
| R > A | |||||||||
| 0.57 | |||||||||
| R > V | |||||||||
| LYRM4 | 51 | A | E | 0.21 | 5 | 85 | 57 mammals, 13 birds, 2 reptiles, 2 amphibians, 11 fish | ||
| E > A | |||||||||
| MARS2 | 298 | L | F | 0.26 | 0.2 | 1 | K (49), G (15.3), E (10.2), R (6.1), N (5.1), S (4.1), Q (3.1), | 98 | 63 mammals, 16 birds, 3 reptiles, 2 amphibians, 14 fish |
| K > L | |||||||||
| MARS2 | 345 | G | R | 98 | 63 mammals, 16 birds, 3 reptiles, 2 amphibians, 14 fish | ||||
| MARS2 | 498 | E | K | 0.31 | 0.09 | 1 | 98 | 63 mammals, 16 birds, 3 reptiles, 2 amphibians, 14 fish | |
| E > K | |||||||||
| ACAD9 | 503 | M | I | 0.4 | 5 | E (77.6), Q (20.4), N (1), L (1) | 98 | 65 mammals, 15 birds, 3 reptiles, 2 amphibians, 13 fish | |
aHuman position, position in the human protein ortholog; AA north, the prominent amino acid form in the northern population; AA south, the prominent amino acid form in the southern population; PANTH, PANTHER score; SIFT, SIFT score in the vertebrate MSA; CI, conservation index score extracted from the MSA; AA distribution, amino acid distribution in the vertebrate MSA (percentage in parenthesis). Bold in the SIFT, PANTH, and CI columns—values that passed the threshold of significance. Bold in the column entitled “AA Distribution”—amino acid variants found in chameleons but also in other species.
FMaximum likelihood tree based on 21 whole mtDNA sequences. Bootstrap scores of 1,000 replicates are indicated near the branches. Sequences from samples marked with a “D” were generated in this study, while the rest are from our previous work (Bar-Yaacov, Arbel-Thau, et al. 2012). GenBank accession numbers are the following: Portugal EF222198, Cyprus EF222200, Turkey 1 EF222201, and Turkey 2 EF222202. The tree is drawn to scale, with branch lengths calculated using the average pathway method. Fst between the two populations based on the whole mtDNA sequences was 0.885. For accession numbers, see Materials and Methods section. For sample collection sites, see figure 1 and supplementary table S1, Supplementary Material online.
Distribution of Each NSS in the Six analyzed nDNA-Encoded Mitochondrial Genes
| Protein, Amino Acid Position, and mtDNA Type | Homozygote 1 | Heterozygote | Homozygote 2 | Total Samples Analyzed |
|---|---|---|---|---|
| POLRMT 1090 | A (Q) | A/T (Q/L) | T (L) | 70 |
| North | 16 (0.457) | 17 (0.485) | 2 (0.057) | 35 |
| South | 2 (0.057) | 17 (0.485) | 16 (0.457) | 35 |
| POLRMT 1218 | A (N) | A/G (N/D) | G (D) | 70 |
| North | 4 (0.114) | 21 (0.6) | 10 (0.286) | 35 |
| South | 0 (0) | 7 (0.2) | 28 (0.8) | 35 |
| ACO1 601 | A (H) | A/G (H/R) | G (R) | 69 |
| North | 7 (0.2) | 16 (0.457) | 11 (0.314) | 35 |
| South | 0 (0) | 5 (0.147) | 30 (0.882) | 34 |
| NDUFA5 95 | C (A) | C/T (A/V) | T (V) | 70 |
| North | 30 (0.857) | 5 (0.147) | 0 | 35 |
| South | 12 (0.343) | 18 (0.514) | 5 (0.147) | 35 |
| LYRM4 51 | A (A) | A/C (A/E) | C (E) | 70 |
| North | 8 (0.228) | 20 (0.571) | 7 (0.2) | 35 |
| South | 23 (0.657) | 9 (257) | 3 (0.857) | 35 |
| MARS2 298 | C (L) | C/T (L/F) | T (F) | 67 |
| North | 30 (0.857) | 4 (0.114) | 1 (0.028) | 35 |
| South | 14 (0.437) | 13 (0.406) | 5 (0.156) | 32 |
| MARS2 345 | G (G) | G/A (G/R) | A (R) | 70 |
| North | 30 (0.857) | 4 (0.114) | 1 (0.028) | 35 |
| South | 19 (0.543) | 12 (0.343) | 4 (0.114) | 35 |
| MARS2 498 | G (E) | G/A (E/K) | A (K) | 70 |
| North | 30 (0.857) | 4 (0.114) | 1 (0.028) | 35 |
| South | 19 (0.543) | 12 (0.343) | 4 (0.114) | 35 |
| ACAD9 503 | G (M) | G/A (M/I) | A (I) | 61 |
| North | 29 (0.966) | 1 (0.033) | 0 (0) | 30 |
| South | 21 (0.677) | 10 (0.323) | 0 (0) | 31 |
Note.—Amino acid alleles and allele frequencies are given in in parentheses.
FPosition 95 in NDUFA5 is positively selected in vertebrates. Notably, only position 95 gained the highest score for positive selection while analyzing multiple sequence alignment of 100 vertebrates. Orange and purple correspond to the highest positively and negatively selected scores, respectively.
FStructural modeling of POLRMT NSS in chameleons. Left, chameleon POLRMT model (blue) superimposed on the promoter(red)-bound T7 RNA polymerase (yellow, 1CEZ). The NSS in position 1090 is shown to be within the specificity loop of POLRMT and near the promoter region.
FThe expected fitness curves of different hybrid crosses as a function of the number of diverging nDNA loci interacting with the mtDNA. Values were computed using equations (1)–(4) (see Methods). Orange: F1 individuals; black: F1 males backcrossed to either parental population or F1 females backcrossed to their maternal population; red: F2 hybrids; blue: F1 females backcrossed to their paternal population. These curves were calculated under the assumption that the cost for heterozygotes (s1) equals to 0.05 and the cost for a homozygote against the wrong mitochondrial background equals to 0.5. Variation in these values affects the steepness of the fitness curves but their order is maintained.