| Literature DB >> 30850829 |
Zhenyong Du1, Hiroki Hasegawa2, John R Cooley3, Chris Simon3, Jin Yoshimura4,5,6, Wanzhi Cai1, Teiji Sota2, Hu Li1.
Abstract
The mass application of whole mitogenome (MG) sequencing has great potential for resolving complex phylogeographic patterns that cannot be resolved by partial mitogenomic sequences or nuclear markers. North American periodical cicadas (Magicicada) are well known for their periodical mass emergence at 17- and 13-year intervals in the north and south, respectively. Magicicada comprises three species groups, each containing one 17-year species and one or two 13-year species. Within each life cycle, single-aged cohorts, called broods, of periodical cicadas emerge in different years, and most broods contain members of all three species groups. There are 12 and three extant broods of 17- and 13-year cicadas, respectively. The phylogeographic relationships among the populations and broods within the species groups have not been clearly resolved. We analyzed 125 whole MG sequences from all broods and seven species within three species groups to ascertain the divergence history of the geographic and allochronic populations and their life cycles. Our mitogenomic phylogeny analysis clearly revealed that each of the three species groups had largely similar phylogeographic subdivisions (east, middle, and west) and demographic histories (rapid population expansion after the last glacial period). The mitogenomic phylogeny also partly resolved the brood diversification process, which could be explained by hypothetical temporary life cycle shifts, and showed that none of the 13- and 17-year species within the species groups was monophyletic, possibly due to gene flow between them. Our findings clearly reveal phylogeographic structures in the three Magicicada species groups, demonstrating the advantage of whole MG sequence data in phylogeographic studies.Entities:
Keywords: zzm321990 Magicicadazzm321990 ; demographic history; life cycle polymorphism; mitogenome; parallel evolution; phylogeography
Mesh:
Year: 2019 PMID: 30850829 PMCID: PMC6526903 DOI: 10.1093/molbev/msz051
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Taxonomy, Life Cycles, Spatial/Temporal Distributions in the Eastern United States, Mitochondrial and Nuclear DNA Lineages, and Mitochondrial Phylogeographic Subdivisions of Periodical Cicadas (Magicicada spp.).
| Phenotype | Spatial/Temporal Distribution | Molecular Phylogeny | |||||
|---|---|---|---|---|---|---|---|
| Species Group | Species | Life Cycle Length | Regions in Eastern US | Emergence Year (Brood) | mt/nu DNA Major Lineage | Mitochondrial Phylogeographic Subdivision | |
| Decim |
| } | 17-Year | North | I–X, XIII–XIV | A | Ae, Am, Aw |
|
| 13-Year | Midwest | XIX, XXIII | A | Aw | ||
|
| 13-Year | South | XIX, XXII–XXIII | B | B | ||
| Cassini |
| } | 17-Year | North | I–V, VIII–X, XIII–XIV | C | Ce, Cm, Cw |
|
| 13-Year | South | XIX, XXII–XXIII | C | Cw | ||
| Decula |
| } | 17-Year | North | I–VI, VIII–X, XIII–XIV | D | De, Dm, Dw |
|
| 13-Year | South | XIX, XXII–XXIII | D | (De), Dm, Dw | ||
Note.—“}” indicates a pair of species distinguishable only by different life cycle lengths. Recent emergence year of each brood: Brood I, 2012; Brood II, 2013; Brood III, 2014; Brood IV, 2015; Brood V, 2016; Brood VI, 2017; Brood VII, 2018; Brood VIII, 2019; Brood IX, 2020; Brood X, 2021; Brood XIII, 2024; Brood XIV, 2025 (17-year cicadas); Brood XIX, 2011; Brood XXII, 2014; Brood XXIII, 2015 (13-year cicadas).
Alexander and Moore (1962) and Marshall and Cooley (2000).
Simon (1988).
Sota et al. (2013) and present study.
. 1.Geographic distribution of samples and median-joining network of mitogenomic haplotypes in each species group. Different colors of the circles represent phylogeographic divisions. Brood numbers are indicated by Roman numerals next to the circles. Broken line indicates the boundary between 13- and 17-year cicadas. For the networks (right), colored circles represent haplotypes, and black circles represent missing haplotypes that were not observed. Solid lines between each of two linked haplotypes correspond to one mutation. Twenty mutations or less are represented by solid lines with small black dots. More than 20 mutations are indicated by dashed lines with numerals. The area of the circle is proportional to the number of haplotypes.
Genetic Diversity and Neutrality Test of the Four Major Lineages and Nine Phylogeographic Subdivisions of Periodical Cicadas.
| Group |
|
|
| Hd |
| Fu' | Tajima’s |
|---|---|---|---|---|---|---|---|
| Whole samples | 125 | 1,895 | 115 | 0.997 | 0.0534 | 0.96 | 3.98 |
| Decim group | 48 | 324 | 48 | 1.000 | 0.0062 | –10.68 | 0.79 |
| Cassini group | 39 | 178 | 37 | 0.996 | 0.0033 | –7.67 | 0.47 |
| Decula group | 38 | 93 | 30 | 0.976 | 0.0013 | –6.56 | –0.61 |
| Decim A group (13/17-y) | 38 | 126 | 38 | 1.000 | 0.0014 | –22.69 | –1.23 |
|
| 10 | 28 | 10 | 1.000 | 0.0006 | –4.06 | –0.78 |
|
| 30 | 105 | 30 | 1.000 | 0.0013 | –15.70 | –1.08 |
|
| 8 | 27 | 8 | 1.000 | 0.0006 | –2.70 | –1.28 |
|
| 29 | 156 | 27 | 0.993 | 0.0035 | –3.16 | 1.01 |
|
| 10 | 27 | 10 | 1.000 | 0.0007 | –3.69 | –0.12 |
|
| 26 | 73 | 21 | 0.975 | 0.0013 | –3.02 | –0.09 |
|
| 12 | 49 | 11 | 0.985 | 0.0012 | –1.25 | 0.45 |
| Aw group (13/17-y) | 14 | 39 | 14 | 1.000 | 0.0005 | –8.69 | –1.94 |
| Am group (17-y) | 12 | 30 | 12 | 1.000 | 0.0005 | –6.61 | –1.43 |
| Ae group (17-y) | 12 | 41 | 12 | 1.000 | 0.0006 | –5.35 | –1.70 |
| Cw group (13/17-y) | 18 | 52 | 18 | 1.000 | 0.0007 | –10.71 | –1.54 |
| Cm group (17-y) | 12 | 22 | 10 | 0.955 | 0.0003 | –4.60 | –2.03 |
| Ce group (17-y) | 9 | 15 | 9 | 1.000 | 0.0003 | –5.72 | –1.33 |
| Dw group (13/17-y) | 12 | 12 | 8 | 0.848 | 0.0001 | –4.02 | –2.09 |
| Dm group (13/17-y) | 18 | 37 | 14 | 0.954 | 0.0004 | –5.06 | –1.79 |
| De group (17-y) | 8 | 16 | 8 | 1.000 | 0.0003 | –4.61 | –1.81 |
Note.—N, sample size; S, number of polymorphic sites; H, number of haplotypes; π, nucleotide diversity; Hd, haplotype diversity.
P < 0.05. **P < 0.02. ***P < 0.001.
Sequence Divergence between Species Groups and Species.
| Comparison |
|
|
|---|---|---|
| Decim vs. Cassini | 0.082221 | 0.9492 |
| Cassini vs. Decula | 0.058879 | 0.9652 |
| Decula vs. Decim | 0.078477 | 0.9555 |
| Decim A vs. Decim B ( | 0.014650 | 0.9237 |
|
| 0.015066 | 0.9646 |
|
| 0.000682 | 0.3556 |
|
| 0.014670 | 0.9289 |
|
| 0.001390 | 0.3168 |
|
| 0.000006 | 0.0033 |
NS P > 0.05. *P < 0.00001.
. 2.Phylogenetic relationships of major nodes and divergence time based on the protein coding gene (PCG) data set. Bars in purple show 95% highest posterior density intervals. The four node supports shown are posterior probabilities of Bayesian inference and bootstrap percentages in maximum-likelihood (ML) analysis based on the PCG and mitogenome (MG) data sets. The OTU labels describe the sample number, species, brood number, and state of collection (supplementary table S1, Supplementary Material online). Phylogeographic subdivisions and life cycles are also shown.
. 3.Phylogenetic relationships in Decim. The two node supports shown are bootstrap percentages inferred from ML analysis (shown when > 70%) based on the PCG and MG data sets.
. 4.Phylogenetic relationships in (A) Cassini and (B) Decula. The two node supports shown are bootstrap percentages inferred from ML analysis (shown when > 70%) based on the PCG and MG data sets.
Test for Heterogeneity in Selection Regime within Each Species Group.
| Group |
|
| |
|---|---|---|---|
| One-ratio Model | Two-ratio Model ( | ||
| Decim | 0.114 | 0.081/0.149 | 0.082 |
| Decim A | 0.149 | 0.101/0.183 | 0.153 |
| Cassini | 0.075 | 0.098/0.049 | 0.078 |
| Decula | 0.118 | 0.110/0.123 | 0.819 |
Note.—B/F represents background branches with foreground branches.
. 5.Mismatch distributions (left) and Bayesian skyline plots (right). (A) Decim A, (B) Decim B (Magicicada tredecim), (C) Cassini, and (D) Decula were each calculated based on the PCG data set. The observed mismatch distribution is denoted by vertical bars, and the expected distribution under the population expansion model is represented by red lines. Harpending’s raggedness indices are shown. For Bayesian skyline plots, the median estimated effective population sizes (middle lines) are enclosed within the 95% highest posterior density intervals (shaded areas).