| Literature DB >> 27489210 |
Alejandro Sanchez-Flores1, Fernando Peñaloza1, Javier Carpinteyro-Ponce2, Nestor Nazario-Yepiz3, Cei Abreu-Goodger3, Carlos A Machado4, Therese Ann Markow5.
Abstract
We report genomes of two species of cactophilic Drosophila: Drosophila arizonae and D. navojoa These two are the closest relatives of D. mojavensis, forming the D. mojavensis cluster. D. mojavensis and D. arizonae diverged from D. navojoa ∼5.8 Mya, while the split between D. arizonae and D. mojavensis is more recent, at 1.5 Mya. Together the three genomes provide opportunities to examine genomic changes associated with speciation and host shifts in this ecologically defined group of flies. The three species are also separated by fixed inversion differences in three of their six chromosomes. While the levels of nucleotide divergence in the colinear chromosomes are significantly lower than in the inverted chromosomes, consistent with a past role of the inversions in preventing gene flow, the patterns differ among the inverted chromosomes when the locations of nucleotides inside or outside of the inversions are considered. For Muller element E, there is greater divergence external to the inversion breakpoints. For Muller A, the divergence is slightly higher inside the inversions, while for Muller B, the breakpoints and hence the difference in substitutions in relation to the inversions could not be determined. The differences among the inverted chromosomes, especially once the breakpoints are clearly established, could aid in dating the origins of the inversions.Entities:
Keywords: Drosophila; cactophilic; evolution; genome; inversions
Mesh:
Year: 2016 PMID: 27489210 PMCID: PMC5068933 DOI: 10.1534/g3.116.033779
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Fixed Inversion Differences among the three species in the Drosophila mojavensis cluster and indication of Muller elements that are colinear (C, D, and F) or that have fixed chromosomal inversions between species (after Wasserman 1962, 1992). Colinear chromosomes are in black, while red, blue, and green are fixed inversion differences among the three species.
Sequencing statistics and quality control, and assembly statistics
| Sequencing statistics and quality control | ||
| Read length (bases) | 2 × 100 | 2 × 100 |
| PE reads | 97,359,954 | 158,820,658 |
| PE insert size (bases) | 145 ± 60 | 257 ± 60 |
| MP reads | 37,994,866 | 42,353,652 |
| MP insert size (bases) | 1717 ± 796 | 2345 ± 489 |
| Estimated genome size (Mb) | 142.06 | 148.68 |
| Estimated genome coverage | 52× | 81× |
| GC content (%) | 42 | 39 |
| Repetitiveness (%) | 10 | 17 |
| Assembly statistics | ||
| Bases in the assembly (Mb) | 141.37 | 115.88 |
| Bases in Muller elements (Mb) | 132.58 | 95.27 |
| Total no. of scaffolds | 3179 | 8054 |
| Average scaffold size (kb) | 44.47 | 14.39 |
| Shortest scaffold length (bases) | 886 | 867 |
| N50 (Mbases / no. scaffolds) | 2.65 / 3 | 2.18 / 3 |
| N90 (kbases / no. scaffolds) | 71.37 / 7 | 4.31 / 929 |
| CEGMA complete / partial (%) | 92.42 / 97.58 | 78.23 / 84.68 |
| Muller element coverage (%) | 92.35 | 82.21 |
Statistic including scaffolds ordered and linked into Muller elements.
Protein-coding gene models and amino acid sequence statistics
| Predicted protein-coding genes | 12,129 | 10,695 | 15,015 |
| Average gene length (bases) | 2176.23 | 2029.55 | 2138.78 |
| Average protein length (aa) | 724.409 | 675.517 | 711.928 |
| Average transcript length (bases) | 2638.08 | 2499.65 | 2677.92 |
| Average exon length (bases) | 385.915 | 371.348 | 374.89 |
| Average exon number per gene | 5.63913 | 5.46536 | 5.70509 |
Average synonymous (dS) and nonsynonymous (dN) substitutions per site and dN/dS ratios for genes located in each Muller element in D. arizonae (ar), D. mojavensis (mo), and D. navojoa (na)
| dN | dS | dN/dS | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Element | ar-mo | ar-na | mo-na | ar-mo | ar-na | mo-na | ar-mo | ar-na | mo-na |
| Muller A | 0.010 ↑ | 0.030 ↑ | 0.031 ↑ | 0.055 | 0.162 ↑ | 0.165 ↑ | 0.165 | 0.162 | 0.164 |
| Muller B | 0.016 ↑ | 0.038 ↑ | 0.035 ↑ | 0.080 ↑ | 0.198 ↑ | 0.199 ↑ | 0.161 | 0.150 | 0.147 |
| Muller E | 0.011↑ | 0.022 | 0.023 | 0.069 ↑ | 0.158 | 0.158 | 0.147 | 0.128 | 0.129 |
| Muller C | 0.008 | 0.022 | 0.022 | 0.055 | 0.151 | 0.148 | 0.163 | 0.118 | 0.145 |
| Muller D | 0.009 | 0.022 | 0.023 | 0.058 | 0.151 | 0.152 | 0.159 | 0.139 | 0.141 |
↑ indicates chromosomes with significantly higher divergence levels (Mann–Whitney U-test; P < 0.05); comparisons were performed between each Muller element and the combined inverted or colinear chromosomes.
Chromosomes with fixed inversion differences between D. arizonae and D. mojavensis.
Figure 3Divergence times within the D. repleta group with D. virilis as an external group. The node dates (in red) were estimated under an uncorrelated log-normal relaxed clock. The 95% highest posterior density intervals (in blue) are shown for each node.
Figure 2Distribution of nonsynonymous (dN) and synonymous (dS) substitution rates per site and dN/dS ratios for ortholog genes in each Muller element among pairwise comparisons of Drosophila mojavensis, D. arizonae, and D. navojoa. Muller elements A, B, and E (gray bars) have fixed inversion differences among species. Muller elements C and D (white bars) are colinear in the three species. The number of genes used in each comparison are shown in Table S4.
Average synonymous (dS) and nonsynonymous (dN) substitutions per site and dN/dS ratios for genes located inside (IR) or outside (CR) the fixed inverted region in Muller elements A and E in D. arizonae (ar), D. mojavensis (mo), and D. navojoa (na)
| dN | dS | dN/dS | |||||||
|---|---|---|---|---|---|---|---|---|---|
| ar-mo | ar-na | mo-na | ar-mo | ar-na | mo-na | ar-mo | ar-na | mo-na | |
| Muller A | |||||||||
| Genes in IR | 346 | 387 | 387 | 346 | 387 | 387 | 346 | 387 | 387 |
| Genes in CR | 666 | 760 | 759 | 666 | 760 | 759 | 666 | 760 | 759 |
| Median for IR | 0.005 | 0.016 | 0.016 | 0.049 | 0.149 | 0.149 | 0.108 | 0.110 | 0.118 |
| Median for CR | 0.004 | 0.014 | 0.014 | 0.048 | 0.149 | 0.148 | 0.109 | 0.104 | 0.108 |
| 0.234 | 0.382 | 0.268 | 0.094 | 0.679 | 0.966 | 0.720 | 0.379 | 0.308 | |
| Muller E | |||||||||
| Genes in IR | 859 | 930 | 946 | 859 | 930 | 946 | 859 | 930 | 946 |
| Genes in CR | 277 | 310 | 312 | 277 | 310 | 312 | 277 | 310 | 312 |
| Median for IR | 0.005 | 0.012 | 0.012 | 0.0616 | 0.1487 | 0.148 | 0.098 | 0.09 | 0.092 |
| Median for CR | 0.005 | 0.013 | 0.014 | 0.0561 | 0.1628 | 0.1614 | 0.103 | 0.092 | 0.088 |
| 0.591 | 0.420 | 0.309 | 0.0002 | 0.0001 | 0.0005 | 0.622 | 0.501 | 0.903 | |
Significantly different (P < 0.05).
Figure 4The number of putatively selected genes in each lineage is shown on each branch. The table on the right shows the numbers separated by Muller element, highlighting in red the chromosomes with fixed inversion differences between D. mojavensis and D. arizonae.