| Literature DB >> 22439699 |
Qi Zhou1, Hong-mei Zhu, Quan-fei Huang, Li Zhao, Guo-jie Zhang, Scott W Roy, Beatriz Vicoso, Zhao-lin Xuan, Jue Ruan, Yue Zhang, Ruo-ping Zhao, Chen Ye, Xiu-qing Zhang, Jun Wang, Wen Wang, Doris Bachtrog.
Abstract
BACKGROUND: Drosophila albomicans is a unique model organism for studying both sex chromosome and B chromosome evolution. A pair of its autosomes comprising roughly 40% of the whole genome has fused to the ancient X and Y chromosomes only about 0.12 million years ago, thereby creating the youngest and most gene-rich neo-sex system reported to date. This species also possesses recently derived B chromosomes that show non-Mendelian inheritance and significantly influence fertility.Entities:
Mesh:
Year: 2012 PMID: 22439699 PMCID: PMC3353239 DOI: 10.1186/1471-2164-13-109
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Karyotype of Drosophila albomicans. (A) Drosophila albomicans male (left) and female (right). Males have a silver patch on the frons. (B) Schematic karyotype of a male D. albomicans. The third autosome pair has fused to the ancestral sex chromosome pair and forms a neo-sex chromosome. Colors represent different ancestral Drosophila chromosome arms (Muller's elements). (C) B chromosomes of D. albomicans. The blue arrow marks a B chromosome.
Summary of the sequencing data and assembly statistics
| Assembly* | Reads Used | Total Length | Scaffold N50# |
|---|---|---|---|
| male reads | 24.9 Gb (124.5-fold) | 234 Mb | 9.9 kb |
| female reads | 5.7 Gb (28.5-fold) | 170 Mb | 12.6 kb |
| reference | 14.0 Gb (98.9-fold) | 183 Mb | 31.9 kb → 49.0 kb |
* There are three versions of initial assemblies: one produced using only male reads, one using only female reads, and both of these assemblies were used to extract candidate B- or neo-Y linked sequences. The reference assembly was produced using all the female reads and part of the male reads, using parameters to maximize the N50 statistics. The reference assembly was subjected to further optimization (designated by '→' in Table 1) by comparative analysis with other Drosophila genomes and gene annotations. # N50: the length L where 50% of all nucleotides in the assembly are contained in contigs/scaffolds of size ≥ L.
Figure 2Patterns of heterozygosity across chromosomes. SNP densities along each chromosome were calculated within a 1 Mb window with a step size of 100 kb. (A) SNP densities calculated based on mapping of male reads. (B) SNP densities calculated based on mapping results of female reads.
Comparison of neo-sex chromosomes with other chromosomes
| Chromosome | chrX (Muller A) | neo-sex (Muller C + D) | chr2 (Muller B + E) | chr4 (Muller F)† |
|---|---|---|---|---|
| exonic SNP density *(♂/♀) | 2.04/1.01(2.02) | 4.61/0.97(4.75) | 4.35/3.33(1.30) | 0.18/0.01(18) |
| intronic SNPs density *(♂/♀) | 2.58/0.77(3.35) | 5.57/0.86(6.47) | 5.84/2.75(2.12) | 0.99/0.03(33) |
| putative pseudogenes No./total# | 11/2055 | 80/4751 | 54/5056 | 1/87 |
| 1.03/0.42 | 2.02/0.58 | 1.89/1.26 | 0.73/0.16 | |
| 9.48/4.56 | 19.56/6.82 | 18.11/14.3 | 29.13/10.87 | |
| Tandem Duplications (%)|| | 13.8/5.6 | 20.7/5.5 | 15.3/3.79 | 14.9/5.8 |
* SNP density (sites/kb) was calculated using exon or intron regions of scaffolds mapping to the respective chromosomes, and shown as SNP density in male/SNP density in female, and their ratio in parentheses.
# Male-specific putative pseudogenes were identified by premature stop codons, frameshifts or mutations at start codons. Shown are the pseudogene number/total mapped genes on each chromosome.
§ Shown are nonsynonymous substitutions calculated in male/female. Sd.: standard deviation.
|| Shown are proportions of tandem duplications among all the SV events for each chromosome identified in male/female.
†Due to the low number of genes on chr4, inferences on this chromosome may be strongly affected by sampling effects.
Figure 3Structural variation in the . (A) The male to female ratio of different structural variants (insertions, deletions, tandem duplication, dispersed duplications) on each chromosome. (B) The proportion of tandem duplications overlapping with protein-coding regions on each chromosome for males and females (no tandem duplications on the fourth chromosome overlapping coding regions were observed in females).
Figure 4B-specific sequences of . (A) Validation of the identified B-linked probe with Southern Blotting. Up panel: Genomic DNAs digested with HaeII. Lanes 3-5,10 show hybridization results with genomic DNA from D. albomicans strains without B's. Lane 11 is the negative control lane using genomic DNA of D. melanogaster. Other lanes show hybridizations with DNA from D. albomicans strains containing B's. Arrows indicate differential bands between B and no-B strains. Down panel: Genomic DNAs digested with MspI. Lanes 3-6 show D. albomicans strains without B's. Lane 8 is the negative control and other lanes are D. albomicans strains with B's. (B) PCR assays using B-chromosome derived primers. Lane 2-5 show PCR products obtained from D. albomicans strains without B's, and lanes 6-9 are from strains with B's. Lane 1 is the DL2000 marker.