| Literature DB >> 29617732 |
Shenglong Li1, Masahiro Ajimura1, Zhiwei Chen1, Jianqiu Liu1, Enxiang Chen1, Huizhen Guo1, Vidya Tadapatri2, Chilakala Gangi Reddy2, Jiwei Zhang1, Hirohisa Kishino3, Hiroaki Abe3, Qingyou Xia1, Kallare P Arunkumar2, Kazuei Mita1.
Abstract
Notwithstanding the rapid developments in sequencing techniques, Y and W sex chromosomes have still been mostly excluded from whole genome sequencing projects due to their high repetitive DNA content. Therefore, Y and W chromosomes are poorly described in most species despite their biological importance. Several methods were developed for identifying Y or W-linked sequences among unmapped scaffolds. However, it is not enough to discover functional regions from short unmapped scaffolds. Here, we provide a new and simple strategy based on k-mer comparison for comprehensive analysis of the W chromosome in Bombyx mori. Using this novel method, we effectively assembled de novo 1281 W-derived genome contigs (totaling 1.9 Mbp), and identified 156 W-linked transcript RNAs and 345 W-linked small RNAs. This method will help in the elucidation of mechanisms of sexual development and exploration of W chromosome biological functions, and provide insights into the evolution of sex chromosomes. Moreover, we showed this method can be employed in identifying heterogametic sex chromosomes (W and Y chromosomes) in many other species where genomic information is still scarce.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29617732 PMCID: PMC6105102 DOI: 10.1093/dnares/dsy010
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Distribution of calculated KQs. We counted the KQ value for each 15-mer and constructed a distribution histogram for B. mori, D. melanogaster and An. gambiaIt is apparent that most autosome-derived, Z specific or W unique 15-mers have distinctive KQs. Autosome derived 15-mers have a KQ value around 1 (mainly from 0.3 to 1.8), Z or X derived 15-mers have a KQ value around 2 (mainly from 1.8 to 2.2) and W or Y derived 15-mers have a KQ value around 0. Thus, we can easily identify W specific 15-mers by the KQ value.
Figure 2Identification of Y-linked sequences in D. melanogaster. (A) We applied KQ method in genome sequences of D. melanogaster and counted length and AYK value for each scaffold. For the vast majority of Autosome (green point) and X-derived (blue point) scaffolds, their AYK values are less than 10 (usually close to 0); Y-derived scaffolds (red point) were with AYK value more than 10. Unmapped scaffolds (grey point) have dispersive AYK values, we speculated that scaffolds with AYK value more than 10 are possible Y-linked. (B) Compared with the CQ and YGS methods, KQ method could identify more precise Y-derived scaffolds in D. melanogaster. Based on the analysis of scaffold length and number, the results of three methods were similar when the length of scaffold is more than 5 kb, whereas KQ method (red pillar) could identify more precisely Y-linked scaffolds when the length of scaffold is less than 5 kb.
Figure 3Female-specific PCR amplification of de novo assembled contigs (A) and BAC end sequences (B). (A) PCR performed with three female and three male genomic DNA samples shows female-specific amplification of 10 W-contigs. Chr 2 sequence was amplified as a control in both female and male genomic DNA confirming the integrity of the genomic DNA samples. (B) PCR carried out with three female and three male genomic DNA samples shows female-specific amplification of 10 BAC end sequences.
Figure 4W-derived transcripts, expression profiles and genomic PCR amplification. (A) Hierarchical cluster analysis of W-derived transcripts. Expression profiles of transcripts in mixed embryos (0–24 hpo) were clustered and visualized in a heatmap. (B) PCR amplification of W-derived transcripts in cDNA (Left panel) and gDNA (Right panel). RT-PCR (cDNA) showed expression as cluster A (female embryos specific) or cluster B (in both female and male embryos). However, a PCR test of genomic DNA showed both transcript RNAs were female-specific, indicating that they are W-derived, but cluster A RNAs are zygotically expressed in female embryos whereas cluster B RNAs are probably maternal RNAs deposited into eggs. (C) qPCR of a cluster B RNA (DN42314_c0) from 0 to 36 hpo exhibited its sex-biased expression in embyos.
Figure 5Length distribution graph of W-linked small RNAs and female-specific RT-PCR amplification. (A) The lengths of most W-derived small RNAs were concentrated on around 27–29 nt and their first bases from the 5’ end were preferentially T (U), which indicated most W-derived small RNAs are piRNAs. (B) Female-specific RT-PCR amplifications indicated that all five small RNAs were indeed W-derived.