| Literature DB >> 25806526 |
Tingcai Cheng1, Bohua Fu1, Yuqian Wu1, Renwen Long1, Chun Liu1, Qingyou Xia1.
Abstract
The wild silkworm Bombyx mandarina is widely believed to be an ancestor of the domesticated silkworm, Bombyx mori. Silkworms are often used as a model for studying the mechanism of species domestication. Here, we performed transcriptome sequencing of the wild silkworm using an Illumina HiSeq2000 platform. We produced 100,004,078 high-quality reads and assembled them into 50,773 contigs with an N50 length of 1764 bp and a mean length of 941.62 bp. A total of 33,759 unigenes were identified, with 12,805 annotated in the Nr database, 8273 in the Pfam database, and 9093 in the Swiss-Prot database. Expression profile analysis found significant differential expression of 1308 unigenes between the middle silk gland (MSG) and posterior silk gland (PSG). Three sericin genes (sericin 1, sericin 2, and sericin 3) were expressed specifically in the MSG and three fibroin genes (fibroin-H, fibroin-L, and fibroin/P25) were expressed specifically in the PSG. In addition, 32,297 Single-nucleotide polymorphisms (SNPs) and 361 insertion-deletions (INDELs) were detected. Comparison with the domesticated silkworm p50/Dazao identified 5,295 orthologous genes, among which 400 might have experienced or to be experiencing positive selection by Ka/Ks analysis. These data and analyses presented here provide insights into silkworm domestication and an invaluable resource for wild silkworm genomics research.Entities:
Mesh:
Year: 2015 PMID: 25806526 PMCID: PMC4373670 DOI: 10.1371/journal.pone.0122837
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of transcripts for Bombyx mandarina.
| Number (percentage) | ||
|---|---|---|
| Length(bp) | Transcript (bp) | Unigene (bp) |
|
| 14,829(29.21%) | 12,428(36.81%) |
|
| 11,338(22.33%) | 8,488(25.14%) |
|
| 9,588(18.88%) | 5,694(16.87%) |
|
| 8,332(16.41%) | 4,117(12.20%) |
|
| 6,686(13.17%) | 3,032(8.98%) |
|
| 47,808,990 | 25,730,955 |
|
| 50,773 | 33,759 |
|
| 39.91% | 39.99% |
|
| 478 | 374 |
|
| 941.62 | 762.20 |
|
| 1,764 | 1,437 |
Fig 1Gene ontology (GO) annotation.
Fig 2Clusters of orthologous group (COG) annotation.
Summary of expression related to silk proteins between MSG and PSG.
| WBm_unigene | length | FPKM_MSG | FPKM_PSG | logFC | PValue | FDR | Bm_CDS | Ka/Ks | Putative function |
|---|---|---|---|---|---|---|---|---|---|
| comp4655_c0_seq1 | 562 | 21.07 | 1580.95 | 6.035 | 1.29E-13 | 9.94E-12 | Fibroin-H | ||
| comp11395_c0_seq1 | 1362 | 1298.87 | 137786.13 | 6.534 | 3.26E-15 | 3.24E-13 | BGIBMGA009393 | 0.6473 | fibroin-L |
| comp8545_c1_seq1 | 1353 | 182.40 | 31454.66 | 7.235 | 2.73E-17 | 3.76E-15 | BGIBMGA001347 | 0.1938 | fibroin/P25 |
| comp14614_c0_seq5 | 2173 | 8339.87 | 6.45 | -10.532 | 4.54E-27 | 4.99E-24 | sericin 1 | ||
| comp14333_c2_seq1 | 1846 | 7606.27 | 6.60 | -10.366 | 1.51E-26 | 1.03E-23 | BGIBMGA011901 | 1.2050 | sericin 2 |
| comp17080_c0_seq1 | 480 | 5175.15 | 6.27 | -9.877 | 1.03E-24 | 4.19E-22 | BGIBMGA012002 | 1.8625 | sericin 3 |
| comp13620_c0_seq1 | 3130 | 65.03 | 408.04 | 2.453 | 3.71E-04 | 4.75E-03 | BGIBMGA006216 | alanine-tRNA ligase | |
| comp4656_c0_seq1 | 2624 | 121.56 | 986.25 | 2.825 | 5.35E-05 | 9.04E-04 | BGIBMGA007637 | 0.0198 | glycine-tRNA ligase |
| comp17114_c0_seq1 | 1334 | 530.01 | 0.43 | -10.453 | 3.77E-25 | 1.74E-22 | BGIBMGA009261 | fibrohexamerin | |
| comp5260_c0_seq1 | 1067 | 3.18 | 0.05 | -5.883 | 1.63E-06 | 3.95E-05 | BGIBMGA009261 | fibrohexamerin | |
| comp7638_c0_seq1 | 1509 | 1118.49 | 0.75 | -10.727 | 1.15E-26 | 8.65E-24 | BGIBMGA009261 | 1.6506 | fibrohexamerin |
| comp8883_c0_seq1 | 954 | 24544.0 | 15.54 | -10.819 | 6.09E-28 | 1.46E-24 | BGIBMGA009261 | fibrohexamerin |
Summary of SNPs and Indels for Bombyx mandarina.
| Length (bp) | SNP (num) | SNP_density (bp/num) | Indel (num) | Indel_density (bp/num) | |
|---|---|---|---|---|---|
|
| 25,730,955 | 32,297 | 797 | 361 | 71,277 |
|
| 11,584,893 | 19,783/16,074/3,709 | 586 | 14 | 827,492 |
|
| 4,051,613 | 4,974 | 815 | 179 | 22,635 |
|
| 1,640,684 | 2,024 | 811 | 54 | 30,383 |
*Total number/ synonymous site number / nonsynonymous number
Fig 3Distribution of Ka and Ks values.
Above the black line, orthologous pairs with Ka/Ks ratio >1; between black and gray lines, pairs with Ka/Ks ratio 0.5–1.
Fig 4Distribution of KEGG biological categories of positively selected and all unigenes in pathways.
(A) Distribution of five classes of positively selected and all unigenes in pathways; (B) Metabolism shows distribution of six classes. (a) Distribution of positively selected unigenes in pathways, (b) Distribution of all unigenes in pathways.
KEGG pathway enrichment analysis of positive selection.
| KEGG pathway | Unigene | Bm_orthologs | Ka/Ks | Putative function |
|---|---|---|---|---|
| Glycosaminoglycan biosynthesis—chondroitin sulfate / dermatan sulfate (ko00532) | comp12639_c0_seq1 | BGIBMGA007765 | 0.5734 | beta-1,3-galactosyltransferase |
| comp1835_c0_seq1 | BGIBMGA003101 | 0.5113 | carbohydrate sulfotransferase | |
| comp35605_c0_seq1 | BGIBMGA012390 | 0.5474 | chondroitin sulfate synthase | |
| Circadian rhythm—fly (ko04711) | comp15415_c0_seq2 | BGIBMGA007304 | 0.9945 | double-time protein |
| comp18252_c0_seq1 | BGIBMGA000486 | 0.5633 | period circadian protein (PER) | |
| comp6926_c0_seq1 | BGIBMGA000498 | 0.7571 | circadian locomoter output cycles kaput protein (COLCK) | |
| RIG-I-like receptor signaling pathway (ko04622) | comp11847_c0_seq1 | BGIBMGA003954 | 0.5546 | autophagy-related protein Atg12 |
| comp12369_c0_seq1 | BGIBMGA000198 | 0.5607 | TANK-binding kinase | |
| comp6686_c1_seq1 | BGIBMGA000679 | 0.9135 | rotamase Pin1 |