| Literature DB >> 23496955 |
Guoqing Pan1, Jinshan Xu, Tian Li, Qingyou Xia, Shao-Lun Liu, Guojie Zhang, Songgang Li, Chunfeng Li, Handeng Liu, Liu Yang, Tie Liu, Xi Zhang, Zhengli Wu, Wei Fan, Xiaoqun Dang, Heng Xiang, Meilin Tao, Yanhong Li, Junhua Hu, Zhi Li, Lipeng Lin, Jie Luo, Lina Geng, LinLing Wang, Mengxian Long, Yongji Wan, Ningjia He, Ze Zhang, Cheng Lu, Patrick J Keeling, Jun Wang, Zhonghuai Xiang, Zeyang Zhou.
Abstract
BACKGROUND: Microsporidian Nosema bombycis has received much attention because the pébrine disease of domesticated silkworms results in great economic losses in the silkworm industry. So far, no effective treatment could be found for pébrine. Compared to other known Nosema parasites, N. bombycis can unusually parasitize a broad range of hosts. To gain some insights into the underlying genetic mechanism of pathological ability and host range expansion in this parasite, a comparative genomic approach is conducted. The genome of two Nosema parasites, N. bombycis and N. antheraeae (an obligatory parasite to undomesticated silkworms Antheraea pernyi), were sequenced and compared with their distantly related species, N. ceranae (an obligatory parasite to honey bees).Entities:
Mesh:
Substances:
Year: 2013 PMID: 23496955 PMCID: PMC3614468 DOI: 10.1186/1471-2164-14-186
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
A comparison of genome features among three species (, ) and two species (and )
| Chromosomes(bands) | 18 | ~12 | ND | 11 | 6 |
| Assembly(Mbp) | 15.7 | 6.6 | 7.9 | 2.9 | 3.86 |
| Genomic coverage | 100% | ~89% | 90% | 86% | ~64% |
| Scaffold Num | 1,605 | 6,215 | 5,465 | 11 | 1,646 |
| N50(bp) | 57,394 | 1,883 | 2,902 | ND | 2,349 |
| Largest scaffold length(bp) | 571,060 | 53,183 | 65,607 | 209,983 | 204,069 |
| G + C content (%) | 31 | 28 | 27 | 48 | 26 |
| No .of CDS | 4,458 | 3,413 | 2,614 | 1,997 | 3,632 |
| Mean CDS length (bp) | 741 | 775 | 904 | 1,077 | 995 |
| GenBank No. | NA30919 | NA183977 | NA32971 | NA155 | NA21011 |
Figure 1Venn diagram showing the number of homologous genes and lineage-specific genes amongst three species, , , and . The arabic numbers followed by characters represent the number of homologous genes in each Nosema species (‘a’ denotes N. antheraeae, ‘b’ denotes N. bombycis , and ‘c’ denotes N. ceranae). For instance, 2455b:1746a:1459c means that 2455 genes of N. bombycis, 1746 genes of N. antheraeae, and 1459 genes of N. ceranae are homologous to each other.
Classification of repetitive families in genome
| DNA | hAT | 1,011,459 | 6.45 |
| | Merlin | 470,573 | 3.00 |
| | PiggyBac | 441,876 | 2.82 |
| | TcMar | 786,733 | 5.02 |
| | MuDR | 109,681 | 0.70 |
| | others | 58,866 | 0.38 |
| LTR | Gypsy | 577,653 | 3.68 |
| | others | 33,635 | 0.21 |
| LINE | Dong-R4 | 162,622 | 1.04 |
| | others | 59,703 | 0.38 |
| Rolling-circle | Helitron | 102,334 | 0.65 |
| SINE | — | 28,669 | 0.18 |
| Unknown | — | 2,204,497 | 14.06 |
| Total | — | 6,048,301 | 38.57 |
Figure 2A maximum-likelihood phylogenetic tree of host-derived transposase sequences. Arrows show the putative recent horizontal gene transfer (HGT) events of host-derived transposable elements. Several transposons are closely related to those from insects. Black boxes indicate elements from the two silkworm-infecting Nosema species, while white boxes indicate elements from the domesticated silkworm B. mori. Numbers in parentheses indicate the total copy numbers for each transposable element.
Figure 3Horizontal gene transfers of protein-coding genes in . (A) Venn diagram showing the numbers of HGT genes between two different dataset that were identified using two different methods, the Darkhorse method and the phylogenetic method. The total number of the union of HGT genes between two dataset is 55. (B) The diagram showing the origination of those 55 HGT genes. All of them originated from prokaryotes.
Figure 4Gene duplications and the dS distribution of paralogs and orthologs among three species. Abbreviation: Na, N. antheraeae; Nb, N. bombycis; Nc, N. ceranae. (A) A circos map showing the comparative genomics among three different Nosema species based on all available scaffolds. Each line represents the homologous syntenic regions between any two species or between any given two chromosome positions of single species. Many lines across different scaffolds of N. bombysis indicates higher rate of segmental syntenic duplications. (B) The dS distribution of segmental paralogs of Nb and the orthologs between Nb and Na showing a higher dS values in orthologs in general. Notably, a higher peak (arrow) seen in Nb suggests the possibility of a burst of paralogs recently over the Nb evolution after the separation of Na and Nb. (C) An example of syntenic comparisons among three Nosema species showing a cluster of tandem paralogs. The corresponding genetic position and names of identified element are provided in Additional file 8. The number of all identified tandem paralogs for each Nosema genome is summarized on the right side. (D) The dS distribution of tandem paralogs of Nb and orthologs between Na and Nb showing that majorities of tandem paralogs arose after the separation of Na and Nb because the dS values of those tandem paralogs are smaller than that of orthologs.
Figure 5An example from the syntenic analysis showing that often consist of two homologous regions, instead of one such as . When summarized the number of syntenic regions in both N. antheraeae and N. bombycis, the number of paralogous syntenic region of N. bombycis is often twice more than that of orthologous syntenic region between N. antheraeae and N. bombycis, indicating that large segmental duplication events have occurred over the evolution of N. bombycis.
Site test of adaptive evolution for paralogous genes
| Hypothetical protein | CPG199 | 4 | DD | 17.30*** | 17.68*** | 17.07*** | 6.55472 |
| Hypothetical protein | CPG293 | 4 | SD,DD | 7.34* | 7.58* | 6.68* | 67.43545 |
| Hypothetical protein | CPG446 | 3 | TD | 15.34*** | 16.15*** | 16.52*** | 96.64162 |
| Hypothetical protein | CPG767 | 3 | TD,DD | 13.80** | 13.92*** | 13.26** | 12.30571 |
| Surface adhesion protein | CPG844 | 3 | SD, DD | 21.04*** | 21.03*** | 22.81*** | 6.72603 |
| Serine protease inhibitor 106 | CPG945 | 3 | TD,DD | 5.85 | 5.92 | 5.8 | 19.13308 |
| Serine protease inhibitor 106 | CPG1974 | 2 | DD | 15.81*** | 16.31** | 15.81*** | 52.1588 |
| LPXTG-motif cell wall anchor domain protein+ | CPG1776 | 2 | SD | 14.38*** | 14.39** | 14.38*** | 20.02006 |
| DnaJ homolog subfamily C member 9 | CPG1792 | 2 | SD | 13.76** | 13.85*** | 19.87*** | 57.65414 |
| NIK and IKK(beta) binding protein | CPG1878 | 2 | SD | 12.22** | 12.59** | 14.57*** | 36.70677 |
| Replication-associated protein | CPG2140 | 2 | DD | 18.67*** | 18.76*** | 21.14*** | 15.15205 |
| Glucan endo-1,6-beta-glucosidase | CPG1640 | 2 | DD | 5.72 | 5.72 | 9.64** | 61.53906 |
| Integrator complex subunit 4 | CPG2120 | 2 | SD | 8.46* | 8.59* | 8.22* | 14.58649 |
| Hypothetical protein | CPG961 | 3 | DD | 21.04*** | 22.99*** | 19.99*** | 9.5982 |
| Hypothetical protein | CPG1520 | 2 | SD | 15.28*** | 15.37*** | 15.13*** | 8.84271 |
| Hypothetical protein | CPG1637 | 2 | SD | 18.44*** | 18.75*** | 20.57*** | 14.62068 |
| Hypothetical protein | CPG987 | 3 | DD | 13.65** | 13.37** | 13.11** | 39.08442 |
| Hypothetical protein | CPG1865 | 2 | SD | 10.90** | 10.94** | 10.70** | 23.75418 |
| Hypothetical protein | CPG1967 | 2 | SD | 6.06* | 6.81* | 5.48 | 29.25563 |
| Hypothetical protein | CPG2026 | 2 | DD | 8.28* | 8.41* | 15.98*** | 15.9724 |
| Hypothetical protein | CPG2083 | 2 | DD | 29.01*** | 29.00*** | 31.16*** | 61.60344 |
| hypothetical protein | CPG775 | 3 | SD | 14.05*** | 14.05*** | 12.37** | 5.3171 |
| Hypothetical protein | CPG2128 | 2 | SD | 17.79*** | 17.85*** | 19.27*** | 45.49562 |
| Hypothetical protein | CPG869 | 3 | SD, DD | 17.45*** | 17.85*** | 17.31*** | 8.16665 |
Note: SD, segmental; TD, Tandem; DD, Disperse; M1 vs. M2, LRT statistic for model M1 versus M2; M7 vs. M8, LRT for model M7 versus M8; * Significance with P < 0.05; ** Significance with P < 0.01; *** Significance with P < 0.001.
Figure 6A hypothetical model showing how the SPN protein of suppresses the serine protease cascade of the melanization pathway of the host . After the suppression of the serine protease cascade, the defensive response, the subsequent formation of melanization will be inhibited in the hosts. Abbreviation: PPO, prophenoloxidase; β-GRP, β-glucan recognition protein.