| Literature DB >> 27653464 |
Shuai Wang1, Sen Wang2,3, Yingfeng Luo2, Lihua Xiao4, Xuenong Luo1, Shenghan Gao2, Yongxi Dou1, Huangkai Zhang2,3, Aijiang Guo1, Qingshu Meng2, Junling Hou1, Bing Zhang5, Shaohua Zhang1, Meng Yang2, Xuelian Meng1, Hailiang Mei2, Hui Li1, Zilong He2, Xueliang Zhu1, Xinyu Tan2, Xing-Quan Zhu1, Jun Yu2, Jianping Cai1, Guan Zhu6, Songnian Hu2, Xuepeng Cai1.
Abstract
Taenia saginata, Taenia solium and Taenia asiatica (beef, pork and Asian tapeworms, respectively) are parasitic flatworms of major public health and food safety importance. Among them, T. asiatica is a newly recognized species that split from T. saginata via an intermediate host switch ∼1.14 Myr ago. Here we report the 169- and 168-Mb draft genomes of T. saginata and T. asiatica. Comparative analysis reveals that high rates of gene duplications and functional diversifications might have partially driven the divergence between T. asiatica and T. saginata. We observe accelerated evolutionary rates, adaptive evolutions in homeostasis regulation, tegument maintenance and lipid uptakes, and differential/specialized gene family expansions in T. asiatica that may favour its hepatotropism in the new intermediate host. We also identify potential targets for developing diagnostic or intervention tools against human tapeworms. These data provide new insights into the evolution of Taenia parasites, particularly the recent speciation of T. asiatica.Entities:
Year: 2016 PMID: 27653464 PMCID: PMC5036155 DOI: 10.1038/ncomms12845
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Genomic features of T. saginata and T. asiatica in comparison with other worms.
| Assembly size (Mb) | 169 | 168 | 131 | 114 | 141 | 365 | 100 |
| GC content (%) | 43.2 | 43.2 | 43.5 | 42.2 | 36.0 | 35.2 | 35.4 |
| Coding genes number | 13,161 | 13,323 | 11,902 | 10,506 | 10,141 | 10,772 | 20,469 |
| Average gene length (Kb) | 6.0 | 5.9 | 4.6 | 5.4 | 6.1 | 15.4 | 3.1 |
| Protein length (aa) | 464 | 466 | 444 | 505 | 490 | 477 | 453 |
| Gene density (genes per Mb) | 77.9 | 79.3 | 90.9 | 92.2 | 71.9 | 29.5 | 201.0 |
| Number of exons per gene | 6.2 | 6.2 | 6.6 | 6.8 | 6.4 | 6.5 | 6.4 |
| Mean length of exons (bp) | 237 | 244 | 237 | 220 | 229 | 226 | 212 |
| Number of introns per gene | 5.2 | 5.2 | 5.6 | 5.8 | 5.4 | 5.5 | 5.4 |
| Mean length of introns (bp) | 864 | 831 | 775 | 684 | 862 | 2,460 | 354 |
| GC content of exon (%) | 49.7 | 49.6 | 50.2 | 50.0 | 44.3 | 36.0 | 42.6 |
| GC content of intron (%) | 41.5 | 41.2 | 40.8 | 39.9 | 34.6 | 34.7 | 32.5 |
| Repeat content (%) | 10.4 | 10.9 | 18.1 | 10.9 | 7.6 | 40.0 | 17.0 |
| tRNA number | 339 | 353 | 162 | 856 | 44 | 153 | 966 |
*T. solium v2 genome (China isolate) properties and gene models.
Figure 1Unique intron features in the tapeworm genomes.
(a) Bimodal length distributions of short introns in tapeworms (T. asiatica, T. saginata, T. solium and E. multilocularis) and monogenean G. salaris in comparison with the unimodal distributions in the fluke S. mansoni and the free-living flatworm S. mediterranea. (b) Preferential distributions of peak-1 and peak-2 introns toward 3′-end and middle regions of genes, respectively. The curves show the relationship between intron densities and relative positions from the 5′-ends of genes. (c) Peak-2 intron-containing genes are highly enriched to certain functional groups (shown for T. asiatica, by two-sided Fisher's exact test). (d) Length preference of introns flanking small exons (<400 bp; shown for T. asiatica). The minimal mean length of (5′ or 3′) introns flanking small exons that have a specific length (<400 bp) is 370 bp in T. asiatica.
Figure 2Homologues and synteny between T. asiatica and related tapeworms.
(a) Homologous genes shared between T. asiatica and other tapeworms (that is, T. saginata, T. solium, E. multicularis and H. microstoma). (b) Gene block linkages between T. asiatica and T. saginata. The collinear gene blocks determined by MCScan between genome scaffolds (>1 Mb) represent 7,212 and 7,201 genes for T. asiatica and T. saginata, respectively.
Figure 3Gene duplications revealed by Ks analysis.
(a) The percentage of paralogous gene pairs of duplicated genes along with Ks values are typically L shaped in the T. saginata (Tsa) and T. asiatica (Tas) genomes, indicating the occurrence of continuous gene duplication events and the losses of duplicated genes over the time. (b) Distribution of Ks values in orthologous genes with peaks indicating the splits between various flatworms T. asiatica (Tas), T. saginata (Tsa), T. solium (Tso), E. granulosus (Egr), E. multilocularis (Emu), S. mansoni (Sma) and S. japanicum (Sja). The grey bar indicates the divergence point between the Taenia and Echinococcus lineages. (c) Extensive duplications of diagnostic antigen gp50 genes in the tapeworm lineage (Taenia, Echinococcus and Hymenolepis). (d) Sequence logo shows the conserved and distinct sequence characteristics of the gp50 sequences from the tapeworms. The sequence logo was generated from 183 sequences aligned at the conserved blocks selected by Gblocks with a less stringent selection (http://molevol.cmima.csic.es/castresana/Gblocks_server.html).
Figure 4Evolution of gene families in the flatworms and roundworms.
(a) The dynamics of gene family sizes in the genomes of T. saginata, T. asiatica, T. solium, E. granulosus, E. multilocularis, H. microstoma, S. japonicum and S. mansoni. Numbers above and below the branches indicate gene family gains/losses (red) and the expansions/contractions (blue), respectively. (b) Phylogenetic reconstruction clustered low-density lipoprotein receptor (LDLR) genes from T. saginata, T. asiatica and T. solium into four groups, in which group-4 LDLR genes were expanded only in the T. asiatica genome (Supplementary Methods, section 8.8). (c) The expansion of fatty acid desaturases (FADS) in the T. asiatica genome, compared with that in the T. saginata genome (Supplementary Methods, section 8.8).