| Literature DB >> 26829753 |
Vicky L Hunt1, Isheng J Tsai2,3, Avril Coghlan4, Adam J Reid4, Nancy Holroyd4, Bernardo J Foth4, Alan Tracey4, James A Cotton4, Eleanor J Stanley4, Helen Beasley4, Hayley M Bennett4, Karen Brooks4, Bhavana Harsha4, Rei Kajitani5, Arpita Kulkarni6, Dorothee Harbecke6, Eiji Nagayasu3, Sarah Nichol4, Yoshitoshi Ogura7, Michael A Quail4, Nadine Randle8, Dong Xia8, Norbert W Brattig9, Hanns Soblik9, Diogo M Ribeiro4, Alejandro Sanchez-Flores4,10, Tetsuya Hayashi7, Takehiko Itoh5, Dee R Denver11, Warwick Grant12, Jonathan D Stoltzfus13, James B Lok13, Haruhiko Murayama3, Jonathan Wastling8,14, Adrian Streit6, Taisei Kikuchi3, Mark Viney1, Matthew Berriman4.
Abstract
Soil-transmitted nematodes, including the Strongyloides genus, cause one of the most prevalent neglected tropical diseases. Here we compare the genomes of four Strongyloides species, including the human pathogen Strongyloides stercoralis, and their close relatives that are facultatively parasitic (Parastrongyloides trichosuri) and free-living (Rhabditophanes sp. KR3021). A significant paralogous expansion of key gene families--families encoding astacin-like and SCP/TAPS proteins--is associated with the evolution of parasitism in this clade. Exploiting the unique Strongyloides life cycle, we compare the transcriptomes of the parasitic and free-living stages and find that these same gene families are upregulated in the parasitic stages, underscoring their role in nematode parasitism.Entities:
Mesh:
Year: 2016 PMID: 26829753 PMCID: PMC4948059 DOI: 10.1038/ng.3495
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Fig. 1Evolution and comparative genomics of Strongyloides and relatives
The life cycles of six clade IV nematodes showing the transition from a free-living lifestyle (in Rhabditophanes), through facultative parasitism (P. trichosuri), to obligate parasitism (Strongyloides spp.), and the phylogeny of these species (maximum-likelihood phylogeny based on a concatenated alignment of 841,529 amino acid sites from 4,437 conserved single-copy orthologous genes). Values on nodes (all 100) are the number of bootstrap replicate trees showing the split induced by the node, out of 100 bootstrap replicates. The phylogeny is annotated with the numbers of gene families appearing along each branch of the phylogeny (+values on each branch) and histograms show the number of duplications (blue) and losses (red) for individual genes (dark blue or red) and foor families (light blue or red); the number of gene origins and gene losses in 18 astacin families (upper numbers in boxes) and ten SCP/TAPS families (lower numbers in boxes) as estimated by the Ensembl Compara pipeline is also shown. The pie charts summarize the evolutionary history of the genome of each species, defining genes shared among all six species, the five parasitic species (Strongyloididae, which includes all except Rhabditophanes), the four Strongyloides species, and species-specific genes. The host species of the parasites are shown: for P. trichosuri the brushtail possum, for S. ratti and S. venezuelensis the rat, for S. stercoralis humans, and for S. papillosus sheep.
Properties of genome assemblies
Genome statistics based on scaffolds, excluding scaffolds less than 1000 bp. N50 is the size above which 50% of the assembled bases are distributed; N50 (number) is the number of scaffolds in which 50% of assembled bases exist.
| S. ratti | S. stercoralis | S. papillosus | S. venezuelensis | P. trichosuri | Rhabditophanes | C. elegans | |
|---|---|---|---|---|---|---|---|
| Clade | IV | IV | IV | IV | IV | IV | V |
| Number of chromosomes | 3[ | 3[ | 2[ | 2[ | 3[ | 5 | 6[ |
| Assembly version | V5.0.4 | V2.0.4 | V2.1.4 | V2.0.4 | V2.0.4 | V2.0.4 | WS244 |
| Assembly size (Mb) | 43.1 | 42.6 | 60.2 | 52.1 | 42.2 | 47.2 | 100.2 |
| Number of scaffolds | 115 | 675 | 4,353 | 520 | 1,391 | 380 | 6 |
| N50 of scaffolds (kb) | 11,700 | 431 | 86 | 715 | 837 | 537 | 17,500 |
| N50 (number) | 2 | 16 | 129 | 16 | 12 | 22 | 3 |
| Maximum scaffold length (Mb) | 16.8 | 5.0 | 1.7 | 5.9 | 6.2 | 7.3 | 20.9 |
| G+C content (%) | 21 | 22 | 26 | 25 | 31 | 32 | 36 |
| Number of genes | 12,451 | 13,098 | 18,457 | 16,904 | 15,010 | 13,496 | 23,629 |
| Number of exons | 33,796 | 34,366 | 40,821 | 40,619 | 35,049 | 37,987 | 145,275 |
| Exons, combined length (Mb) | 17.5 | 17.9 | 22.4 | 20.3 | 20.8 | 17.8 | 30.1 |
| Median exon length (bp) | 263 | 265 | 304 | 261 | 348 | 276 | 146 |
| Number of introns | 21,345 | 21,268 | 22,364 | 23,715 | 20,039 | 24,491 | 169,506 |
See Supplementary Figure 7
12 scaffolds, covering 93% of the genome, are assigned to chromosomes; 103 scaffolds are not assigned to a chromosome.
Fig. 2Nuclear genomic synteny and mitochondrial genomes of four Strongyloides spp., P. trichosuri and Rhabditophanes sp
(a) The S. ratti genome, our best assembled genome, is used as the reference sequence; synteny is based on sequence matches. Graduation of color across the S. ratti chromosomes represents position along the chromosome for chromosome I (yellow-red), chromosome II (blue-purple) and chromosome X (green). Black boxes represent scaffolds >1Mb; scaffolds <1Mb are grouped together and shown in grey. (b) The mitochondrial gene order and phylogeny for our six species and seven outgroup species that encompass four nematode clades. Our eighth outgroup species, Meloidogyne hapla, was excluded due to insufficient mitochondrial genome data. Inverted sequences are shown by gene boxes with inverted text. The maximum likelihood tree (left) was constructed using 12 mitochondrial proteins. Amino acid sequences were aligned before concatenation (Supplementary Note).
Fig. 3The parasitic female, free-living female and infective third-stage larvae transcriptomes of Strongyloides spp
The progeny of the parasitic female pass out of the host (as larvae for S. stercoralis, or eggs and larvae for S. ratti) where infective third stage larvae (iL3s) can develop directly, or free-living males and females develop, whose progeny develop into iL3s; iL3s then infect hosts. The human parasite, S. stercoralis, can undergo internal auto-infection (grey dashed line) where iL3s develop and internally reinfect the same host. The transcriptome of the parasitic females, free-living females and iL3s were compared for S. ratti and S. stercoralis. Representative GO terms that were significantly enriched (left-hand side area of box) and Ensembl Compara gene families significantly upregulated (right-hand side of box) for each of these three stages of the lifecycle is summarized. The pie charts show the proportion of the GO terms common to S. ratti and S. stercoralis, or unique to either. Numbers in the right-hand side of boxes represent the number of genes upregulated in each gene family for S. ratti and S. stercoralis.
Fig. 4Strongyloides-specific expansions and chromosomal clustering of gene families
(a) Astacin-like and (b) SCP/TAPS are the two major Strongyloides ratti gene families upregulated in the transcriptome of parasitic females. Left shows the phylogeny of each of these for S. ratti and our eight outgroup species and the crayfish Astacus astacus S. ratti genes are in light blue. Right shows the distribution of these genes in the genome, plotted as clusters of physically adjacent genes in the genome. Numbers above the peaks are the number of genes in a cluster of physically neighboring genes; ticks below the axis denote scaffold boundaries for chromosome X. The transcriptomic expression of these genes (in RPKM, reads per kilobase per million mapped reads) for parasitic females, free-living females and iL3s are shown on a grey scale, and the results of pairwise edgeR analysis of the gene expression among these lifecycle stages is shown in red or blue where a gene is upregulated. The color representing upregulation (red or blue) in a given stage of the life cycle relates to the color of the name of that stage for each pairwise comparison (fold change > 2, FDR < 0.01); no differential expression is shown as a white block.