| Literature DB >> 32745123 |
Eerik Aunin1, Ulrike Böhme1, Theo Sanderson2, Noah D Simons3, Tony L Goldberg4, Nelson Ting3, Colin A Chapman5,6,7, Chris I Newbold1,8, Matthew Berriman1, Adam J Reid1.
Abstract
Hepatocystis is a genus of single-celled parasites infecting, amongst other hosts, monkeys, bats and squirrels. Although thought to have descended from malaria parasites (Plasmodium spp.), Hepatocystis spp. are thought not to undergo replication in the blood-the part of the Plasmodium life cycle which causes the symptoms of malaria. Furthermore, Hepatocystis is transmitted by biting midges, not mosquitoes. Comparative genomics of Hepatocystis and Plasmodium species therefore presents an opportunity to better understand some of the most important aspects of malaria parasite biology. We were able to generate a draft genome for Hepatocystis sp. using DNA sequencing reads from the blood of a naturally infected red colobus monkey. We provide robust phylogenetic support for Hepatocystis sp. as a sister group to Plasmodium parasites infecting rodents. We show transcriptomic support for a lack of replication in the blood and genomic support for a complete loss of a family of genes involved in red blood cell invasion. Our analyses highlight the rapid evolution of genes involved in parasite vector stages, revealing genes that may be critical for interactions between malaria parasites and mosquitoes.Entities:
Mesh:
Year: 2020 PMID: 32745123 PMCID: PMC7425995 DOI: 10.1371/journal.ppat.1008717
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Fig 1An assembly of genomic sequencing reads from a red colobus blood sample contained significant amounts of sequence from the parasite Hepatocystis spp.
(A) Contigs from the red colobus (Piliocolobus tephrosceles) assembly had a bimodal distribution of AT-content and sequence similarity to Plasmodium spp. (B). A phylogenetic tree of cytochrome b indicated that the closest match for the apicomplexan parasite sequenced from red colobus blood is a Hepatocystis isolate from a monkey host. Parasite cytochrome b sequences derived from RNA-seq assemblies from red colobus blood samples are almost entirely identical to the cytochrome b sequence assembled from Hepatocystis DNA reads from a single monkey. Branches of the tree have been coloured by bootstrap support values from 15 (red) to 100 (green). Some bootstrap support values are also shown next to the nodes as text. Red arrows highlight the Hepatocystis samples from the current study. Blue place names indicate the African continent, green Australia, orange Asia.
Features of the Hepatocystis sp. ex. Piliocolobus tephrosceles assembly compared to P. falciparum 3D7, P. vivax P01 and P. berghei ANKA.
| Genome size (Mb) | 19.95 | 23.3 | 29.0 | 18.7 |
| G+C content (%) | 22.05 | 19.34 | 39.8 | 22.05 |
| Gaps within scaffolds | 979 | 0 | 431 | 0 |
| No. of scaffolds | 2439 | 14 | 240 | 19 |
| No. of chromosomes | ND | 14 | 14 | 14 |
| No. of genes | 5,341 | 5,441 | 6,650 | 5,049 |
| No. of pseudogenes | 28 | 158 | 158 | 129 |
| No. of partial genes | 1,475 | 0 | 196 | 8 |
| No. of ncRNA | 19 | 103 | 35 | 47 |
| No. of tRNAs | 41 | 45 | 45 | 45 |
| No. of telomeres | 0 | 26 | 1 | 12 |
| No. of centromeres | 5 | 13 | 14 | 14 |
| Genome size (bp) | 6,595 | 5,967 | 5,989 | 5,957 |
| G+C content (%) | 30.99 | 31.6 | 30.5 | 30.9 |
| No. of genes | 3 | 3 | 3 | 3 |
| Genome size (kb) | 27.0 | 34.3 | 29.6 | 34.3 |
| G+C content (%) | 13.29 | 14.22 | 13.3 | 15.1 |
| No. of genes | 28 | 30 | 30 | 30 |
| CEGMA—complete | 63.31% | 69.35% | 68.15% | 70.16% |
| as least partial | 69.35% | 71.77% | 71.77% | 73.39% |
* including pseudogenes, duplications and partial genes, excluding non-coding RNA genes
** two small contigs have telomeric repeats (scaffold_2410–5 telomeric repeats, scaffold_2364–9 telomeric repeats)
Fig 2Whole genome phylogeny and key features of the Hepatocystis genome.
A whole-genome phylogenetic tree is combined with a graphical overview of key features of Hepatocystis and Plasmodium species (genome versions from August 2019). The maximum likelihood phylogenetic tree of Hepatocystis, Plasmodium and Haemoproteus species is based on an amino acid alignment of 2673 single copy orthologs encoded by the nuclear genome. Bootstrap support values of all nodes were 100, except for one node where the value was 79. The rooting of the tree at Hae. tartakovskyi is based on previously published Plasmodium phylogenetic trees [11,45]. TRAP—thrombospondin-related anonymous protein. RBP protein—reticulocyte binding protein.
Fig 3Hepatocystis sp. in vivo RNA-seq data supports a lack of erythrocytic schizogony and a variable sex ratio.
(A) Distributions of SNPs per 100 kb in each Hepatocystis sp. RNA-seq sample, relative to the genome assembly reference, highlight consistently low genetic diversity. Samples SAMN07757853, SAMN07757863, SAMN07757870 and SAMN07757873 have been excluded from the figure due to their low expression of Hepatocystis genes. (B) Deconvolution of RNA-seq samples to identify parasite stage composition shows no evidence for blood schizonts. Ring and trophozoite cells are assumed to relate to early stages of gametocyte development, which are not distinguishable from asexual rings and trophozoites. (C) Proportions of early blood stages (ring and trophozoite) are negatively correlated with mature female gametocytes, however male and female gametocyte ratios are poorly correlated, suggesting that sex ratios vary among samples.
Size of known and novel gene families in Hepatocystis sp. in comparison to its relatives.
| ApiAP2 transcription factors | 27 | 27 | 26 | 27 | 20 |
| 16 (includes 4 pseudogenes) | 0 | 0 | 0 | 0 | |
| 10 (includes 4 pseudogenes) | 0 | 0 | 0 | 0 | |
| 8 | 9 | 9 | 9 | 8 | |
| 6-cysteine proteins | 7 | 14 | 13 | 15 (includes 1 pseudogene) | 22 |
| 6 | 6 | 6 | 6 | 6 | |
| Thrombospondin-Related Anonymous Protein ( | 6 | 1 | 1 | 1 | 1 |
| 5 (includes 1 pseudogene) | 0 | 218 (includes 83 pseudogenes) | 1212 (includes 109 pseudogenes) | 0 | |
| Serine repeat antigen (SERA) | 5 (1 pseudogene) | 9 | 5 | 13 | 2 |
| early transcribed membrane protein ( | 4 | 15 | 7 | 10 | 1 |
| exported protein 1 ( | 4 (includes 2 pseudogenes) | 1 | 1 | 1 | 0 |
| Tryptophan-rich antigen | 4 | 4 (includes 1 pseudogene) | 11 (includes 4 pseudogenes) | 40 | 0 |
| Lysophospholipase | 2–4 | 6 (includes 1 pseudogene) | 5 (includes 2 pseudogenes) | 7 (includes 1 pseudogene) | 1 |
| Phist domain-containing | 2 | 81 (includes 19 pseudogenes) | 3 | 82 (includes 6 pseudogenes) | 3 |
| 1 | 1 | 74 (includes 26 pseudogenes) | 1 | 1 (partial) | |
| 0 | 7 | 15 | 10 | 0 |
Fig 4Hepatocystis sp. orthologues of genes which are highly expressed in late schizogony in P. berghei tend to be missing from the genome.
P. berghei gene expression clusters from the Malaria Cell Atlas were used to determine whether orthologous genes, conserved with P. vivax and P. ovale, but absent in Hepatocystis, tended to be expressed in particular parts of the life cycle. The only significant cluster was cluster 10, which includes genes most highly expressed in late schizont stages (highlighted by a red box). The top panels show the log2 observed/expected ratios for orthologous genes in each cluster which are shared between P. berghei and Hepatocystis and the same ratio for those which are shared between P. berghei, P. vivax and P. ovale, but not Hepatocystis sp. Cluster 1, 18, 19 and 20 were not tested because they contained fewer than 2 expected counts for either ratio. The asterisk indicates a Fisher’s exact test false discovery rate of < = 0.05.
Top 15 genes with functional annotations ranked by Hepatocystis sp. dN in comparison of Hepatocystis sp., P. berghei ANKA and P. ovale curtisi.
Genes with completely unknown function and genes with very little information on their possible functions have been left out from this table. The rank column indicates the Hepatocystis sp. dN rank of each gene in the complete table (with 4009 genes) that includes genes with unknown function (S4 Table).
| Gene id | Annotations | Rank | Putative function | |||
|---|---|---|---|---|---|---|
| HEP_00146800 | 1.08 | 0.21 | 0.42 | Oocyst rupture protein 2 (ORP2) | 3 | Sporozoite egress from the oocyst [ |
| HEP_00446500 | 0.71 | 0.19 | 0.34 | liver specific protein 2 (LISP2) | 19 | Involved in liver stage development [ |
| HEP_00295100 | 0.71 | 0.31 | 0.63 | AP2-O | 20 | Essential for morphogenesis in ookinete stage in |
| HEP_00035800 | 0.69 | 0.27 | 0.16 | 6-cysteine protein (p38) | 24 | |
| HEP_00213800 | 0.67 | 0.45 | 0.63 | AP2 domain transcription factor AP2-SP2 | 28 | Required for sporozoite production in |
| HEP_00337100 | 0.62 | 0.36 | 0.42 | AP2 domain transcription factor ApiAP2 | 40 | Involved in blood stage replication [ |
| HEP_00456700 | 0.62 | 0.61 | 0.38 | Merozoite TRAP-like protein (MTRAP) | 42 | Essential for gamete egress from erythrocytes [ |
| HEP_00304800 | 0.62 | 0.32 | 0.26 | Secreted ookinete protein (PSOP7) | 43 | Secreted ookinete proteins are necessary for invasion of the mosquito midgut [ |
| HEP_00254800 | 0.59 | 0.26 | 0.26 | Microgamete surface protein MiGS | 54 | Plays a critical role in male gametocyte osmiophilic body formation and exflagellation [ |
| HEP_00115700 | 0.55 | 0.41 | 0.15 | Merozoite surface protein 4 (MSP4) | 62 | Merozoite surface proteins are involved in red blood cell invasion [ |
| HEP_00166600 | 0.54 | 0.36 | 0.2 | Repetitive organellar protein (ROPE) | 64 | Localised to the apical end of merozoites, possibly involved in red blood cell invasion [ |
| HEP_00195400 | 0.54 | 0.24 | 0.26 | Osmiophilic body protein (G377) | 67 | Female‐specific protein, affects the size of the osmiophilic body and female gamete egress efficiency [ |
| HEP_00130400 | 0.53 | 0.17 | 0.11 | Thioredoxin 2 (TRX2) | 75 | Part of a protein complex in parasitophorous vacuolar membrane, required for pathogenic protein secretion into host [ |
| HEP_00391300 | 0.52 | 0.49 | 0.27 | Autophagy-related protein 11 (ATG11) | 76 | Predicted to be involved in cargo selection in selective autophagy [ |
| HEP_00155500 | 0.52 | 0.18 | 0.17 | Metacaspase-2 | 80 | Protease with caspase-like activity [ |