| Literature DB >> 25359557 |
Thomas D Otto, Ulrike Böhme, Andrew P Jackson, Martin Hunt, Blandine Franke-Fayard, Wieteke A M Hoeijmakers, Agnieszka A Religa, Lauren Robertson, Mandy Sanders, Solabomi A Ogun, Deirdre Cunningham, Annette Erhart, Oliver Billker, Shahid M Khan, Hendrik G Stunnenberg, Jean Langhorne, Anthony A Holder, Andrew P Waters, Chris I Newbold, Arnab Pain, Matthew Berriman, Chris J Janse.
Abstract
BACKGROUND: Rodent malaria parasites (RMP) are used extensively as models of human malaria. Draft RMP genomes have been published for Plasmodium yoelii, P. berghei ANKA (PbA) and P. chabaudi AS (PcAS). Although availability of these genomes made a significant impact on recent malaria research, these genomes were highly fragmented and were annotated with little manual curation. The fragmented nature of the genomes has hampered genome wide analysis of Plasmodium gene regulation and function.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25359557 PMCID: PMC4242472 DOI: 10.1186/s12915-014-0086-0
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Features of the reference genomes of ANKA, AS and YM
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
|
| ||||||
| Genome size (Mb) | 18.0 | 18.5 | 16.9 | 18.8 | 21.9 | 23.3 |
| G + C content (%) | - | 22.1 | - | 23.6 | 21.1 | 19.4 |
| Chromosomes | 14 | 14 | 14 | 14 | 14 | 14 |
| Synteny breaksb | - | - | 1 | 1 | 0 | ND |
| Contigs | 7497 | 220 | 10,679 | 40 | 195 | 14 |
| Sequence coverage | 4x | 237x | 4x | 109x | 627x | - |
| Genesc | 5,864 | 4,979 | 5,698 | 5,139 | 5,675 | 5,419 |
| Genes with functional annotationd | - | 2,781 (56%) | - | 2,927 (57%) | 3,485 (61%) | 3,234 (60%) |
| Novel genes (see Additional file | - | 487 | - | 409 | - | - |
|
| ||||||
| Genome size (bp) | - | 5,957 | - | 5,949 | 6,512 | 5,967 |
| G + C content (%) | - | 30.9 | - | 30.9 | 30.7 | 31.6 |
| Number of genes | - | 3 | - | 3 | 3 | 3 |
|
| ||||||
| Genome size (bp) | - | 30,302 | - | 29,468 | 29,736 | 29,430 |
| G + C content (%) | - | 13.5 | - | 13.7 | 14.1 | 13.1 |
| Genes | - | 30 | - | 30 | 30 | 30 |
aGenome version: 1.5.2013; Apicoplast genome from accession numbers: X95275, X95276; bcompared to the PbA genome; cin new versions, this includes pseudogenes and partial genes, but does not include non-coding RNA genes; dfigures include all genes except those annotated as ‘hypothetical’, ‘conserved Plasmodium protein, unknown function’, ‘conserved protein, unknown function’, ‘conserved rodent malaria protein, unknown function’ or ‘Plasmodium exported protein, unknown function’.
Different (subtelomeric) multigene families in the RMP genomes
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
| ||
|
|
| 100 | 88 | 12 | 194 | 3 | 4 | 583 | 40 | 172 |
|
|
| 23 | 16 | 3 | 132 | 2 | 0 | 94 | 8 | 11 |
|
|
| 34 | 1 | 5 | 26 | 0 | 0 | 48 | 2 | 4 |
|
| PYSTC | 6 | 0 | - | 10 | 0 | - | 22 | 0 | - |
|
|
| 1 | 0 | - | 17 | 4 | - | 5 | 0 | - |
|
|
| 7 | - | - | 13 | - | - | 12 | - | - |
|
|
| 6 | - | 8 | 8 | - | 0 | 11 | - | 3 |
|
| ||||||||||
|
| 4 | 1 | - | 28 | 0 | - | 11 | 1 | - | |
|
|
| 1 | 0 | 0 | 13 | 1 | 1 | 1 | 0 | 0 |
|
| 1 | - | - | 9 | - | - | 1 | - | - | |
|
| 46 | - | - | 67 | - | - | 46 | - | - | |
See Additional file 4 for details of all genes of P. berghei ANKA (PbA), P. c. chabaudi AS (PcAS) and P. y. yoelii YM (PyYM). CG: complete gene; FG: fragment; PSG: pseudogene.
Figure 1Organization of subtelomeric regions of RMP chromosomes. A) Organization of subtelomeric regions of chromosomes 13 of PyYM (left region), 5 of PcAS (left region) and 11 of PbA (left region). The order and orientation of the genes are shown, including genes belonging to the pir, fam-a, fam-b and fam-c gene families. Exons are shown in coloured boxes with introns as linking lines. As a comparison, a subtelomeric region of P. falciparum 3D7 chromosome 9 is shown. The shaded/grey areas mark the start of the conserved, syntenic regions. Black angular lines represent gaps. B) Artemis view showing a copy of the PbA-specific 2.3 kb repeat element containing a fragmented and ‘pseudogenised’ bir gene (PBANKA_000720). Also shown is the location of three 27 bp telomeric repeat units. The HMM Pfam match ‘Cir_Bir_Yir PF06022’ (grey box) spans amino acid 6 to 239 with an e-value of 2.5e-77. RPM, rodent malaria parasites.
Sequence diversity and number of members of multigene families from different RMP isolates/lines
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
|
| - | - | - | 18.56 | 220 | 200 | 42 | 40 | 6 | 1 |
|
| 437x | 294 | 21 | 18.45 | 313 | 174 | 67 | 46 | 6 | 1 |
|
| 161x | 127 | 14 | 18.47 | 310 | 191 | 54 | 47 | 5 | 1 |
|
| 268x | 95 | 4 | 18.44 | 333 | 182 | 54 | 45 | 7 | 1 |
|
| 401x | 2,098 | 345 | 18.32 | 275 | 170 | 58 | 50 | 6 | 1 |
|
| 262x | 2,759 | 469 | 18.72 | 123 | 224 | 54 | 41 | 7 | 1 |
|
| - | - | - | 22.03 | 195 | 795 | 113 | 54 | 22 | 5 |
|
| 289x | 740 | 8 | 22.75 | 154 | 980 | 157 | 78 | 26 | 5 |
|
| - | - | - | 18.83 | 40 | 201 | 134 | 26 | 10 | 21 |
|
| 79x | 144,148 | 4,321 | 18.98 | 246 | 276 | 160 | 31 | 23 | 21 |
|
| 127x | 144,281 | 4,326 | 18.89 | 240 | 277 | 161 | 28 | 18 | 19 |
|
| 98x | 251,984 | 4,514 | 19.63 | 272 | 371 | 215 | 37 | 33 | 29 |
|
| 248x | 274,877 | 4,509 | 19.41 | 275 | 398 | 193 | 41 | 32 | 25 |
aReference genomes (that is, PbA, PcAS and PyYM) to which sequence data from other isolates were mapped and analysed.; bexcluded from the analysis: all subtelomerically located genes (as mentioned in Additional file 4) and repetitive and low complexity regions of genes. Only single nucleotide polymorphisms (SNPs) were counted with at least 10 high quality mapped reads, 90% allele and 20% calls on each strand (see Additional file 8 for details of the SNPs in individual genes); cincluding pseudogenes and fragments. RMP, rodent malaria parasites.
Figure 2Gene expression (RNAseq) in multiple RMP life cycle stages. A) Spearman correlation of FPKM values of orthologous genes between life cycle stages of PbA, PcAS and PyYM. PbA: ring (RI), trophozoite (Tr), schizont (Sch), gametocyte (Gct) and 16 and 24 hour ookinetes (Ook). PcAS: trophozoites (trophozoites of blood (Pc-bl)-and vector-transmitted (Pc-vec) PcAS; PyYM; blood stages (2 lines PyYM_WT and PyYM_MUT). B) Heat maps of expression (FPKM normalized by gene) of PbA genes in different life cycle stages. Left panel, all PbA genes ordered based on P. berghei expression pattern (FPKM values >21; in total 4,733 genes). Right panel, 2,236 PbA genes with orthologs in P. falciparum and FPKM values >63, ordered according to the temporal expression levels (in asexual blood stages) of their P. falciparum orthologs as shown in [34]. FPKM, fragments per kilo base of exon per million fragments mapped; RMP, rodent malaria parasites.
Figure 3Expression of members of three large RMP multigene families in different life cycle stages. Temporal expression patterns of members of the three largest PbA multigene familes (pir, fam-a and fam-b) in different life cycle stages as visualized by heat maps of RNA-seq data. The expression (FKPM) values of genes over the life-cycle stages (yellow-red) are normalised per gene. The min/max column values are the log minimal and log maximal FPKM values for each gene. Only genes with an FPKM above 21, in all conditions, were included. FKPM, fragments per kilo base of exon per million fragments mapped; RMP, rodent malaria parasites.
Figure 4Features of the RMP multigene family A) Phylogenetic tree of RMP s showing the different clades (S1 to 8, L1 to 4) and separation of the ‘long’ (L) and ‘short’ form (S) s. B) Features of the different RMP pir clades. For each clade we show the total number of RMP pirs followed by their distribution (pie charts) in the three species and the distribution of gene lengths (box plots). In addition, pie charts show the relative abundance of clades in the different isolates/lines of P. chabaudi and P. yoelii. The expression bar plots (red bars) visualise the expression of the pirs of different clades in the different life cycle stages (except for the ookinete stage since expression/FPKM values are below the cut of level of 21). A pir is assigned to a life cycle stage based on the highest FPKM value. The height of the expression bar represents the percentage of all pirs in that clade. FKPM, fragments per kilo base of exon per million fragments mapped; RMP, rodent malaria parasites.