| Literature DB >> 31777939 |
Elie Hammam1,2,3,4, Guruprasad Ananda5, Ameya Sinha6,7, Christine Scheidig-Benatar1,2,3, Mylene Bohec8, Peter R Preiser6,7, Peter C Dedon6,9, Artur Scherf1,2,3, Shruthi S Vembar1,2,3.
Abstract
DNA cytosine modifications are key epigenetic regulators of cellular processes in mammalian cells, with their misregulation leading to varied disease states. In the human malaria parasite Plasmodium falciparum, a unicellular eukaryotic pathogen, little is known about the predominant cytosine modifications, cytosine methylation (5mC) and hydroxymethylation (5hmC). Here, we report the first identification of a hydroxymethylcytosine-like (5hmC-like) modification in P. falciparum asexual blood stages using a suite of biochemical methods. In contrast to mammalian cells, we report 5hmC-like levels in the P. falciparum genome of 0.2-0.4%, which are significantly higher than the methylated cytosine (mC) levels of 0.01-0.05%. Immunoprecipitation of hydroxymethylated DNA followed by next generation sequencing (hmeDIP-seq) revealed that 5hmC-like modifications are enriched in gene bodies with minimal dynamic changes during asexual development. Moreover, levels of the 5hmC-like base in gene bodies positively correlated to transcript levels, with more than 2000 genes stably marked with this modification throughout asexual development. Our work highlights the existence of a new predominant cytosine DNA modification pathway in P. falciparum and opens up exciting avenues for gene regulation research and the development of antimalarials.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31777939 PMCID: PMC6943133 DOI: 10.1093/nar/gkz1093
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.A 5hmC-like modification is the predominant cytosine modification in the Plasmodium falciparum genome. (A) Southwestern blotting analysis of genomic DNA prepared from Ring (R), Trophozoite (T) or Schizont (S) stage P. falciparum 3D7-G7 parasites using antibodies against 5mC (Abnova) or 5hmC (Activemotif). Unmethylated, methylated or hydroxymethylated DNA standards (Activemotif) served as controls. The total amount of DNA used is indicated in ng. (B) Immunostaining of Ring (R) or Schizont (S) stage P. falciparum 3D7-G7 parasites with anti-5hmC (top) or anti-5mC (bottom) antibodies. Anti-histone H3 antibodies were used as a positive control for nuclear staining. BF = Brightfield. (C) ELISA-based quantification of 5hmC (blue) or 5mC (gray) levels in genomic DNA prepared from Ring (R) or Schizont (S) stage parasites of the NF54 and 3D7-G7 strains. The positive controls were the methylated (5mC) and hydroxymethylated (5hmC) DNA standards used in part 1A. Data were calculated as a percentage and represent the means of three independent experiments + standard errors of means (S.E.M.).
Figure 2.BS- and oxBS-seq profiling confirms that a majority of modified cytosines in Plasmodium falciparum are 5hmC-like in nature. (A) Schematic representation of the BS- and oxBS-seq methodology. Genomic DNA prepared from synchronized P. falciparum parasites was treated with sodium bisulfite alone (BS-seq; top panel) or with an oxidizing agent prior to sodium bisulfite treatment (oxBS-seq; bottom panel). The resulting DNA was processed for Next Generation Sequencing and a subtraction of oxBS-seq signals from BS-seq signals yielded 5hmC distribution at single nucleotide resolution. See ‘Materials and Methods’ section for additional details. (B) Measured 5hmC (blue) or 5mC (grey) levels (relative to total cytosines in the P. falciparum genome, at 5 dpge and 10% methylation proportion and represented as a percentage) in Ring (R) or Schizont (S) stage BS- and oxBS-seq samples for the NF54 and 3D7-G7 strains. Data represent the means of three independent experiments + S.E.M. (C) Pearson correlation analysis of the 5hmC-like profiles (calculated for bigwig files normalized over 500 nt bins) for the different BS- and oxBS-seq replicates for genomic DNA extracted from Ring (R) or Schizont (S) stage parasites of the NF54 and 3D7-G7 strains. The colour scale indicates the value of the Pearson Correlation Coefficient R from −1 to 1. (D) Overlap in the number of 5hmC-like loci identified in BS- and oxBS-seq replicates of the indicates samples of the NF54 and 3D7-G7 strains. (E) The percentage of 5hmC-like loci found in the CpG (black), CHG (yellow) or CHH (white) context was calculated for the indicated parasite strain and growth stage: R = Ring and S = Schizont, from the BS- and oxBS-seq data. Data represent the means of three independent experiments + S.E.M.
Figure 3.hmeDIP-seq performs robustly for the AT-rich Plasmodium falciparum genome. (A) Schematic representation of the hmeDIP-seq methodology used to measure 5hmC (or 5hmC-like) distribution in P. falciparum asexual stages. Sheared P. falciparum genomic DNA was immunoprecipitated using anti-5hmC antibodies and the resulting DNA processed for Next Generation Sequencing (NGS). MACS2 was used for 5hmC-like peak-calling. See Materials and Methods for additional details. (B) Correlation between hmeDIP-seq biological replicates (calculated for bam alignment files derived from Illumina sequencing data) for the Ring (R), Trophozoite (Troph or T) and Schizont (Schiz or S) stages of 3D7-G7 was determined using a Pearson correlation analysis. The colour scale indicates values of the Pearson correlation coefficient R from 0 to 1. Ctrl = Control IP for biological replicate 1. A and B are technical replicates for the same genomic DNA sample. (C) Correlation of MACS2 fold enrichment (FE) profiles of hmeDIP-seq samples relative to Input DNA for the Ring (R), Trophozoite (T) and Schizont (S) stages of 3D7-G7 was compared to the IgG control IPs relative to Input DNA. For each stage, data were normalized across all replicates to generate a single FE profile (described in ‘Materials and Methods’ section). The colour scale indicates values of the Pearson correlation coefficient R from 0 to 1. Ctrl = Control IP for biological replicate 1. (D) Principal Component Analysis of the different MACS2-derived FE profiles of part C was performed using the plotPCA function of DeepTools on a multibigwigsummary file. The eigenvalues of the top two principal components PC1 (54.3% of variance explained) and PC2 (31.3% of variance explained) are shown and meaningful clustering of samples indicated.
Figure 4.5hmC-like bases occur predominantly in genic regions of the Plasmodium falciparum genome and are stably maintained during intra-erythrocytic development. (A) The number of hmeDIP-enriched peaks found in genic (red bars; 1 kb upstream, green bars; gene body and blue bars; 1 kb downstream) and intergenic (black bars) regions was calculated for the indicated growth stage from the hmeDIP-seq data: Troph = Trophozoite and S = Schizont. (B) Genes containing 5hmC-like loci either in the gene body or within their 1 kb upstream or downstream flanks were identified for the Ring, Trophozoite (Troph) and Schizont (Schiz) stages of the 3D7-G7 strain. A Venn diagram was used to represent the overlap in 5hmC-like-associated genes for the different sub-domains of a genic region. (C) A Venn diagram was used to represent the overlap in 5hmC-like-associated genes for the indicated sub-region across different P. falciparum growth stages: Troph = Trophozoite and S = Schizont. (D) The hmeDIP-seq and MACS2-derived fold enrichment (FE) pofiles for a 35 kb region of chromosome 4 (dark blue) are shown for the Ring, Trophozoite (Troph) and Schizont (Schiz) stages. MACS2-derived peaks are shown underneath the FE profiles. The coverage of bam files for hmeDIP-enriched DNA (red), DNA enriched in the control IP (green), or Input DNA (black) is shown and was calculated as reads per million over bins of 1000 nt. The GC plot (light blue) represents the G+C content of the P, falciparum genome normalized across bins of 1000 nt.
Figure 5.5hmC-like distribution within gene bodies is positively correlated to mRNA levels. (A) The expression of genes with (green) or without (red) 5hmC-like peaks in the gene body is displayed as a box and whisker plot for one replicate of the Ring (Rep 1), Trophozoite (Troph; Rep 1) and Schizont (Schiz; Rep 1) stages of the 3D7-G7 strain. For each box and whisker plot, the mean is indicated as a dashed line and the median as a solid line within the box. Y-axis: FPKM (fragments per kilobase of transcript per million mapped reads) measured using strand-specific RNA-seq. For each comparison, P-values were calculated using the non-parametric Mann–Whitney U-test. * indicates a P-value < 0.001. Refer to Supplementary Figure S5 and Table SS5A for the other replicates (B) Left panel: The intersection of 5hmC-like-associated genes (peaks within gene body) identified in the Ring, Trophozoite (Troph) and Schizont (Schiz) stages of the 3D7-G7 strain are represented using a Venn diagram. Right panel: The table summarizes the expression (Average RPKM) of 2178 overlapping genes relative to the remaining genes in replicate 1 of the indicated growth stage. The P-value was calculated using the non-parametric Mann–Whitney U-test. Refer to Supplementary Table S5D for the remaining replicates. (C) Gene Ontology (GO) analysis of genes that contain 5hmC-like loci within the gene body in all three stages of the 3D7-G7 strain was performed using https://plasmodborg. Background refers to the number of genes annotated to the indicated GO term in the entire background set, which in this case is comprised of all annotated P. falciparum genes. MF = Molecular Function; BP = Biological Process; CC = Cellular Component. Refer to Supplementary Table S6 for additional details.
Figure 6.hm5dC is not detectable in Plasmodium falciparum DNA by mass spectrometry but there is evidence for another type of modified nucleoside. LC-MS/MS spectra captured using Multiple Reaction Monitoring (MRM) mode are shown for two transitions, (A) 258.1>142.1 and (B) 258.1>124.1, for two replicates (Rep 1 and Rep 2) of P. falciparum genomic DNA. Calf Thymus DNA and a synthetic standard purchased from Cayman Chemicals served as positive controls, and showed hm5dC (i.e. 5hmC) elution at 2.00 min at room temperature. ? signifies an unknown peak that elutes at approximately 2.75 min for both transitions. The no DNA ‘Matrix’ control included enzymes, inhibitors and other buffers. Note that the y-axis scales of the various samples are not linked.