| Literature DB >> 29783949 |
Joan Simam1, Martin Rono2,3,4, Joyce Ngoi2, Mary Nyonda5, Sachel Mok6, Kevin Marsh3, Zbynek Bozdech7, Margaret Mackinnon8.
Abstract
BACKGROUND: Gene copy number variants (CNVs), which consist of deletions and amplifications of single or sets of contiguous genes, contribute to the great diversity in the Plasmodium falciparum genome. In vitro studies in the laboratory have revealed their important role in parasite fitness phenotypes such as red cell invasion, transmissibility and cytoadherence. Studies of natural parasite populations indicate that CNVs are also common in the field and thus may facilitate adaptation of the parasite to its local environment.Entities:
Keywords: Adaptation; Copy number variation; Plasmodium falciparum
Mesh:
Year: 2018 PMID: 29783949 PMCID: PMC5963192 DOI: 10.1186/s12864-018-4689-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of the four study populations
| Population | Kisumu | Kilifi pre malaria decline | Kilifi post malaria decline | Sudan |
|---|---|---|---|---|
| Malaria transmission intensity | High | Medium-high | Medium-low | Low |
| Number of samples | 49 | 33 | 49 | 52 |
| Year of sample collection | 2008 | 1994–1996 | 2010 | 2007 |
| Median age in months | 36 | 30 | 53.5 | 84 |
| (range) | (6–72) | (11–37) | (14–147) | (12–612) |
| log10 median parasitaemia | 5.3 | 4.7 | 5.1 | 5.1 |
| (par/μl) (range) | (4.9–5.8) | (4.0–5.8) | (2.5–6.1) | (4.4–5.8) |
| Median hemoglobin (g/dl) | 9.9 | 9.4 | 10.6 | 9.6 |
| (range) | (5.2–15.2) | (5.3–13.2) | (3.4–12.1) | (3.2–12.9) |
| Median number of clones | 2 | 2 | 2 | 2 |
| (range) | (1–6) | (1–7) | (1–5) | (1–5) |
| Monoclonal infections (percentage) | 10.2 | 33.3 | 18.4 | 26.9 |
Fig. 1Location and properties of CNVs in the P. falciparum genome. a Chromosomal location of the 94 CNVs in the 14 nuclear chromosomes of the P. falciparum genome (deletions in blue and amplifications in red). White vertical bars represent regions not targeted by the microarray probes. Black vertical bars are locations of centromeres. Distributions of length of CNVs (b), number of genes per CNV (c) and number of CNVs per sample (d) split by study population (horizontal line, median number; top and bottom boundaries, 75th and 25th percentiles; whiskers, minimum and maximum)
Fig. 2Systematic effects of host, parasite and gene factors on overall CNV prevalence. a Effects of host status, infection status and population on population prevalence of all CNVs. b Effects of gene properties on genomic prevalence of amplification CNVs. c As for b but for deletion CNVs. Points show least-squares means for each level of the factors (x-axis) adjusted for other factors in the model (separate panels). Vertical lines show upper and lower 95% confidence intervals. Significance of each factor is indicated at the top of each panel. *, P < 0.05; ***, P < 0.001
Functional gene categories showing significant enrichment for CNVs
| Functional gene set | No. of genes in CNVs | Total no. of genes in gene set | Nominal | Adjusted |
|---|---|---|---|---|
| Deletions | ||||
| Exported proteins - PHISTs | 8 | 48 | < 0.001 | < 0.001 |
| Characteristics of | 5 | 35 | < 0.001 | 0.043 |
| Glycolysis | 4 | 27 | 0.001 | 0.10 |
| Exported proteins - Unique | 6 | 72 | 0.0025 | 0.10 |
| Utilization of phospholipids | 4 | 40 | 0.005 | 0.26 |
| Amplifications | ||||
| Rab and other proteins involved in intracellular traffic | 3 | 14 | 0.009 | 1 |
| RNA binding genes | 12 | 173 | 0.01 | 1 |
aBy hypergeometric test without adjustment for testing of multiple gene sets
bAdjusted for multiple-testing by the Benjamini-Hochberg method
Fig. 3Population differentiation of CNVs. a Overlap of CNVs between populations. b Distribution of transmission intensity-related frequency clines (z-score) among the CNVs (filled bars, red for amplifications, green for deletions) vs. the expected distribution based on permuted data (black line). Vertical solid lines indicate the upper and lower 2.5% probability thresholds of the latter. Vertical dashed lines indicate the equivalent thresholds after Benjamini-Hochberg adjustment for multiple testing. c Distributions of population pairwise FST estimates for CNVs (dots, individual CNVs; horizontal line, median number; top and bottom boundaries, 75th and 25th percentiles, whiskers, minimum and maximum)
CNVs showing evidence of population-specific adaptation. Only those CNVs that either had FST greater than 0.30 or which showed significant transmission intensity-related clines in population frequency are shown
| CNV name | CNV type | FSTa | Clineb | Annotationsc |
|---|---|---|---|---|
| cnv5_101 | Amplification | 0.64 | - *** | 6-cysteine protein (P38) SET domain protein, putative (SET9) |
| cnv12_413 | Amplification | 0.64 | -* | |
| cnv9_242 | Amplification | 0.44 | P1 nuclease, putative | |
| cnv13_478 | Deletion | 0.46 | - † | E1-E2 ATPase, putative |
| cnv9_254 | Deletion | 0.41 | protein phosphatase-beta thioredoxin-like protein 2 (TLP2) zinc binding protein (Yippee), putative histone deacetylase 1 (HDAC1) | |
| cnv3_051 | Amplification | 0.34 | circumsporozoite- and TRAP-related protein (CTRP) | |
| cnv3_036 | Amplification | 0.32 | -† | Plasmodium exported protein (hyp1), unknown function (GEXP21) |
| Plasmodium exported protein, unknown function | ||||
| Plasmodium exported protein, unknown function | ||||
| cnv6_129 | Deletion | 0.30 | ubiquitin-conjugating enzyme E2, putative polypyrimidine tract binding protein, putative | |
| cnv4_078 | Amplification | 0.30 | -† | |
| cnv11_354 | Deletion | 0.27 | Plasmodium exported protein (PHISTc), unknown function (GEXP12) | |
| Plasmodium exported protein (hyp11), unknown function | ||||
| Plasmodium exported protein, unknown function | ||||
| cnv11_355 | Deletion | 0.18 | -*** | antigen 332, DBL-like protein (Pf332) |
| cnv9_269 | Deletion | 0.27 | -*** | gametocyte development protein 1 (GDV1) |
| Plasmodium exported protein, unknown function (GEXP22) | ||||
| gametocytogenesis-implicated protein (GIG) | ||||
| Plasmodium exported protein, unknown function | ||||
| cytoadherence linked asexual protein 9 (CLAG9) | ||||
| ring-exported protein 1 (REX1) | ||||
| ring-exported protein 2 (REX2) | ||||
| early transcribed membrane protein (ETRAMP9) | ||||
| ring-exported protein 4 (REX4) | ||||
| virulence-associated protein 1 (VAP1) | ||||
| Plasmodium exported protein (PHISTc), unknown function | ||||
| (GEXP05) | ||||
| lysophospholipase, putative | ||||
| Plasmodium exported protein (PHISTc), unknown function | ||||
| Plasmodium exported protein (PHISTb), unknown function | ||||
| Plasmodium exported protein, unknown function | ||||
| lysophospholipase, putative | ||||
| cnv1_007 | Amplification | 0.26 | -** | tubulin-specific chaperone a, putative N-acetyltransferase, putative |
| cnv14_549 | Amplification | 0.21 | -*** | thioredoxin peroxidase 1 (Trx-Px1) copper transporter |
| cnv11_355 | Deletion | 0.18 | - *** | antigen 332-DBL-like protein (Pf332) |
| cnv2_023 | Deletion | 0.14 | -* | conserved Plasmodium membrane protein, unknown function |
| cnv12_368 | Deletion | 0.07 | -* | glycerol-3-phosphate dehydrogenase, putative |
aMaximum of pairwise population FST values
bSignificance of transmission intensity cline in CNV frequency after Benjamini-Hochberg correction for multiple testing. (†, P < 0.10;*, P < 0.05;**, P < 0.01;***, P < 0.001; +, positive association; −, negative association). Four further CNVs had clines with adjusted P < 0.10 but FST < 0.3 and so are not shown (three amplifications, cnv9_262, − †; cnv5_109, + †; cnv14_573, − †, and one deletion, cnv7_193, − †)
cExcluding genes with no known or putative function
Fig. 4Associations between CNVs and sexual stage function. a pairwise linkage disequilibrium between CNV alleles. Heatmap colours indicate the strength and direction of the correlation between isolates across populations (r-value, white indicates the same CNV). Colour bars on left indicate the type of CNV and the strength and direction of its transmission intensity related frequency clines. CNVs were clustered (top dendrogram) by similarity in correlation profiles. CNVs with low linkage disequilibria are excluded. b Ratio of sexual to asexual stage expression (y-axis) for individual inside the genes in (a) grouped by linkage disequilibrium block and CNV type (Amp., amplification; Del., deletion)