| Literature DB >> 30337594 |
Ernest Diez Benavente1, Damilola R Oresegun2, Paola Florez de Sessions3, Eloise M Walker1, Cally Roper1, Jamille G Dombrowski4, Rodrigo M de Souza4,5, Claudio R F Marinho4, Colin J Sutherland1, Martin L Hibberd1,3, Fady Mohareb2, David A Baker1, Taane G Clark6,7, Susana Campino8.
Abstract
Malaria infection during pregnancy, caused by the sequestering of Plasmodium falciparum parasites in the placenta, leads to high infant mortality and maternal morbidity. The parasite-placenta adherence mechanism is mediated by the VAR2CSA protein, a target for natural occurring immunity. Currently, vaccine development is based on its ID1-DBL2Xb domain however little is known about the global genetic diversity of the encoding var2csa gene, which could influence vaccine efficacy. In a comprehensive analysis of the var2csa gene in >2,000 P. falciparum field isolates across 23 countries, we found that var2csa is duplicated in high prevalence (>25%), African and Oceanian populations harbour a much higher diversity than other regions, and that insertions/deletions are abundant leading to an underestimation of the diversity of the locus. Further, ID1-DBL2Xb haplotypes associated with adverse birth outcomes are present globally, and African-specific haplotypes exist, which should be incorporated into vaccine design.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30337594 PMCID: PMC6193930 DOI: 10.1038/s41598-018-33767-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the Plasmodium falciparum var2csa assembled 7Kb fragment and its diversity by country using the field isolates with little evidence of mixed infections (n = 1,249).
| Population | n | Number sites without missing data | Number of Haplotypes | Haplotype diversity ( | Nucleotide diversity (π) | Average no. of nucleotide differences (pairwise comparison) |
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Bangladesh | 11 | 6830 | 10 | 0.982 | 0.0896 | 612 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| PNG | 3 | — | — | — | — | — |
| Brazil | 3 | — | — | — | — | — |
| Colombia | 10 | 7045 | 6 | 0.889 | 0.0775 | 546 |
| Peru | 6 | — | — | — | — | — |
bolded are African countries, italicised are South East Asian countries, DRC = Democratic Republic of Congo; PNG = Papua New Guinea.
Figure 1Plasmodium falciparum var2csa diversity across a 7 kb region covering the five DBL domains, and 1,249 field isolates. (A) Schematic structure of the var2csa gene including the N-terminal sequence (NTS, blue), 5 Duffy binding like-Domains (DBL, green), 3 Inter-Domains (ID, red) and the Cysteine-Rich Inter-Domain (CIDR, yellow); the lengths in amino acids of the 3D7 reference are presented in parentheses. (B) Accumulation of unique insertions and deletions (InDels) across the var2csa gene. (C) Distribution of nucleotide diversity (π) across the gene and by population. Regions of abnormal (“flat”) nucleotide diversity are highlighted in green (B and C).
Diversity statistics across the different domains of the var2csa gene in 1,249 isolates of P. falciparum.
| Domain | Length in amino acids in 3D7 reference | Region in alignment | % of InDel positions | Mean length | Standard Deviation (length) | Positions without missing data | Variant Sites | Variant Sites (%) | No. Haplotypes (n = 1,249) | Haplotype Diversity | Nucleotide diversity |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NTS | 50 | 1–153 | 3 | 151.2 | 1.3 | 132 | 48 | 36 | 285 | 0.977 | 0.105 |
| DBL1X | 429 | 154–1852 | 36 | 1300.0 | 7.8 | 856 | 391 | 46 | 436 | 0.986 | 0.115 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| DBL3X | 358 | 6671–8253 | 30 | 1102.2 | 21.6 | 890 | 172 | 19 | 449 | 0.986 | 0.044 |
| DBL4X | 372 | 8254–9568 | 27 | 1103.9 | 14.9 | 808 | 121 | 15 | 450 | 0.986 | 0.032 |
| ID2 | 40 | 9569–9691 | 0 | 122.0 | 0 | 122 | 25 | 20 | 133 | 0.958 | 0.045 |
| DBL5X | 298 | 9692–10799 | 32 | 885.3 | 37.6 | 668 | 202 | 30 | 414 | 0.986 | 0.063 |
| ID3 | 20 | 10800–10863 | 13 | 58.8 | 3.3 | 56 | 32 | 57 | 56 | 0.919 | 0.193 |
InDel Insertions and deletions; Bolded is the CSA minal binding-domain.
Figure 2Population structure using the ID1-DBL2Xb protein sequences (A) Four distinct clades are identified, with some overlap to the clades found in[18], where Clade 1 is 3D7-like and Clade 2 is FCR3-like. The PCA analysis (B and C) supports the separation of these clades and reveals the proximity of Clades 1 and 2. (D) The distribution of clades across the different regions and previous studies, with three of the clades present across all the populations (Clades 1, 2 and 4); 3D7-like clade associated with adverse outcome in pregnancy is in West Africa (41.2%), East Africa (27.5%), South East Asia (23.5%) and South America (20%); Clade 3 is present in African parasite populations and Clade 4 is predominantly in African populations.