| Literature DB >> 32660484 |
Hui-Ying Huang1,2, Xue-Yan Liang1,2, Li-Yun Lin1, Jiang-Tao Chen3,4, Carlos Salas Ehapo5, Urbano Monsuy Eyi5, Jian Li6, Ting-Ting Jiang6, Yu-Zhong Zheng1, Guang-Cai Zha1, Dong-De Xie4, Jin-Quan He4, Wei-Zhong Chen7, Xiang-Zhi Liu7, Huan-Tong Mo7, Xin-Yao Chen7, Min Lin8,9.
Abstract
BACKGROUND: Plasmodium falciparum circumsporozoite protein (PfCSP) is a potential malaria vaccine candidate, but various polymorphisms of the pfcsp gene among global P. falciparum population become the major barrier to the effectiveness of vaccines. This study aimed to investigate the genetic polymorphisms and natural selection of pfcsp in Bioko and the comparison among global P. falciparum population.Entities:
Keywords: Bioko Island; Circumsporozoite protein; Genetic polymorphism; Malaria; Plasmodium falciparum
Mesh:
Substances:
Year: 2020 PMID: 32660484 PMCID: PMC7359586 DOI: 10.1186/s12936-020-03315-4
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
Genetic diversity and natural selection test and recombination analysis of global PfCSP C-terminus
| S | H | Hd ± SD | dN-dS | Tajima’D | Fu and Li’s D | Fu and Li’s F | Ra | Rb | Rm | |
|---|---|---|---|---|---|---|---|---|---|---|
| Bioko island | 33 | 34 | 0.962 ± 0.008 | 0.0166 | − 0.68556 | − 1.23926 | − 1.22255 | 0.0888 | 32.1 | 8 |
| Cameroon | 12 | 7 | 0.944 ± 0.70 | 0.037 | − 0.84239 | − 0.68964 | − 0.81319 | 0.0919 | 15.9 | 2 |
| Gambia | 23 | 21 | 0.921 ± 0.022 | 0.0182 | 0.13689 | 0.31057 | 0.29579 | 0.057 | 21.5 | 6 |
| Ghana | 37 | 89 | 0.960 ± 0.004 | 0.0194 | − 0.48482 | 0.03757 | − 0.23777 | 0.0897 | 33.8 | 7 |
| Senegal | 31 | 42 | 0.954 ± 0.008 | 0.0193 | − 0.15853 | 0.44100 | 0.23957 | 0.0992 | 37.4 | 6 |
| Tanzania | 21 | 29 | 0.966 ± 0.009 | 0.059 | 0.60634 | 0.80724 | 0.875 | 0.1838 | 31.8 | 6 |
| Congo | 22 | 25 | 0.947 ± 0.011 | 0.0175 | 0.18825 | 0.17445 | 0.21407 | 0.0666 | 25.1 | 6 |
| Nigeria | 7 | 4 | 0.900 ± 0.161 | − 0.194 | 0.13160 | 0.78960 | 0.69173 | 0.1954 | 33.8 | 0 |
| Mali | 23 | 26 | 0.947 ± 0.014 | 0.0192 | 0.16275 | 0.90715 | 0.76259 | 0.1069 | 40.3 | 5 |
| Malawi | 28 | 51 | 0.965 ± 0.004 | 0.0189 | 0.26223 | 0.56701 | 0.52963 | 0.0883 | 33.3 | 9 |
| Guinea | 23 | 26 | 0.955 ± 0.011 | 0.0185 | 0.08885 | 1.17263 | 0.93102 | 0.1271 | 47.9 | 5 |
| India | 2 | 2 | 0.389 ± 0.164 | 0.0057 | 0.19590 | 1.063 | 0.94854 | 0.0006 | 0.1 | 0 |
| Iran | 2 | 2 | 0.467 ± 0.075 | 0.0068 | 1.56386 | 0.85807 | 1.20551 | 0.0127 | 2.2 | 0 |
| Myanmar | 7 | 7 | 0.342 ± 0.080 | 0.0021 | − 1.06145 | 0.43133 | − 0.06134 | 0 | 0.001 | 1 |
| Philippines | 18 | 9 | 0.821 ± 0.041 | 0.073 | 1.01981 | 1.64995* | 1.70088* | 0.0977 | 16.9 | 1 |
| Thailand | 13 | 9 | 0.535 ± 0.048 | 0.0066 | − 0.16926 | 0.24690 | 0.11276 | 0 | 0.001 | 4 |
| Vietnam | 18 | 11 | 0.757 ± 0.034 | 0.0122 | − 0.03642 | 0.81913 | 0.60286 | 0.009 | 3.4 | 3 |
| Bangladesh | 20 | 14 | 0.890 ± 0.036 | 0.0134 | − 0.53740 | − 0.45410 | − 0.56526 | 0.0549 | 20.7 | 5 |
| Cambodia | 22 | 22 | 0.812 ± 0.009 | 0.0115 | 0.10655 | 1.81010* | 1.33256 | 0.0231 | 8.7 | 6 |
| Laos | 18 | 10 | 0.813 ± 0.030 | 0.0108 | − 0.13493 | 1.27356 | 0.91834 | 0.0154 | 5.8 | 3 |
| Brazil | 8 | 3 | 0.459 ± 0.080 | 0.032 | 0.72190 | 1.30884 | 1.31767 | 0 | 0.001 | 0 |
| Venezuela | 14 | 7 | 0.911 ± 0.077 | 0.023 | 0.64552 | 0.79924 | 0.85743 | 0.167 | 28.9 | 2 |
| PNG | 5 | 4 | 0.532 ± 0.034 | 0.02 | 0.41918 | − 0.01266 | 0.15105 | 0.0283 | 4.9 | 1 |
| Solomon | 4 | 3 | 0.405 ± 0.069 | 0.031 | − 0.00704 | 1.00249 | 0.80842 | 0 | 0.004 | 0 |
| Vanuatu | 2 | 2 | 0.276 ± 0.046 | 0.016 | 0.70558 | 0.67407 | 0.80111 | 0 | 0.001 | 0 |
n: number of samples; S: segregating sites; H: number of haplotypes; Hd: haplotype diversity; dN: the number of synonymous substitutions per site; dS: the number of nun-synonymous substitutions per site; Ra: estimate of recombination between adjacent sites; Rb: estimate of recombination per gene; Rm: minimum number of recombination events; *p < 0.05
Fig. 1Global sequence polymorphism of pfcsp N-terminus and central repeat region. a Mutation type and frequency of N-terminal region of global pfcsp. Mutations with frequency < 10% were not marked. b Frequencies composition of NANP/NVDP repeat numbers in central repeat region among global pfcsp
Fig. 2Global nucleotide diversity (π) of C-terminal (311–363) region. The values of nucleotide diversity (π) were calculated using DNASP version 6.12.01 with the sliding window length of 10 bp and step size of 5 bp
Fig. 3Non-synonymous mutation in C-terminus (311–363) of global pfcsp. Each logo consists of stacks of symbols, one stack for each position in the sequence. The height of the amino acid abbreviation indicates its relative frequency at that position. As for the pattern of 3D7 isolate, all positions were marked in coloured; as for global pfcsp patterns, mutation sites are marked in colour, while the conserved sites are in gray
Fig. 4Mutations distribution and T cell epitopes map of pfcsp (3D7 isolate). Capital letters in black are amino acid sequences of 3D7 isolate; The red capital letters under the black ones are for mutants. Sequences with black solid line below indicated CD8+ T cell epitopes, sequences with blue dotted line above indicated CD4+ T cell epitopes. Repeat region is in gray shadow; Th2R region is in orange shadow; Th3R region is in green shadow
Global PfCSP C-terminal Mutant types and effect prediction
| Mutant | Frequency in global | Frequency in Bioko | SIFT | Humdiv | ΔΔG |
|---|---|---|---|---|---|
| K314Q | 21.78% | 47.92% | 0.13 | 0.997A | − 0.373664 |
| K317E | 84.32% | 93.75% | 0.43 | 0.146C | 1.07621 |
| K317T | 6.97% | 1.04% | 0.2 | 0.995A | 1.20284 |
| K317A | 0.17% | 0 | 0.08 | 0.988A | 1.31391 |
| E318K | 23.52% | 56.25% | 0.08 | 0.04C | 1.29697 |
| E318Q | 59.67% | 34.38% | 0.06 | 0.02C | 0.981937 |
| N321K | 84.76% | 79.17% | 0.84 | 0.043C | − 1.74619 |
| N321Q | 1.83% | 6.25% | 0.54 | 0.954A | − 0.111155 |
| N321T | 2.92% | 0 | 0.51 | 0.082C | − 1.89854 |
| K322E | 1.61% | 8.33% | 0.09 | 0.186C | 0.882041 |
| K322I | 5.97% | 5.21% | 0.02* | 0.353C | − 0.667251 |
| K322R | 4.44% | 10.42% | 0.16 | 0.521B | 0.504493 |
| K322T | 16.9% | 43.75% | 0.06 | 0.186C | 1.11661 |
| K322N | 6.45% | 0 | 0.06 | 0.186C | 1.04753 |
| Q324K | 19.29% | 39.58% | 0.29 | 0.043C | − 0.294208 |
| N325Y | 7.58% | 0 | 0.02* | 0.186C | 0.241653 |
| S326A | 0.09% | 0 | 0.04* | 0.94B | − 0.00308024 |
| L327I | 11.06% | 10.42% | 0.82 | 0.994A | 1.56456 |
| G349D | 0.13% | 0 | 0.83 | 0.999A | − 0.608967 |
| N352G | 8.36% | 9.38% | 0.14 | 0.994A | 0.117207 |
| N352D | 18.73% | 6.25% | 0.08 | 0.523B | 0.0203338 |
| P354S | 4.66% | 1.04% | 0.18 | 0.988A | 1.85106 |
| D356N | 3.79% | 9.38% | 0.1 | 0.995A | − 0.222874 |
| D356G | 0.09% | 2.08% | 0.09 | 0.988A | 0.0325801 |
| E357Q | 45.78% | 76.04% | 0.13 | 0.723B | − 0.334979 |
| D359N | 8.84% | 14.58% | 0.2 | 0.288C | 0.433524 |
| A361E | 72.43% | 46.88% | 0.79 | 0.186C | 0.273329 |
| A361I | 0.96% | 1.04% | 0.46 | 0.973A | 0.206936 |
* SIFT < 0.05, predicted to be deleterious; HumdivA > = 0.953, probably damaging; 0.953 > HumdivB > = 0.432, possibly damaging; 0.432 > HumdivC > = 0.0024, benign
Fig. 5Haplotype network of C-terminal region among global pfcsp. Isolates from four continents and Bioko Island were marked in five different series colours, blue series for Africa, red series for Asia, khaki series for Oceania, green series for America, and yellow for Bioko Island
Population pairwise fixation index (Fst) result
| Africa | America | Asia | Oceania | |
|---|---|---|---|---|
| America | 0.19194* | |||
| Asia | 0.24167* | 0.21294* | ||
| Oceania | 0.25712* | 0.42132* | 0.06564* | |
| Bioko | 0.00878* | 0.24014* | 0.31435* | 0.42889* |
* p < 0.05