| Literature DB >> 26920563 |
Deodatus Vincent Maliti1,2,3, C D Marsden4, B J Main5, N J Govella6, Y Yamasaki7, T C Collier8, K Kreppel9, J C Chiu10, G C Lanzaro11, H M Ferguson12, Y Lee13.
Abstract
BACKGROUND: There is growing evidence that the widespread use of Long-Lasting Insecticidal Nets (LLINs) is prompting malaria vectors to shift their biting towards times and places where people are not protected, such as earlier in the evening and/or outdoors. It is uncertain whether these behavioural shifts are due to phenotypic plasticity and/or ecological changes within vector communities that favour more exophilic species, or involve genetic factors within vector species to limit their contact with LLINs. Possibly variation in the time and location of mosquito biting has a genetic basis, but as yet this phenomenon has received little investigation. Here we used a candidate gene approach to investigate whether polymorphisms in selected circadian clock genes could explain variation in the time and location of feeding (indoors versus outside) within a natural population of the major African malaria vector Anopheles arabiensis.Entities:
Mesh:
Year: 2016 PMID: 26920563 PMCID: PMC4769569 DOI: 10.1186/s13071-016-1394-8
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Fig. 1A map of Tanzania showing collection sites. Lupiro and Sagamaganga villages are approximately 40 km away from each other in the highly malaria endemic valley of Kilombero
Gene and loci identity with the forward and reverse primer sequences
| Gene | Gene ID | Loci ID | PCR product size(bp) | Forward primer | Reverse primer |
|---|---|---|---|---|---|
|
| AGAP005711 | CLK-E01-267 | 119 | GTAAAATACTCTCCCGGTA | GTAAAATACTCTCCCGGTG |
|
| AGAP005711 | CLK-E01-192 | 113 | GCTTCGTTCGAGAGAAAGGAA | GCTTCGTTCGAGAGAAAGGAG |
|
| AGAP005711 | CLK-E01-087 | 106 | CTTGCGCACGGTCGACTTGTCCATC | CTTGCGCACGGTCGACTTGTCCATT |
|
| AGAP005711 | CLK-E01-240 | 119 | TTCCCGATGATGAACCCGTCC | TTCCCGATGATGAACCCGTCT |
|
| AGAP001958 | CRY1-E04-206 | 120 | TCGACGGCGCAGCACGGA | TCGACGGCGCAGCACGGT |
|
| AGAP001958 | CRY1-E04-097 | 99 | CGCACGTCCATCGTTC | CGCACGTCCATCGTTT |
|
| AGAP001958 | CRY1-E04-240 | 120 | CTACCACCAGCAGCTGTCCA | CTACCACCAGCAGCTGTCCG |
|
| AGAP001958 | CRY1-E04-252 | 102 | CGACCTTGACCGACAGTTC | CGACCTTGACCGACAGTTT |
|
| AGAP004261 | CRY2-E05-378 | 113 | CCACTGCCATTGCCACCA | CCACTGCCATTGCCACCG |
|
| AGAP004261 | CRY2-E05-561 | 98 | GGCGCAGTCGCAGGAAAAC | GGCGCAGTCGCAGGAAAAT |
|
| AGAP004261 | CRY2-E05-501 | 100 | TGAGAATGCTGCAGCTGTGAC | TGAGAATGCTGCAGCTGTGAT |
|
| AGAP004261 | CRY2-E05-407 | 113 | GCCTTGTTTGGTGTCGTCAGGCA | GCCTTGTTTGGTGTCGTCAGGCG |
|
| AGAP004261 | CRY2-E05-045 | 85 | TCCGCTGCCGATGGTC | TCCGCTGCCGATGGTT |
|
| AGAP004261 | CRY2-E05-125 | 82 | CCCCAATACCGCACACCGAA | CCCCAATACCGCACACCGAG |
|
| AGAP004261 | CRY2-E05-351 | 102 | TATCGTGGGTCCGGGCCGCTA | TATCGTGGGTCCGGGCCGCTG |
|
| AGAP004261 | CRY2-E05-051 | 85 | GCGGGAAGCAATCGCA | GCGGGAAGCAATCGCG |
|
| AGAP005655 | CYC-E01-034 | 117 | ATTGCTGTTGGAGGGTTTA | ATTGCTGTTGGAGGGTTTG |
|
| AGAP005655 | CYC-E01-217 | 118 | CCACTCGTTACACCCTGAGGG | CCACTCGTTACACCCTGAGGT |
|
| AGAP005655 | CYC-E01-093 | 96 | GGCAGCGTCCGATTTAAGCCCA | GGCAGCGTCCGATTTAAGCCCG |
|
| AGAP005655 | CYC-E01-072 | 96 | GGGTAAAGTGAAGGAGCAACTC | GGGTAAAGTGAAGGAGCAACTG |
|
| AGAP005655 | CYC-E01-268 | 118 | ACTTTGCACTTCATCCGA | ACTTTGCACTTCATCCGG |
|
| AGAP005655 | CYC-E01-250 | 118 | TGGAAGAAGGAACGGCGC | TGGAAGAAGGAACGGCGA |
|
| AGAP005655 | CYC-E01-021 | 98 | TTGATCTTCTTGGGCAGAGC | TTGATCTTCTTGGGCAGAGT |
|
| AGAP006376 | PDP1-E02-110 | 116 | ATCGTCGCGGGACCGCTTC | ATCGTCGCGGGACCGCTTT |
|
| AGAP001856 | PER-PAS-082 | 101 | CGGCTTCCCCAAGGAC | CGGCTTCCCCAAGGAT |
|
| AGAP001856 | PER-PAS-355 | 100 | AGAAGGCGGAGATCATGAGCGGC | AGAAGGCGGAGATCATGAGCGGT |
|
| AGAP001856 | PER-PAS-202 | 114 | GGGGAAAGAGCGGCCAGAAGGAC | GGGGAAAGAGCGGCCAGAAGGAT |
|
| AGAP001856 | PER-PAS-370 | 100 | GGTGGCCGAGATGATC | GGTGGCCGAGATGATG |
|
| AGAP007801 | TIM-E05-075 | 111 | GCCCCGTTGACGCTGTCC | GCCCCGTTGACGCTGTCG |
|
| AGAP007801 | TIM-E05-087 | 111 | GTATCTGCGTTCCGATGTCG | GTATCTGCGTTCCGATGTCT |
|
| AGAP007801 | TIM-E05-189 | 98 | GTCCGCTACGACACAC | GTCCGCTACGACACAT |
|
| AGAP007801 | TIM-E05-495 | 101 | CCTACGCTGATTGCCTGGCTA | CCTACGCTGATTGCCTGGCTG |
|
| AGAP007801 | VRI-E02-427 | 118 | CCCCGATAAGGATGCGGCCACC | CCCCGATAAGGATGCGGCCACT |
|
| AGAP007801 | VRI-E02-355 | 108 | AAGTTGGCGTGCTCGTGA | AAGTTGGCGTGCTCGTGG |
PCR products sizes are in base pairs (bp)
Loci identity showing reference genes, chromosomes of origin, mutated nucleotide and variant codons
| SNP ID | Reference gene ID ( | Chromosome | SNP type | Reference ( | Variant codon | Reference amino acid | Variant amino acid | Mutation type |
|---|---|---|---|---|---|---|---|---|
| PER-PAS-202 | AGAP001856 | 3R | T/C | GAC | GAU | D | D | S |
| PER-PAS-082 | AGAP001856 | 3R | C/T | GAU | GAC | D | D | S |
| PER-PAS-301 | AGAP001856 | 3R | G/A | CGG | CGA | R | R | S |
| PER-PAS-355 | AGAP001856 | 3R | T/C | GGC | GGU | G | G | S |
| PER-PAS-370 | AGAP001856 | 3R | G/C | CUC | CUG | L | L | S |
| CRY1_E04-240 | AGAP001958 | 2R | T/C | CGG | UGG | R | W | NS |
| CRY1_E04-206 | AGAP001958 | 2R | T/A | UCA | UCU | S | S | S |
| CRY1_E04-097 | AGAP001958 | 2R | A/G | GAA | AAA | E | K | NS |
| CRY1_E04-252 | AGAP001958 | 2R | T/C | UUU | UUC | F | F | S |
| CRY2_E05-351 | AGAP004261 | 2R | C/T | CUU | CUC | L | L | S |
| CRY2_E05-378 | AGAP004261 | 2R | C/T | CUC | CUU | L | L | S |
| CRY2_E05-407 | AGAP004261 | 2R | G/C | AGG | AGC | R | S | NS |
| CRY2_E05-501 | AGAP004261 | 2R | A/G | GCA | ACA | A | T | NS |
| CRY2_E05-561 | AGAP004261 | 2R | T/C | AAC | AAU | N | N | S |
| CRY2_E05-045 | AGAP004261 | 2R | T/C | GUU | GUC | V | V | S |
| CRY2_E05-125 | AGAP004261 | 2R | G/A | GCG | ACG | A | T | NS |
| CYC_E01-268 | AGAP005655 | 2L | T/C | UGC | UGU | C | C | S |
| CYC_E01-250 | AGAP005655 | 2L | G/T | UCG | GCG | S | A | NS |
| CYC_E01-217 | AGAP005655 | 2L | C/A | ACC | CCC | T | P | NS |
| CYC_E01-093 | AGAP005655 | 2L | G/A | ACG | GCG | T | A | NS |
| CYC_E01-072 | AGAP005655 | 2L | G/C | CUC | CUG | L | L | S |
| CYC_E01-021 | AGAP005655 | 2L | T/C | AGU | AGC | S | S | S |
| CYC_E01-034 | AGAP005655 | 2L | T/C | UUC | UUU | F | F | S |
| Clk_E01-087 | AGAP005711 | 2L | G/A | GAU | AAU | D | N | NS |
| Clk_E01-192 | AGAP005711 | 2L | C/T | CUC | UUC | L | F | NS |
| Clk_E01-240 | AGAP005711 | 2L | G/A | CUG | CUA | L | L | S |
| Clk_E01-267 | AGAP005711 | 2L | C/T | CAC | UAC | H | Y | NS |
| PDP1_E02-110 | AGAP006376 | 2L | G/A | GCG | GCA | A | A | S |
| Vri_E02-355 | AGAP007801 | 3R | C/T | AUC | AUU | I | I | S |
| Vri_E02-427 | AGAP007801 | 3R | C/T | ACC | ACU | T | T | S |
| TIM_E05-087 | AGAP007801 | 3R | C/A | CCC | CCA | P | P | S |
| TIM_E05-189 | AGAP007801 | 3R | A/G | AUG | GUG | M | V | NS |
| TIM_E05-075 | AGAP001856 | 3R | G/C | GAC | CAC | D | H | NS |
| TIM_05-195 | AGAP007801 | 3R | C/T | GUC | GUU | V | V | NS |
The loci were selected from the 8 clock genes; PER period, CRY1 cryptochrome1, CRY2 cryptochrome2, CYC cycle, CLK clock, Pdp1 par domain protein 1, VRI vrille and TIM timeless. Mutation type S means synonymous mutation and NS means non-synonymous mutation
Tajima’s D statistics including synonymous and non-synonymous mutations and nucleotide diversity in 8 clock genes and among early and late feeding phenotypes of An. arabiensis
| Chra | Gene IDb | Genec | Popd | Ne | n | ∏(%)g | Dh | μ | μNS j | μNCS k | SNPdl | LE:LLm | SE:SLn | LE:SEo | LL:SLp | Codq pos |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2L | AGAP005711 | Clk E018627 | LE | 30 | 12 | 0.0057 | −1.6620 | 7 | 4 | 7 | 24.31 | 6 | 4 | 7 | 4 | 1 |
| LL | 26 | 7 | 0.0034 | −1.2588 | 7 | 0 | 7 | |||||||||
| SE | 30 | 8 | 0.0032 | −1.5161 | 8 | 0 | 8 | |||||||||
| SL | 24 | 4 | 0.0032 | −0.1632 | 4 | 0 | 4 | |||||||||
| 2R | AGAP001958 | Cry1 E041560 | LE | 22 | 8 | 0.0051 | −0.9904 | 1 | 7 | 1 | 27.36 | 7 | 6 | 7 | 6 | 2 |
| LL | 18 | 10 | 0.0050 | −1.7391 | 1 | 9 | 1 | |||||||||
| SE | 28 | 10 | 0.0061 | −0.9119 | 1 | 9 | 1 | |||||||||
| SL | 20 | 6 | 0.0054 | −0.1223 | 1 | 5 | 1 | |||||||||
| 2R | AGAP004261 | Cry2 E051011 | LE | 28 | 16 | 0.0057 | −0.5965 | 0 | 16 | NA | 31.31 | 12 | 13 | 15 | 13 | 1 |
| LL | 20 | 15 | 0.0054 | −0.9156 | 0 | 15 | 0 | |||||||||
| SE | 36 | 16 | 0.0050 | −0.7496 | 0 | 16 | 0 | |||||||||
| SL | 22 | 15 | 0.0054 | −0.7795 | 0 | 15 | 0 | |||||||||
| 2L | AGAP005655 | Cyc E018454In5 | LE | 18 | 16 | 0.0138 | −0.1541 | 5 | 10 | 5 | 9.53 | 13 | 25 | 16 | 23 | 1 |
| LL | 18 | 32 | 0.0301 | −0.8217 | 7 | 21 | 7 | |||||||||
| SE | 20 | 26 | 0.0277 | 0.5346 | 6 | 16 | 6 | |||||||||
| SL | 16 | 23 | 0.0198 | −0.7512 | 6 | 16 | 6 | |||||||||
| 2L | AGAP006376 | Pdp1 E021381 | LE | 28 | 3 | 0.0021 | −1.3214 | 3 | 0 | 3 | 56.33 | 0 | 0 | 0 | 0 | 1 |
| LL | 20 | 0 | NA | NA | NA | NA | NA | |||||||||
| SE | 26 | 0 | NA | NA | NA | NA | NA | |||||||||
| SL | 26 | 2 | 0.0009 | −1.5131 | 2 | 0 | 2 | |||||||||
| 3R | AGAP001856 | Per PAS | LE | 26 | 5 | 0.0033 | 0.0343 | 0 | 5 | 0 | 68 | 4 | 3 | 4 | 4 | 1 |
| LL | 18 | 5 | 0.0037 | 0.1080 | 0 | 5 | 0 | |||||||||
| SE | 26 | 4 | 0.0036 | 0.1083 | 0 | 4 | 0 | |||||||||
| SL | 20 | 4 | 0.0028 | 0.0781 | 0 | 4 | 0 | |||||||||
| 3R | AGAP001856 | Tim E052569 | LE | 42 | 17 | 0.0073 | 0.4200 | 15 | 2 | 15 | 31.82 | 12 | 11 | 11 | 11 | 1 |
| LL | 32 | 12 | 0.0076 | 1.1916 | 10 | 2 | 10 | |||||||||
| SE | 20 | 11 | 0.0057 | 1.6610 | 10 | 1 | 10 | |||||||||
| SL | 22 | 14 | 0.0071 | 0.3339 | 13 | 1 | 13 | |||||||||
| 3R | AGAP007801 | Vri E027461b | LE | 28 | 4 | 0.0018 | −0.4212 | 0 | 4 | 0 | 125.75 | 2 | 3 | 2 | 1 | 3 |
| LL | 18 | 2 | 0.0010 | 0.9062 | 0 | 2 | 0 | |||||||||
| SE | 30 | 3 | 0.0014 | 0.2328 | 0 | 3 | 0 | |||||||||
| SL | 18 | 3 | 0.0014 | −0.2589 | 0 | 3 | 0 |
aChromosome, b An. gambiae reference gene ID, cGene, eNumber of haplotype sequences, fNumber of segregating sites, gNucleotide diversity, hTajima’s D, iNumber of synonymous mutations, jNumber of non-synonymous mutations, kNumber of silent mutations, lSNP density i.e. number of SNPs found in a given number of nucleotide base-pairs, mshared polymorphisms between early and late in Sagamaganga, nshared polymorphisms between early and late in Sagamaganga, oshared polymorphisms in early biting between Lupiro and Sagamaganga, pshared polymorphisms in late biting between Lupiro and Sagamaganga, and qCodon at which the SNP occurs
STRUCTURE assignment of allele frequencies of clock gene SNPs
| Locus | SNP | Cluster 1 | Cluster 2 | Locus | SNP | Cluster 1 | Cluster 2 | Locus | SNP | Cluster 1 | Cluster 2 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| clk-e01-087 | G | 0.957 | 0.952 | cyc-e01-072 | C | 0.099 | 0.094 | cry2-e05-378 | C | 0.883 | 0.889 |
| A | 0.043 | 0.048 | G | 0.901 | 0.906 | T | 0.117 | 0.111 | |||
| clk-e01-192 | C | 0.865 | 0.890 | cyc-e01-093 | G | 0.461 | 0.503 | cry2-e05-407 | T | 0.106 | 0.082 |
| T | 0.135 | 0.110 | A | 0.539 | 0.497 | C | 0.894 | 0.918 | |||
| clk-e01-267 | T | 0.847 | 0.900 | cyc-e01-034 | T | 0.239 | 0.183 | cry2-e05-501 | G | 0.088 | 0.088 |
| C | 0.153 | 0.100 | C | 0.761 | 0.817 | A | 0.912 | 0.912 | |||
| clk-e01-240 | G | 0.933 | 0.933 | cyc-e01-217 | C | 0.931 | 0.893 | cry2-e05-561 | C | 0.975 | 0.971 |
| A | 0.067 | 0.067 | A | 0.069 | 0.107 | T | 0.025 | 0.029 | |||
| cry1-e04-097 | C | 0.938 | 0.946 | cyc-e01-021 | T | 0.567 | 0.630 | cry2-e05-051 | T | 0.913 | 0.911 |
| T | 0.062 | 0.054 | C | 0.433 | 0.370 | C | 0.087 | 0.089 | |||
| cry1-e04-206 | G | 0.912 | 0.917 | cyc-e021-250 | G | 0.119 | 0.096 | cry2-e05-045 | C | 0.421 | 0.407 |
| A | 0.088 | 0.083 | T | 0.881 | 0.904 | T | 0.579 | 0.593 | |||
| cry1-e04-240 | G | 0.969 | 0.968 | cyc-e01-268 | T | 0.481 | 0.471 | cry2-e05-125 | G | 0.859 | 0.888 |
| A | 0.031 | 0.032 | - | C | 0.519 | 0.529 | A | 0.141 | 0.112 | ||
| cry1-e04-252 | G | 0.771 | 0.764 | - | - | - | - | cry2-e05-351 | T | 0.533 | 0.528 |
| A | 0.229 | 0.236 | - | - | - | - | C | 0.467 | 0.472 | ||
| per-pas-082 | C | 0.929 | 0.931 | tim-e05-189 | G | 0.513 | 0.022 | pdp1-e02-110 | G | 0.967 | 0.977 |
| T | 0.071 | 0.069 | A | 0.487 | 0.978 | - | A | 0.033 | 0.023 | ||
| per-pas-202 | C | 0.643 | 0.624 | tim-e05-495 | G | 0.764 | 0.991 | - | - | - | - |
| T | 0.357 | 0.376 | A | 0.236 | 0.009 | - | - | - | - | ||
| per-pas-355 | C | 0.799 | 0.829 | tim-e05-075 | G | 0.570 | 0.994 | vri-e02-355 | C | 0.978 | 0.987 |
| T | 0.201 | 0.171 | C | 0.430 | 0.006 | T | 0.022 | 0.013 | |||
| per-pas-370 | C | 0.646 | 0.569 | tim-e05-087 | C | 0.187 | 0.489 | vri-e02-427 | C | 0.974 | 0.963 |
| G | 0.354 | 0.431 | A | 0.813 | 0.511 | T | 0.026 | 0.037 |
The Timeless gene was associated with the binary clustering into clusters 1 and 2
Fig. 2STRUCTURE clustering results with parameter K = 2. All 34 loci icluded in (a), and in (b) only the 4 TIM loci included. LEI=Lupiro early indoors, LLI=Lupiro late indoors, SEI=Sagamaganga early indoors, SLI=Sagamaganga late indoors, LEO=Lupiro early outdoors, LLO=Lupiro late outdoors, SEO=Sagamaganga early outdoors, SLO=Sagamaganga late outdoors. There appears to be two population sub-divisions (blue and yellow) across samples with different feeding behaviours
Significance test for Fisher Exact test between cluster 1 and 2 as assigned by STRUCTURE
| Gene | Number of loci | Number of fisher exact comparisons performed between cluster 1 and 2 |
|
|---|---|---|---|
|
| 4 | 8 | 0.123 |
|
| 4 | 8 | 0.125 |
|
| 4 | 8 | 0.122 |
|
| 7 | 14 | 0.071 |
|
| 4 | 8 | <0.0001* |
|
| 8 | 16 | 0.063 |
|
| 1 | 2 | 0.500 |
|
| 2 | 4 | 0.333 |
P-values are for the significance test for the association of the frequency of SNPs between cluster 1 and 2 following Bonferroni correction
Fig. 3Principal Component Analysis based on the genetic distances generated by STRUCTURE at K = 2. All 8 phenotypes were combined in one analysis and separate analysis for each of the 8 feeding phenotypes. Coordinate 1 and 2 represent the first and second principal components, respectively. PCoA analysis included a total of 730 samples from 8 feeding phenotypes from Lupiro and Sagamaganga: LEI (Lupiro early indoors), LEO (Lupiro early outdoors), LLI (Lupiro late indoors), LLO (Lupiro late outdoors), SEI (Sagamaganga early indoors), SEO (Sagamaganga early outdoors), SLI (Sagamaganga late indoors), SLO (Sagamaganga late outdoors). All 34 loci were included in the PCoA analysis
Fig. 4Unrooted tree based on Jukes-Cantor Genetic distance model with bootstrap using complete mitochondrial sequences extracted from [60]. Three distinct groups were identified: Sagamaganga group which included samples from Sagamaganga and Lupiro, Minepa group with samples from Minepa and Lupiro and the Cameroon group