| Literature DB >> 28320313 |
Gloria I Giraldo-Calderón1,2, Michael J Zanis3,4, Catherine A Hill5,6.
Abstract
BACKGROUND: Opsins are light sensitive receptors associated with visual processes. Insects typically possess opsins that are stimulated by ultraviolet, short and long wavelength (LW) radiation. Six putative LW-sensitive opsins predicted in the yellow fever mosquito, Aedes aegypti and malaria mosquito, Anopheles gambiae, and eight in the southern house mosquito, Culex quinquefasciatus, suggest gene expansion in the Family Culicidae (mosquitoes) relative to other insects. Here we report the first detailed molecular and evolutionary analyses of LW opsins in three mosquito vectors, with a goal to understanding the molecular basis of opsin-mediated visual processes that could be exploited for mosquito control.Entities:
Keywords: Conserved non-coding sequences; Differential expression; Gene duplication; Long-wavelength; Mosquito; Opsin; Positive selection; Vision
Mesh:
Substances:
Year: 2017 PMID: 28320313 PMCID: PMC5359912 DOI: 10.1186/s12862-017-0910-6
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Summary of Ae. aegypti, An. gambiae and Cx. quinquefasciatus opsin genes
| Genea | NCBI accession no. | VectorBase gene ID | Supercontig or Scaffold | Base pair range on Supercontig/Scaffold | No. amino acids | No. exons | Expression evidence (transcript, peptide)b | Predicted wavelength sensitivity range |
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
|
| XM_001651947.2 |
| 1.208 | 1,715,269–1,716,390 | 373 | 1 | t, - | Long |
|
| XM_001657569.2 |
| 1.197 | 948,069–949,187 | 372 | 1 | t, - | Long |
|
| XM_001651948.2 |
| 1.208 | 1,726,735–1,727,928 | 373 | 2 | t, - | Long |
|
| XM_001651116.2 |
| 1.165 | 1,775,297–1,776,430 | 377 | 1 | t, - | Long |
|
| XM_001651117.2 |
| 1.165 | 1,782,454–1,783,587 | 377 | 1 | t, - | Long |
|
| XM_001652675.2 |
| 1.252 | 485,115–486,311 | 379 | 2 | t, - | Long |
|
| XM_001653816.2 |
| 1.412 | 271,590–320,188 | 381 | 5 | t, - | Ultraviolet |
|
| XM_001662932.2 |
| 1.75 | 1,344,859–1,358,612 | 380 | 2 | t, - | Short |
|
| XM_011494686.1 |
| 1.9 | 1,914,223–1,914,456 | 77 | 1 | N.A. | N.A. |
|
| XM_001650694.1 |
| 1.151 | 1,763,069–1,943,586 | 502 | 3 | t, - | Rh7-like |
|
| XM_001650752.2 |
| 1.153 | 1,048,301–1,134,510 | 412 | 5 | t, - | Pteropsin |
|
| ||||||||
|
| XM_003435715.1 |
| AAAB01008987 | 15,485,448–15,486,560 | 370 | 1 | t, p | Long |
|
| XM_003435710.1 |
| AAAB01008987 | 15,549,902–15,551,014 | 370 | 1 | t, p | Long |
|
| XM_003435711.1 |
| AAAB01008987 | 15,548,576–15,547,464 | 370 | 1 | t, p | Long |
|
| XM_001238570.2 |
| AAAB01008987 | 15,552,322–15,553,524 | 378 | 2 | t, p | Long |
|
| XM_322000.3 |
| AAAB01008987 | 15,555,427–15,556,642 | 372 | 2 | t, p | Long |
|
| XM_312478.3 |
| AAAB01008859 | 9,719,568–9,718,344 | 374 | 2 | t, p | Long |
|
| XM_001688738.1 |
| AAAB01008960 | 13,070,327–13,075,141 | 379 | 5 | t, p | Ultraviolet |
|
| XM_319247.2 |
| AAAB01008980 | 12,661,959–12,660,706 | 367 | 3 | t, p | Short |
|
| XM_308329.4 |
| AAAB01008807 | 1,869,762–1,884,337 | 463 | 3 | t, p | Rh7-like |
|
| XM_312503.4 |
| AAAB01008859 | 10,064,831–10,075,172 | 460 | 5 | t, - | Pteropsin |
|
| XM_312502.2 |
| AAAB01008859 | 10,056,116–10,044,185 | 433 | 5 | t, p | Pteropsin |
|
| ||||||||
|
| XM_001845645.1 |
| 3.60 | 303,692–304,879 | 374 | 2 | - | Long |
|
| XM_001846589.1 |
| 3.82 | 144,908–146,098 | 378 | 2 | - | Short |
|
| XM_001851105.1 |
| 3.237 | 371,783–376,427 | 379 | 4 | t, - | Ultraviolet |
|
| XM_001861603.1 |
| 3.327 | 113,980–120,849 | 466 | 5 | - | Rh7-like |
|
| XM_001862130.1 |
| 3.358 | 153,562–154,668 | 368 | 1 | t, - | Long |
|
| XM_001862163.1 |
| 3.361 | 253,314–254,429 | 371 | 1 | t, - | Long |
|
| XM_001862165.1 |
| 3.361 | 260,306–261,487 | 373 | 2 | t, - | Long |
|
| XM_001862166.1 |
| 3.361 | 267,771–268,892 | 373 | 1 | t, - | Long |
|
| XM_001862168.1 |
| 3.361 | 285,932–287,053 | 373 | 1 | t, - | Long |
|
| XM_001863451.1 |
| 3.447 | 159,930–161,031 | 351 | 1 | - | Long |
|
| XM_001863511.1 |
| 3.452 | 5394–6590 | 377 | 2 | - | Short |
|
| XM_001864516.1 |
| 3.529 | 288,784–316,273 | 411 | 5 | - | Pteropsin |
|
| XM_001870251.1 |
| 3.2251 | 5950–7074 | 374 | 1 | t, - | Long |
aAnnotations from Hill et al. [25], Nene et al. [13] and Arensburger et al. [14]. Revised opsin annotations were produced using the AaegL1.2, AgamP3.5 and Cpip1.2 gene sets from VectorBase; bExpression evidence for Ae. aegypti, An. gambiae and Cx. quinquefasciatus obtained from PCR or RT-PCR data (Additional file 1: Table S3), ESTs (Additional file 1: Table S2), Microarrays [40–42], RNAseq [43, 53] and mass spectrometry [44–46]; p, peptide evidence; t, transcript evidence; -, absence of transcript or peptide evidence; cIndicates incomplete gene model; N.A. not available
Fig. 1Amino acid alignment of the predicted trans-membrane domains (TMDI-VII) of Ae. aegypti putative LW opsins. Highlighted amino acids show positions conserved in LW opsins. Black shading, identical residues; gray shading, similar residues, based on the similarity matrix BLOSUM62. The asterisk (*) indicates the residue identified by Fitmodel as under positive selection
Fig. 2Predicted divergence times of insect long-wavelength-sensitive (LW) opsins, based on r8s and BEAST analyses. Estimates of time of divergence are shown in million years MY with r8s (a) and BEAST (b) software. Color circles show the duplication events in the evolution of mosquito LW opsins: most recent common ancestor (MRCA) duplication event (red); An. gambiae duplication event (blue); culicinae duplication event (green); Ae. aegypti duplication event (purple); Cx. quinquefacitaus duplication event (yellow). Squares (dotted line) show the most recent and the more ancestral mosquito LW duplication events. The dagger shows the constrained nodes using fossils. Gray shading indicates opsins under positive selection. Species abbreviations: Acyrthosiphon pisum (ApL), Apis mellifera (AmL), Bombyx mori (BBmL), Calliphora vicina (Cv), Danaus plexippus (DpL), Drosophila melanogaster (DmRh), Pediculus humanus (PhL), and Tribolium castaneum (TcL). a Divergence times in MY were estimated using a penalized likelihood (PL) approach and calibrated with the sister group to Culicidae (Chaoboridae, 187 MY [112]). b Divergence times in MY were estimated using a relaxed clock log normal model and calibrated with the following fossil calibrations [117]: Westphalomerope maryvonneae 313.7 MY (Holometabola), Triassoxyela foveolata 226.4 MY (Hymenoptera), Parasabatinca aftimacrai 129.4 MY (Lepidoptera), Grauvogelia arzvilleriana 240.5 MY (Diptera). The drops and flower represent the first predicted appearance of blood and nectar sources, namely reptiles (345–280 MY), mammals (150 MY), birds (136–65 MY) and flowering plants (125–130 MY) [120, 121]
Residues in mosquito long wavelength opsins predicted under positive selection
| Gene | Amino acid residue | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| N-terminus (extracellular) | Trans-membrane domain III | C-terminus (intracellular) | |||||||
| Clade 4 | |||||||||
|
| G5* | M8* | V17 | A18 | S19* | M128 | Q357 | V366 | K373* |
|
| G5* | M8* | V17 | A18 | S19* | M128 | Q357 | V366 | K373* |
|
| A5* | A10* | A13 | V14 | A15* | M124 | Q340 | V349 | - |
|
| A5* | T8* | A16 | A17 | M18* | M127 | Q356 | I365 | I372* |
|
| A5* | N8 | A16 | M17 | V18* | M127 | Q356 | V365 | S372 |
|
| A5* | N8 | A16 | M17 | V18* | M127 | Q356 | V365 | S372 |
|
| Q10* | Q14 | A16 | A17 | T18* | M127 | Q356 | V365 | S372 |
|
| S10* | Q14 | V16 | V17 | S18 | M127 | Q356 | V365 | A372 |
| Clade 3 | |||||||||
|
| D10 | Q13 | G15 | N16 | G17 | C126* | D355* | S367 | - |
|
| D10 | Q13 | G15 | A16 | G17 | C126* | D355* | E364* | - |
|
| D10 | Q13 | A15 | G16 | G17 | C126* | N357* | T366 | - |
|
| D10 | - | S14 | S15 | G16 | M125 | N358* | T367 | - |
|
| D10 | T13 | S15* | G16 | G17 | M126 | G355* | Q364 | - |
|
| D10 | T13 | G15* | S16 | G17 | M126 | G355* | Q364 | - |
|
| D10 | T13 | S15* | G16 | G17 | M126 | G355* | Q364 | - |
The asterisk (*) indicates residues under positive selection (Fitmodel analysis). The phylogenetic relationship of genes under positive selection is shown in Fig. 2. The residue number was derived by numbering from the first amino acid of each sequence (note that the B. taurus and T. pacificus rhodopsins were not used as reference to number the residues); -, indicates a gap inserted in the alignment and thus, a residue is not available at this position
Fig. 3Conserved non-coding sequences (CNS) and transcription factor binding sites (TFBS) in AgGPRop3 and AgGPRop4. a Anopheles gambiae opsin genes showing 5′- and 3′-non coding region on chromosome (Chr) 2R. b Alignment between the 5′ region of the AgGPRop3 (2,612 bp) and of AgGPRop4 (10,000 bp) was submitted to mVISTA using the LAGAN algorithm to identify CNS. Conserved regions are shaded (red). The minimum value on the Y-axis is 50%, the minimum conservation identity is 70%, and the minimum length for a CNS is 100 bp (default values on the VISTA plot). Because VISTA calculates the percentage of conservation identity as the number of matches divided by the length of the reference sequence (not the length of the alignment), the length in bp of the CNS represents the reciprocal alignment value. c Sequences were submitted to rVISTA to identify conserved clusters of the 50 known TFBS in insects using TRANSFAC®. Vertical (green) lines indicate the position of the 25 conserved TFBS identified, which occur in the clusters 1 and 2