| Literature DB >> 33930146 |
Jiao Zhu1,2, Alessio Iannucci3, Francesca Romana Dani3, Wolfgang Knoll1, Paolo Pelosi1.
Abstract
Lipocalins represent one of the most successful superfamilies of proteins. Most of them are extracellular carriers for hydrophobic ligands across aqueous media, but other functions have been reported. They are present in most living organisms including bacteria. In animals they have been identified in mammals, molluscs, and arthropods; sequences have also been reported for plants. A subgroup of lipocalins, referred to as odorant-binding proteins (OBPs), mediate chemical communication in mammals by ferrying specific pheromones to the vomeronasal organ. So far, these proteins have not been reported as carriers of semiochemicals in other living organisms; instead chemical communication in arthropods is mediated by other protein families structurally unrelated to lipocalins. A search in the databases has revealed extensive duplication and differentiation of lipocalin genes in some species of insects, crustaceans, and chelicerates. Their large numbers, ranging from a handful to few dozens in the same species, their wide divergence, both within and between species, and their expression in chemosensory organs suggest that such expansion may have occurred under environmental pressure, thus supporting the hypothesis that lipocalins may be involved in chemical communication in arthropods.Entities:
Keywords: arthropods; chemical communication; insects; lipocalins; odorant-binding proteins; phylogenesis
Mesh:
Substances:
Year: 2021 PMID: 33930146 PMCID: PMC8214410 DOI: 10.1093/gbe/evab091
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.Number of genes encoding lipocalins (LCN) in selected species of insects. Orders and Classes are reported, but the tree does not reflect phylogenetic distances. Adapted from Giribet and Edgecombe (2012).
Fig. 2.Number of genes encoding lipocalins (LCN) in selected species of Crustacea and Chelicerata. Orders and Classes are reported, but the tree does not reflect phylogenetic distances. Adapted from Giribet and Edgecombe (2012). In addition to those reported, we only found one or two LCN genes in the NCBI databases relative to the following Chelicerata: Dinothrombium tinctorium; Euroglyphus maynei; Galendromus occidentalis; Ixodes scapularis; Leptotrombidium delicense; Rhipicephalus microplus; Sarcoptes scabiei; Tropilaelaps mercedesae. We could not find any LCN genes in the following species. Crustacea: Triops cancriformis; Procambarus virginalis; Lepidurus apus; Cherax destructor; Pandalus platyceros; Ligia exotica; Tisbe sp.; Pandalus platyceros; Apocyclops sp.; Semibalanus balanoides; Acartia tonsa; Eulimnadia texana; Calanus glacialis; Caridina multidentate; Oithona nana; Caligus rogercresseyi; Cherax destructor; Parhyale hawaiensis; Tigriopus japonicus; Calanus finmarchicus; Lepeophtheirus salmonis. Chelicerata: Acanthoscurria geniculate; Achipteria coleoptrata; Androctonus mauritanicus; Anelosimus studiosus; Brevipalpus yothersi; Cordylochernes scorpioides; Dermanyssus gallinae; Dermatophagoides farina; Dysdera sylvatica; Haemaphysalis longicornis; Hypochthonius rufulus; Ixodes ricinus; Latrodectus hesperus; Loxosceles reclusa; Mesobuthus martensii; Pardosa pseudoannulata; Platynothrus peltifer; Psoroptes ovis; Steganacarus magnus; Tachypleus tridentatus. Finally, no LCN genes were identified for Pycnogonida or Myriapoda.
Fig. 3.Neighbour-Joining tree of lipocalins identified in the genomes of selected species of Hexapoda. Apis: Acyrthosiphon pisum; Aaeg: Aedes aegypti; Agam: Anopheles gambiae; Acer: Apis cerana; Bmor: Bombyx mori; Cqui: Culex quinquefasciatus; Dple: Danaus plexippus; Dmel: Drosophila melanogaster; Fcan: Folsomia candida; Harm: Helicoverpa armigera; Ldec: Leptinotarsa decemlineata; Mrot: Megachile rotundata; Nvit: Nasonia vitripennis; Ocin: Orchesella cincta; Pxut: Papilio Xuthus; Prap: Pieris rapae; Tcas: Tribolium castaneum. The 44 sequences of Rhodnius prolixus are not included, because most of them segregate into a separate clade (see supplementary fig. S1, Supplementary Material online). Color code: Entognatha: magenta; Hemiptera: purple; Hymenoptera: green; Coleoptera: brown; Lepidoptera: blue; Diptera: red.
Genome Location of Bombyx mori (Bmor) and Helicoverpa armigera (Harm) Lipocalin Genes
| Protein | Location | Locus Tag |
|---|---|---|
|
| Chr 5 | LOC101742419 |
|
| Chr 5 | LOC101742419 |
|
| Chr 5 | LOC100286767 |
|
| Chr 25 | Chbp |
|
| Chr 25 | LOC692416 |
|
| Chr 25 | LOC101745459 |
|
| Chr 25 | LOC101745319 |
|
| Chr 25 | LOC101745175 |
|
| Chr 25 | LOC101745601 |
| Bmor_XP_004921596 | Chr 10 | LOC101739203 |
| Bmor_XP_004926879 | Chr 16 | LOC101742345 |
| Bmor_XP_004923537 | Chr 6 | LOC101739113 |
|
| NW_018395398.1 | B5X24_HaOG215900 |
|
| NW_018395398.1 | B5X24_HaOG215901 |
|
| NW_018395510.1 | B5X24_HaOG202918 |
|
| NW_018395510.1 | B5X24_HaOG202919 |
|
| NW_018395510.1 | B5X24_HaOG202921 |
| Harm_XP_021182455.1 | NW_018395414.1 | B5X24_HaOG207595 |
| Harm_XP_021185371.1 | NW_018395436.1 | B5X24_HaOG212519 |
| Harm_AFK64814.1 | Unannotated | Unannotated |
Note.—Genes in bold font are clustered (supplementary Fig. S2, Supplementary Material online). H. armigera genome is currently not chromosome anchored, however, genes are present in proximity on the same scaffolds. In particular, Harm_XP_021200319.1 and Harm_XP_021200364.1 are found in close proximity on the scaffold NW_018395398.1, whereas Harm_XP_021190587.1, Harm_XP_021190592.1, and Harm_XP_021190608.1 are found in close proximity on the scaffold NW_018395510.1 (supplementary fig. S2, Supplementary Material online). For this analysis, we used the reference genomes Harm1.0 (acc. no. PRJNA378437) and Bmor_2016v1.0 (acc. no. PRJDB4947).
Fig. 6.Expression of lipocalins in different tissues of H. armigera. The relative expression was evaluated based on the number of SRA found for each sequence in the transcriptomes of pheromone glands, antennae, salivary glands, and abdomen. Values were normalized with reference to the housekeeping gene NADH dehydrogenase (acc. no. AHJ91280.1) (Yang et al. 2014). Only SRAs with 100% identity to the query sequence were taken into account.