Literature DB >> 34899608

Identification and Expression Analysis of G Protein-Coupled Receptors in the Miridae Insect Apolygus lucorum.

Han Gao1, Yanxiao Li1, Miao Wang1, Xiaowen Song1, Jing Tang1, Fan Feng1, Bin Li1.   

Abstract

G protein-coupled receptors (GPCRs) are the largest and most versatile family of transmembrane receptors in the cell and they play a vital role in the regulation of multiple physiological processes. The family Miridae (Hemiptera: Heteroptera) is one of the most diverse families of insects. Until now, information on GPCRs has been lacking in Miridae. Apolygus lucorum, a representative species of the Miridae, is an omnivorous pest that occurs worldwide and is notorious for causing serious damage to various crops and substantial economic losses. By searching the genome, 133 GPCRs were identified in A. lucorum. Compared with other model insects, we have observed GPCR genes to be remarkably expanded in A. lucorum, especially focusing on biogenic amine receptors and neuropeptide receptors. Among these, there is a novel large clade duplicated from known FMRFamide receptors (FMRFaRs). Moreover, the temporal and spatial expression profiles of the 133 genes across developmental stages were determined by transcriptome analysis. Most GPCR genes showed a low expression level in the whole organism of A. lucorum. However, there were a few highly expressed GPCR genes. The highly expressed LW opsins in the head probably relate to nocturning of A. lucorum, and the expression of Cirl at different times and in different tissues indicated it may be involved in growth and development of A. lucorum. We also found C2 leucine-rich repeat-containing GPCRs (LGRs) were mainly distributed in Hemiptera and Phthiraptera among insects. Our study was the first investigation on GPCRs in A. lucorum and it provided a molecular target for the regulation and control of Miridae pests.
Copyright © 2021 Gao, Li, Wang, Song, Tang, Feng and Li.

Entities:  

Keywords:  Apolygus lucorum; GPCRs; expansion; identification; phylogenetic analysis

Mesh:

Substances:

Year:  2021        PMID: 34899608      PMCID: PMC8660763          DOI: 10.3389/fendo.2021.773669

Source DB:  PubMed          Journal:  Front Endocrinol (Lausanne)        ISSN: 1664-2392            Impact factor:   5.555


1 Introduction

G protein-coupled receptors (GPCRs) are in a large family of protein cell surface receptors that detect molecules outside the cell and activate cellular responses (1, 2). GPCRs are found only in eukaryotes, namely, yeast, choanoflagellates, and animals (3). Based on sequence homology and functional similarity, GPCRs can be grouped into six families (4, 5): Family-A (rhodopsin-like); Family-B (secretin receptor family); Family-C (metabotropic glutamate/pheromone); Family-D (fungal mating pheromone receptors); Family-E (cyclic AMP receptors); and Family-F (frizzled/smoothened). These receptors are involved in a wide variety of physiological processes (6), namely, visual sensation (7), taste (8), smell sensation (9), behavioral and mood regulation (10), regulation of immune system activity and inflammation (11, 12), and autonomic nervous system transmission (10, 13). Because of their crucial roles in the regulation of multiple physiological processes, GPCRs are an important drug target (14) and approximately 34% (15) of all Food and Drug Administration (FDA) approved drugs target 108 members of this family. With the continuous innovation of next-generation sequencing technology and bioinformatics, systematic identification research about GPCRs has been reported in several insects (16–21). Among Hemiptera, it has been reported in Acyrthosiphon pisum (22), Aphis craccivora (23), Cimex lectularius (24), Diaphorina citri (25), Nilaparvata lugens (26), and Rhodnius prolixus (27). The family Miridae (Hemiptera: Heteroptera), which includes plant bugs, leaf bugs, or grass bugs, is one of the most diverse families of insects, including over 11,000 species in more than 1,300 genera (4, 28). It is the largest family of true bugs belonging to Hemiptera and new members of Miridae are being described constantly. Mirids exhibit a wide range of food preferences and behaviors, including phytophagy, carnivory, and omnivory. Some mirids exhibit significant economic impacts and some are pests of food and fiber crops, whereas others are beneficial species used as biological control agents (29). Although Miridae is the largest family of Hemiptera and exhibits a complex habit, there has been little information reported on GPCRs. Apolygus lucorum (Miridae) is an omnivorous pest that occurs worldwide and is notorious for the serious damage it causes in various crops and its substantial economic losses (30, 31). Recently, the genome of A. lucorum had been reported, which provided convenient in-depth studies of this pest (32). In the present research, using bioinformatics analysis, we screened the genes encoding GPCRs from the genome of A. lucorum. The expression profiles of all GPCRs were also determined by using public transcriptome data. These results allowed us to make comparisons of GPCR systems in different insect species and to provide relevant information for further functional studies in A. lucorum. Our study was the first investigation of GPCRs in A. lucorum, which may become the basis for further investigation of the function of miridae GPCRs.

2 Materials and Methods

2.1 Identification of A. lucorum GPCRs

A. lucorum protein sequences were retrieved from the NCBI Genome database (https://www.ncbi.nlm.nih.gov/assembly/GCA_009739505.2/) (32). Based on previous studies and records in Flybase (http://www.flybase.org/) (33), the GPCRs of Drosophila melanogaster (34), A. pisum (22), Bombyx mori (17), Tribolium castaneum (16), and Pediculus humanus humanus (18) were collected. By using D. melanogaster GPCRs as references and A. lucorum protein sequences as queries, BLASTP searches (35) were performed with a cut-off e-value of 1e−5 to look for all GPCR candidates. Then, seven-transmembrane (7TM) domain and annotation information was adopted as the basic criteria for all GPCR candidates. The GPCR candidates in which the number of 7TM domains was more than four or the annotation information indicated it was a GPCR were retained. The remaining GPCR candidates were also confirmed by means of BLASTX analysis in the UniProtKB/Swiss-Prot database. Using all GPCRs that we collected, pre-phylogenetic analysis with the maximum likelihood method was the final criteria to remove non-GPCRs from candidate pools. If a candidate showed fewer genetic relationships with known GPCRs by phylogenetic analysis and the hit sequences in BLASTX analysis indicated they were not GPCRs, they were classified as a non-GPCR and removed from our analysis.

2.2 Structural Analyses, Annotation Information, and Gene Locations of GPCRs

The 7TM domains for all GPCR candidates were predicted with the server TMHMM (v2.0) (36) from the Centre for Biological Sequence Analysis (http://www.cbs.dtu.dk/services/TMHMM/). Functional annotations of the target proteins were done using InterProScan (37). In addition, the chromosomal location of each GPCR candidate was extracted from the genome annotation file of A. lucorum.

2.3 Phylogenetic Analysis

Partial GPCRs of R. prolixus and C. lectularius that also belonged to Heteroptera were also obtain based on previous study (24, 27). GPCRs from D. melanogaster, A. pisum, C. lectularius, and R. prolixus were assigned to a family/subfamily according to previous results (22, 24, 27, 34). Putative A. lucorum GPCRs were classified into different families/subfamilies according to the families to which their orthologous proteins were assigned. Amino acid sequences of the putative A. lucorum GPCRs in each family/subfamily were aligned with receptors of the same family/subfamily in D. melanogaster, A. pisum, C. lectularius, and R. prolixus using MAFFT v7 (38). Phylogeny tests were accomplished using the bootstrap method with 1,000 replications to reconstruct maximum likelihood (ML) trees using IQ-TREE (39) and the best-fit tree model was determined with ModelFinder (40). It should be noted that the GPCRs of R. prolixus and C. lectularius were uncompleted, which were composed of opsins, biogenic amine receptors, and neuropeptide GPCRs. For the Drosophila sequences, the name of the GPCRs were used, while for A. pisum and R. prolixus, the protein names were same as in previous work (22, 27), and for C. lectularius, the accession numbers in NCBI were used. The GPCRs of A. lucorum identified in this work were numbered according to their families.

2.4 Expression Analysis

To study the expression profiles of the GPCRs, a total of 39 transcriptome data of A. lucorum were downloaded from the genome project of A. lucorum (Accession: PRJNA526332) in the NCBI Sequence Read Archive (SRA) database (https://www.ncbi.nlm.nih.gov/sra/) (30, 41), which included egg and different tissues (leg, head, body, mouthpart, wing, and gut) of nymphs and adults. Each tissue contained three biological replicates. In detail, we downloaded the SRA data first and then we used an SRA-Toolkit to split the paired-end reads. Clean reads were obtained from the raw data using Trimmomatic (42) to remove reads with quality scores lower than 10 and adapter sequences. To analyze gene expression profiles, clean reads of each sample were mapped to A. lucorum gene sets using hisat2 (43), and then the TPM value (44) of each putative GPCR gene was calculated with featureCounts (45). These TPM expression values were scaled and served to generate a cross-sample normalized trimmed mean of the M-values (TMM) gene expression matrix (46). Finally, the heatmap was drawn in ITOL (https://itol.embl.de) (47) using the normalized matrix. The value used for each sample was the mean of three independent biological replicates.

2.5 Classification of Gene Duplication Types

MCScanX (48) was used to classified the duplication types of different duplicate GPCR genes. First, the homology with different genes in the genome of A. lucorum was determined by a whole-genome BLASTP analysis with a max target seqs of 5 and a cut-off e-value of 1e−5. Then, the homology with different genes and the chromosomal location were combined and all genes were classified into various types, including the segmental duplication, and tandem duplication. Finally, the duplication types of GPCRs were extracted based on these results. All visualized works were accomplished in TBtools (49).

3 Results

A total of 133 putative GPCRs were identified in A. lucorum. These GPCRs were classified into four families and included 98 family-A members, 21 family-B members, 10 family-C members, and four family-F members ( – and ). Based on the protein sequences, phylogenetic trees were reconstructed for each GPCR family/subfamily of A. lucorum, R. prolixus, A. pisum, and D. melanogaster. All GPCRs were quantified with the TPM values obtained from transcriptomic data. The expression profile of each GPCR across developmental stages was also present in the phylogenetic trees of each GPCR family/subfamily ( – , and ). The chromosomal locations of all GPCRs are shown in .
Table 1

Opsin and biogenic amine receptors in A. lucorum.

No.Accession numberPutative Endogenous ligandOrthologue of D. melanogaster Orthologue of A. pisum Predicted TMHsAnnotation by InterProScan Homology search in Swissport (blastp)
E-valueDescriptionSpecies
Opsin
A1KAF6206346.1OrphanRh6ACYPI009332Complete(IPR000276) GPCR, rhodopsin-like; (IPR001760) Opsin; (IPR001391) Opsin lateral eye type0Opsin-1 Schistocerca gregaria
A2KAF6206345.1OrphanRh6ACYPI009332Complete(IPR000276) GPCR, rhodopsin-like; (IPR001760) Opsin; (IPR001391) Opsin lateral eye type;0Opsin Sphodromantis sp.
A3KAF6207755.1OrphanRh3, Rh4ACYPI002544, ACYPI004442Complete(IPR000276) GPCR, rhodopsin-like; (IPR001760) Opsin; (IPR000856) Opsin RH3/RH41.00E−165UV-sensitive opsin Apis mellifera
A4KAF6207831.1OrphanRh7ACYPI001006, ACYPI005074Complete(IPR000276) GPCR, rhodopsin-like1.60E−77Opsin-2 Manduca sexta
A5KAF6207832.1OrphanRh7ACYPI001006, ACYPI005074Complete(IPR000276) GPCR, rhodopsin-like; (IPR001760) Opsin1.88E−73Opsin Rh3 D. melanogaster
A6KAF6205310.1Orphannana5(IPR000276) GPCR, rhodopsin-like4.83E−48Pinopsin Columba livia
A7KAF6208054.1Orphannana6(IPR000276) GPCR, rhodopsin-like; (IPR001760) Opsin2.01E−70GQ-rhodopsin Mizuhopecten yessoensis
Biogenic amine receptors
A8KAF6211999.1AcetylcholinemAChR-AACYPI005180Complete(IPR000276) GPCR, rhodopsin-like; (IPR000995) Muscarinic acetylcholine receptor family4.10E−160mAChR DM1 D. melanogaster
A9KAF6206451.1AcetylcholinemAChR-BACYPI0012556(IPR000276) GPCR, rhodopsin-like2.84E−57mAChR gar-2 Caenorhabditis elegans
A10KAF6206450.1AcetylcholinemAChR-BACYPI0012552(IPR000276) GPCR, rhodopsin-like1.28E−27mAChR gar-2 Caenorhabditis elegans
A11KAF6202800.1AcetylcholinemAChR-CnaComplete(IPR000276) GPCR, rhodopsin-like1.07E−23D(1B) DopR Rattus norvegicus
A12KAF6209068.1DopamineDop1R1ACYPI006935Complete(IPR000276) GPCR, rhodopsin-like; (IPR000929) Dopamine receptor family0Dop1R1 D. melanogaster
A13KAF6217029.1DopamineDop1R2ACYPI0092416(IPR000276) GPCR, rhodopsin-like4.60E−164Dop1R2 D. melanogaster
A14KAF6204820.1DopamineDop2RACYPI0074152(IPR000276) GPCR, rhodopsin-like; (IPR001671) Melanocortin/ACTH receptor2.03E−61Dop2R D. melanogaster
A15KAF6204823.1DopamineDop2RACYPI0074152(IPR000276) GPCR, rhodopsin-like2.20E−103Dop2R D. melanogaster
A16KAF6201362.1Dopamine, EcdysteroidsDopEcRACYPI005538Complete(IPR000276) GPCR, rhodopsin-like5.89E−24G-protein coupled receptor 52 Mus musculus
A17KAF6209377.1OctopamineOambACYPI0055785(IPR000276) GPCR, rhodopsin-like5.20E−109Oamb D. melanogaster
A18KAF6209376.1OctopamineOambACYPI0055783(IPR000276) GPCR, rhodopsin-like2.03E−29Oamb D. melanogaster
A19KAF6209232.1OctopamineOctbeta1RACYPI0073865(IPR000276) GPCR, rhodopsin-like2.90E−104Octbeta1R D. melanogaster
A20KAF6209392.1OctopamineOctbeta2RACYPI0046583(IPR000276) GPCR, rhodopsin-like4.18E−91Octbeta2R D. melanogaster
A21KAF6209465.1OctopamineOctalpha2RACYPI0101554(IPR000276) GPCR, rhodopsin-like4.37E−54Alpha-2C adrenoreceptor Danio rerio
A22KAF6209394.1OctopamineOctbeta2RACYPI0046584(IPR000276) GPCR, rhodopsin-like6.10E−76Octbeta2R D. melanogaster
A23KAF6209913.1OctopamineOctbeta3RACYPI0100255(IPR000276) GPCR, rhodopsin-like6.00E−100Octbeta3R D. melanogaster
A24KAF6209915.1OctopamineOctbeta3RACYPI0100253(IPR000276) GPCR, rhodopsin-like2.09E−44Octbeta3R D. melanogaster
A25KAF6199206.1Octopamine/TyramineOct-TyrRACYPI007379Complete(IPR000276) GPCR, rhodopsin-like; (IPR002002) Octopamine receptor0OctR Heliothis virescens
A26KAF6209650.1Serotonin5-HT1A, 5-HT1BXP_0019497254(IPR000276) GPCR, rhodopsin-like4.80E−1075-HT receptor Bombyx mori
A27KAF6210093.1Serotonin5-HT1A, 5-HT1BXP_0019497254(IPR000276) GPCR, rhodopsin-like8.60E−1035-HT receptor Heliothis virescens
A28KAF6208815.1Serotonin5-HT1A, 5-HT1BXP_0019497252(IPR000276) GPCR, rhodopsin-like1.71E−515-HT receptor Heliothis virescens
A29KAF6209646.1Serotonin5-HT1A, 5-HT1BXP_0019497252(IPR000276) GPCR, rhodopsin-like1.06E−415-HT receptor Bombyx mori
A30KAF6204179.1Serotonin5-HT2AACYPI0089692(IPR000276) GPCR, rhodopsin-like5.75E−295-HT-2C Canis lupus familiaris
A31KAF6204181.1Serotonin5-HT2AACYPI0089692(IPR000276) GPCR, rhodopsin-like4.89E−255-HT-2B Mus musculus
A32KAF6204453.1Serotonin5-HT2BACYPI507073(IPR000276) GPCR, rhodopsin-like2.15E−195-HT-2B Homo sapiens
A33KAF6204455.1Serotonin5-HT2BACYPI507073(IPR000276) GPCR, rhodopsin-like9.03E−245-HT-2B Mus musculus
A34KAF6204452.1OrphannanaComplete(IPR000276) GPCR, rhodopsin-like3.57E−145-HT-2A Macaca mulatta
A35KAF6205718.1Serotonin5-HT7XP_0032418355(IPR000276) GPCR, rhodopsin-like8.50E−1125-HT receptor D. melanogaster
A36KAF6205714.1Serotonin5-HT7XP_0032418352(IPR000276) GPCR, rhodopsin-like5.89E−535-HT receptor D. melanogaster
A37KAF6200010.1OrphanCG13579ACYPI0087775(IPR000276) GPCR, rhodopsin-like3.35E−12Trace amine-associated receptor 1 Mus musculus

na, not annotated or not applicable, Complete means there is a complete 7TM structure.

Table 4

The number of A. lucorum GPCRs of each family in comparison with the other four insects.

A. lucorum A. pisum D. melanogaster T. castaneum B. mori
Family-A9862736869
Opsin75726
Biogenic amine receptors3018212016
Neuropeptide and protein hormone receptors (contained the purine GPCRs)5839454647
Family-B2110262112
Family-C10710109
Family-F43543
Total1338211310393
Figure 1

Phylogenetic tree reconstruction of opsin GPCRs from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), and A. lucorum (red circle) inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster biogenic amine receptor 5-HT1A. Expression profiles of A. lucorum GPCR genes from different tissues are shown in the corresponding branch side. The transcription level of each gene is represented by a square with a color that codes for the values of Lg (TPM+1). N, nymph; A, adult.

Figure 3

Phylogenetic tree reconstruction of Family-B GPCRs from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), and A. lucorum (red circle) inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster opsin GPCR Rh6. Expression profiles of A. lucorum GPCR genes from different tissues are shown in the corresponding branch side. The transcription level of each gene is represented by a square with a color that codes for the values of Lg (TPM+1). N, nymph; A, adult.

Figure 4

Chromosomal locations and tandem duplicated gene pairs of the 133 putative GPCR genes. Each was mapped to the chromosome based on its physical location. The chromosome number (LG1–LG16, tig00472765) is indicated at the left. The tandem duplicated genes were outlined with red color.

Opsin and biogenic amine receptors in A. lucorum. na, not annotated or not applicable, Complete means there is a complete 7TM structure. Neuropeptide and protein hormone receptors and purine GPCRs in A. lucorum. na, not annotated or applicable, Complete means there is a complete 7TM structure. Family-B GPCRs of A. lucorum. na, not annotated or applicable, Complete means there is a complete 7TM structure. The number of A. lucorum GPCRs of each family in comparison with the other four insects. Phylogenetic tree reconstruction of opsin GPCRs from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), and A. lucorum (red circle) inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster biogenic amine receptor 5-HT1A. Expression profiles of A. lucorum GPCR genes from different tissues are shown in the corresponding branch side. The transcription level of each gene is represented by a square with a color that codes for the values of Lg (TPM+1). N, nymph; A, adult. Phylogenetic tree reconstruction of neuropeptide and protein hormone receptors from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), and A. lucorum (red circle) inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster opsin GPCR Rh6. Expression profiles of A. lucorum GPCR genes from different tissues are shown in the corresponding branch side. The transcription level of each gene is represented by a square with a color that codes for the values of Lg (TPM+1). N, nymph; A, adult. Phylogenetic tree reconstruction of Family-B GPCRs from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), and A. lucorum (red circle) inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster opsin GPCR Rh6. Expression profiles of A. lucorum GPCR genes from different tissues are shown in the corresponding branch side. The transcription level of each gene is represented by a square with a color that codes for the values of Lg (TPM+1). N, nymph; A, adult. Chromosomal locations and tandem duplicated gene pairs of the 133 putative GPCR genes. Each was mapped to the chromosome based on its physical location. The chromosome number (LG1–LG16, tig00472765) is indicated at the left. The tandem duplicated genes were outlined with red color.

3.1 Family-A GPCRs

Insect family-A GPCRs include opsins, biogenic amine receptors, neuropeptide and protein hormone receptors, and purine GPCRs (17, 18, 22). In this study, 98 family-A GPCRs were identified in the genome of A. lucorum, and these receptors were composed of seven opsins, 30 biogenic amine receptors, 58 neuropeptide and protein hormone receptors, and three purine GPCRs ( , ).
Table 2

Neuropeptide and protein hormone receptors and purine GPCRs in A. lucorum.

No.Accession numberPutative endogenous ligandOrthologue of D. melanogaster Orthologue of A. pisum Predicted TMHsAnnotation by InterProScan Homology search in Swissport (blastp)
E-valueDescriptionSpecies
Neuropeptide and protein hormone receptors
A38KAF6210560.1AKH/corazonin-related peptidenana4(IPR000276) GPCR, rhodopsin-like; (IPR000405) Galanin receptor family4.74E−31GnRHR II Clarias gariepinus
A39KAF6216586.1Adipokinetic hormone  AkhR ACYPI002471Complete(IPR027417) P-loop containing nucleoside triphosphate hydrolase;5.09E−49GnRHR II Clarias gariepinus
A40KAF6198962.1Allatostatins-A AstA-R1, AstA-R2ACYPI0086234None predicted5.02E−41AstA-R Bombyx mori
A41KAF6198963.1Allatostatins-A AstA-R1, AstA-R2ACYPI0086234(IPR000276) GPCR, rhodopsin-like; (IPR005390) Neuromedin U receptor2.45E−79AstA-R Bombyx mori
A42KAF6206708.1Allatostatins-C AstC-R1, AstC-R2 ACYPI002528Complete(IPR000276) GPCR, rhodopsin-like; (IPR002131) Glycoprotein hormone receptor family; (IPR036055) LDL receptor-like superfamily; (IPR032675) Leucine-rich repeat domain superfamily1.27E−81Somatostatin receptor type 4 Rattus norvegicus
A43KAF6202370.1CAPA CapaR ACYPI007245Complete(IPR000276) GPCR, rhodopsin-like1.17E−85 CapaR  D. melanogaster
A44KAF6216164.1CAPA CapaR ACYPI007245Complete(IPR000276) GPCR, rhodopsin-like; (IPR019427) 7TM GPCR, serpentine receptor class w (Srw)6.22E−102Cap2bR D. melanogaster
A45KAF6210431.1CCHamide CCHa2-R ACYPI004781Complete(IPR000276) GPCR, rhodopsin-like7.23E−131CCHa1-R D. melanogaster
A46KAF6201633.1CNMamide CNMaR ACYPI0080272(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor familyNANANA
A47KAF6201631.1CNMamide CNMaRACYPI008027Complete(IPR000276) GPCR, rhodopsin-like9.79E−14 FMRFaR  D. melanogaster
A48KAF6205178.1CorazoninCrzRACYPI002471Complete(IPR000276) GPCR, rhodopsin-like3.56E−59GnRHR Octopus vulgaris
A49KAF6198396.1Crustacean cardioactive peptideCCAP-RACYPI062442Complete(IPR000276) GPCR, rhodopsin-like; (IPR001634) Adenosine receptor3.69E−158CCAPR D. melanogaster
A50KAF6211684.1Crustacean cardioactive peptideCCAP-RACYPI0624426(IPR000276) GPCR, rhodopsin-like1.9E−156CCAPR D. melanogaster
A51KAF6213085.1Crustacean cardioactive peptideCCAP-RACYPI062442Complete(IPR008429) Cleft lip and palate transmembrane 11.73E−137CCAPR D. melanogaster
A52KAF6209917.1ETH ETHR BK008727Complete(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor family2.19E−44TRH-R Gallus gallus
A53KAF6209916.1ETH ETHR BK0087273(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor family; (IPR036241) NSFL1 cofactor p47, SEP domain superfamily4.21E−16TRH-R Bos taurus
A54KAF6209916.1FMRFamides FMRFaR ACYPI006053Complete(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor family1.75E−130 FMRFaR  D. melanogaster
A55KAF6205268.1FMRFamides FMRFaR ACYPI0060535(IPR027417) P-loop containing nucleoside triphosphate hydrolase; (IPR042035) DEAH helicase, winged-helix domain; (IPR012340) Nucleic acid-binding, OB-fold5.96E−45 FMRFaR  D. melanogaster
A56KAF6200840.1FaRPnana6(IPR000276) GPCR, rhodopsin-like; (IPR019427) 7TM GPCR, serpentine receptor class w (Srw)4.53E−40 FMRFaR  D. melanogaster
A57KAF6215133.1FaRPnana5(IPR000276) GPCR, rhodopsin-like5.5E−45 FMRFaR  D. melanogaster
A58KAF6202705.1FaRPnanaComplete(IPR000276) GPCR, rhodopsin-like; (IPR032675) Leucine-rich repeat domain superfamily; (IPR008112) Relaxin receptor2.25E−31 FMRFaR  D. melanogaster
A59KAF6199396.1FaRPnana5(IPR000276) GPCR, rhodopsin-like9.69E−35 FMRFaR  D. melanogaster
A60KAF6212897.1FaRPnanaComplete(IPR000276) GPCR, rhodopsin-like1.13E−33 FMRFaR  D. melanogaster
A61KAF6203114.1FaRPnana6(IPR000276) GPCR, rhodopsin-like3.58E−26 FMRFaR  D. melanogaster
A62KAF6204912.1FaRPnanaComplete(IPR000276) GPCR, rhodopsin-like2.55E−29 FMRFaR  D. melanogaster
A63KAF6216165.1GPA2/GPB5 Lgr1 ACYPI004597Complete(IPR000276) GPCR, rhodopsin-like2.6E−103LH/CG-R Mus musculus
A64KAF6215698.1Bursicon rk ACYPI000221Complete(IPR008365) Prostanoid receptor7.46E−112LGR5 Rattus norvegicus
A65KAF6211436.1Insulin-like peptide 7 and 8Lgr3ACYPI0082915(IPR000276) GPCR, rhodopsin-like2.26E−116Relaxin receptor 2 Mus musculus
A66KAF6202512.1Insulin-like peptide 7 and 8Lgr4naComplete(IPR000276) GPCR, rhodopsin-like0GPCR GRL101 Lymnaea stagnalis
A67KAF6211616.1Leucokinin Lkr ACYPI010083, ACYPI0007625(IPR000276) GPCR, rhodopsin-like; (IPR001681) Neurokinin receptor2.25E−40 TkR99D  D. melanogaster
A68KAF6205586.1Leucokinin Lkr ACYPI010083, ACYPI0007625(IPR000276) GPCR, rhodopsin-like5.04E−34QRFP-like peptide receptor Branchiostoma floridae
A69KAF6212176.1Neuropeptide F NPFR ACYPI0076644(IPR000276) GPCR, rhodopsin-like1.66E−25NPYR type 2 Cavia porcellus
A70KAF6209720.1Neuropeptide F NPFR ACYPI0076644(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor family4.36E−27DmNPFR1 D. melanogaster
A71KAF6212186.1Neuropeptide F NPFR ACYPI0076642(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor family8.96E−16 NPFR  D. melanogaster
A72KAF6213141.1Pyrokinin-1 PK1-R ACYPI000735, ACYPI005805Complete(IPR000276) GPCR, rhodopsin-like; (IPR002120) Thyrotropin-releasing hormone receptor4.33E−117 PK1-R  D. melanogaster
A73KAF6213123.1Pyrokinin-1 PK1-R ACYPI000735, ACYPI0058053(IPR000276) GPCR, rhodopsin-like; (IPR000611) Neuropeptide Y receptor family2.91E−42 PK1-R  D. melanogaster
A74KAF6213143.1Pyrokinin-2 PK2-R2naComplete(IPR000276) GPCR, rhodopsin-like; (IPR019427) 7TM GPCR, serpentine receptor class w (Srw)1.14E−101 PK1-R  D. melanogaster
A75KAF6210859.1Proctolin Proc-R ACYPI30716Complete(IPR027417) P-loop containing nucleoside triphosphate hydrolase3.98E−25 FMRFaR  D. melanogaster
A76KAF6207650.1RYamide RYa-R ACYPI002886Complete(IPR000832) GPCR, family 2, secretin-like1.52E−122 RYa-R  D. melanogaster
A77KAF6211457.1SIFamide SIFaR ACYPI008341, BK008728Complete(IPR000276) GPCR, rhodopsin-like; (IPR002131) Glycoprotein hormone receptor family; (IPR032675) Leucine-rich repeat domain superfamily8.27E−107 SIFaR  D. melanogaster
A78KAF6213872.1SIFamide SIFaR ACYPI008341, BK0087283(IPR008429) Cleft lip and palate transmembrane 1; (IPR030434) Cleft lip and palate transmembrane protein 1-like protein4.69E−101 SIFaR  D. melanogaster
A79KAF6209074.1short neuropeptide F sNPF-R ACYPI005474Complete(IPR000276) GPCR, rhodopsin-like; (IPR005390) Neuromedin U receptor1.15E−62NPY2-R Homo sapiens
A80KAF6215350.1Allatostatin-CAstC-R ACYPI003290Complete(IPR000276) GPCR, rhodopsin-like; (IPR032675) Leucine-rich repeat domain superfamily; (IPR002131) Glycoprotein hormone receptor family3.65E−147 SPR  D. melanogaster
A81KAF6198907.1SulfakininCCKLR-17D1, CCKLR-17D3na3(IPR000276) GPCR, rhodopsin-like3.51E−29CCK-XLR Xenopus laevis
A82KAF6213654.1Tachykinin TkR86C ACYPI0011033(IPR000276) GPCR, rhodopsin-like4.57E−50 TkR86C  D. melanogaster
A83KAF6210188.1Tachykinin TkR99D ACYPI0029174(IPR000276) GPCR, rhodopsin-like2.1E−93 TkR99D  D. melanogaster
A84KAF6211953.1OrphanCG4313ACYPI0052345(IPR000276) GPCR, rhodopsin-like2.38E−55moody D. melanogaster
A85KAF6198896.1OrphanCG32547ACYPI0006713(IPR000276) GPCR, rhodopsin-like; (IPR001817) Vasopressin receptorNANA NA
A86KAF6211940.1Orphan moodyACYPI0062936(IPR000276) GPCR, rhodopsin-like5.19E−143moody D. melanogaster
A87KAF6198937.1OrphannaACYPI40167Complete(IPR000276) GPCR, rhodopsin-like1.52E−14NPY2-R Mus musculus
A88KAF6216499.1OrphannaACYPI38121Complete(IPR000276) GPCR, rhodopsin-like1.14E−10Melatonin receptor type 1A Gallus gallus
A89KAF6200805.1Orphannana6(IPR000276) GPCR, rhodopsin-like1.02E−12Melatonin receptor type 1A Gallus gallus
A90KAF6202756.1Orphannana2(IPR000276) GPCR, rhodopsin-like1.17E−13Somatostatin receptor type 5 Homo sapiens
A91KAF6205087.1OrphannanaComplete(IPR000276) GPCR, rhodopsin-like5.87E−12Prostaglandin E2 receptor EP4 subtype Bos taurus
A92KAF6208108.1OrphannanaComplete(IPR000276) GPCR, rhodopsin-like; (IPR001556) Bombesin receptor-like1.75E−41 RYa-R  D. melanogaster
A93KAF6215130.1AllatotropinnanaComplete(IPR000276) GPCR, rhodopsin-like; (IPR005390) Neuromedin U receptor8.144E−89Orexin receptor type 2 Mus musculus
A94KAF6215334.1Orphannana5(IPR000276) GPCR, rhodopsin-like1.53E−53TRHR Gallus gallus
A95KAF6215529.1OrphannanaComplete(IPR000276) GPCR, rhodopsin-like2.03E−13Cadherin EGF LAG seven-pass G-type receptor 1 Mus musculus
Purine receptor
A96KAF6207107.1Adenosine AdoR ACYPI247133(IPR000276) GPCR, rhodopsin-like; (IPR001817) Vasopressin receptor1.34E−29AdoR A2a Equus caballus
A97KAF6207108.1Adenosine AdoR ACYPI247136None predicted1.46E−57AdoR A2a Equus caballus
A98KAF6207109.1Adenosine AdoR ACYPI247133(IPR000276) GPCR, rhodopsin-like4.05E−21AdoR A2a Gallus gallus

na, not annotated or applicable, Complete means there is a complete 7TM structure.

3.1.1 Opsins

Color vision in insects is based on the expression of different opsins in photoreceptive cells. Opsins are members of the family-A GPCRs and are coupled to light-sensitive chromophores in animal photoreceptors (50). Three groups of opsins have been reported in D. melanogaster: one related to long-wavelength (LW) vision (including Rh1, Rh2, and Rh6), another group related to short-wavelength (SW) vision (Rh3, Rh4, and Rh5), and a third group including only Rh7 (34, 51). A fourth group of invertebrate opsins, named pteropsins, has been found in Apis mellifera (50) and R. prolixus (27), which was missing from the genome of D. melanogaster and A. pisum. In this study, seven putative opsins were identified in A. lucorum. The phylogenetic analysis suggested that A1 and A2 are related to the LW opsin, A3 is related to the SW opsin, A4 and A5 belong to a third group, and A6 and A7 are close to pteropsins ( ). Four groups of invertebrate opsins were also identified in the A. lucorum genome. According to the expression profile, opsins were expressed at the highest levels in the head and mouthpart tissue, which is corresponding to their biological function. Among the four types of opsins detected in A. lucorum, the A1 showed the highest expression in the head tissue of adults with a transcripts per kilobase of exon model per million mapped reads (TPM) of 28,787 ( ) followed by A3 with a TPM of 770.
Figure 5

Expression patterns of A1 and B10 in different tissues by transcriptome analysis. The vertical bars indicate standard errors of the mean (n = 3).

Expression patterns of A1 and B10 in different tissues by transcriptome analysis. The vertical bars indicate standard errors of the mean (n = 3).

3.1.2 Biogenic Amine Receptors

The known biogenic amines that act as ligands for GPCRs in insects contain acetylcholine, dopamine, serotonin, octopamine, and tyramine (27). Here, we identified 30 biogenic amine receptors in A. lucorum. Based on phylogenetic analysis and sequence similarity, A8–11 are receptors for acetylcholine; A12–16 are dopamine-like receptors; A17–24 are orthologs of the octopamine receptors; A26–33 and A35–36 were identified as the serotonin-like receptors; and A25 is the GPCR that could be stimulated by two structurally related endogenous ligands, octopamine and tyramine ( ). Additionally, A34 and A37 are orphan receptors of this subfamily in A. lucorum, and are orthologs of RPRC011175 and CG13579, respectively. However, two tyramine receptors (TyrR and TyrRII) are likely to be missing in all three heteropteran insects. A25 is the only tyramine receptor in A. lucorum. Compared with opsins, the expression level of biogenic amine receptors is much lower. A36 showed the highest expression in gut tissues of adults with a TPM of 11. In FlyBase (33), we found 5-HT7, the ortholog gene of A36 in D. melanogaster, was also expressed in the digestive system.

3.1.3 Neuropeptide and Protein Hormone Receptors

The rhodopsin-like neuropeptide and protein hormone receptors are the largest subfamily in the rhodopsin-like family (17, 22, 52). In this subfamily, 58 putative A. lucorum sequences were identified. Like other insects, A. lucorum rhodopsin-like neuropeptide and protein hormone receptors can be classified into 25 groups based on their ligands; i.e., adipokinetic hormone receptors (AKHR), AKH/corazonin-related peptide (ACP) receptors, allatotropin receptor (AT-R), allatostatin-A receptors (AstA-R), allatostatin-B receptors (AstB-R), allatostatin-C receptors (AstC-R), bursion receptor, corazonin receptors (CrzR), neuropeptide F receptors (NPFR), short neuropeptide F receptors (sNPFR), proctolin receptors (Proc-R), pyrokinin receptors (PK-R), leukokinin receptors (Lkr), cholecystokinin-like receptors (CCKLR), tachykinin receptors (TkR), CAPA receptors (CapaR), crustacean cardioactive peptide receptors (CCAP-R), CNMamide receptors (CNMaR), CCHamide receptors (CCHa-R), ecdysis triggering hormone receptors (ETHR), FMRFamide receptors (FMRFaR), GPA2/GPB5 receptors, SIFamide receptors (SIFaR), relaxin receptors, RYamide receptors (RYa-R), and several orphan GPCRs ( ). Most of these neuropeptide receptors displayed one-to-one orthologous relationships between A. lucorum, R. prolixus, C. lectularius, A. pisum, and D. melanogaster, and all subtypes of leucine-rich repeat-containing GPCRs (LGR) were observed in A. lucorum ( and ). However, several duplications and losses of neuropeptide receptor genes were also observed in A. lucorum. It is worth mentioning that as many as nine A. lucorum GPCRs (A54–62) displayed strong evidence of an evolutionary kinship with the FMRFaRs of R. prolixus, C. lectularius, A. pisum, and D. melanogaster, indicating that a large clade may have duplicated from FMRFaRs in A. lucorum ( and ). Duplications of eight neuropeptide receptor genes (CapaR, CCAP-R, CNMaR, ETHR, Lkr, NPFR, PK1-R, and SIFaR) were identified in A. lucorum, and duplications of Lkr, PK1-R, ETHR, CCAP-R, NPFR, and SIFaR were also observed in R. prolixus or C. lectularius. The trapped in endoderm 1 (Tre1) receptors, trissin receptors (TrissinR), myosuppressin receptors (MsR), and other six orphan receptors were not found in the genome of A. lucorum. Instead, we found six orphan receptors (A89–92 and A94–95) that have not been reported in A. lucorum. The expression levels of neuropeptide and protein hormone receptors were higher than in biogenic amine receptors. The expression of A65 (LGR) in the bodies of nymphs was the highest in this subfamily with a TPM of 41. Moreover, A50 (CCAP-R) and A86 (moody) showed high expression levels in multiple tissues (TPM >10 at least in five tissues).
Figure 2

Phylogenetic tree reconstruction of neuropeptide and protein hormone receptors from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), and A. lucorum (red circle) inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster opsin GPCR Rh6. Expression profiles of A. lucorum GPCR genes from different tissues are shown in the corresponding branch side. The transcription level of each gene is represented by a square with a color that codes for the values of Lg (TPM+1). N, nymph; A, adult.

Figure 6

Phylogenetic tree and domains of different LGRs. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster opsin GPCR Rh6. Different LGR types were painted with different colors. Predicted domains of each sequence are shown in the corresponding branch side. Dc, D. citri; Al, A. lucorum; Ap, A. pisum; Dm, D. melanogaster; Tc, Tribolium castaneum; Bm, B. mori.

Figure 7

Phylogenetic tree reconstruction of FMRFaRs and MsRs from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), A. lucorum (red circle), and other two model insects inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values. The tree was rooted by the D. melanogaster Proc-R.

Phylogenetic tree and domains of different LGRs. Numbers at nodes on the tree were the bootstrap values (below 50 are not shown). The tree was rooted by the D. melanogaster opsin GPCR Rh6. Different LGR types were painted with different colors. Predicted domains of each sequence are shown in the corresponding branch side. Dc, D. citri; Al, A. lucorum; Ap, A. pisum; Dm, D. melanogaster; Tc, Tribolium castaneum; Bm, B. mori. Phylogenetic tree reconstruction of FMRFaRs and MsRs from D. melanogaster (green square), A. pisum (blue star), R. prolixus (orange right triangle), C. lectularius (purple left triangle), A. lucorum (red circle), and other two model insects inferred from maximum likelihood (ML) analysis. Numbers at nodes on the tree were the bootstrap values. The tree was rooted by the D. melanogaster Proc-R.

3.1.4 Purine GPCRs

Only one receptor in this subfamily, adenosine receptor (AdoR), has been previously classified in this subfamily (17, 53). Here, three putative A. lucorum GPCRs (A96–98) were identified as AdoR, whereas there was only one member in D. melanogaster and A. pisum ( ). Purine GPCRs are activated by the binding of purine nucleotides or their derivatives (principally adenosine or ADP/ATP) (54, 55). Duplication of AdoR suggests that purinergic neural transmission may play a more important role in A. lucorum.

3.2 Family-B GPCRs

Family-B GPCRs play vital roles in many biological processes, including growth, development, and reproduction. They are characterized by long N-terminal domains, and they form a small group of receptors that are structurally and functionally divergent from other groups of GPCRs (56). Within this family, family-B GPCRs can be further subdivided into three subfamilies: B1–B3, which are greatly divergent in both function and structure. In total, 21 family-B GPCRs were identified from the genome of A. lucorum in this study, which consisted of nine B1 subfamily members, three B2 subfamily members, and nine B3 subfamily members ( and ).
Table 3

Family-B GPCRs of A. lucorum.

No.Accession numberPutative endogenous ligandOrthologue of D. melanogaster Orthologue of A. pisum Predicted TMHsAnnotation by InterProScan Homology search in Swissport (blastp)
E-valueDescriptionSpecies
SUBFAMILY B1
B1KAF6209407.1Diuretic hormone 31Dh31-R ACYPI007222, ACYPI001361Complete(IPR000832) GPCR, family 2, secretin-like; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily4.8E−74Calcitonin gene-related peptide type 1 receptor Danio rerio
B2KAF6209957.1Diuretic hormone 44Dh44-R1, Dh44-R2ACYPI549245(IPR000832) GPCR, family 2, secretin-like; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily; (IPR002001) GPCR, family 2, diuretic hormone receptor5.79E−68DH-R Acheta domesticus
B3KAF6209955.1Diuretic hormone 44Dh44-R1, Dh44-R2ACYPI549242(IPR000832) GPCR, family 2, secretin-like5.51E−46DH-R Acheta domesticus
B4KAF6198455.1Pigment-dispersing factorPdfr ACYPI464312(IPR000832) GPCR, family 2, secretin-like6.16E−49PDF receptor D. melanogaster
B5KAF6198460.1Pigment-dispersing factorPdfr ACYPI464312(IPR000832) GPCR, family 2, secretin-like6.82E−26PDF receptor D. melanogaster
B6KAF6210210.1Diuretic hormone 31hecACYPI0095694(IPR000832) GPCR, family 2, secretin-like; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily1.12E−52Calcitonin receptor Oryctolagus cuniculus
B7KAF6210211.1Diuretic hormone 31hecACYPI0095693(IPR000832) GPCR, family 2, secretin-like6.2E−21Corticotropin-releasing factor receptor 1 Mus musculus
B8KAF6210212.1Diuretic hormone 31Dh31-R ACYPI007222, ACYPI001361Complete(IPR000832) GPCR, family 2, secretin-like; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily4.18E−84Calcitonin gene-related peptide type 1 receptor Danio rerio
B9KAF6204739.1Parathyroid hormonenana5(IPR000832) GPCR, family 2, secretin-like; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily6.86E−75PTH2 receptor Homo sapiens
SUBFAMILY B2
B10KAF6216792.1α-latrotoxinCirl ACYPI005705Complete(IPR000832) GPCR, family 2, secretin-like; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily; (IPR043159) D-galactoside/L-rhamnose binding SUEL lectin domain superfamily; (IPR031234) Latrophilin-10Latrophilin Drosophila ananassae
B11KAF6198557.1Orphanstan ACYPI001529Complete(IPR000832) GPCR, family 2, secretin-like1.59E−80stan D. melanogaster
B12KAF6198871.1OrphanCG15744naComplete(IPR032675) Leucine-rich repeat domain superfamily; (IPR000832) GPCR, family 2, secretin-like; (IPR013783) Immunoglobulin-like fold; (IPR036445) GPCR family 2, extracellular hormone receptor domain superfamily; (IPR036179) Immunoglobulin-like domain superfamily2.6E−111Adhesion GPCR A3 Danio rerio
SUBFAMILY B3
B13KAF6217262.1Orphanmthl5 ACYPI003439CompleteNone predicted1.2E−128Mth-like 5 D. melanogaster
B14KAF6207251.1Orphanmthlna5(IPR000832) GPCR, family 2, secretin-like2.45E−12Probable Mth-like 4 D. melanogaster
B15KAF6215469.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like1.91E−50Mth2 Drosophila simulans
B16KAF6197298.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like3.21E−26Mth2 Drosophila yakuba
B17KAF6207182.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like6.91E−21Probable Mth-like 3 D. melanogaster
B18KAF6208177.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like4.42E−19Probable Mth-like 4 D. melanogaster
B19KAF6207243.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like5.5E−18Probable Mth-like 3 D. melanogaster
B20KAF6216028.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like; (IPR022343) GCR1-cAMP receptor7.79E−27Probable Mth-like 11 D. melanogaster
B21KAF6206877.1OrphanmthlnaComplete(IPR000832) GPCR, family 2, secretin-like2.6E−33Mth2 Drosophila yakuba

na, not annotated or applicable, Complete means there is a complete 7TM structure.

The B1 subfamily is made of largely classical hormone receptors. It comprises three types of hormone receptors in D. melanogaster: diuretic hormone 31 receptor (DH31-R/hector), CRF-like diuretic hormone 44 (DH44-R), and pigment dispersing factor receptor (Pdfr) (57–60). In our study, all three types of hormone receptors were identified. The parathyroid hormone receptor (PTHR), which is involved in the calcium and phosphate homeostasis and bone growth in vertebrates, is also a subfamily-B1 GPCR (58). There are two PTHRs in mammals, which are involved in calcium and phosphate homeostasis and bone growth (61, 62). In insects, PTHR-like (PTHRL) have been identified from T. castaneum, A. mellifera, P. h. humanus, and N. lugens (18, 26, 57), but its counter-parts in D. melanogaster, B. mori, A. pisum, and A. gambiae are not found (17, 22, 34, 63). T. castaneum has two distinct PTHRLs (57), N. lugens possesses a pair of homologous PTHRLs (26), and A. mellifera only has one PTHRL (57). In our study, we also identified one PTHRL, B9, which shared a low e-value (1e−104 and 1e−111) with two PTHRLs in N. lugens. These results showed that genes coding for PTHR are divergent among insects. The B2 subfamily is characterized by a long extracellular N-terminal domain and a GPCR proteolytic site (57, 58, 64). Based on phylogenetic analysis and sequence similarity, three receptors (B10–12) were classified in the B2 subfamily, which correspond to a calcium-independent receptor for α-latrotoxin (Cirl), starry night (stan), and CG15744, respectively. However, the orthologs for CG11318 and CG15556 were not identified in the genome of A. lucorum. There is only one group of receptors in the B3 subfamily; i.e., Methuselah (mth)/Methuselah-like (mthl) (57, 58). This gene family is involved in the modulation of life span and stress responses. No counterpart for the mth gene family has been identified in vertebrates (57). In insects, the number in the B3 subfamily is highly variable (18, 57). In D. melanogaster, this subfamily can be divided into two groups based on their structure, 12 mth ectodomain‐positive members (mth, mthl2, mthl3, mthl4, mthl6, mthl7, mthl8, mthl9, mthl10, mthl11, mthl12, and mthl13) and four mth ectodomain‐negative members (mthl1, mthl5, mthl14, and mthl15) (65). In our study, nine receptors (B13–21) were identified in this family. Based on phylogenetic analysis, B13 and B20 may belong to the mth ectodomain‐positive group, and others may be members of the mth ectodomain‐negative group. At the mRNA level, B13, B18, and B21 showed a higher expression level than other mthl members, which indicated these three members of the B3 subfamily may play a more important role in A. lucorum.

3.3 Family-C GPCRs

Family-C GPCRs possess a large ligand-binding extracellular domain and form constitutive dimers (34, 66–68). There are three types of GPCRs in family-C, the glutamate and γ-amino butyric acid (GABA-B) receptors, the bride of sevenless (boss-type) receptors, and the metabotropic glutamate (mGlu) receptors. Until now, nine family-C GPCRs from D. melanogaster and seven from A. pisum have been reported. By using these reference sequences, 10 family-C members ( nad ) of A. lucorum were identified here. Like A. pisum (22), there are two GABA-B receptors (C1 and C2) in A. lucorum. C1 shares a 78% sequence similarity with D. melanogaster GABA-B-R1, while C2 has a 44% sequence identity with D. melanogaster GABA-B-R2. The orthologous gene to D. melanogaster GABA-B-R3 has not been found in two Hemiptera insects. The boss-type receptor was first identified as a ligand for sevenless tyrosine kinase, which was involved in eye differentiation in D. melanogaster. Subsequently, boss has been implicated in the glucose-response (69, 70). It has been reported in D. melanogaster (34), A. gambiae (63), A. pisum (22), and T. castaneum (16), but not in B. mori (17), A. mellifera, N. vitripennis, and P. humanus corporis. Here, C3 was identified as the orthologous gene to boss. These results indicated that boss has been randomly lost in insects during their evolutionary process. Moreover, three mGlu receptors (C4, C5, and C6) were found in A. lucorum, whereas there are only two mGlu receptors in D. melanogaster (34) and one mGlu receptor in A. pisum (22). Small expansions of A. lucorum mGlu receptors have been observed. There are some unclassified receptors in this family. C7 was the orthologous gene to smog. C8 and C9 showed 53 and 38% sequence identities with their counterparts in D. melanogaster, respectively. As shown in , C9 showed a high expressed level in egg, adult head, and all nymph tissues, except leg, while the function of its orthologous gene is unclear. C10 was an orphan receptor that has not been reported in D. melanogaster and A. pisum (22, 34).

3.4 Family-F GPCRs

Family-F GPCRs comprise the frizzled gene family and the smoothened gene (34, 71). In this study, four putative A. lucorum GPCRs were identified in the frizzled/smoothened GPCR family, which are orthologous to D. melanogaster fz, fz2, fz3, and smo ( and ). Our results indicated that orthologs for D. melanogaster fz4 were missing in A. lucorum. Among family-F, F4 (smo) showed the highest expression in the egg with a TPM of 61.

4 Discussion

In this study, we systematically identified 133 GPCRs from A. lucorum. Compared with other model insects, we also found the GPCR genes remarkably expanded among the biogenic amine receptors, neuropeptide and protein hormone receptors, and the B1 subfamily ( ). Some of them had been reported in R. prolixus or C. lectularius (24, 27), such as the duplications of 5-HT7, Lkr, PK1-R, ETHR, CCAP-R, NPFR, SIFaR, and DH31-R. However, missing TyrR, Tre1, TrissinR, MsR, and some orphan receptors has also been observed in the genome of A. lucorum ( ). All these predicted GPCRs were quantified by transcriptome data. Although most GPCR genes showed a low expression level in A. lucorum, there were a few highly expressed GPCR genes, such as the long-wavelength opsin and Cirl. By comparative analysis, we also found C2 LGR types were widely distributed in Hemiptera. All these aspects will be discussed in detail below.

4.1 GPCRs Gene Expansion Occurred in A. lucorum

Compared with other well-studied insects, we noticed that the number of genes coding for GPCRs is obviously larger than for other insects, especially expanded among the biogenic amine receptors, neuropeptide and protein hormone receptors, and B1 subfamily ( ). There were 26 GPCRs duplicated in A. lucorum. Twenty-three of them were classified into the three subfamilies mentioned above. By MCScanX analysis, we found six tandem duplication events occurred among Rh6, Rh7, AstA-R, ETHR, FMRFaR, and AdoR, while most GPCR genes duplicated dispersedly. Considering the location of GPCR genes on chromosomes, it suggested that the duplication of GPCR genes mainly occurred as independent duplications and transitions ( ). The duplicate biogenic amine receptors in A. lucorum included 5-HT1A, 5-HT1B, 5-HT2A, 5-HT2B, 5-HT7, Dop2R, mAChR-B, Oamb, Octbeta2R, and Octbeta3R. These biogenic amine receptors can regulate many behaviors including flight and fight, learning and memory, sleep and wakefulness, feeding, and social and reproductive behaviors (72–74). For example, 5-HT1A was related to locomotor activity in B. mori. Injecting the antagonist of the Bm5-HT1A receptor into larvae caused slow or weak motility, and adults had lowered courtship vitality or moving speed (75). mAChRs have also been reported to be critical in regulation of locomotory behavior in Drosophila (76). In addition, 5-HT1B mediates hemocyte phagocytosis and serotonergic signaling performs critical modulatory functions in immune systems. Moreover, 5-HT7 and Dop2R were shown to be associated with learning ability (77, 78) and octopamine receptors were required for ovulation in D. melanogaster (79). The ligands of neuropeptide and protein hormone receptors and the B1 subfamily belong to neuropeptides, which also play an important role in the regulation of development, reproduction, feeding, courtship, aggression, olfaction, locomotor activity, circadian rhythm, and many other physiological processes in insects (21, 80, 81). The gene expansion of these three subfamilies indicated that A. lucorum had a more complex peptidergic signaling system. Expansions of genes associated with omnivorousness and mesophyll feeding, such as those related to digestion, chemosensory perception, and detoxification, were also observed in A. lucorum (32). Gustatory receptors (Grs) and odorant receptors (Ors) are thought to be the most important chemosensory receptors. Like GPCRs, Ors, and Grs are seven-transmembrane domain receptors but belong to the chemosensory 7tm receptor superfamily (82, 83). It has been suggested that the cause of gene expansion in GPCRs might be similar to that of chemosensory receptors, also to better adapt to the environment (32). A. lucorum is found in natural and agricultural ecosystems throughout the world (30), and many of them are generalists, exhibiting diverse feeding habits or preferences (e.g., feeding on leaf, stem, inflorescences, nectar, pollen, and fruit) (32). These results indicated the complex peptidergic signaling system is more favorable for A. lucorum to adapt to multiple living environments and multiple hosts.

4.2 A. lucorum Appears to Have Evolved From a Novel Large Clade of Known FMRFaRs

FMRFamide (FMRFa) is a cardioexcitatory peptide that was first isolated from the nervous system of the clam, Macrocallista nimbosa (84), and is active as a tetrapeptide only in mollusks and annelids. Since the discovery of FMRFa, peptides with extended length at the N-terminal portion have been reported, such as myosuppressin (Ms) (85). Here, nine receptors (A54–62) displayed a certainly evolutionary kinship with the FMRFaR of D. melanogaster, A. pisum, C. lectularius, and R. prolixus, while the MsR is missing in A. lucorum ( ). By reconstruction of the phylogenetic tree of MsR and FMRFaR with more species (17, 86), we found these receptors were closer to the known FMRFaRs ( ). Among these, A54 and A55 were clustered in a single clade with the FMRFaRs that had been identified in other insects, whereas the other seven receptors are clustered on the other single clade. The most similar proteins in UniProtKB/Swiss-Prot of these receptors are both D. melanogaster FMRFaRs (CG2114). Among these receptors, A54 was the ortholog to the insect FMRFaRs with the smallest e-value of 1.75E−130, and A55, which was adjacent located on chromosome 9, is a tandem duplication that occurred at the beginning of this receptor expansion. However, the other seven receptors (A56–62) were scattered across six chromosomes, which indicated these seven receptors might have arisen from transposition. By searching in the genome of other heteropteran insects, we found there are only one or two FMRFaRs in each heteropteran species. We suggest A54 and A55 should be classified as FMRFaRs, while the others (A56–62) were named as FMRFaR-like for the moment. This branch might be another unknown GPCR or even contain the MsR. Recent research had found that FMRFaR stimulates intracellular calcium signaling through the IP3R and helps maintain neuronal excitability in a subset of dopaminergic neurons for positive modulation of flight bout durations (87), and the ligand can reduce spontaneous muscle contractions, such as in the intestinal muscle and the heart rate, which also have an impact on movements (88, 89). A. lucorum has great flight capacity and its adults can fly 151.3 km within 48 h (90). A large FMRFaR-like branch evolved in A. lucorum may help it maintain strong flight capability.

4.3 Only a Few GPCR Genes Showed High Expression Levels in A. lucorum

In our research, only 23 GPCR genes (17.3% of all GPCR genes) were expressed highly (TPM >10) in at least one tissue, while most GPCR genes showed a low expression level in A. lucorum. This result is consistent with previous studies in which most GPCRs showed a low endogenous expression level, even at the mRNA level (91–93). Certainly, a low expression level of GPCRs does not necessarily equate to functional insignificance (94). Here, through transcriptome analysis, we found the expression levels of opsins were higher than in other subfamilies. Opsins that originated early in metazoan evolution mediate the response to visual stimuli primarily. When stimulated by light, opsins can activate a downstream signaling cascade by conformational change (95). Most opsin genes are expressed in photoreceptors, but there are opsins expressed in other tissues, suggesting some nonvisual functions (96, 97). In A. lucorum, opsin genes (A1–3) were expressed highly not only in head but also in leg, wing, and mouthpart, indicating these opsins may execute some nonvisual functions ( ). Among these, the LW opsin (A1) showed the highest expression levels in the head tissue of adults (TPM = 28,787, at least 20 times more than other GPCR; ). The peak absorbance of the LW opsin is 500–600 nm, which corresponds to yellow-green light. As night sets in, the natural ambient light is increasingly dominated by longer wavelengths (98, 99). The importance of LW opsin had been reported in many nocturnal insects (100, 101). The adults of the A. lucorum were mainly active from dusk to early morning (102). High expression levels of LW opsin may help the organism adapt to a low light environment. We found B10, orthologous to Cirl, is the most widely expressed GPCR gene, which can be tested in all tissues ( ). Cirl belongs to a unique branch of GPCRs and, specifically, is an adhesion GPCR (103, 104). The orthologs of Cirl have been discovered in almost all animals from invertebrates to vertebrates, including humans (105). There are three homologs of Cirl in most vertebrates (Cirl‐1, Cirl‐2, and Cirl‐3) and two in birds and worms, whereas there is only one homolog in insects—which is most homologous to vertebrate Cirl-2 (103, 106). The expression pattern of insects Cirl, which had been reported to be expressed in multiple tissues (103, 107), was also like vertebrate Cirl-2 (108). Although there is only one Cirl member in insect species, Cirl is still involved in multiple physiological processes, which can regulate sensory, developmental, reproductive, and immune functions in insects (104, 109). Here, B10 had been detected in all transcriptomic samples and its distribution range is wider than in T. castaneum and D. melanogaster (103, 106). This kind of expression pattern suggested B10 is crucial in the development of A. lucorum.

4.4 Type C2 LGRs Are Mainly Distributed in Hemiptera and Phthiraptera Insects

Within the neuropeptide and protein hormone receptor subfamily, LGRs are a distinct subgroup with important functions in development and reproduction (110). Three distinct types of LGRs have been defined based on their structural characteristics and they are distinguished by the number of leucine-rich repeat (LRR) motifs, the absence or presence of a low density lipoprotein receptor domain class A (LDLa) motif, and their type-specific hinge region. Generally, type B LGRs have about twice the number of LRRs compared to the other two types. An exclusive feature of the type C LGRs is the presence of at least one LDLa motif in the ectodomain. The more general type containing only one LDLa will be referred to as type C1, whereas type C2 contained more than one LDLa (111). Type C2 LGRs were first discovered in echinoderms, mollusks, and in one insect species (Pediculus humanis corporis). In our study, we found it existed in all hemipteran insects that we studied. Combining recent work, we mentioned that type C2 LGRs are reported in many hemipteran insects and P. h. humanus (18, 23, 25, 26), and are lost in other orders of insects (16, 17, 63, 86) ( ). Until now, the presence of type C2 LGRs have been found in all Hemiptera insects in which their GPCRs have been identified (22, 23, 25, 26). Among insects, Phthiraptera is one of the orders most closely related to Hemiptera (112). Type C2 LGRs may be present in the common ancestor of these two orders. To clarify the distribution of type C2 LGRs in insects, we checked all protein sequences in the non-redundant protein sequences database (nr) of NCBI. The result showed, except for Hemiptera and Phthiraptera, type C2 LGRs were also founded in Zootermopsis nevadensis of Blattodea. In terms of functionality, LGRs have important functions in development and reproduction. Type A LGRs and type B LGRs are stimulated by large dimeric protein hormones (110), regulating the adult eclosion of insects, and cuticle tanning (113, 114), while type C1 LGRs are the receptors of insulin-like peptide 7 and insulin-like peptide 8 and they coordinate organ growth in D. melanogaster (115–117). At present, the function of type C2 LGRs is undetermined. The function of type C2 LGRs and the existence of type C2 LGRs in Phthiraptera need to be explored in future work.

Data Availability Statement

The original contributions presented in the study are included in the article/ . Further inquiries can be directed to the corresponding author.

Author Contributions

The majority of the work described here was carried out by HG. This work was also assisted by YL, MW, XS, JT, and FF. BL designed the study and crucially revised the manuscript for important intellectual content and data analysis. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the National Natural Science Foundation of China (Nos. 31872970 & 3217030192), and the Graduate Research and Innovation Projects of Jiangsu Province (No. KYCX21_1361).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  115 in total

Review 1.  International Union of Pharmacology. XLVI. G protein-coupled receptor list.

Authors:  Steven M Foord; Tom I Bonner; Richard R Neubig; Edward M Rosser; Jean-Phillipe Pin; Anthony P Davenport; Michael Spedding; Anthony J Harmar
Journal:  Pharmacol Rev       Date:  2005-06       Impact factor: 25.468

2.  Identification and comparative analysis of G protein-coupled receptors in Pediculus humanus humanus.

Authors:  Chengjun Li; Xiaowen Song; Xuhong Chen; Xing Liu; Ming Sang; Wei Wu; Xiaopei Yun; Xingxing Hu; Bin Li
Journal:  Genomics       Date:  2014-06-19       Impact factor: 5.736

3.  Mechanism and function of Drosophila capa GPCR: a desiccation stress-responsive receptor with functional homology to human neuromedinU receptor.

Authors:  Selim Terhzaz; Pablo Cabrero; Joris H Robben; Jonathan C Radford; Brian D Hudson; Graeme Milligan; Julian A T Dow; Shireen-A Davies
Journal:  PLoS One       Date:  2012-01-11       Impact factor: 3.240

Review 4.  Family-B G-protein-coupled receptors.

Authors:  A J Harmar
Journal:  Genome Biol       Date:  2001-11-23       Impact factor: 13.583

5.  Seasonal alterations in host range and fidelity in the polyphagous mirid bug, Apolygus lucorum (Heteroptera: Miridae).

Authors:  Hongsheng Pan; Bing Liu; Yanhui Lu; Kris A G Wyckhuys
Journal:  PLoS One       Date:  2015-02-18       Impact factor: 3.240

6.  One-step generation of mice carrying a conditional allele together with an HA-tag insertion for the delta opioid receptor.

Authors:  Dongru Su; Min Wang; Chenli Ye; Jiahui Fang; Yanhui Duan; Zhenghong Zhang; Qiuhong Hua; Changjie Shi; Lihong Zhang; Ru Zhang; Xin Xie
Journal:  Sci Rep       Date:  2017-03-16       Impact factor: 4.379

7.  An effect of thermo-stimulation of lower abdomen on autonomic nervous system: An experimental study.

Authors:  Kyeong Han Kim; Jong Uk Kim; Bo Hyun Kim; Jin Hyeon Shin; Seong Jin Hong; Sang Ryong Lee; Tae-Han Yook
Journal:  J Pharmacopuncture       Date:  2018-06-30

8.  Detection and Quantification of GPCR mRNA: An Assessment and Implications of Data from High-Content Methods.

Authors:  Krishna Sriram; Shu Z Wiley; Kevin Moyung; Matthew W Gorr; Cristina Salmerón; Jordin Marucut; Randall P French; Andrew M Lowy; Paul A Insel
Journal:  ACS Omega       Date:  2019-09-30

9.  The evolution of opsin genes in five species of mirid bugs: duplication of long-wavelength opsins and loss of blue-sensitive opsins.

Authors:  Pengjun Xu; Bin Lu; Jiangtao Chao; Robert Holdbrook; Gemei Liang; Yanhui Lu
Journal:  BMC Ecol Evol       Date:  2021-04-26

10.  Identification of the genes involved in odorant reception and detection in the palm weevil Rhynchophorus ferrugineus, an important quarantine pest, by antennal transcriptome analysis.

Authors:  Binu Antony; Alan Soffan; Jernej Jakše; Mahmoud M Abdelazim; Saleh A Aldosari; Abdulrahman S Aldawood; Arnab Pain
Journal:  BMC Genomics       Date:  2016-01-22       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.