| Literature DB >> 28231302 |
Sergio Munoz1, Felix D Guerrero2, Anastasia Kellogg1, Andrew M Heekin2, Ming-Ying Leung1.
Abstract
The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller's organ, located in the tick's forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28231302 PMCID: PMC5322884 DOI: 10.1371/journal.pone.0172326
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overall workflow of the study.
Visual representation of all steps of the study, from wet-lab procedures to bioinformatic analysis. Steps taken and results are represented by rectangles, methods by rhombuses and criteria by circles.
GPCR Synonyms (GO:0004930) and terms associated with GPCR activity.
| G-protein coupled receptor activity, unknown ligand |
|---|
| Mas proto-oncogene receptor activity |
| Orphan G protein coupled receptor activity |
| Orphan GPCR activity |
| RDC1 receptor activity |
| Super conserved receptor expressed in brain receptor activity |
| Epstein-Barr Virus-induced receptor activity |
| SREB receptor |
| EBV-induced receptor |
| Orphan G-protein coupled receptor activity |
| Receptor activity, G-protein coupled |
| G protein coupled receptor activity |
| G protein linked receptor activity |
| GPCR activity |
| Ligand-dependent GPCR activity |
Fig 2R. australis foreleg transcriptome annotation.
a) Top Blastx hits by species; b) Characterization of Pfam predictions for all ORFs with at least 50 amino acids. The chart represents those ORFs that had a significant hit in Pfam and belonged to a clan/superfamily. Clans/superfamilies containing less than 5 sequences were grouped in the “others”.
ORFs predicted as GPCRs by any of the alignment-based tools.
| Foreleg ORF | Blastx vs Uniref100 | Blastp vs | Blastp vs Chelicarates | Blastp vs Syng | Blastp vs | Pfam |
|---|---|---|---|---|---|---|
| athaller3876_117(length)_1(strand)_1(frame) | ||||||
| athaller1224_421(length)_1(strand)_0(frame) | ||||||
| athaller3802_175(length)_1(strand)_2(frame) | ||||||
| athaller3787_156(length)_1(strand)_0(frame) | ||||||
| athaller1175_662(length)_1(strand)_1(frame) | ||||||
| athaller1230_465(length)_-1(strand)_0(frame) | ||||||
| athaller2715_233(length)_1(strand)_2(frame) | ||||||
| athaller2824_126(length)_1(strand)_1(frame) | ||||||
| athaller356_291(length)_1(strand)_1(frame) | ||||||
| athaller357_301(length)_1(strand)_2(frame) | ||||||
| athaller4258_129(length)_-1(strand)_2(frame) | ||||||
| athaller4697_102(length)_1(strand)_0(frame) | ||||||
| athaller675_193(length)_1(strand)_0(frame) | ||||||
| athaller2474_190(length)_1(strand)_1(frame) | ||||||
| athaller4147_161(length)_-1(strand)_2(frame) | ||||||
| athaller1305_187(length)_-1(strand)_1(frame) | ||||||
| athaller1897_360(length)_-1(strand)_2(frame) | ||||||
| athaller508_240(length)_-1(strand)_1(frame) |
Fig 3Venn Diagram showing numbers of GPCRs predicted by 4 different approaches: Alignment methods (Blast and Pfam), TMHMM, GPCRpred, and PCA-GPCR.
The number of total predictions by each approach is indicated in parenthesis.
ORFs predicted to be GPCRs by at least 3 approaches.
| ORF | Pred. by | PCA-GPCR Class | PCA-GPCR Subfamily | GPCRpred Class | GPCRPred Subfamily |
|---|---|---|---|---|---|
| athaller1386_405(length)_1(strand)_0(frame) | TGP | A | (Rhod)opsin | A | Prostanoid |
| athaller1516_468(length)_-1(strand)_0(frame) | TGP | A | (Rhod)opsin | A | Amine |
| athaller2143_275(length)_1(strand)_0(frame) | TGP | A | (Rhod)opsin | A | Peptide |
| athaller2570_238(length)_-1(strand)_0(frame) | TGP | A | (Rhod)opsin | A | Peptide |
| athaller1702_381(length)_-1(strand)_2(frame) | TGP | A | Amine | A | Peptide |
| athaller1768_231(length)_-1(strand)_2(frame) | TGP | A | Amine | A | Peptide |
| athaller1154_282(length)_1(strand)_0(frame) | TGP | A | Melatonin | A | Peptide |
| athaller1479_188(length)_1(strand)_2(frame) | TGP | A | Melatonin | A | Peptide |
| athaller3002_157(length)_-1(strand)_1(frame) | TGP | A | Melatonin | A | Peptide |
| athaller1138_187(length)_1(strand)_2(frame) | TGP | A | NucleotideLike | A | Amine |
| athaller1414_324(length)_-1(strand)_2(frame) | TGP | A | NucleotideLike | A | Amine |
| athaller1735_320(length)_-1(strand)_1(frame) | TGP | A | NucleotideLike | A | Peptide |
| athaller1975_264(length)_1(strand)_0(frame) | TGP | A | NucleotideLike | A | Peptide |
| athaller2639_255(length)_1(strand)_2(frame) | TGP | A | NucleotideLike | A | Peptide |
| athaller1224_421(length)_1(strand)_0(frame) | AGP | A | Peptide | A | Peptide |
| athaller3668_184(length)_-1(strand)_1(frame) | TGP | A | Olfactory | A | Peptide |
| athaller3787_156(length)_1(strand)_0(frame) | TAP | A | Olfactory | NA | - |
| athaller1342_403(length)_-1(strand)_0(frame) | TGP | A | Orphan | A | Rhodopsin |
| athaller1823_379(length)_1(strand)_2(frame) | TGP | A | Orphan | A | Rhodopsin |
| athaller2698_229(length)_-1(strand)_1(frame) | TGP | A | Orphan | A | Rhodopsin |
| athaller356_291(length)_1(strand)_1(frame) | TAP | A | Orphan | NA | - |
| athaller454_222(length)_1(strand)_0(frame) | TGP | A | Orphan | A | Peptide |
| athaller1303_323(length)_-1(strand)_0(frame) | TGP | A | Peptide | A | Rhodopsin |
| athaller1377_284(length)_1(strand)_0(frame) | TGP | A | Peptide | A | Rhodopsin |
| athaller1927_354(length)_-1(strand)_2(frame) | TGP | A | Peptide | A | Nucleotide-like |
| athaller1230_465(length)_-1(strand)_0(frame) | AGP | B | Brainspecific angiogenesis inhibitor (BAI)) | B | - |
| athaller552_430(length)_1(strand)_0(frame) | TGP | B | GPR133 | A | Amine |
| athaller1160_759(length)_1(strand)_1(frame) | TGP | C | GABAB | A | Amine |
| - |
Boldface: Sequences whose subfamilies were consistently predicted by PCA-GPCR and GPCRpred
* Sequences that had +6 and +8 in the RaptorX 3D model score
** This sequence was consistently predicted as GPCR by all methods.
ǂ Possible olfactory GPCR, predicted to be partial GPCR by TMHMM.
1Tools that predicted the ORF to be a GPCR (T = TMHMM, A = AlignmentMethods, G = GPCRpred, P = PCA-GPCR)
Fig 4Phylogenetic tree of the ORFs that were predicted by 3 or more approaches.
Clades containing clusters of ORFs from the same GPCR subfamilies are highlighted.
Fig 53D structure model for ORF athaller1175_662.
Tertiary structure of a predicted GABAB receptor. This ORF is consistently considered to be GPCR by all the prediction approaches used.
GPCR-related search terms.
List of names and terms of proteins known to interact with GPCRs.
| Search term | Protein type |
|---|---|
| Adenylate Cyclase | |
| G-protein subunits | |
| GRK | |
| PKA | |
| GTPase | |
| GAP | |
| GAP | |
| GEF | |
| GEF | |
| GTPase and GAP | |
| GTPase | |
| GTPase and GAP |
Predicted GPCR-related proteins.
Cattle tick foreleg transcripts annotated as encoding proteins related to GPCR activity.
| cAMP-dependent protein kinase regulator | athaller128 |
| CAMP-dependent protein kinase catalytic subunit isoform 2 | athaller1899 |
| cAMP-dependent protein kinase regulator | athaller2981 |
| Guanine nucleotide-binding protein subunit beta-2-like 1 protein | athaller853 |
| Guanine nucleotide binding protein beta subunit | athaller2293 |
| Putative mitofusin 1 gtpase involved in mitochondrila bioproteinsis | athaller282 |
| Putative mitofusin 1 gtpase involved in mitochondrila bioproteinsis | athaller284 |
| Ras-like GTP-binding protein Rho1 | athaller398 |
| Ras-like GTP-binding protein Rho1 | athaller399 |
| Putative rac1 gtpase effector fhos | athaller500 |
| Putative rac1 gtpase effector fhos | athaller501 |
| Ypt/rab specific gtpase activating protein gyp6 | athaller785 |
| Ypt/rab specific gtpase activating protein gyp6 | athaller786 |
| Putative rhoa gtpase effector dia/diaphanous (Fragment) | athaller1192 |
| Ras-related protein Rap-1A | athaller1263 |
| Ras-related protein Ral-A | athaller1266 |
| Putative rab subfamily protein of small gtpase (Fragment) | athaller1833 |
| Rho GTPase-activating protein RICH2 (Fragment) | athaller2021 |
| Putative vesicle coat complex copii gtpase subunit sar1 | athaller2065 |
| GTP-binding nuclear protein | athaller2080 |
| Rhoa gtpase effector dia/diaphanous (Fragment) | athaller2113 |
| Putative ypt/rab-specific gtpase-activating protein gyp1 | athaller2150 |
| Putative rac1 gtpase effector fhos | athaller2477 |
| Putative ras-related protein rab-11a | athaller2532 |
| Ras-related protein Rab-18 | athaller2665 |
| Ras-related protein Rap-2C | athaller2666 |
| Putative gtpase rab14 small g protein superfamily | athaller2700 |
| Ras-related protein Rab-1A | athaller2702 |
| Rasgap sh3 binding protein rasputin | athaller2830 |
| Putative rac1 gtpase effector fhos | athaller3014 |
| Large subunit GTPase 1 (Fragment) | athaller3190 |
| Rho gtpase binding protein | athaller3535 |
| Putative gtpase rab2 small g protein superfamily | athaller3629 |
| Ras-related protein Rab-24 | athaller4154 |
| Rhoa gtpase effector dia/diaphanous | athaller4155 |
| Putative retinitis pigmentosa gtpase regulator b (Fragment) | athaller4427 |
| Adenylate cyclase terminal differentiation specific | athaller184 |
| Adenylate cyclase terminal differentiation specific | athaller185 |
| Adenylate cyclase terminal differentiation specific | athaller186 |
| Adenylate cyclase terminal differentiation specific | athaller187 |
| Beta-arrestin | athaller2087 |
| RhoGAP | athaller2021_271(length)_-1(strand)_2(frame) |
| RabGAP-TBC | athaller2150_313(length)_-1(strand)_2(frame) |
| RhoGAP | athaller3118_219(length)_1(strand)_2(frame) |
| RhoGEF | athaller2000_339(length)_1(strand)_1(frame) |
| Arrestin_C | athaller2087_258(length)_-1(strand)_1(frame) |
*Predicted by both BlastX and Pfam