| Literature DB >> 36180835 |
Hanno Schmidt1,2, Katharina Mauer3, Manuel Glaser4, Bahram Sayyaf Dezfuli5, Sören Lukas Hellmann6,7, Ana Lúcia Silva Gomes8, Falk Butter9, Rebecca C Wade4,10, Thomas Hankeln6, Holger Herlyn11.
Abstract
BACKGROUND: With the expansion of animal production, parasitic helminths are gaining increasing economic importance. However, application of several established deworming agents can harm treated hosts and environment due to their low specificity. Furthermore, the number of parasite strains showing resistance is growing, while hardly any new anthelminthics are being developed. Here, we present a bioinformatics workflow designed to reduce the time and cost in the development of new strategies against parasites. The workflow includes quantitative transcriptomics and proteomics, 3D structure modeling, binding site prediction, and virtual ligand screening. Its use is demonstrated for Acanthocephala (thorny-headed worms) which are an emerging pest in fish aquaculture. We included three acanthocephalans (Pomphorhynchus laevis, Neoechinorhynchus agilis, Neoechinorhynchus buttnerae) from four fish species (common barbel, European eel, thinlip mullet, tambaqui).Entities:
Keywords: Active ingredients; Anthelmintics; Medical genomics; Parasites; Target molecule; Virtual ligand screening
Mesh:
Substances:
Year: 2022 PMID: 36180835 PMCID: PMC9523657 DOI: 10.1186/s12864-022-08882-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Fig. 1Flowchart of the analysis workflow. Female and male worm specimens were collected from two different hosts and used for mRNA sequencing and mass spectrometry. In-depth analyses ensured target identification (the target sequence is present in acanthocephalans), specificity (the target sequence is absent or has little sequence similarity in non-acanthocephalan species), and availability and accessibility (the target is present as protein in the acanthocephalan body wall). Candidate target protein sequences that fulfill these criteria were passed on to protein modeling and ligand screening
Fig. 2Analysis of transcript abundance. Shown are differential expression values for genes with at least 50 transcript reads in every sample. Each dot represents one gene. Genes with significantly similar transcript abundances (log fold change < 1.5; adjusted p-value < 0.05) are labeled blue, the remaining ones red. Only genes with similar transcript abundances in at least two of the four comparisons were kept for downstream analyses. Up- and downward pointing triangles at the top and bottom margins of the plots represent data points outside of the range depicted. Only a few isolated data points to the right have been omitted for better display
Fig. 3Correlation transcript and protein abundances. Each dot represents one of ~ 2500 proteins quantified by mass spectrometry. Candidate target proteins are highlighted in pink. Protein and transcript (RNA) abundances are given as iBAQ values and mean read counts, respectively. The correlation between the two abundances was moderately positive for all proteins (0.51; p-value = 2.4 e-165; Student’s t-test) and strongly positive for the candidate target proteins (0.81; p-value = 4.4 e-13; Student’s t-test). Given the levels of p-values, correction for multiple testing would not have affected the determination of significance
Candidate target proteins in acanthocephalans and known drugs predicted to bind to them
| Protein identifier | PFAM motif | Subcellular localization | Ligand (Drugs-lib in MTiOpenScreen [ | Binding energy (kcal/mol) |
|---|---|---|---|---|
| 1609# | glycosyl transferase | Golgi apparatus, membrane | Pranazepide (Derquantel) | −10.9 |
| 4617 | troponin | nucleus, soluble | Derquantel | −9.0 |
| 5995# | amine lyase | cytoplasm, soluble | Tadalafil | −9.0 |
| 7137# | protein kinase | nucleus, soluble | Casopitant | −10.8 |
| 8627 | unknown function | cell membrane, membrane | Afacifenacin | −11.4 |
| 8750# | PIP5K | cytoplasm, soluble | Piketoprofen | −9.9 |
| 8763# | NAD binding | peroxisome, soluble | Bemcentinib | −11.8 |
| 9169# | phosphatidic acid phosphatase | cell membrane, membrane | Tadalafil | −9.2 |
| 9190# | dopamine beta-monooxygenase | cell membrane, membrane | Fluazuron | −12.9 |
| 9257 | RNA recognition motif | nucleus, soluble | Heliomycin | −9.1 |
| 9684# | – | cytoplasm, soluble | Etoposide | −9.4 |
Ligands predicted to bind strongest with a minimum free energy of −9 kcal/mol are shown. Parentheses give a case of second most strongly binding by derquantel. Hash signs mark proteins predicted to possess enzyme activity based on ECPred
PFAM Protein families
Standard InChI keys and 2D structures of the ligands are available in Supplementary Table S10
Properties of selected ligands for assessing their drug-likeness
| Agent | Molecular weight | nHA | nHD | nRot | nRing | MaxRing | nHet | fChar | nRig | Stereo centers | TPSA | logS | logP | logD | RO5 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 438.45 | 6 | 2 | 4 | 6 | 12 | 7 | 0 | 33 | 1 | 77.56 | −5.6 | 4.23 | 3.76 | Yes | |
| 479.6 | 7 | 2 | 0 | 8 | 14 | 7 | 0 | 34 | 6 | 74.27 | −4.97 | 4.4 | 2.6 | Yes | |
| 389.41 | 7 | 1 | 1 | 6 | 17 | 7 | 0 | 32 | 2 | 74.87 | −5.36 | 2.9 | 2.43 | Yes | |
| 616.62 | 6 | 0 | 9 | 4 | 6 | 13 | 0 | 26 | 3 | 47.1 | −5.18 | 5.48 | 4.32 | No | |
| 481.52 | 5 | 1 | 6 | 5 | 10 | 8 | 0 | 30 | 1 | 44.81 | −6.63 | 5.91 | 4.58 | No* | |
| 344.41 | 4 | 1 | 5 | 3 | 6 | 4 | 0 | 21 | 1 | 62.29 | −4.58 | 3.77 | 3.69 | Yes | |
| 506.66 | 8 | 3 | 3 | 7 | 15 | 8 | 0 | 41 | 1 | 101.74 | −3.45 | 4.73 | 3.92 | No* | |
| 506.21 | 6 | 2 | 8 | 3 | 6 | 13 | 0 | 20 | 0 | 80.32 | −7.63 | 5.26 | 4.16 | No | |
| 376.36 | 6 | 4 | 0 | 5 | 16 | 6 | 0 | 25 | 0 | 115.06 | −5.55 | 4.71 | 1.86 | Yes | |
| 588.56 | 13 | 3 | 5 | 7 | 16 | 13 | 0 | 37 | 10 | 160.83 | −3.57 | 1.69 | 1.79 | No* | |
Listed are the physicochemical properties of the ligands with highest binding affinity for each of the eleven candidate target proteins
nHA Number of hydrogen bond acceptors, nHD Number of hydrogen bond donors, nRot Number of rotatable bonds, nRing Number of rings, MaxRing Number of atoms in the biggest ring, nHet Number of heteroatoms, fChar Formal charge, nRig Number of rigid bonds, TPSA Topological polar surface area, logS Log of the aqueous solubility, logP Log of the octanol/water partition coefficient, logD LogP at physiological pH of 7.4, RO5 Rule of five
The rule of five scores and integrates values for molecular weight, nHA, nHD, and logP for a synopsis of orally active drug-likeness. Asterisks mark ligands that do not fulfill the RO5 but have been applied orally in clinical studies. Molecular weight and RO5 were retrieved from ChEMBL, the other properties were predicted by ADMETlab
Fig. 4Three-dimensional structure models of the eleven top candidate target proteins. Shown are de novo models of 3D structures (constructed using AlphaFold2) for eleven proteins which fulfilled all filter criteria. The proteins were each additionally predicted to bind a drug with a free energy of ≤ −9.0 kcal/mol in the virtual screening using AutoDock Vina. The proteins are shown as molecular surfaces colored by AlphaFold2 confidence score (pLDDT; with higher values having greater confidence). Gray markings indicate predicted binding sites (on the surface or within the protein). Values in parentheses below protein identifiers give the average pLDDT of the protein model followed by the percentage identity between this model and one from a second 3D structure prediction program, RoseTTAFold. Both values are on the scale 0–100