Literature DB >> 29436819

Fragment-Based Screening of a Natural Product Library against 62 Potential Malaria Drug Targets Employing Native Mass Spectrometry.

Hoan Vu1, Liliana Pedro1, Tin Mak1, Brendan McCormick1, Jessica Rowley1, Miaomiao Liu1, Angela Di Capua1, Billy Williams-Noonan1, Ngoc B Pham1, Rebecca Pouwer1, Bao Nguyen1, Katherine T Andrews1, Tina Skinner-Adams1, Jessica Kim, Wim G J Hol, Raymond Hui2, Gregory J Crowther, Wesley C Van Voorhis, Ronald J Quinn1.   

Abstract

Natural products are well known for their biological relevance, high degree of three-dimensionality, and access to areas of largely unexplored chemical space. To shape our understanding of the interaction between natural products and protein targets in the postgenomic era, we have used native mass spectrometry to investigate 62 potential protein targets for malaria using a natural-product-based fragment library. We reveal here 96 low-molecular-weight natural products identified as binding partners of 32 of the putative malarial targets. Seventy-nine (79) fragments have direct growth inhibition on Plasmodium falciparum at concentrations that are promising for the development of fragment hits against these protein targets. This adds a fragment library to the published HTS active libraries in the public domain.

Entities:  

Keywords:  fragments; malaria; native mass spectrometry; natural products; target identification

Mesh:

Substances:

Year:  2018        PMID: 29436819      PMCID: PMC5902791          DOI: 10.1021/acsinfecdis.7b00197

Source DB:  PubMed          Journal:  ACS Infect Dis        ISSN: 2373-8227            Impact factor:   5.084


In 2015, malaria infected an estimated 212 million people globally, leading to 429 000 deaths, most of which were children aged 5 years and under in Africa.[1]Plasmodium falciparum is the most common species causing malaria in humans. Currently, four major classes of compounds are clinically used to treat malaria: (I) quinolines (e.g., chloroquine, quinine, mefloquine), (II) antifolates (e.g., sulfadoxine, pyrimethamine), (III) artemisinin derivatives (e.g., artesunate, artemether), and (IV) antimicrobials (e.g., doxycycline).[2] However, a combination of parasite drug resistance and vector resistance to insecticides remains a huge challenge in combating this disease.[3] It is an urgent need that new classes of drugs be developed for the treatment of this pathogen. Since 2008, almost 6 million compounds have been screened against the blood stages of P. falciparum, and approximately 0.5% of these compounds showed activity consistent with an EC50 of less than 1 μM.[4] Chemical structures and associated data have been made public from consortia led by GlaxoSmithKline (GSK), Novartis, and St. Jude Children’s Research Hospital, Memphis (St. Jude) to encourage further research into this disease.[5−7] In addition, a subset of 400 compounds has been assembled into the Medicines for Malaria Venture Malaria Box that has catalyzed research into the discovery of new antimalarial clinical candidates.[8−13] Finding a ligand for a disease-related protein remains one of the biggest hurdles of the postgenomic era. The identification of synthetic or natural ligands can help decrypt the biochemical functions of proteins with unknown functions.[14] Such efforts require a method to detect the interaction between small molecular ligands and macromolecular targets. Here, we describe a fragment-based approach using native mass spectrometry to detect protein–ligand interactions that exploit proteins produced by structural genomics efforts. The technique relies on nondenaturing electrospray ionization (ESI) to first recognize multicharged proteins in their near-native states. Native screening using high-resolution electrospray ionization Fourier transform ion cyclotron resonance mass spectrometry (ESI-FT-ICR-MS) is a label-free, fast, accurate method that permits the direct observation of noncovalent protein–ligand complexes.[15−17] In the resulting mass spectrum, the difference between the mass-to-charge ratio (Δm/z) for the protein–ligand complex and the unbound protein ions multiplied by the charge state (z) directly affords the molecular weight of the bound ligand (MWligand = Δm/z × z). The molecular weight mass information on the ligand allows the identification of the active ligand in a pooled library. The simple mix-and-measure, label-free nature of these experiments makes native ESI-FT-ICR-MS a practical technique in screening fragments. We previously reported the detection of protein–ligand interactions in complex natural product extracts.[18−20] We also demonstrated that ESI-FT-ICR-MS can directly observe weak, noncovalent protein binding of fragment-sized natural products.[21] Fragment-based drug discovery (FBDD) is an established alternative approach to high-throughput screening (HTS). Because of the simpler structure and smaller size of fragments, screening of fragment libraries can sample chemical space more efficiently and give higher hit rates in comparison to standard nonfragment chemical libraries.[22,23] Although fragment hits generally have lower potency with initial affinities of up to 1 mM,[24,25] their high ligand efficiency can offer more efficient and fruitful optimization campaigns, thus reducing attrition rates at all stages.[26] FBDD has yielded two marketed drugs, vemurafenib[27,28] and venetoclax,[29] and there are many other compounds currently in clinical trials,[30,31] including verubecestat (MK-8931) for Alzheimer’s disease, where the “intractable” Beta-APP cleaving enzyme (BACE) protein pockets that are long and shallow have proven difficult for traditional drug screenings to identify a hit.[32] Natural products, enriched with biosynthetic intermediates and endogenous metabolites resulting from being exposed to a long selection process to interact with biological targets, are excellent resources to search for protein binding partners. Native mass spectrometry provides key advantages for weak binding detection as it has high sensitivity, low sample consumption, does not require modifications or labeling of the protein target, and provides a direct visualization of all species present in solution under binding equilibrium.[33,34] Using this assay, we screened a fragment-sized library consisting of low-molecular-weight natural products and identified a number of natural products that bind to putative malaria targets. The hits were further investigated in biological activity assays to identify compounds that inhibit P. falciparum and warrant hit-to-lead development. We have identified 96 fragments that bind to 32 potential targets and have demonstrated that 79 of these fragments can inhibit the growth of malaria parasites in vitro.

Results and Discussion

Protein Selection

The proteins were selected according to a variety of criteria from the analysis of ∼5400 predicted open reading frames in the P. falciparum genome. Initial analysis was restricted to proteins of less than 50 kDa as the 4.7 T FT-ICR-MS detects up to 50 kDa. This produced 2535 genes, of which there was strong evidence that 67 were essential in either the exoerythrocytic stage or intraerythrocytic stages or had demonstrated high expression in gametocytes or liver stages. The evidence for essentiality was based mainly on genetic knockouts[35] but also involved pathway-based reasoning. For example, dihydroorotate dehydrogenase (DHODH) has been validated as a drug target,[36] so other pyrimidine biosynthesis enzymes were also included. Of these 67 genes, 56 met the criterion of having been validated in an exoerythrocytic stage and/or being highly expressed in gametocyte[37] and/or liver stages.[38] Proteins that were expected to be difficult to express and/or purify were removed from the analysis, leaving a total of 49 prioritized candidates. From these prioritized genes, 13 proteins were successfully expressed and purified for this study. We attempted to clone and express all 49 proteins but were successful with only 13, broadly consistent with the low success rates previously reported for Plasmodium proteins.[39,40] The full list of these proteins is in Table S1. Another 66 proteins were identified from previous selection criteria[41] as likely being essential for parasite survival and were amenable to recombinant expression. All proteins were soluble and therefore have a high likelihood of chemotherapeutic accessibility. Seventy-nine Plasmodium proteins were subjected to mass spectrometry under optimized conditions, yielding protein signals at low charged states between +8 and +18, depending on the molecular weight of the protein. Out of the 79 proteins, 62 proteins had a good signal-to-noise ratio (>100:1), indicating that native mass spectrometry is a powerful technology for recognizing protein in its  native-like state (Table S2 and Figure S3).

Fragment Library Assembly and Comparative Analysis of Physicochemical Properties, Shape, and Structural Diversity

The physicochemical properties of most fragment libraries today are compliant with the rule of three (Ro3), initially proposed by Congreve et al. in 2003.[42] However, both the structural and shape diversity of a fragment library are important additional factors to consider, and fragment libraries obtained synthetically or purchased from commercial suppliers are predominantly composed of small, flat heterocyclic molecules.[31,43] Natural product fragments have different chemotypes and are more populated with three-dimensional (3D) structures, thus offering an opportunity to improve the structural and shape diversity of fragment libraries and consequently to search different regions of biologically relevant chemical space and find structurally different hits.[44] We compared natural product fragments present in the Dictionary of Natural Products (DNP, sdf version 211.9) with commercial and synthetic fragments present in six different screening libraries (Asinex, ChemBridge, InterBioscreen (IBS), Molecular Libraries Small Molecule Repository (MLSMR), Life Chemicals, and Otava) using the following filtering criteria: molecular weight (MW) ≤ 250 Da, octanol–water partition coefficient (ALogP) < 4, hydrogen bond donors (HBD) ≤ 4, hydrogen bond acceptors (HBA) ≤ 5, rotatable bonds (RB) ≤ 6, percent polar surface area (%PSA) < 45. After a cleaning process which included salt stripping, structure normalization and standardization, ionization at pH 7.4 and the selection of organic molecules with MW ≥ 100 Da or heavy atom count ≥7 and sulfur atom counts ≤3,[45] we found that only 6.7% of the fragments from DNP were considered identical (Tanimoto score of 1) to all of the unique fragments present in the six commercially sourced fragment libraries, as determined by pairwise similarity comparisons using two-dimensional (2D) extended-connectivity fingerprints ECFP_6. In agreement with Hert et al.,[46] who noted that 83% of natural product ring scaffolds were absent in commercially available molecules, our substructure search analysis revealed that on average 91% of the scaffolds from the DNP fragments were absent in each of the commercially sourced fragment libraries. Our natural product fragment library contains 643 molecules with fragment-like physicochemical properties. Of these molecules, 602 (94%) comply strictly with the physicochemical property criteria initially set to guide the selection of these fragments (MW ≤ 250 Da, ALogP < 4, HBD ≤ 4, HBA ≤ 5, RB ≤ 6, %PSA < 45), while the remaining 41 constitute derivatives or structures that despite slightly violating one criterion or another were deemed of interest to screening. For the sake of accurate comparison to other generated fragment libraries, these 41 fragments were left out of the analyses discussed in this section, unless otherwise stated. The mean distributions among the different fragment libraries for selected physicochemical properties (Figure S1) show that natural-product-based fragments (our 602 natural product fragments and fragments from DNP) have on average the same number of rings but a markedly smaller number of aromatic rings than commercially available fragments (fragments present in Asinex, ChemBridge, IBS, MLSMR, Life Chemicals, and Otava screening libraries). Our natural product fragment library presents itself with the highest average number of HBDs and the lowest average of  ALogP and number of RBs (Figure S1). To characterize the 3D shape of the fragments contained within each library, the fraction of sp3-hybridized carbon atoms (Fsp3) and principal moments of inertia (PMI) were determined for each molecule. The plot of the average of the calculated sum of the normalized PMI ratios (PMIsum) against the average Fsp3 clearly differentiates fragment libraries based on commercial and synthetic compounds from those based on natural products, which occupy a space where the average Fsp3 and average PMIsum are higher (Figure a). However, the benefits of populating a fragment library with more 3D-shaped fragments is still controversial.[47−52] On one hand, 3D-shaped fragments have been suggested to be capable of sampling a larger chemical space[50] and of exploring the pocket space of targets more extensively than 2D-shaped fragments,[48] being more likely to deliver biologically active and clinically relevant molecules.[51,52] On the other hand, 3D-shaped fragments have raised questions about the implications that their implicit higher molecular complexity can have for hit rates[53] and for ease of synthesis and derivatization.[54] It has also been shown that three-dimensionality can be introduced, to different degrees depending on the connectivity of the fragment scaffold, during fragment elaboration.[47−49] While it is still unclear how important 3D character is for the core scaffold of fragments, we believe that shape diversity should exist in a fragment library. The 643-member natural product fragment library also contains a good representation of flat scaffolds (Bemis-Murcko molecular frameworks including rings, linkers, exocyclic double bonds, and terminal double bonds attached to the linkers), despite containing a higher proportion of 3D-shaped structures than other commercially available fragment libraries.
Figure 1

Fragment libraries shape (a) and structural (b) diversity. (a) Average sum of normalized PMI ratios (PMIsum) as a function of the average fraction of sp3-hybridized carbon atoms (Fsp3) calculated for each fragment library. The three principal moments of inertia (PMI) were determined for each 3D minimized structure. The normalized PMI ratios (NPR 1 and 2) were calculated by dividing the two lowest PMIs by the highest. NPR1 was added to NPR2 to give the PMIsum. Flat molecules give a PMIsum of 1. A PMIsum higher than 1 indicates a deviation from planarity toward a spherelike shape. (b) Percentage of unique Bemis-Murcko scaffolds as a function of the average number of unique fingerprint features per fragment, calculated as the total number of unique fingerprint features determined using 2D extended-connectivity fingerprints (ECFP_6) divided by the number of fragments in the library.

Fragment libraries shape (a) and structural (b) diversity. (a) Average sum of normalized PMI ratios (PMIsum) as a function of the average fraction of sp3-hybridized carbon atoms (Fsp3) calculated for each fragment library. The three principal moments of inertia (PMI) were determined for each 3D minimized structure. The normalized PMI ratios (NPR 1 and 2) were calculated by dividing the two lowest PMIs by the highest. NPR1 was added to NPR2 to give the PMIsum. Flat molecules give a PMIsum of 1. A PMIsum higher than 1 indicates a deviation from planarity toward a spherelike shape. (b) Percentage of unique Bemis-Murcko scaffolds as a function of the average number of unique fingerprint features per fragment, calculated as the total number of unique fingerprint features determined using 2D extended-connectivity fingerprints (ECFP_6) divided by the number of fragments in the library. To assess the structural diversity of each fragment library, the percentage of unique fragment scaffolds (Bemis-Murcko molecular frameworks including rings, linkers, exocyclic double bonds, and terminal double bonds attached to the linkers) present in each library and the average number of unique fingerprint features per fragment (using 2D extended-connectivity fingerprints ECFP_6) were calculated. Figure b shows the plot of the former against the latter, demonstrating the high structural diversity achieved with our natural product fragment library.

Prevalence of Fragment Hit Scaffolds among Existing Antimalarial HTS Hits

In order to accelerate research efforts for antimalarial drug discovery, the structures of the hit molecules that resulted from different high-throughput screens (HTS) have been deposited in the public domain (http://www.ebi.ac.uk/chemblntd). Herein, the structures provided by St. Jude’s Children’s Research Hospital,[6] Novartis,[55,56] GlaxoSmithKline,[5] Harvard Medical School,[57] and MMV[58] were used to evaluate the representation of the scaffolds derived from our fragment hits within this antimalarial chemical space. Our 96 fragment hits contain 72 unique fragment scaffolds. Through a substructure search, each HTS hit was searched for in the presence of one or more of these fragment scaffolds. Twenty-six (36%) of the scaffolds were found among these libraries. While 8 of the fragment scaffolds are common to all antimalarial HTS hit libraries, some are unique to specific active antimalarial libraries (Figure S2). Table gives examples of some fragment scaffolds represented among the antimalarial HTS hits as well as the originating fragment. The structures of the active drug-sized molecules clearly show that these could have been developed through FBDD by fragment linking, growing, or merging (Table ). While this provides a validation of our fragment screening approach, it shows that, with 46 (64%) of the fragment hit scaffolds remaining to be mapped within an antimalarial HTS hit molecule, many opportunities exist for fragment elaboration into structurally new active drug-sized molecules. A PMI plot of the unique scaffolds present in our natural product fragment library (325 fragment scaffolds), where those present within fragment hits are discriminated from those absent, is shown in Figure . Within our fragment screening campaign, fragment hits were identified across all types of scaffold shapes.
Table 1

Examples of Fragment Hit Scaffolds Present among Antimalarial HTS Hits

Figure 2

PMI of the fragment scaffolds (Bemis-Murcko molecular frameworks) contained in our fragment hit library (96 fragment hits, 72 unique fragment scaffolds) and comparison with those not contained in any fragment hit (253 unique fragment scaffolds). NPR1 constitutes the X axis, while NPR2 constitutes the Y axis.

PMI of the fragment scaffolds (Bemis-Murcko molecular frameworks) contained in our fragment hit library (96 fragment hits, 72 unique fragment scaffolds) and comparison with those not contained in any fragment hit (253 unique fragment scaffolds). NPR1 constitutes the X axis, while NPR2 constitutes the Y axis.

Fragment Screening by Native MS

We performed a native MS screening using the natural product library against each of the 62 proteins. The complexes formed were ranked as strong, medium, or weak according to the ratio between the protein–ligand complex signal intensity and the unbound protein signal intensity (strong, ratio >0.5; medium, ratio 0.15 to 0.5; weak, ratio 0.02 to <0.15) (Figure ).
Figure 3

Native proteins and protein–ligand complexes. (a) PF3D7_1457200 thioredoxin and its strong signal complex with 10. (b) PF3D7_1457200 thioredoxin I and its medium signal complex with 16. (c) PF3D7_1457200 thioredoxin I and its weak complex with 8.

Native proteins and protein–ligand complexes. (a) PF3D7_1457200 thioredoxin and its strong signal complex with 10. (b) PF3D7_1457200 thioredoxin I and its medium signal complex with 16. (c) PF3D7_1457200 thioredoxin I and its weak complex with 8. This ratio does not provide an absolute measure of the binding affinity but can be used to obtain a relative affinity ranking and provide rough KD estimates. Assuming that the ionization, transmission, and detection efficiencies of the protein–ligand complex and free protein are the same and that relative equilibrium concentrations in solution are preserved upon transfer to the gas phase, the higher the intensity of the protein–ligand complex in the mass spectrum relative to the free protein, the higher the protein occupancy in solution and therefore the higher the ligand affinity for the protein. In our experience[17,20] and considering the range of protein and ligand concentrations used throughout this work (1–20 μM protein, 5–400 μM fragment), it is estimated that strong and medium binders have KD < 100 μM. The categorization into strong, medium, and weak binding is particularly useful, together with hit rates, in assessing target ligandability, the ability of a protein target to bind small molecules with high affinity.[24] Taking into account the ligandability scoring system proposed by Edfeldt and colleagues,[24] we have defined high ligandable targets (with a high ligandability score) as those with ≥2 strong/medium binders and ≥7 hits (greater than 1% of the library); medium ligandability for proteins with at least 1 strong/medium binder and ≥2 hits; and low ligandability for proteins having at least 1 binder (Figure ). The results clearly identify some targets that are more worthwhile to pursue on the basis of ligandability.
Figure 4

Summary of 32 proteins that formed complexes with 96 natural products. Number of strong binders (ratio >0.5), medium (ratio 0.15 to 0.5), and weak (ratio 0.02 to <0.15) shown for each protein. Ligandability is based on high (≥2 strong/medium binders and ≥7 hits), medium (at least 1 strong/medium binder and ≥2 hits), and low (at least 1 hit).

Summary of 32 proteins that formed complexes with 96 natural products. Number of strong binders (ratio >0.5), medium (ratio 0.15 to 0.5), and weak (ratio 0.02 to <0.15) shown for each protein. Ligandability is based on high (≥2 strong/medium binders and ≥7 hits), medium (at least 1 strong/medium binder and ≥2 hits), and low (at least 1 hit). Ninety-six compounds (Figure ) formed complexes with 32 proteins (Figure ). Of the 96 fragments found to have protein partners, 48 fragments bound 1 protein, 18 bound 2 proteins, 14 bound 3 proteins, 4 bound 4 proteins, 5 bound 5 proteins, and 7 bound 6 proteins. Compounds that formed complexes with greater than 6 proteins were considered promiscuous and were excluded from further analysis. The retained hits are highly selective because they were screened against a total of 62 proteins: 48 fragments did not bind to 61 other proteins, 18 compounds did not bind to 60 other proteins, 14 compounds did not bind to 59 proteins, 4 compounds did not bind to 58 proteins, 5 compounds did not bind to 57 proteins, and 7 compounds did not bind to 56 proteins.
Figure 5

Structures of 96 low-molecular-weight natural product binders. * indicates a derivative of the indicated natural product. See Table S3 for details.

Figure 6

Heat map showing hits from the fragment library (643 members) against 32 proteins. Green indicates strong binding (protein–ligand complex/free protein ratio is greater than 50%). Blue indicates medium binding (protein–ligand complex/free protein ratio is greater than 15% and smaller than 50%). Orange indicates weak binding (protein–ligand complex/free protein ratio is from 2 to 15%). White indicates no binding. The proteins are shown in Table S1.

Structures of 96 low-molecular-weight natural product binders. * indicates a derivative of the indicated natural product. See Table S3 for details. Heat map showing hits from the fragment library (643 members) against 32 proteins. Green indicates strong binding (protein–ligand complex/free protein ratio is greater than 50%). Blue indicates medium binding (protein–ligand complex/free protein ratio is greater than 15% and smaller than 50%). Orange indicates weak binding (protein–ligand complex/free protein ratio is from 2 to 15%). White indicates no binding. The proteins are shown in Table S1. Although it is generally thought that molecules containing an electrophilic warhead (such as Michael acceptors) react indiscriminately and form covalent bonds with a variety of proteins, several studies have shown that such is not the case.[59,60] A molecule containing a weak to moderate electrophilic warhead will bind covalently to a nucleophilic protein residue only if it has first bound noncovalently to the target protein and if the reactive electrophile has been properly oriented within the target protein so that a bond can be formed with the nucleophile on the protein.[59,60] Many of our fragment hits contain electrophilic groups and are selective for a limited number of proteins (for example, see 15–20). Moreover, our experimental data shows that not all PAINS-containing molecules (such as polyphenols, β-lactams, epoxides, and rhodamine-related compounds) are promiscuous (for example, see 27, 50, 68, and 71).[61,62] To better display these data obtained from our MS screening, we built a map that shows the relationship between fragments and proteins (Figure ). Considering hits and proteins as nodes, we connected them to each other based on the binding interaction from Figure : the result is a network with 96 fragments. As previously mentioned, there were 48 selective fragments that bound a single protein so that 48 fragments bound more than 1 protein.
Figure 7

Network visualization of 96 fragment hits against 32 proteins. The network was visualized using the Cytoscape software.[63] Circle nodes represent fragment hits, and rectangular nodes represent proteins. The edge represents interaction between fragment hit and protein. The green edge indicates strong binding, blue indicates medium binding, and orange indicates weak binding, according to Figure .

Network visualization of 96 fragment hits against 32 proteins. The network was visualized using the Cytoscape software.[63] Circle nodes represent fragment hits, and rectangular nodes represent proteins. The edge represents interaction between fragment hit and protein. The green edge indicates strong binding, blue indicates medium binding, and orange indicates weak binding, according to Figure . Figure shows a limited part of the complete map with 54 fragments and 6 proteins connected to each other through 9 fragments. Twenty-seven hits are binding protein PF3D7_1127100 dUTPase, and six of these hits are joining other four proteins. Fragments 2, 7, and 54 are binding PVX_099895 guanylate kinase, 32 is binding PF3D7_1216200 glycerol-3-phosphate dehydrogenase, and 60 is binding PF3D7_1204300 eukaryotic translation initiation factor 5A (eIF5a). Joint hits most likely identify a similar binding interaction in each protein given the principle that fragments identify single binding sites.
Figure 8

Enlarged view of Figure . Circle nodes represent fragment hits, and rectangular nodes represent proteins. The edge represents the interaction between fragment hit and protein. In this portion of Figure , nine fragments are the connecting points among six proteins.

Enlarged view of Figure . Circle nodes represent fragment hits, and rectangular nodes represent proteins. The edge represents the interaction between fragment hit and protein. In this portion of Figure , nine fragments are the connecting points among six proteins. The 96 natural product fragments have different chemotypes and are different from the known antimalarial aminoquinolines, quinolones, or diamidines. An analysis of hits by structural similarity using Tanimoto measurements with a cutoff at 0.75 showed different structure–activity relationships. The two major clusters consist of the sesquiterpene lactones (10, 15–20, 22–24, 27–29, 33, 36, 39–40), which share the same backbone carbon structure as the current most effective antimalarial drug artemisinin. This structural class is known to kill P. falciparum parasites at submicromolar concentration.[64] The third largest cluster is the 2,5-disubstituted oxazolones with seven fragments (61, 67–72). There are 6 securinine analogues (80, 81, 86–89) and 40 structural singletons. Seventy-nine (79) of the 96 fragments had in vitro activity against asexual intraerythrocytic blood stage P. falciparum 3D7 parasites at 100 μM, determined IC50 values, or reported activity in the literature. This corresponded to active compounds against 31 of the 32 proteins (Figure , Table S3). IC50 values were able to be calculated for 24 compounds, with 13 having IC50 < 45 μM. Compounds 5,[65]6,[65]7,[66]9,[67]27,[68]85[69] have previously been reported to have anti-Plasmodial activity. Compound 9 had weak activity against P. falciparum 3D7 (61% inhibition at 100 μM) but had a previously reported IC50 of 9 μM against chloroquine-sensitive (D6, Sierra Leone) and chloroquine-resistant (W2) lines of P. falciparum.(67) The IC50 values of the other compounds correlated with previously reported values. We have previously reported that 80, 81, and 87–89 bind allosterically to recombinant P. falciparum 2′-deoxyuridine 5′-triphosphate nucleotidohydrolase (PfdUTPase, PF3D7_1127100), enhancing enzyme activity and inhibiting the in vitro activity viability of both P. falciparum gametocyte (sexual) and blood (asexual) stage parasites.[21] We identified 31 malaria proteins having a fragment–protein complex and in vitro activity against P. falciparum. These proteins warrant further investigation as possible drug targets in Plasmodium. Only one protein had a binder that did not show in vitro activity against P. falciparum: adenylosuccinate synthetase (AdSS) (PF13_0287). Fragment-based drug discovery is based on the identification of low-affinity ligands. The starting point for vemurafenib[27,28] was an IC50 in the mM range with low affinity at ∼200 μM. Similarly, the starting point for venetoclax (ABT-199) was two fragment hits with KD  = 400 and 2000 μM that were linked to initially give a compound with an IC50 of 6.9 μM on the way to the development of the approved drug.[26,29] In comparison, direct growth inhibition of P. falciparum at 100 μM is significant, and the fact that 14 compounds had IC50   values <45 μM is highly encouraging.

Conclusions

We demonstrated that enriching fragment screening libraries with natural-product-based fragments provides the means not only to increase the structural diversity of the library but also to improve the content of 3D-shaped structures. Increasing the structural diversity of fragment libraries with chemotypes usually absent in commercially available libraries has allowed searching different regions of biologically relevant chemical space and finding structurally different hits. We have presented a native mass spectrometry method that can recognize native proteins and find a binder concurrently. We have demonstrated the value of native MS screening in fragment-based drug discovery on cloned and purified proteins. We identified 31 proteins worthy of further investigation as anti-plasmodial targets having 79 low-molecular-weight natural product hits with in vitro antimalarial activity out of a total of 96 natural product hits using native MS. This adds a fragment library to the already-published HTS active libraries in the public domain and may facilitate fragment-based drug discovery for malaria.

Material and Methods

Cheminformatics Analysis

Fragment Generation

Fragments from DNP (sdf version 211.9; 166 379 molecules), Asinex (gold and platinum collection; 296 537 molecules), ChemBridge (express-pick collection stock; 459 091 molecules), InterBioscreen (IBS) (synthetic compounds library; 467 251 molecules), the Molecular Libraries Small Molecule Repository (MLSMR) (325 164 molecules), Life Chemicals (general HTS collection; 389 305 molecules), and Otava (collection for prompt delivery; 271 710 molecules) libraries were obtained using a series of filters available in Scitegic Pipeline Pilot (version 9.1.0.1, Accelrys Software, Inc.), which first cleaned up the structures (including salt stripping and selection of the largest fragment, structure normalization and standardization, ionization at pH 7.4, and  selection of organic molecules with MW ≥ 100 Da or heavy atom count ≥7 and sulfur atom counts ≤3) and then selected those with fragmentlike physicochemical properties (MW ≤ 250 Da, atomic-based prediction of the octanol–water partition coefficient (A log P) < 4, HBD ≤ 4, HBA ≤ 5, rotatable bonds (RB) ≤ 6, percent polar surface area (%PSA) < 45). After the removal of duplicate molecules using canonical SMILES, 20 510 DNP fragments, 14 484 Asinex fragments, 24 291 ChemBridge fragments, 17 271 IBS fragments, 20 783 MLSMR fragments, 3682 Life Chemicals fragments, and 17 769 Otava fragments were obtained.

Determination of the Fraction of sp3-Hybridized Carbon Atoms (Fsp3) and Principal Moments of Inertia (PMI)

The fraction of sp3-hybridized carbon atoms (Fsp3) of each fragment was determined with Scitegic Pipeline Pilot (version 9.1.0.1, Accelrys Software, Inc.). Principal moments of inertia (PMI) were determined in Maestro (version 10.4.017, Schrodinger Inc., NY) after 3D structure minimization in Scitegic Pipeline Pilot. The normalized PMI ratios (NPR 1 and 2) were calculated by dividing the two lowest PMIs by the highest, as described by Sauer et al.[70] NPR1 was added to NPR2 to give the PMIsum.

Generation of Fragment Scaffolds

Within Scitegic Pipeline Pilot, the “generate scaffold tree” component was first used to generate hierarchical ring scaffolds according to the procedure described in ref (71). Then the first-level scaffolds, which correspond to the Bemis-Murcko molecular frameworks (rings, linkers, exocyclic double bonds, and terminal double bonds attached to the linkers), were selected.[72] After the removal of duplicate molecules using canonical SMILES, the following number of unique fragment scaffolds were obtained: 3958 DNP, 5163 ChemBridge, 3779 Asinex, 5319 IBS, 1374 Life Chemicals, 6824 MLSMR, 3704 Otava, and 325/72 of our global/hit fragment library. The percentage of unique fragment scaffolds was calculated by dividing the number of unique scaffolds by the number of fragments in the library.

Determination of the Average Number of Unique Fingerprint Features per Fragment

Using Scitegic Pipeline Pilot, each molecule was represented by its extended-connectivity fingerprint, ECFP_6, a 2D circular topological fingerprint where integer identifiers which account for different atom properties (atom type, mass, connection count, etc.) are first assigned to each heavy atom of the molecule and then are combined with identifiers of neighboring atoms in an iterative process until a diameter of six bonds is reached. This iteration process is based on the Morgan Algorithm[73] and converts the substructural features around each atom into integer values which constitute the extended-connectivity fingerprint. The average number of unique fingerprint features per fragment was determined by counting the total number of unique ECFP_6 fingerprint features and dividing it by the total number of molecules.

Cleanup of Active Antimalarial HTS Hits

St. Jude’s Children’s Research Hospital (1523 molecules),[6] Novartis (5615 molecules),[56] GlaxoSmithKline (GSK) (13 467 molecules),[5] Harvard Medical School (35 molecules),[57] and Medicines for Malaria Venture (MMV) (400 molecules)[58] data sets were processed according to the following workflow in Scitegic Pipeline Pilot: salt stripping, selection of the largest fragment in the molecule, structure normalization and standardization, ionization at pH 7.4, selection of organic molecules (molecules containing only organic atoms H, C, N, O, P, S, F, Cl, Br, or I), and removal of duplicate molecules using canonical SMILES. At the end, St. Jude’s Children’s Research Hospital contained 1523 “cleaned” molecules, Novartis 5464, GSK 13 458, Harvard Medical School 35, and MMV 400. These were the structures used for searching for the presence of fragment scaffolds derived from our fragment hit library.

Substructure Search

The fragment scaffolds were used as queries for the substructure search within the molecular structures of interest. Using Scitegic Pipeline Pilot, each query was mapped against all of the molecules to be searched. For each molecule successfully mapped, the number and name of queries mapped were retrieved.

Pairwise Similarity Comparisons

The similarity between two molecules was determined by calculating the similarity between their respective ECFP_6 fingerprints using the Tanimoto function: Tanimoto coefficient = c/(a + b – c), where a is the number of bits present in molecule A but not B, b is the number of bits present in molecule B but not A, and c is the number of bits present in both A and B.

Protein Cloning, Purification, and Purity Analysis

Recombinant histidine-tagged enzymes from P. falciparum and P. vivax were expressed in Escherichia coli. In general, genes were cloned into expression vectors that enabled tagging of the corresponding proteins with an N-terminal 6-histidine tag, as previously described.[67,68] In some cases, truncated genes (missing some nucleotides at the 5′ and/or 3′ ends) were cloned to improve the chances of expression. Proteins were purified using a nickel column (immobilized-metal affinity chromatography, IMAC), flash frozen, stored at −80 C, and shipped on dry ice. Residues 61–204 of P. falciparum Myosin A tail domain interacting protein (MTIP) were expressed and purified as published.[68]

Native MS Screening

Protein Preparation

The proteins were buffer-exchanged into a suitable volatile buffer (ammonium acetate or ammonium bicarbonate) under nearly physiological conditions (pH 7.0) using size exclusion chromatography prior to ESI–MS analysis. Depending on the protein, the buffer and its concentration were chosen to obtain the highest sensitivity in the mass spectrometer. Starting conditions were 10 μM protein in 10 mM volatile buffer.

Instrument Control and Acquisition

Experiments were performed on a Bruker Apex III 4.7T or a Bruker SolariX XR 12T FT-ICR mass spectrometer (Bruker Daltonics Inc., Billerica, MA). An Agilent electrospray tuning mix (G2431A) was used for instrument calibration. Instrument parameters were tuned to avoid the dissociation of noncovalent protein–ligand complexes and achieve the highest signal intensities. As a control, a bovine carbonic anhydrase II (bCAII) sulfanilamide complex was used. Mass spectra were recorded in positive ion and profile modes with a mass range from 50 to 6000 m/z. Each spectrum was a sum of 16 transients (scans) composed of 256 K (Bruker Apex III 4.7T) or 1 M (Bruker SolariX XR 12T) data points. All aspects of pulse sequence control and data acquisition were controlled by Xmass (Bruker Apex III 4.7T) or Solarix control (Bruker SolariX XR 12T) software in a Windows operating system. For fragment library screening, pools of eight fragments (1 μL of each fragment at 5 mM in MeOH) were dried, resuspended in 1 μL MeOH, and incubated with proteins (1–20 μM) for 1 h at room temperature and analyzed by ESI-FT-ICR-MS. Final fragment/protein molar ratios varied between 5:1 and 20:1. When a noncovalent complex was found, the molecular weight of the binding fragment was calculated from the spectrum using the following equation: MWligand = Δm/z × z. The binding of the individual active fragment was confirmed in a separate experiment (Figure ).

Fragment Library

Approximately half of the compounds were previously isolated in our laboratory and are >90% pure. The other half were purchased. The cited papers (Table S3) are listed to provide easy access to the original publications that reported the compounds.

Asexual Blood Stage Assay

A drug-sensitive (3D7) and -resistant (Dd2) Pf line was to be tested using an established radiometric growth inhibition assay. Briefly, P. falciparum-infected erythrocytes (2.5% hematocrit, 0.5% parasitaemia) were incubated in 96-well plates containing test compounds or DMSO as a control. After 48 h, 3H-hypoxanthine (0.5 microCi/well) was added, and the plates were incubated for another 24 h, followed by an assessment of 3H incorporation.
  68 in total

1.  A 'rule of three' for fragment-based lead discovery?

Authors:  Miles Congreve; Robin Carr; Chris Murray; Harren Jhoti
Journal:  Drug Discov Today       Date:  2003-10-01       Impact factor: 7.851

2.  New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays.

Authors:  Jonathan B Baell; Georgina A Holloway
Journal:  J Med Chem       Date:  2010-04-08       Impact factor: 7.446

3.  Fragment-based lead discovery grows up.

Authors:  Monya Baker
Journal:  Nat Rev Drug Discov       Date:  2013-01       Impact factor: 84.694

4.  The Plasmodium falciparum sexual development transcriptome: a microarray analysis using ontology-based pattern identification.

Authors:  Jason A Young; Quinton L Fivelman; Peter L Blair; Patricia de la Vega; Karine G Le Roch; Yingyao Zhou; Daniel J Carucci; David A Baker; Elizabeth A Winzeler
Journal:  Mol Biochem Parasitol       Date:  2005-09       Impact factor: 1.759

5.  ABT-199, a potent and selective BCL-2 inhibitor, achieves antitumor activity while sparing platelets.

Authors:  Andrew J Souers; Joel D Leverson; Erwin R Boghaert; Scott L Ackler; Nathaniel D Catron; Jun Chen; Brian D Dayton; Hong Ding; Sari H Enschede; Wayne J Fairbrother; David C S Huang; Sarah G Hymowitz; Sha Jin; Seong Lin Khaw; Peter J Kovar; Lloyd T Lam; Jackie Lee; Heather L Maecker; Kennan C Marsh; Kylie D Mason; Michael J Mitten; Paul M Nimmer; Anatol Oleksijew; Chang H Park; Cheol-Min Park; Darren C Phillips; Andrew W Roberts; Deepak Sampath; John F Seymour; Morey L Smith; Gerard M Sullivan; Stephen K Tahir; Chris Tse; Michael D Wendt; Yu Xiao; John C Xue; Haichao Zhang; Rod A Humerickhouse; Saul H Rosenberg; Steven W Elmore
Journal:  Nat Med       Date:  2013-01-06       Impact factor: 53.440

6.  Genome-scale protein expression and structural biology of Plasmodium falciparum and related Apicomplexan organisms.

Authors:  Masoud Vedadi; Jocelyne Lew; Jennifer Artz; Mehrnaz Amani; Yong Zhao; Aiping Dong; Gregory A Wasney; Mian Gao; Tanya Hills; Stephen Brokx; Wei Qiu; Sujata Sharma; Angelina Diassiti; Zahoor Alam; Michelle Melone; Anne Mulichak; Amy Wernimont; James Bray; Peter Loppnau; Olga Plotnikova; Kate Newberry; Emayavaram Sundararajan; Simon Houston; John Walker; Wolfram Tempel; Alexey Bochkarev; Ivona Kozieradzki; Aled Edwards; Cheryl Arrowsmith; David Roos; Kevin Kain; Raymond Hui
Journal:  Mol Biochem Parasitol       Date:  2006-11-13       Impact factor: 1.759

7.  In silico activity profiling reveals the mechanism of action of antimalarials discovered in a high-throughput screen.

Authors:  David Plouffe; Achim Brinker; Case McNamara; Kerstin Henson; Nobutaka Kato; Kelli Kuhen; Advait Nagle; Francisco Adrián; Jason T Matzen; Paul Anderson; Tae-Gyu Nam; Nathanael S Gray; Arnab Chatterjee; Jeff Janes; S Frank Yan; Richard Trager; Jeremy S Caldwell; Peter G Schultz; Yingyao Zhou; Elizabeth A Winzeler
Journal:  Proc Natl Acad Sci U S A       Date:  2008-06-25       Impact factor: 11.205

Review 8.  Fragment-based hit identification: thinking in 3D.

Authors:  Andrew D Morley; Angelo Pugliese; Kristian Birchall; Justin Bower; Paul Brennan; Nathan Brown; Tim Chapman; Martin Drysdale; Ian H Gilbert; Swen Hoelder; Allan Jordan; Steven V Ley; Andy Merritt; David Miller; Martin E Swarbrick; Paul G Wyatt
Journal:  Drug Discov Today       Date:  2013-07-30       Impact factor: 7.851

9.  The open access malaria box: a drug discovery catalyst for neglected diseases.

Authors:  Thomas Spangenberg; Jeremy N Burrows; Paul Kowalczyk; Simon McDonald; Timothy N C Wells; Paul Willis
Journal:  PLoS One       Date:  2013-06-17       Impact factor: 3.240

10.  Optimization of Electrospray Ionization by Statistical Design of Experiments and Response Surface Methodology: Protein-Ligand Equilibrium Dissociation Constant Determinations.

Authors:  Liliana Pedro; Wesley C Van Voorhis; Ronald J Quinn
Journal:  J Am Soc Mass Spectrom       Date:  2016-05-25       Impact factor: 3.109

View more
  19 in total

1.  Re-engineering natural products to engage new biological targets.

Authors:  Stephen E Motika; Paul J Hergenrother
Journal:  Nat Prod Rep       Date:  2020-11-18       Impact factor: 13.423

Review 2.  Is it time for artificial intelligence to predict the function of natural products based on 2D-structure.

Authors:  Miaomiao Liu; Peter Karuso; Yunjiang Feng; Esther Kellenberger; Fei Liu; Can Wang; Ronald J Quinn
Journal:  Medchemcomm       Date:  2019-06-06       Impact factor: 3.597

Review 3.  Principle and design of pseudo-natural products.

Authors:  George Karageorgis; Daniel J Foley; Luca Laraia; Herbert Waldmann
Journal:  Nat Chem       Date:  2020-02-03       Impact factor: 24.427

Review 4.  Natural product-informed exploration of chemical space to enable bioactive molecular discovery.

Authors:  Adam Nelson; George Karageorgis
Journal:  RSC Med Chem       Date:  2020-12-16

5.  Development of an HPLC-based guanosine monophosphate kinase assay and application to Plasmodium vivax guanylate kinase.

Authors:  Liliana Pedro; Megan Cross; Andreas Hofmann; Tin Mak; Ronald J Quinn
Journal:  Anal Biochem       Date:  2019-03-31       Impact factor: 3.365

Review 6.  Fragment-based screening with natural products for novel anti-parasitic disease drug discovery.

Authors:  Miaomiao Liu; Ronald J Quinn
Journal:  Expert Opin Drug Discov       Date:  2019-09-12       Impact factor: 6.098

7.  Discovery of a Natural Product That Binds to the Mycobacterium tuberculosis Protein Rv1466 Using Native Mass Spectrometry.

Authors:  Ali R Elnaas; Darren Grice; Jianying Han; Yunjiang Feng; Angela Di Capua; Tin Mak; Joseph A Laureanti; Garry W Buchko; Peter J Myler; Gregory Cook; Ronald J Quinn; Miaomiao Liu
Journal:  Molecules       Date:  2020-05-21       Impact factor: 4.411

8.  Screen of traditional soup broths with reported antipyretic activity towards the discovery of potential antimalarials.

Authors:  Ursula Straschil; Kathrin Witmer; Michael J Delves; Stephen D Marks; Jake Baum
Journal:  Arch Dis Child       Date:  2019-12       Impact factor: 3.791

Review 9.  Screening Marine Natural Products for New Drug Leads against Trypanosomatids and Malaria.

Authors:  María Álvarez-Bardón; Yolanda Pérez-Pertejo; César Ordóñez; Daniel Sepúlveda-Crespo; Nestor M Carballeira; Babu L Tekwani; Sankaranarayanan Murugesan; Maria Martinez-Valladares; Carlos García-Estrada; Rosa M Reguera; Rafael Balaña-Fouce
Journal:  Mar Drugs       Date:  2020-03-31       Impact factor: 5.118

10.  Combined Protein- and Ligand-Observed NMR Workflow to Screen Fragment Cocktails against Multiple Proteins: A Case Study Using Bromodomains.

Authors:  Jorden A Johnson; Noelle M Olson; Madison J Tooker; Scott K Bur; William C K Pomerantz
Journal:  Molecules       Date:  2020-08-29       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.