| Literature DB >> 18000556 |
Sanjay Kumar1, Kshitiz Chaudhary, Jeremy M Foster, Jacopo F Novelli, Yinhua Zhang, Shiliang Wang, David Spiro, Elodie Ghedin, Clotilde K S Carlow.
Abstract
We report results from the first genome-wide application of a rational drug target selection methodology to a metazoan pathogen genome, the completed draft sequence of Brugia malayi, a parasitic nematode responsible for human lymphatic filariasis. More than 1.5 billion people worldwide are at risk of contracting lymphatic filariasis and onchocerciasis, a related filarial disease. Drug treatments for filariasis have not changed significantly in over 20 years, and with the risk of resistance rising, there is an urgent need for the development of new anti-filarial drug therapies. The recent publication of the draft genomic sequence for B. malayi enables a genome-wide search for new drug targets. However, there is no functional genomics data in B. malayi to guide the selection of potential drug targets. To circumvent this problem, we have utilized the free-living model nematode Caenorhabditis elegans as a surrogate for B. malayi. Sequence comparisons between the two genomes allow us to map C. elegans orthologs to B. malayi genes. Using these orthology mappings and by incorporating the extensive genomic and functional genomic data, including genome-wide RNAi screens, that already exist for C. elegans, we identify potentially essential genes in B. malayi. Further incorporation of human host genome sequence data and a custom algorithm for prioritization enables us to collect and rank nearly 600 drug target candidates. Previously identified potential drug targets cluster near the top of our prioritized list, lending credibility to our methodology. Over-represented Gene Ontology terms, predicted InterPro domains, and RNAi phenotypes of C. elegans orthologs associated with the potential target pool are identified. By virtue of the selection procedure, the potential B. malayi drug targets highlight components of key processes in nematode biology such as central metabolism, molting and regulation of gene expression.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18000556 PMCID: PMC2063515 DOI: 10.1371/journal.pone.0001189
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Selection methodology.
Venn diagrams summarizing the reduction in search space achieved by selection of “essential” B. malayi gene products (left), and subsequent refinement of targets (right). Overlapping areas (not to scale) represent orthologous sequences (Materials and Methods). Numbers indicate gene products.
Over-represented GO terms in the target pool.
| GO Hierarchy | GO Term ID | GO Term | Freq. in Geneome | Freq. in Targets |
|
| GO:0005737 | Cytoplasm | 349 | 41 |
| GO:0005739 | Mitochondria | 31 | 10 | |
| GO:0030054 | Cell junction | 13 | 7 | |
| GO:0005911 | Intracellular junction | 13 | 7 | |
| GO:0005921 | Gap junction | 13 | 7 | |
|
| GO:0005198 | Structural molecule activity | 192 | 24 |
| GO:0042302 | Structural constituent of cuticle | 46 | 17 | |
| GO:0015077 | Monovalent inorganic cation activity | 32 | 11 | |
| GO:0015078 | Hydrogen ion transporter activity | 32 | 11 | |
|
| GO:0006811 | Ion transport | 233 | 30 |
| GO:0006820 | Anion transport | 91 | 18 | |
| GO:0015698 | Inorganic anion transport | 85 | 18 | |
| GO:0006817 | Phosphate transport | 81 | 18 |
GO term over representation was calculated as described in Materials and Methods. A minimum significance of p<0.05 was required. The fractions indicate the frequency of the term in the entire predicted B. malayi proteome (of 11771) and the target pool (of 589), respectively.
Prioritization factors and relative weighting scheme.
| Criteria | Description | Weight | Observed Range |
| Homology and protein length ratio | Present in |
| 0…226 |
| Present in |
| −61…0 | |
| Essentiality | Severity and reproducibility of the RNAi phenotype of the |
| 0…230 |
| Stage specific expression | Presence of specific ESTs in all stages (microfilariae, L2, L3, L4 and adults) | +10 | 0…10 |
| Presence of ESTs in adults | +7 | ||
| Presence of ESTs in L4 | +5 | ||
| Presence of ESTs in L1 | +4 | ||
| Presence of ESTs in L3 | +3 | ||
| Presence of ESTs in L2 | +1 | ||
| Druggability | Presence of LR5 druggable domain | +50 | 0…50 |
| Presence of druggable E.C. number | +50 | 0…50 | |
| Expressability | GRAVY score measuring hydropathicity and expressability |
| −21…0 |
Total scores (range −52 … 275) for each target were obtained by summing the individual weights.
b, maximum bit score of the B. malayi : C. elegans protein alignment; b, maximum bit score of the B. malayi : H. sapiens protein alignment; l, number of amino acids in B. malayi and C. elegans homologs respectively.
r, number of instances an RNAi phenotype has been reported for the gene in wormbase; d degree of severity (0–100) assigned to a phenotype bin i; n, number of RNAi experiments reported for a particular gene.
maximum value, irrespective of expression data in other stages/categories (non-additive).
g, GRAVY score for the B. malayi protein.
Figure 2Frequencies of binned RNAi phenotypes in C. elegans orthologs of B. malayi targets.
Observed frequencies were all statistically over- or under-represented relative to expected frequency in the whole genome based on a hypergeometric distribution (P values<1e-3). The entire set of observed values was statistically different from the background (expected) values as measured by a G-test (two sided P value = 5.9e-21).
Ranked listing of the top 40 predicted drug targets.
| Score |
|
|
|
| 275 |
| PAN domain containing protein |
|
| 248 |
| PAN domain containing protein |
|
| 248 | Bm1_45135 | Conserved hypothetical protein, putative |
|
| 179 |
| chitin synthase 1, chs-1 |
|
| 172 | Bm1_36850 | hypothetical protein | C01B10.11 |
| 172 |
| RNA dependent RNA polymerase family protein |
|
| 157 | Bm1_15245 | RH17657p-related | C25H3.9 |
| 157 | Bm1_43465 | Temporarily assigned gene name protein 40, putative |
|
| 154 | Bm1_38120 | hypothetical protein | W04G3.8 |
| 151 | Bm1_35395 | Acyltransferase family protein | T14D7.2 |
| 143 | Bm1_36765 | SD01790p-related | Y41E3.1 |
| 141 | Bm1_25640 | hypothetical protein | ZC247.1 |
| 140 |
| hypothetical protein |
|
| 133 | Bm1_49915 | conserved hypothetical protein | K07A12.7 |
| 125 | Bm1_45670 | WH2 motif family protein | C34E10.11 |
| 123 |
| conserved hypothetical protein |
|
| 119 | Bm1_46940 | hypothetical protein | C52A11.2 |
| 116 | Bm1_38110 | hypothetical protein | W04G3.2 |
| 115 | Bm1_32730 | LBP/BPI/CETP family, C-terminal domain containing protein | C06G1.1 |
| 114 | Bm1_42470 | hypothetical protein | B0491.5 |
| 112 | Bm1_55705 | Conserved hypothetical protein, putative | B0205.11 |
| 110 | Bm1_38105 | hypothetical protein | W04G3.3 |
| 108 | Bm1_38425 | 3′-5′ exonuclease family protein | C10G6.1 |
| 107 | Bm1_43740 | conserved hypothetical protein | T14D7.2 |
| 107 | Bm1_19285 | Innexin family protein |
|
| 106 | Bm1_51995 | LBP/BPI/CETP family, C-terminal domain containing protein | F44A2.3 |
| 105 |
| Fatty acid desaturase family protein |
|
| 105 | Bm1_02135 | ribosomal protein L9 domain containing protein | B0205.11 |
| 103 | Bm1_03880 | hypothetical protein | Y71G12B.13 |
| 100 | Bm1_35075 | Innexin inx-3, putative |
|
| 99 | Bm1_31660 | hypothetical protein | C55C3.5 |
| 98 |
| hypothetical protein |
|
| 98 | Bm1_09270 | Skp1 related (ubiquitin ligase complex component) protein 18-like |
|
| 97 | Bm1_50630 | hypothetical protein | T19B10.2 |
| 96 |
|
|
|
| 96 | Bm1_39265 | GH05862p-related | F42G8.10 |
| 91 |
| amine oxidase, flavin-containing-related | H04M03.4 |
| 88 | Bm1_08915 | hypothetical protein |
|
| 84 | Bm1_16245 | symbol-related | ZK809.3 |
| 82 | Bm1_33575 | hypothetical protein | ZK899.2 |
Previously identified targets are shown with bold identifiers. Manually added annotations are shown in italics. B. malayi pub locus and descriptions are from Ghedin et al. [11]. C. elegans gene names and RNAi phenotypes are from Wormbase.
|
|
|
|---|---|
| larval/adult lethality/arrest | 100 |
| embryonic lethality | 90 |
| morphology defect | 80 |
| growth defect | 70 |
| movement defect | 60 |
| vulval/egg laying defect | 50 |
| other/unclassified | 10 |
| wild-type | 0 |