| Literature DB >> 19943957 |
Alexander G Holman1, Paul J Davis, Jeremy M Foster, Clotilde K S Carlow, Sanjay Kumar.
Abstract
BACKGROUND: Wolbachia (wBm) is an obligate endosymbiotic bacterium of Brugia malayi, a parasitic filarial nematode of humans and one of the causative agents of lymphatic filariasis. There is a pressing need for new drugs against filarial parasites, such as B. malayi. As wBm is required for B. malayi development and fertility, targeting wBm is a promising approach. However, the lifecycle of neither B. malayi nor wBm can be maintained in vitro. To facilitate selection of potential drug targets we computationally ranked the wBm genome based on confidence that a particular gene is essential for the survival of the bacterium.Entities:
Mesh:
Year: 2009 PMID: 19943957 PMCID: PMC2794283 DOI: 10.1186/1471-2180-9-243
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
DEG Members
| Organism Name | Taxon ID | Ess. Genes | Refseq Gene Count | % Ess. |
|---|---|---|---|---|
| 202950 | 499 | 3325 | 15% | |
| 224308 | 271 | 4105 | 7% | |
| 511145 | 712 | 4132 | 17% | |
| 401614 | 392 | 1719 | 23% | |
| 71421 | 642 | 1657 | 39% | |
| 85962 | 323 | 1576 | 20% | |
| 83332 | 614 | 3989 | 15% | |
| 243273 | 381 | 477 | 80% | |
| 272635 | 310 | 782 | 40% | |
| 208963 | 335 | 5892 | 6% | |
| 99287 | 230 | 4527 | 5% | |
| 158879 | 302 | 2619 | 12% | |
| 171101 | 133 | 2043 | 12% | |
| 170187 | 111 | 2105 | 12% | |
| 243277 | 5 | 3835 | 0% |
(γ): γ-proteobacteria, (B): bacilli, (ϵ): ϵ-proteobacteria, (A): actinobacteria, (M): mollicutes.
Figure 1Distribution of MHS values by rank in . The X-axis indicates the 805 protein coding genes in the wBm genome, ranked by MHS. The Y-axis shows the value of the MHS for each protein.
Figure 2E-values of the BLAST alignments producing the top 20 MHS. The black bars indicate the e-value of the best alignment to each organism within DEG. The y-axis is a linear scale of the negative log10 of the e-value, ranging from 1 to a maximal alignment of 200. The x-axis bins correspond to the 15 organisms contained within DEG.
Top 20 wBm genes ranked by MHS. Annotations taken from the Refseq release of the wBm proteome.
| Rank | MHS | GI | Annotation |
|---|---|---|---|
| 1 | 0.772 | 58584904 | DNA-directed RNA polymerase: RpoB/RpoC |
| 2 | 0.733 | 58584602 | Translation elongation factor GT-Pase: FusA |
| 3 | 0.656 | 58585021 | DNA gyrase, topoisomerase II, B sub-unit: GyrB |
| 4 | 0.585 | 58584662 | DNA gyrase subunit A |
| 5 | 0.550 | 58584524 | Translocase |
| 6 | 0.539 | 58584756 | DNA polymerase III alpha subunit |
| 7 | 0.497 | 58584618 | Alanyl-tRNA synthetase |
| 8 | 0.482 | 58584729 | Threonyl-tRNA synthetase |
| 9 | 0.425 | 58584862 | Leucyl-tRNA synthetase |
| 10 | 0.414 | 58584752 | Molecular chaperone: DnaK |
| 11 | 0.361 | 58584429 | CTP synthetase |
| 12 | 0.310 | 58584410 | ATP-dependent Zn protease: HflB |
| 13 | 0.276 | 58584946 | ATP synthase subunit B |
| 14 | 0.269 | 58584379 | Enolase |
| 15 | 0.267 | 58584441 | ATP-binding subunit of Clp protease and DnaK/DnaJ chaperones |
| 16 | 0.267 | 58584652 | 2-oxoglutarate dehydrogenase complex, E1 component |
| 17 | 0.258 | 58584572 | ATP synthase subunit A |
| 18 | 0.249 | 58584805 | NAD-dependent DNA ligase: Lig |
| 19 | 0.246 | 58584298 | Topoisomerase IA: TopA |
| 20 | 0.245 | 58584921 | Transketolase |
Figure 3Essential gene prediction by MHS was validated through a jackknife methodology. For each organism within DEG, the ability of the MHS to place experimentally validated essential genes at the top of a ranked genome was evaluated. All graphs correspond to the schematic found in the upper left. The X-axis represents the ranked genome of the organism, ranked from left to right as strongest to weakest prediction of essentiality. The Y-axis is the cumulative count of essential genes encountered moving left to right through the ranked genome. Line A is the ideal sorting, in which all essential genes are placed at the top of the ranking. Line B is the sorting by MHS. Lines C are 10 random assortments of the genome. Percent sorting achieved by MHS and the p-value for the difference between the MHS score ranking B and 1000 random assortments such as in C are shown in the lower right. Graphs are ordered by descending genome size of the organism. E. coli, F. novicida, and M. genitalium show 10, 2 and 2 fewer total essential genes, respectively, than shown in Table 1 because the corresponding DEG genes are not able to be resolved to genomic genes and are omitted from the jackknife analysis.
Genomes available within the order Rickettsiales
| Genus species Strain | Taxon ID |
|---|---|
| 234826 | |
| 212042 | |
| 320483 | |
| 335992 | |
| 269484 | |
| 205920 | |
| 302409 | |
| 254945 | |
| 254945 | |
| 357244 | |
| 334380 | |
| 222891 | |
| 293614 | |
| 391896 | |
| 336407 | |
| 293613 | |
| 272944 | |
| 315456 | |
| 416276 | |
| 272947 | |
| 452659 | |
| 392021 | |
| 257363 | |
| 163164 | |
| 66084 | |
| 570417 | |
| 292805 |
Figure 4Distribution of GCS in . The X-axis indicates the 805 protein coding genes in the wBm genome, ranked by GCS. The Y-axis shows the value of the GCS for each protein.
Figure 5Comparison of the prediction of . The X-axis shows normalized MHS on a log scale, while the Y-axis shows GCS. Grey lines indicate empirically determined thresholds for confidence in prediction of essentiality and are set at 7.3 × 10-3 for the MHS and 29 for the GCS. Therefore, the upper right quadrant contains genes with high confidence by both metrics. The upper left quadrant contains genes identified only by GCS, while the bottom right quadrant contains genes identified only by MHS. The numbers adjacent to the quadrant lines indicate gene counts in each quadrant. Red dots indicate Wolbachia genes which have significant protein sequence similarity to the targets of approved drugs and are predicted to be druggable.
Top 20 wBm genes ranked by GCS. Annotations taken from the Refseq release of the wBm proteome.
| Rank | GCS | GI | Annotation |
|---|---|---|---|
| 1 | 101 | 58584652 | 2-oxoglutarate dehydrogenase complex, E1 component |
| 2 | 101 | 58584298 | Topoisomerase IA: TopA |
| 3 | 101 | 58584469 | Pyruvate phosphate dikinase |
| 4 | 101 | 58584904 | DNA-directed RNA polymerase: RpoB/RpoC |
| 5 | 101 | 58584952 | Ribonucleotide-diphosphate reductase alpha subunit |
| 6 | 101 | 58584808 | ATP-dependent Lon protease |
| 7 | 101 | 58584662 | DNA gyrase subunit A |
| 8 | 101 | 58584705 | Succinate dehydrogenase |
| 9 | 101 | 58584602 | Translation elongation factor, GT-Pase: FusA |
| 10 | 101 | 58584729 | Threonyl-tRNA synthetase |
| 11 | 101 | 58584633 | NADH dehydrogenase gamma sub-unit |
| 12 | 101 | 58584752 | Molecular chaperone: DnaK |
| 13 | 101 | 58584862 | Leucyl-tRNA synthetase |
| 14 | 101 | 58584524 | Translocase |
| 15 | 100.994 | 58585021 | DNA gyrase, topoisomerase II, B sub-unit: GyrB |
| 16 | 100.989 | 58584924 | GTP-binding protein: LepA |
| 17 | 100.987 | 58584410 | ATP-dependent Zn protease: HflB |
| 18 | 100.986 | 58584731 | NADH:ubiquinone oxidoreductase, NADH-binding, chain F |
| 19 | 100.974 | 58584620 | Isoleucyl-tRNA synthetase |
| 20 | 100.974 | 58584756 | DNA polymerase III alpha subunit |
Figure 6Number of essential genes versus total number of Refseq genes. •-DEG organisms (V. cholerae omitted as an outlier). △-wBm essential gene prediction by MHS. ▽-wBm essential gene prediction by GCS score.
16S rRNA gene sequence sources
| Refseq ID | Taxon | Coordinates | Species name |
|---|---|---|---|
| NC_012026.1 | 320483 | 246283-247795 | |
| NC_004842.2 | 234826 | 247468-248989 | |
| NC_007797.1 | 212042 | 1057470-1058902 | |
| NC_007205.1 | 335992 | 511358-512831 | |
| NC_007354.1 | 269484 | 285955-287439 | |
| NC_007799.1 | 205920 | 942218-943726 | |
| NC_006831.1 | 302409 | 303748-305256 | |
| NC_006832.1 | 254945 | 306928-308437 | |
| NC_005295.2 | 254945 | 326964-328421 | |
| NC_007798.1 | 222891 | 36268-37765 | |
| NC_009488.1 | 357244 | 1322598-1324120 | |
| NC_010793.1 | 334380 | 379135-380647 | |
| NC_009881.1 | 293614 | 864179-865686 | |
| NC_009883.1 | 391896 | 1008161-1009668 | |
| NC_007940.1 | 336407 | 537796-539303 | |
| NC_009879.1 | 293613 | 385940-387447 | |
| NC_003103.1 | 272944 | 884601-886108 | |
| NC_007109.1 | 315456 | 456383-457890 | |
| NC_009900.1 | 416276 | 968391-969898 | |
| NC_000963.1 | 272947 | 772263-773769 | |
| NC_009882.1 | 392021 | 876489-877996 | |
| NC_010263.1 | 452659 | 887263-888750 | |
| NC_006142.1 | 257363 | 779669-781167 | |
| NC_010981.1 | 570417 | 1136001-1137446 | |
| NC_002978.6 | 163164 | 1167943-1169389 | |
| NC_006833.1 | 292805 | 634569-636083 | |
| NC_012416.1 | 66084 | 1289969-1291473 |