| Literature DB >> 21674801 |
Matthias T Ehebauer1, Matthias Wilmanns.
Abstract
Mycobacterium tuberculosis is a highly infectious pathogen that is still responsible for millions of deaths annually. Effectively treating this disease typically requires a course of antibiotics, most of which were developed decades ago. These drugs are, however, not effective against persistent tubercle bacilli and the emergence of drug-resistant stains threatens to make many of them obsolete. The identification of new drug targets, allowing the development of new potential drugs, is therefore imperative. Both proteomics and structural biology have important roles to play in this process, the former as a means of identifying promising drug targets and the latter allowing understanding of protein function and protein-drug interactions at atomic resolution. The determination of M. tuberculosis protein structures has been a goal of the scientific community for the last decade, who have aimed to supply a large amount of structural data that can be used in structure-based approaches for drug discovery and design. Only since the genome sequence of M. tuberculosis has been available has the determination of large numbers of tuberculosis protein structures been possible. Currently, the molecular structures of 8.5% of all the pathogen's protein-encoding ORFs have been determined. In this review, we look at the progress made in determining the M. tuberculosis structural proteome and the impact this has had on the development of potential new drugs, as well as the discovery of the function of crucial mycobaterial proteins.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21674801 PMCID: PMC3345573 DOI: 10.1002/pmic.201000787
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984
Figure 1Analysis of M. tuberculosis protein structures deposited in the PDB. Shown is the proportion, in percentage of the total number of currently deposited structures (898), that were determined by X-ray crystallography or NMR, the proportion of structures that are ligand-bound, the proportion of structures determined by structural genomics initiatives and the proportion of crystal structures (873) in the different given resolution ranges.
Figure 2The number of M. tuberculosis structures deposited in the PDB to date (December 2010). The first structure was determined in 1994. In 2000 a total of 11 structures had been determined, by 2005 a total of 203 and by 2010 almost 900 structures had been deposited in the PDB. The first publication of the M. tuberculosis H37Rv genome sequence is indicated. The structures are those of proteins that are targeted by front-line anti-TB drugs and the proteasome. The year of their deposition in the PDB is indicated with arrows on the time-line. The isoniazide and ethionamide target is inhA (1ENY) 34. Isoniazide is a pro-drug and needs to be activated by KatG (1SJ2) 35. The fluoroquinolones target gyrA (3ILW) and pyrazinamide pncA (3GBC). Rifampin and streptomycin target the ribosome, of which no TB structure has been determined.
Functional classification of M. tuberculosis proteins and the available structural data for each class
| Class | Function | ORFs | Structures | %Structures/ORFs | Folds | Structures/fold |
|---|---|---|---|---|---|---|
| 0 | Virulence, detoxification, adaptation | 99 | 20 | 20.2 | 15 | 1.3 |
| 1 | Lipid metabolism | 233 | 31 | 13.3 | 9 | 3.4 |
| 2 | Information pathways | 229 | 25 | 10.9 | 9 | 2.8 |
| 3 | Cell-wall and cell processes | 708 | 31 | 4.4 | 11 | 2.8 |
| 6 | PE and PPE proteins | 170 | 2 | 1.2 | 1 | 2 |
| 7 | Intermediate metabolism and respiration | 894 | 143 | 16.0 | 35 | 4.1 |
| 8 | Proteins of unknown function | 272 | 0 | – | – | – |
| 9 | Regulatory proteins | 189 | 32 | 16.9 | 11 | 2.9 |
| 10 | Conserved hypothetical proteins | 1051 | 43 | 4.1 | 15 | 2.9 |
| Total protein encoding ORFs | 3845 | 327 | 8.5 |
Definition of classes and the number of protein-encoding ORFs in the M. tuberculosis are as given in Camus et al. 12. Class 4 are RNA-encoding genes and class 5 are insertion sequences and phages and were not included in this analysis.
The number of different folds in each functional class are based on the SCOP (http://scop.mrc-lmb.cam.ac.uk/scop/) annotation of the structures given in the Supporting Information Table S1. Only those structures, in total 112, that are annotated in SCOP were considered, which represents 34% of all the distinct protein structures.