| Literature DB >> 17501996 |
Abstract
BACKGROUND: Tuberculosis remains a leading infectious disease with global public health threat. Its control and management have been complicated by multi-drug resistance and latent infection, which prompts scientists to find new and more effective drugs. With the completion of the genome sequence of the etiologic bacterium, Mycobacterium tuberculosis, it is now feasible to search for new drug targets by sieving through a large number of gene products and conduct genome-scale experiments based on microarray technology. However, the full potential of genome-wide microarray analysis in configuring interrelationships among all genes in M. tuberculosis has yet to be realized. To date, it is only possible to assign a function to 52% of proteins predicted in the genome.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17501996 PMCID: PMC1884158 DOI: 10.1186/1471-2180-7-37
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
The top 100 most expressed genes of M. tuberculosis in log-phase growth. For other active genes, refer to Additional file 4.
| ORF | Gene | Mean | S.D. | ORF | Gene | Mean | S.D. |
| Rv1641 | 11397 | 1612 | Rv0709 | 4436 | 859 | ||
| Rv1872c | 10798 | 1943 | Rv0170 | 4383 | 723 | ||
| Rv1398c | - | 10543 | 2353 | Rv3219 | 4380 | 467 | |
| Rv1038c | 10524 | 1911 | Rv1094 | 4331 | 359 | ||
| Rv1037c | 9677 | 2110 | Rv3053c | 4315 | 430 | ||
| Rv3874 | 9539 | 2143 | Rv2007c | 4217 | 1604 | ||
| Rv3614c | - | 8992 | 3003 | Rv1388 | 4191 | 611 | |
| Rv2031c | 8926 | 2861 | Rv0704 | 4140 | 917 | ||
| Rv3615c | - | 8718 | 2869 | Rv1980c | 4063 | 1067 | |
| Rv3648c | 8362 | 1199 | Rv3459c | 4044 | 393 | ||
| Rv3131 | - | 8102 | 3991 | Rv0682 | 4034 | 393 | |
| Rv2348c | - | 7796 | 2033 | Rv0715 | 4004 | 773 | |
| Rv3407 | - | 7792 | 1011 | Rv3130c | - | 3991 | 1822 |
| Rv3616c | - | 7388 | 2263 | Rv2442c | 3988 | 432 | |
| Rv3583c | - | 7366 | 851 | Rv0718 | 3982 | 799 | |
| Rv0288 | 7332 | 1684 | Rv3412 | - | 3952 | 331 | |
| Rv3841 | 7057 | 1518 | Rv1306 | 3857 | 609 | ||
| Rv1871c | - | 7039 | 840 | Rv0655 | 3727 | 408 | |
| Rv3408 | - | 6953 | 884 | Rv1174c | 3721 | 321 | |
| Rv1397c | - | 6931 | 1091 | Rv0708 | 3658 | 825 | |
| Rv3804c | 6875 | 410 | Rv0440 | 3650 | 935 | ||
| Rv0703 | 6378 | 651 | Rv0668 | 3637 | 661 | ||
| Rv3461c | 6349 | 379 | Rv3849 | - | 3615 | 650 | |
| Rv1298 | 6160 | 765 | Rv1211 | - | 3606 | 386 | |
| Rv3418c | 6017 | 1376 | Rv2204c | - | 3605 | 594 | |
| Rv1177 | 6004 | 569 | Rv1310 | 3604 | 694 | ||
| Rv0685 | 5895 | 438 | Rv3281 | - | 3579 | 602 | |
| Rv1297 | 5609 | 950 | Rv3051c | 3490 | 405 | ||
| Rv3460c | 5602 | 552 | Rv0009 | 3479 | 519 | ||
| Rv0824c | 5569 | 698 | Rv2161c | - | 3452 | 934 | |
| Rv2094c | 5552 | 866 | Rv1308 | 3439 | 467 | ||
| Rv3462c | 5324 | 291 | Rv0167 | 3439 | 620 | ||
| Rv0700 | 5315 | 464 | Rv3457c | 3396 | 841 | ||
| Rv1072 | - | 5244 | 450 | Rv0174 | 3393 | 604 | |
| Rv0144 | - | 5177 | 653 | Rv0641 | 3376 | 464 | |
| Rv0287 | 5135 | 1345 | Rv2457c | 3371 | 337 | ||
| Rv2986c | 5127 | 953 | Rv1109c | - | 3366 | 335 | |
| Rv2137c | - | 5070 | 962 | Rv2785c | 3361 | 365 | |
| Rv3127 | - | 4963 | 1788 | Rv3052c | 3356 | 295 | |
| Rv2840c | - | 4959 | 277 | Rv0710 | 3323 | 715 | |
| Rv3679 | - | 4952 | 995 | Rv0639 | 3320 | 749 | |
| Rv0706 | 4943 | 950 | Rv0569 | - | 3283 | 1757 | |
| Rv1738 | - | 4921 | 2210 | Rv1827 | 3214 | 212 | |
| Rv0702 | 4868 | 415 | Rv2159c | - | 3197 | 832 | |
| Rv0701 | 4822 | 429 | Rv0640 | 3169 | 256 | ||
| Rv1642 | 4816 | 562 | Rv2392 | 3147 | 663 | ||
| Rv2244 | 4650 | 1053 | Rv2196 | 3137 | 578 | ||
| Rv1884c | 4642 | 615 | Rv2391 | 3110 | 822 | ||
| Rv1305 | 4488 | 679 | Rv0289 | - | 3088 | 583 | |
| Rv0705 | 4476 | 908 | Rv3153 | 3066 | 747 |
S.D.: Standard deviation.
Figure 1The gene expression map of genes involved in the log-phase growth of Mycobacterium tuberculosis. The map was generated by Eisen's cluster-analysis program called CLUSTER and viewed by the TREEVIEW program. Several clusters representing aggregations of functionally related genes are visible in the dendrogram (on the left of the image) showing how genes are grouped. The detailed image is available at our web site [see Additional file 2] annotated with gene names alongside of the image strip [see Additional file 3].
| 1 | (CYSH Rv1478 GGTB NIRA Rv2425C ACCD4 Rv3541C Rv0175 Rv2393 MURX Rv3701C) | Intermediate and lipid metabolism |
| 2 | (DRRB Rv1251C EFPA Rv2054 MURC Rv1632C Rv2395 Rv1378C QCRA CTAE RHO PARB Rv3321C ATPH Rv3921C ATPD Rv3805C Rv2901C PARA Rv2949C CTAC Rv1576C ATPB Rv0546C Rv2781C Rv0526 Rv3104C POLA PYRH Rv1870C Rv1711 Rv3672C Rv0514 DAPF Rv2554C Rv1869C NUOE NUOC Rv1178 Rv0528 Rv1481 Rv2791C Rv2610C Rv3856C Rv1565C Rv3212 Rv1043C TSNR Rv1324 PGK Rv0525 RUVC KSGA Rv2989 LPRE PURK Rv0412C RODA Rv3725 HEME Rv1339 Rv1797 NRP UREC Rv2852C Rv3781 TRPA Rv2956 TRPB Rv2808 Rv2128 Rv1695 NUOI Rv1312 NUOL NUOD Rv3806C Rv2759C Rv2966C FOLK Rv2879C Rv1780 Rv1271C Rv3693 PLCB DRRC Rv2600 NUOM NUOH Rv0875C Rv3220C Rv3885C MMPL7 Rv2475C LTP1 Rv0236C NUOJ NUOB NUON MMPL9 Rv2752C Rv0177 Rv0176 HEMB PRCA Rv2553C Rv2367C RPLT GID CMK Rv3122 EMBB Rv1303 Rv1907C Rv2792C) | Energy production and respiration |
| 3 | (FTSZ WAG31 Rv0902C HEMK NARL Rv2147C Rv3267 Rv1477 LPPW SIGC Rv2864C RECA Rv2826C Rv1697 LEUC LEUD Rv3909 DNAQ Rv1465 Rv0486 Rv3910 PIRG Rv2574 Rv2360C Rv3816C AROB Rv2827C FTSK Rv3587C FOLE FTSQ Rv3647C Rv3376 RHLE) | Information pathways (replication, transcription, and translation) |
| 4 | (SIGE Rv0516C Rv0846C Rv0991C Rv3334 Rv2628 Rv2020C Rv0968 Rv2517C NARK2 FBPC LPQS Rv2662 Rv1772 Rv0967 Rv0465C Rv1813C Rv2016 HSP Rv1847 Rv0190 Rv1774 RPST ALD Rv0080 Rv2699C Rv2629 Rv0571C Rv2623 Rv0572C Rv2005C CTPF Rv3133C Rv2004C Rv2626C Rv2625C Rv2627C Rv2032 PANB Rv2466C Rv2035 Rv3134C Rv2962C Rv0081 Rv2630) | Cell wall, cell processes, and metabolism |
The whole-genome gene expression image (Figure 1) was divided into four zones, each annotated with major gene clusters therein and the functional classes of representative genes. The list of genes in each zone is available [see Additional file 5].
The distribution (percentage %) of functional categories for genes in the major clusters of each zone on the gene expression map.
| Clusters in | Vil | Lipid | Info | Cell-W-P | Ins-S-P | Meta-Res | ? | Reg | Hypo |
| Zone-1 | 5 | 20 | 0 | 15 | 0 | 30 | 0 | 0 | 30 |
| Zone-2 | 0 | 2 | 5 | 32 | 3 | 34 | 0 | 2 | 22 |
| Zone-3 | 3 | 0 | 38 | 16 | 0 | 22 | 0 | 4 | 17 |
| Zone-4 | 6 | 8 | 5 | 20 | 2 | 21 | 1 | 7 | 30 |
Vil: virulence, detoxification, adaptation. Lipid: lipid metabolism. Info: information pathways. Cell-W-P: cell wall and cell processes. Ins-S-P: insertion sequences and phages. Meta-Res: intermediate metabolism and respiration. ?: unknown function. Reg: regulatory function. Hypo: conserved hypothetical proteins.
Transcriptional regulators associated with major gene clusters based on microarray analysis.
| Gene/ORF | Gene Product | Assocated Gene Cluster |
| Rv2989 | Probable transcriptional regulatory protein | Respiration and energy production |
| Rv2258c | Possible transcriptional regulatory protein | Information pathways |
| Rv3334 | Probable transcriptional regulatory protein probably MerR-family | Cell wall, cell processes, and metabolism |
| Rv0465c | Probable transcriptional regulatory protein | Ditto |
| Rv0081 | Probable transcriptional regulatory protein | Ditto |
| Rv3681c | Probable transcriptional regulatory protein | Ditto |
| Rv0653c | Possible transcriptional regulatory protein probably TetR-family | Ditto |
| Rv0165c | Possible transcriptional regulatory protein probably GntR-family | Ditto |
| Rv1931c | Probable transcriptional regulatory protein | Ditto |
| Rv1151c | Probable transcriptional regulatory protein | Ditto |
| Rv1956 | Possible transcriptional regulatory protein | Ditto |
Conserved hypothetical proteins associated with the gene clusters of each zone on the gene expression map.
| Clusters in | Conserved Hypothetical Proteins |
| Zone-1 (Intermediate and lipid Metabolism) | (Rv2425c Rv3541c Rv2393 Rv3701c Rv2052c Rv1728c) |
| Zone-2 (Energy and Respiration) | (Rv1251c Rv2054 Rv1378c Rv3321c Rv2901c Rv2949c Rv0546c Rv1870c Rv1711 Rv3672c Rv2554c Rv3856c Rv3212 Rv1043c Rv0525 Rv1339 Rv2956 Rv2759c Rv2879c Rv1780 Rv2475c Rv2752c Rv0177 Rv2367c) |
| Zone-3 (Information Pathways) | (Rv2147c Rv3267 Rv1697 Rv3909 Rv2574 Rv3376 Rv2125 Rv1546 Rv1099c Rv2908c Rv1073 Rv0636 Rv0637 Rv0277c) |
| Zone-4 (Cell Wall, Cell Processes, and metabolism) | (Rv0516c Rv0991c Rv2020c Rv0968 Rv0967 Rv1813c Rv1847 Rv0190 Rv0080 Rv2699c Rv2629 Rv0571c Rv2623 Rv2005c Rv2004c Rv2626c Rv2627c Rv2032 Rv2466c Rv2035 Rv3134c Rv0941c Rv2478c Rv2184c Rv0502 Rv1352 Rv1893 Rv1413 Rv0121c Rv3654c Rv2044c Rv2670c Rv3860 Rv0269c Rv2722 Rv0695 Rv3633 Rv3616c Rv3399 Rv2638 Rv3615c Rv3614c Rv3733c Rv2632c Rv1868 Rv2598 Rv1871c Rv0387c Rv2024c Rv0657c Rv2137c Rv2311 Rv2205c Rv1398c Rv2472 Rv0767c Rv0258c Rv1425 Rv1978 Rv1885c Rv0760c PRA Rv0192) |