| Literature DB >> 18325090 |
Caroline Deshayes1, Emmanuel Perrodou, Daniel Euphrasie, Eric Frapy, Olivier Poch, Pablo Bifani, Odile Lecompte, Jean-Marc Reyrat.
Abstract
BACKGROUND: Computer-assisted analyses have shown that all bacterial genomes contain a small percentage of open reading frames with a frameshift or in-frame stop codon We report here a comparative analysis of these interrupted coding sequences (ICDSs) in six isolates of M. tuberculosis, two of M. bovis and one of M. africanum and question their phenotypic impact and evolutionary significance.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18325090 PMCID: PMC2277376 DOI: 10.1186/1471-2148-8-78
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1A- Schematic representation of the ICDSs common to M. tuberculosis H37Rv, CDC1551 and M. bovis AF2122/97 or specific to one of these strains. The total number of ICDSs is indicated. B- Schematic representation of the ICDSs of M. bovis BCG 1173P2 compared to the other analyzed strains.
List of the 81 ICDSs common to M. tuberculosis H37Rv, CDC1551, M. bovis AF2122/97 and M. africanum GM041182.
| Putative function | Functional classification | |||||
|---|---|---|---|---|---|---|
| 0002 (Rv0151c | 0004 | 0006 | 0109 | ICDS | PE family protein | PE/PPE |
| 0003 (Rv0152c | 0005 | 0007 | 0004 | ICDS | PE family protein | PE/PPE |
| 0007 (Rv0366c | 0011 | 0010 | 0110 | ICDS | Conserved hypothetical | Unknown |
| 0009 (Rv0393 | 0012 | 0011 | 0008 | ICDS | Conserved hypothetical | Unknown |
| 0010 (Rv0520 | 0014 | 0013 | 0012 | ICDS | Dimethylglycine N-methyltransferase | Intermediary metabolism |
| 0012 (Rv0601c | 0017 | 0018 | 0016 | ICDS° | Two-component sensor kinase | Regulation |
| 0014 (Rv0635 | 0019 | 0020 | 0111 | ICDS | Conserved hypothetical | Unknown |
| 0015 (Rv0636 | 0020 | 0021 | 0112 | ICDS | Conserved hypothetical | Unknown |
| 0017 (Rv0724A | 0022 | 0023 | 0020 | ICDS | Conserved hypothetical | Unknown |
| 0020 (Rv0865 | 0026 | 0085 | 0113 | ICDS | Molybdopterin biosynthesis protein | Intermediary metabolism |
| 0021 (Rv0890c | 0027 | 0025 | 0114 | ICDS | Transcriptional regulator | Regulation |
| 0023 (Rv1034c | 0030 | 0030 | 0031 | ICDS | Transposase | IS/phage |
| 0024 (Rv1035c | 0031 | 0031 | 0032 | ICDS | Transposase | IS/phage |
| 0025 (Rv1041c | 0032 | 0086 | 0034 | ICDS | Transposase | IS/phage |
| 0026 (Rv1104 | 0034 | 0087 | 0115 | ICDS | Esterase | Intermediary metabolism |
| 0027 (Rv1104 | 0035 | 0088 | 0037 | ICDS | Esterase | Intermediary metabolism |
| 0028 (Rv1105 | 0036 | 0033 | 0036 | ICDS | Para-nitrobenzyl esterase | Intermediary metabolism |
| 0029 (Rv1119c | 0037 | 0034 | 0039 | ICDS | Conserved hypothetical | Unknown |
| 0030 (Rv1136 | 0039 | 0089 | 0040 | ICDS | Enoyl-CoA | Lipid metabolism |
| 0032 (Rv1149 | 0041 | 0090 | 0041 | ICDS | Transposase | IS/phage |
| 0033 (Rv1163 | 0042 | 0035 | 0116 | ICDS | Nitrate reductase NarI-J | Intermediary metabolism |
| 0035 (Rv1203c | 0044 | 0036 | 0043 | ICDS | Conserved hypothetical | Unknown |
| 0036 (Rv1413 | 0046 | 0041 | 0047 | ICDS | Conserved hypothetical | Unknown |
| 0040 (Rv1662 | 0053 | 0043 | 0117 | ICDS | Polyketide synthase Pks8/17 | Lipid metabolism |
| 0041 (Rv1687c | 0054 | 0132 | 0128 | ICDS° | ATP binding protein, ABC transporter | Cell wall, process |
| 0043 (Rv1735c | 0056 | 0112 | 0050 | ICDS | Malic acid transport protein | Cell wall, process |
| 0046 (Rv1878 | 0061 | 0052 | 0118 | ICDS° | Glutamine synthetase GlnA3 | Intermediary metabolism |
| 0047 (Rv1888A | 0062 | 0053 | 0119 | ICDS | Conserved hypothetical | Unknown |
| 0048 (Rv1931c | 0064 | 0054 | 0058 | ICDS | Conserved hypothetical | Unknown |
| 0049 (Rv1949c | 0065 | 0055 | 0059 | ICDS | Conserved hypothetical | Unknown |
| 0050 (Rv2013 | 0066 | 0056 | 0061 | ICDS° | Transposase | IS/phage |
| 0051 (Rv2086 | 0067 | 0093 | 0062 | ICDS | Conserved hypothetical | Unknown |
| 0052 (Rv2086 | 0068 | 0058 | 0063 | ICDS | Conserved hypothetical | Unknown |
| 0053 (Rv2087 | 0069 | 0094 | 0064 | ICDS | Conserved hypothetical | Unknown |
| 0054 (Rv2095c | 0070 | 0133 | 0129 | ICDS | Conserved hypothetical | Unknown |
| 0058 (Rv2321 | 0074 | 0060 | 0067 | ICDS | Ornithine aminotransferase RocD1 | Intermediary metabolism |
| 0059 (Rv2325 | 0075 | 0096 | 0120 | ICDS | Conserved hypothetical | Unknown |
| 0060 (Rv2331 | 0076 | 0134 | 0130 | ICDS | Hypothetical | Unknown |
| 0061 (Rv2337 | 0077 | 0097 | 0068 | ICDS | Conserved hypothetical | Unknown |
| 0062 | 0081 | 0065 | 0076 | ICDS | Hydrogenase nickel incorporation protein HypB | Intermediary metabolism |
| 0063 (Rv2877c | 0082 | 0099 | 0077 | ICDS° | Conserved hypothetical | Unknown |
| 0065 (Rv2943A | 0083 | 0066 | 0079 | ICDS | Transposase | IS/phage |
| 0068 (Rv3128c | 0088 | 0068 | 0082 | ICDS | Conserved hypothetical | Unknown |
| 0069 (Rv3152 | 0089 | 0100 | 0121 | ICDS | NADH dehydrogenase I | Intermediary metabolism |
| 0070 (Rv3172c | 0090 | 0069 | 0083 | ICDS | Conserved hypothetical | Unknown |
| 0071 (Rv3200c | 0091 | 0101 | 0122 | ICDS | Hypothetical | Unknown |
| 0075 (Rv3349c | 0100 | 0102 | 0085 | ICDS° | Transposase | IS/phage |
| 0076 (Rv3351c | 0101 | 0071 | 0088 | ICDS | Oxidoreductase | Intermediary metabolism |
| 0077 (Rv3352c | 0102 | 0072 | 0089 | ICDS | Oxidoreductase | Intermediary metabolism |
| 0079 (Rv3419c | 0104 | 0135 | 0131 | ICDS | O-sialoglycoprotein endopeptidase | Intermediary metabolism |
| 0080 (Rv3420c | 0105 | 0104 | 0092 | ICDS° | Conserved hypothetical | Unknown |
| 0083 | 0108 | 0107 | 0095 | ICDS | Transposase | IS/phage |
| 0084 (Rv3636 | 0111 | 0076 | 0097 | ICDS | Transposase | IS/phage |
| 0087 (Rv3741c | 0114 | 0078 | 0099 | ICDS° | Aromatic-ring hydroxylase | Intermediary metabolism |
| 0088 (Rv3770A | 0115 | 0079 | 0100 | ICDS° | Transposase | IS/phage |
| 0089 (Rv 3844 | 0116 | 0136 | 0132 | ICDS | Transposase | IS/phage |
| 0090 (Rv3866 | 0117 | 0109 | 0123 | ICDS | Conserved hypothetical | Unknown |
| 0091 (Rv3880c | 0118 | 0137 | 0133 | ICDS | Conserved hypothetical | Unknown |
| 0095 (Rv3900c | 0122 | 0083 | 0124 | ICDS | Conserved hypothetical | Unknown |
| 0097 (Rv3913 | 0123 | 0111 | 0001 | ICDS | Thioredoxin reductase | Intermediary metabolism |
| 0098 | 0124 | 0032 | 0035 | ICDS | Conserved hypothetical | Unknown |
| 0099 (Rv3386 | 0125 | 0103 | 0090 | ICDS | Transposase | IS/phage |
| 0100 (Rv0342 | 0126 | 0009 | 0125 | ICDS | Isoniazid inductible gene protein | Cell wall, process |
| 0101 (Rv0763c | 0127 | 0084 | 0126 | ICDS | Cytochrome P450 | Intermediary metabolism |
| 0102 (Rv1858 | 0128 | 0092 | 0127 | ICDS | Molybdenum transport ABC transporter | Cell wall, process |
| 0103 (Rv0449c | 0129 | 0113 | 0010 | ICDS | Conserved hypothetical | Unknown |
| 0104 (Rv0471c | 0130 | 0114 | 0011 | ICDS | Hypothetical | Unknown |
| 0105 (Rv0859 | 0131 | 0115 | 0024 | ICDS° | Acyl-CoA thiolase FadA and dehydrogenase FadB | Lipid metabolism |
| 0106 (Rv0880 | 0132 | 0116 | 0025 | ICDS° | Transcriptional regulator | Information pathway |
| 0107 (Rv0997 | 0133 | 0117 | 0029 | ICDS | Hypothetical | Unknown |
| 0108 (Rv1041c | 0134 | 0118 | 0033 | ICDS° | Transposase | IS/phage |
| 0109 (Rv1104 | 0135 | 0119 | 0038 | ICDS | Para-nitrobenzyl esterase | Intermediary metabolism |
| 0110 (Rv1221 | 0136 | 0120 | 0044 | ICDS | Alternative sigma factor SigE | Information pathway |
| 0111 (Rv1752c | 0137 | 0121 | 0051 | ICDS | Conserved hypothetical | Unknown |
| 0112 (Rv1961 | 0138 | 0122 | 0060 | ICDS | Hypothetical | Unknown |
| 0113 (Rv2309c | 0139 | 0123 | 0066 | ICDS | Integrase | Information pathway |
| 0114 (Rv2420c | 0140 | 0124 | 0070 | ICDS | nicotinate-nucleotide adenylyltransferase NadD | Intermediary metabolism |
| 0115 (Rv2732c | 0141 | 0125 | 0073 | ICDS | Conserved hypothetical | Unknown |
| 0116 (Rv2922A | 0142 | 0126 | 0078 | ICDS | Acylphosphatase AcyP | Intermediary metabolism |
| 0117 (Rv3774 | 0143 | 0127 | 0101 | ICDS° | Enoyl-CoA hydratase EchA21 and lipase LipE | Lipid metabolism |
| 0119 (Rv2599 | 0144 | 0098 | 0134 | ICDS° | Conserved hypothetical | Unknown |
ICDS number (variable, according to the strain), the size of the predicted protein and its putative function are indicated. The corresponding ORF numbers in M. tuberculosis H37Rv are indicated in brackets. "°" indicates ICDSs containing additional mutations with respect to M. tuberculosis H37Rv, CDC1551 and M. bovis AF2122/97.
| M. bovis AF2122/97 | Putative function | Functional classification | ||||
|---|---|---|---|---|---|---|
| 0001 (Rv0095c | 0003 | Mb0098c | ICDS | Not Found | Conserved hypothetical | Unknown |
| 0005 (Rv0325 | 0010 | Mb0333 | FL | FL | Hypothetical | Unknown |
| 0011 (Rv0590 | 0016 | Mb0605 | FL | FL | MCE-family protein | Virulence, detox, adapt |
| 0013 (Rv0618 | 0018 | Mb0635 | ICDS | FL | Galactose-1-phosphate uridylyltransferase | Intermediary metabolism |
| 0022 (Rv0924c | 0028 | Mb0948c | ICDS | ICDS | Manganese transport protein MntH | Cell wall, process |
| 0031 (Rv1145 | 0040 | Mb1177 | FL | FL | Transmembrane transport protein MmpL13 | Cell wall, process |
| 0037 (Rv1503c | 0048 | Mb1542c | ICDS | FL | Conserved hypothetical | Unknown |
| 0038 (Rv1549 | 0051 | Mb1576 | ICDS | FL | Fatty-acid-coA ligase FadD11 | Lipid metabolism |
| 0039 (Rv1553-Rv1554 | 0052 | Mb1579 | ICDS° | ICDS° | Fumarate reductase | Intermediary metabolism |
| 0045 (Rv1792 | 0058 | Mb1820 | ICDS | FL | ESAT-6-like protein EsxM | Cell wall, process |
| 0055 (Rv2227 | 0072 | Mb2252 | ICDS | FL | Conserved hypothetical | Unknown |
| 0066 (Rv2946c | 0084 | Mb2971c | FL | FL | Polyketide synthase Pks15/1 | Lipid metabolism |
| 0067 (Rv2974c | 0085 | Mb2999c | FL | FL | Conserved hypothetical | Unknown |
| 0072 (Rv3233c | 0092 | Mb3262c | FL | FL | Conserved hypothetical | Unknown |
| 0073 (Rv3337 | 0094 | Mb3370 | ICDS | FL | Conserved hypothetical | Unknown |
| 0078 (Rv3373 | 0103 | Mb3408 | ICDS | FL | Enoyl-CoA hydratase EchA18 | Lipid metabolism |
| 0085 (Rv3725 | 0112 | Mb3752 | FL | FL | Oxidoreductase | Intermediary metabolism |
| 0086 (Rv3738c | 0113 | Not determined | ICDS | ICDS | PPE family protein | PE/PPE |
| 0094 (Rv3897c | 0121 | Mb3927c | FL | FL | Conserved hypothetical | Unknown |
| 0118 (Rv3784 | 0128 | MT3893 | NT | ICDS | NAD-dependent epimerase/dehydratase | Information pathway |
| 0057 (MT1806 | 0046 | Rv1759c | NT | NT | PE_PGRS family protein | PE/PPE |
List of the 19 ICDSs common to M. tuberculosis H37Rv and CDC1551, the ICDSs common to M. tuberculosis H37Rv and M. bovis AF2122/97 and the ICDSs common to M. tuberculosis CDC1551 and M. bovis AF2122/97. ICDS number (variable, according to strain), the size of the predicted protein and its putative function are indicated. The genes that do not contain a frameshift in either M. tuberculosis strain 210 and in M. africanum and that correspond to a full-length ORF are noted "FL". "NT", not tested.
Analysis in 21 W-Beijing isolates of the 8 ICDSs of H37Rv strain corresponding to full-length ORFs in W-Beijing strain 210.
| Finger print | Tracking Number | ICDS 0005 | ICDS 0011 | ICDS 0031 | ICDS 0066 | ICDS 0067 | ICDS 0072 | ICDS 0085 | ICDS 0094 | |
|---|---|---|---|---|---|---|---|---|---|---|
| FL | FL | FL | FL | FL | FL | NT | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL* | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | NT | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | FL | FL | FL | FL | FL* | FL | |||
| FL | FL | NT | FL | FL | FL | FL* | FL | |||
| ICDS | ICDS | ICDS | ICDS | ICDS | ICDS | ICDS | ICDS | |||
| FL | FL | FL | FL | FL | FL | FL | FL | |||
M. tuberculosis isolates from various lineages for which chromosomal DNA was used as a template for PCR amplification of the selected locus. "FL" indicates the presence of a full-length ORF identical to that in the M. tuberculosis 210 strain, "*" indicates an additional mutation acquired in these isolates with respect to M. tuberculosis H37Rv, "NT", not tested.
Figure 2Hypothetical phylogenetic links assessed by comparative analyses of ICDSs. In this schematic representation, the common ancestor gave rise to several branches of strains of the TB complex. Eighty-one frameshifts were acquired during the common evolution of M. bovis and M. tuberculosis. Since the separation of these species, M. bovis has acquired 51 frameshifts, while the branch leading to M. tuberculosis isolates has acquired 19 new frameshifts. Since separation of the isolates, M. tuberculosis H37Rv has acquired 12 new frameshifts and CDC1551 36 new frameshifts. Common and unique ICDSs are shown in dark and light gray, respectively. "*" these 8 ICDSs correspond to full-length ORF in M. tuberculosis 210 and in M. africanum GM041182. "**" 7 out of these 11 ICDSs correspond to full-length ORF in M. africanum GM041182 (Table 2).