| Literature DB >> 17259624 |
Stuart C G Rison1, Jens Mattow2, Peter R Jungblut2, Neil G Stoker1.
Abstract
Identification of protein translation start sites is largely a bioinformatics exercise, with relatively few confirmed by N-terminal sequencing. Translation start site determination is critical for defining both the protein sequence and the upstream DNA which may contain regulatory motifs. It is demonstrated here that translation start sites can be determined during routine protein identification, using MALDI-MS and MS/MS data to select the correct N-terminal sequence from a list of alternatives generated in silico. Applying the method to 13 proteins from Mycobacterium tuberculosis, 11 predicted translational start sites were confirmed, and two reassigned. The authors suggest that these data (be they confirmation or reassignments) are important for the annotation of both this genome and those of organisms with related genes. It was also shown that N-acetylation, reported to be rare in prokaryotes, was present in three of the 13 proteins (23 %), suggesting that in the mycobacteria this modification may be common, and an important regulator of protein function, although more proteins need to be analysed. This method can be performed with little or no additional experimental work during proteomics investigations.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17259624 PMCID: PMC2884946 DOI: 10.1099/mic.0.2006/001537-0
Source DB: PubMed Journal: Microbiology (Reading) ISSN: 1350-0872 Impact factor: 2.777
Fig. 1.General strategy for identifying alternative translation start positions and tryptic digest peptides: application to GlpX (Rv1099c). The six alternative translation starts for GlpX are indicated by the labelled arrows. The minus (m)1, m2, m3 and m4 starts are located upstream of the original (p0) translation start indicated in the M. tuberculosis genome annotation. The plus (p)1 and p2 starts are downstream of the original translation start prediction. The putative trypsin cleavage sites are indicated by the ‘T’-labelled double-tailed bars. The resulting tryptic peptides are shown by the horizontal boxes. Tryptic fragments identified by MS/MS are shown in white; those not detected are shown in black. The figure shows that the detected fragments could only have resulted from the digest of the m2 variant. For reference, the M. tuberculosis sequence was aligned with the N-terminal region of six other actinomycete GlpX homologues, as well as with the E. coli homologue. Asterisks indicate perfectly conserved residues; colons indicate conserved substitutions (start codon methionines are ignored). The C. glutamicum homologue TSS (at position −14) has been confirmed experimentally (Rittmann ). Organisms are E. coli (Eco), Streptomyces coelicolor (Sco), Corynebacterium diptheriae (Cdi), C. glutamicum (Cgl), Mycobacterium smegmatis (Msm), Mycobacterium leprae (Mlp), Mycobacterium avium (Mav) and M. tuberculosis (Mtb).
MS analyses of N-terminal peptides from M. tuberculosis proteins
MetOx, oxidized methionine.
| 5_19 | CP | 1781.93 | (L)STTAELAELHDLVGGLR | p0 | fMet cleaved | ||
| 5_18 | CP | 1733.68 | (V)GDYGPFGFDPDEFDR | p0 | fMet cleaved | ||
| 5_154 | CP | 984.544 | (M)AVGVSPGELR | p0 | fMet cleaved | ||
| 3_350 | CP | 1130.51 (2 MetOx) | [V]MoxDPLMoxAHQR|| | p0 | MS/MS confirmed. 2MetOx detected; thus fMet deformylated but not cleaved. GTG start codon | ||
| 5_65 | CP | 1816.74 | (M)TDTQVTWLTQESHDR | p0 | fMet cleaved | ||
| 1_249/1_216 | CP | 2408.1 | (M) Ac-TAEGSGSSTAAVASHDPSHTRPSR¶ | Ac-m2/m2 | MS/MS confirmed. fMet cleaved. Both acetylated and non-acetylated forms found | ||
| 3_443# | CP | 834.62 | (M)ATAPYGVR | p0 | fMet cleaved | ||
| 5_97 | CP | 1842.94 (MetOx) | [V]MoxKGGAGVPDLPSLDASGVR|| | m1 | MS/MS confirmed. fMet deformylated but not cleaved. GTG start codon | ||
| 6_33 | CP/CSN | 1674.79 | (M)PLSGEYAPSPLDWSR | p0 | fMet cleaved | ||
| 1_386/2_17 | CP | 1155.52 (MetOx) | (M) Ac-TGASTTTATMoxR** | Ac-p0/p0 | fMet cleaved. Both acetylated and non-acetylated forms found. Internal Met oxidized | ||
| 3_325/3_372 | CSN | 1630.74 (MetOx) | (V) Ac-SFPYFISPEQAMoxR†† | Ac-p0/p0 | fMet cleaved | ||
| 6_19‡‡ | CP | 1393.76 | (M)SAVVVDAVEHLVR | p0 | fMet cleaved | ||
| 5_149 | CP | 846.42 | (V)PDSSTALR | p0 | fMet cleaved |
*All from M. tuberculosis unless otherwise mentioned.
†CP, cellular protein; CSN, culture supernatant.
‡Parentheses indicate a missing residue, presumed to be a cleaved fMet, and the amino acid predicted from the DNA sequence.
§Identity of start peptide. p0 indicates that the annotation in the published genome sequence is correct. m1/m2 indicates that the first or second upstream start codon is used. ‘Ac’ indicates that the peptide is N-terminally acetylated.
||Brackets indicate the codon present where an uncleaved methionine was identified (M=ATG, V=GTG, L=TTG).
¶Non-acetylated peptide also found, mass 2366.1.
#Spot from Mycobacterium bovis BCG Chicago.
**Non-acetylated peptide also found, mass 1113.52.
††Non-acetylated peptide also found, mass 1588.74.
‡‡Spot from M. tuberculosis Erdman.
Fig. 2.MS of GlpX. (a) SDS-PAGE indicating two forms of GlpX, acetylated (1) and non-acetylated (2). (b) Identification of a peptide spanning the annotated start (NLAMELVR=945.52; MS/MS-detected b ions: b2, b3, b4, b5, b6; detected y ions: y1, y2, y3, y5. (c) Identification of an acetylated N-terminal peptide (Ac-TAEGSGSSTAAVASHDPSHTRPSR=2408.1; MS/MS: y4, y5, y8, y9, y10, y21). (d) Identification of a non-acetylated N-terminal peptide (TAEGSGSSTAAVASHDPSHTRPSR=2366.1; MS/MS: y1, y4, y8, y9, y21).
Fig. 3.TSS reassignment for ribH (Rv1416). There is a 14 bp gap between the stop codon of ribA2 and the M. tuberculosis annotation predicted start codon (p0) of ribH. No peak corresponding to this start was seen; however, peaks corresponding to the m1 start variant (dashed box) were identified. Thus, the corrected TSS overlaps the upstream ribA2 (Rv1415) gene by 4 bp, an arrangement often observed in prokaryotic genomes for functionally related genes in a common operon.