| Literature DB >> 19597502 |
Julio Ng1, Nuno Bandeira, Wei-Ting Liu, Majid Ghassemian, Thomas L Simmons, William H Gerwick, Roger Linington, Pieter C Dorrestein, Pavel A Pevzner.
Abstract
Nonribosomal peptides (NRPs) are of great pharmacological importance, but there is currently no technology for high-throughput NRP 'dereplication' and sequencing. We used multistage mass spectrometry followed by spectral alignment algorithms for sequencing of cyclic NRPs. We also developed an algorithm for comparative NRP dereplication that establishes similarities between newly isolated and previously identified similar but nonidentical NRPs, substantially reducing dereplication efforts.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19597502 PMCID: PMC2754211 DOI: 10.1038/nmeth.1350
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Figure 1Experimental and theoretical spectra of seglitide Cyclic(N-methyl-Ala, Tyr, D-Trp, Lys, Val, Phe (a) Cyclic diagram of seglitide. A+14 denotes a methylated alanine; the integer residue masses are 85, 163, 186, 128, 99 and 147. (b) MS2 fragmentation of seglitide generates up to 6 linear peptides representing different linearized variants of the same cyclic peptide. (c) The theoretical spectrum for seglitide is a superposition of the fragment masses from the linearized peptide. (d) Experimental spectrum of seglitide (the peaks corresponding to prefix masses are shown in red). (e) The auto-convolution of the spectrum in insert d has prominent peaks for offsets corresponding to masses of amino acids (shown in red). The peak at 0 is truncated. (f) Generation of a gapped peptide from a theoretical spectrum of seglitide. The theoretical spectrum is colored to highlight various linear peptides. For illustration purposes only 3 linearized (A+14YWKV (blue), FA+14YWKV (red) and VFA+14YWK (green)) versions of the cyclic peptide are shown. The frequent 2-amino-acid tag YW is observed in 3 different locations in the spectrum. Additionally, the offsets between 3 consecutive locations of tag YW reveal the masses of amino acids F and V. (g) The gapped peptide constructed from f combines YW (derived from a frequent tag) with VF (derived from the inter distances between tag locations). A+14 and K are inferred from the flanking masses of YW and VF. The complete sequence A+14YWKVF is recovered, but gaps may be generated.
NRP-Tagging results. The reconstructed NRPs are represented as sequences of masses. For the sake of brevity, masses are rounded to integers. Actual sequenced masses are float point numbers. Composite masses (2 or more amino acids) are enclosed in square brackets. For example, [114+57] in cyanopeptide × means that NRP-Tagging returned 171 as the mass of an amino acid instead of the correct masses 114 and 57 (Hmp and Gly). Incorrect masses are enclosed in curly brackets and expressed in terms of their offsets from correct masses. For example,{97+1} in H3526 means that NRP-Tagging returned 98 while the correct mass is 97 (Pro). In this case the isotopic peak (rather than a b-ion) was chosen as the best spectral interpretation. Lastly, cases in which the algorithm splits a mass are enclosed in angle brackets with the correct mass followed by the masses returned by the algorithm. A single mass 286 in cyclomarin A is split as 129, 157. A single mass 222-18 (water loss) in compound 879 is split into 100 and 104. The reconstructions given in the table represent a complete reconstruction of the compound, or a reconstruction with composite masses and/or masses with a known offset. The “Best reconstruction" column presents the high-scoring peptide with a specified rank (“Rank column") that is selected from the list of all top-scoring peptides as the most similar to the correct peptide.
| Compound | Best reconstruction | Rank |
|---|---|---|
| Tyrocidine A | 99, 114, 113, 147, 97, 147, 147, 114, 128, 163 | 3 |
| Tyrocidine A1 | 99, 128, 113, 147, 97, 147, 147, 114, 128, 163 | 16 |
| Tyrocidine B | 99, 114, 113, 147, 97, 186, 147, 114, 128, 163 | 4 |
| Tyrocidine B1 | 99, 128, 113, 147, 97, 186, 147, 114, 128, 163 | 1 |
| Tyrocidine C | 99, 114, 113, 147, 97, 186, 186,114, 128, 163 | 4 |
| Tyrocidine C1 | 99, 128, 113, 147, 97, 186, 186, 114, 128, 163 | 1 |
| Seglitide | 85, 163, 186, 128, 99, 147 | 1 |
| Cyanopeptide X | 57, 113, 161, 141, 71, 113, [114+57], 127 | 1 |
| BQ123 | 113, 186, 115, 97, 99 | 2 |
| Destruxin A | 113, 113, 85, 71, [98+97] | 2 |
| H3526 | 97, 97, 163, 99, {97+1}, 113, {113−1}, 113 | 10 |
| H8405 | 129, 71, 113, 113, 186 | 2 |
| Microcystin LR | {[83+71]+1}, {113−1}, {129−1}, {156+1}, 313, 129 | 27 |
| Compound 879 | 113, 113, <222−18 : 100,104>, {147+18}, 71, 141, 71 | 7 |
| Cyclomarin A | 127, 139, <286 : 129,157 >, 143, 71, [177+99] | 10 |
| Dehydrocyclomarin A | 127, 139, 268, 143, 71, 177, 99 | 27 |
| Cyclomarin C | 127, 139, 270, {143+32}, {[71+177] −32}, 99 | >40 |
| Dehydrocyclomarin C | Not generated | - |