| Literature DB >> 21272379 |
Kenichiro Imai1, Naoya Fujita, M Michael Gromiha, Paul Horton.
Abstract
BACKGROUND: The outer membranes of mitochondria are thought to be homologous to the outer membranes of Gram negative bacteria, which contain 100's of distinct families of β-barrel membrane proteins (BOMPs) often forming channels for transport of nutrients or drugs. However, only four families of mitochondrial BOMPs (MBOMPs) have been confirmed to date. Although estimates as high as 100 have been made in the past, the number of yet undiscovered MBOMPs is an open question. Fortunately, the recent discovery of a membrane integration signal (the β-signal) for MBOMPs gave us an opportunity to look for undiscovered MBOMPs.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21272379 PMCID: PMC3045335 DOI: 10.1186/1471-2164-12-79
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Sequence logos of proposed . The upper figure is computed from all homologs in Uniprot and the bottom from the subset of those with confirmed expression.
Figure 2Conserved . In the top track, H, E, and C represented α-helix, β-strand and coil, as predicted by PSIPRED or from the experimentally determined structure in the case of VDAC (PDB:2K4T).
The frequency of amino acid groups of β-signal motif sequences from 70 MBOMP homologs.
| Motif position | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Large hydrophobic | 0.32 | 0.03 | 0.00 | 0.00 | 0.16 | ||||
| Small hydrophobic | 0.08 | 0.07 | 0.04 | 0.00 | 0.11 | 0.00 | 0.11 | 0.00 | |
| Glycine | 0.07 | 0.16 | 0.00 | 0.00 | 0.24 | 0.00 | 0.00 | 0.00 | |
| Non-negatively charged polar | 0.39 | 0.00 | 0.00 | 0.01 | 0.50 | 0.00 | 0.61 | 0.00 | |
| Negatively charged | 0.09 | 0.00 | 0.00 | 0.00 | 0.00 | 0.14 | 0.00 | 0.11 | 0.00 |
| Proline | 0.05 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
The numbers in parentheses are frequencies of each amino acid group in each position of the β-signal in 70 MBOMP sequences. "Background" is the overall frequency of each amino acid group in the entire set of sequences.
Comparison of the frequency of β-signal motif matches in the C-terminal 40 residues compared with the rest of the sequence of MBOMPs
| Motif pattern | VDAC | Tom40 | Sam50 | Mdm10 |
|---|---|---|---|---|
| Po x G x x HyxHy | (4/4, 0.20, 1.6 × 10-3) | (5/5, 0.12, 2.5 × 10-5) | (5/5, 0.14, 5.4 × 10-5) | (6/8, 0.17, 4.9 × 10-4) |
| Po x Ghy x HyxHy | (4/4, 0.09, 6.6 × 10-5) | (5/5, 0.09, 5.9 × 10-6) | (5/5, 0.09, 5.9 × 10-6) | (6/8, 0.07, 2.9 × 10-6) |
| Po HyGhy x HyxHy | (4/4, 0.07, 2.4 × 10-5) | (4/5, 0.07, 1.1 × 10-4) | (5/5, 0.02, 3.2 × 10-9) | (4/8, 0.04, 1.6 × 10-4) |
| Po Hy xhy x HyxHy | (4/4, 0.25, 3.9 × 10-3) | (5/5, 0.17, 1.4 × 10-4) | (5/5, 0.12, 2.5 × 10-5) | (5/8, 0.12, 1.0 × 10-3) |
| (4/4, 0.07, 2.4 × 10-5) | (4/5, 0.07, 1.1 × 10-4) | (5/5, 0.00, 0.0) | (4/8, 0.04, 1.6 × 10-4) | |
| (4/4, 0.21, 1.9 × 10-3) | (5/5, 0.17, 1.4 × 10-4) | (5/5, 0.08, 3.3 × 10-6) | (5/8, 0.12, 1.0 × 10-3) | |
| (4/4, 0.11, 1.5 × 10-4) | (4/5, 0.07, 1.1 × 10-4) | (5/5, 0.02, 3.2 × 10-9) | (7/8, 0.04, 1.3 × 10-9) | |
| (4/4, 0.37, 1.9 × 10-2) | (5/5, 0.23, 6.4 × 10-4) | (5/5, 0.22, 5.2 × 10-4) | (8/8, 0.18, 1.1 × 10-6) | |
| (4/4, 0.11, 1.5 × 10-4) | (4/5, 0.07, 1.1 × 10-4) | (5/5, 0.00, 0.0) | (7/8, 0.04, 1.3 × 10-9) | |
| (4/4, 0.33, 1.2 × 10-2) | (5/5, 0.23, 6.4 × 10-4) | (5/5, 0.20, 3.2 × 10-4) | (8/8, 0.16, 4.3 × 10-7) |
Statistics on the frequency of matches in the C-terminal versus non-C-terminal part of MBOMPs families is shown for the 4 known MBOMPs and 10 variations of the β-signal motif. Each cell holds a triple: the fraction of homologs with C-termini which match the motif, the fraction of non-C-terminal length 40 substrings of the MBOMP sequences which match the motif, and a p-value. The p-value is computed with a binomial test, in which the non-C-terminal frequency (e.g. 0.2) is the probability of success and the C-terminal frequency (e.g 4/4) is the observed data.
Figure 3Informatics pipeline for searching for novel MBOMP candidates.
List of identified protein clusters with conserved β -signal.
| Representative protein | Number of cluster member | Organism | Protein length | Conserved motif position | Subcellular Localization | Family and domain |
|---|---|---|---|---|---|---|
| SAM50-like protein CG7639 (Q9V784) | 12 | 443 | 435-442 | Mito OM (G) | SAM50/omp85 family (U), Bacterial surface antigen (I) | |
| SAM complex subunit of the mitochondrial outer membrane, putative (B9WEF8) | 6 | 499 | 492-498 | OM (G) | Bacterial surface antigen (I) | |
| SAM50-like protein (A3LZ83) | 6 | 489 | 473-480 | - | - | |
| SAM50 (P53969) | 5 | 484 | 476-483 | Mito OM (U, G) | SAM50/omp85 family (U), Bacterial surface antigen (I) | |
| SAM50-like protein SpAC17C9.06 (Q10478) | 2 | 475 | 467-474 | Mito OM (G) | SAM50/omp85 family (U), Bacterial surface antigen (I) | |
| SAM50-like protein gop-3 (P46576) | 2 | 434 | 426-433 | Mito OM (G) | SAM50/omp85 family (U), Bacterial surface antigen (I) | |
| KLLA0E02223p (Q6CPU1) | 2 | 480 | 472-479 | OM (G) | Bacterial surface antigen (I) | |
| Predicted cell surface protein homologous to bacterial outer membrane proteins (ISS) (Q017Y3) | 2 | 521 | 508-515 | OM (G) | Bacterial surface antigen (I) | |
| TOM40 (P23644) | 36 | 387 | 352-359 | Mito OM (U, G) | Tom40 family (U), Porin, eukaryotic type (I), Mitochondrial outer membrane translocase complex, subunit Tom40 (I) | |
| GA18230 (Q293I2) | 8 | 321 | 290-297 | Mito OM (G) | Porin, eukaryotic type (I) | |
| Outer mitochondrial membrane protein porin (B8NA08) | 5 | 346 | 337-344 | Mito OM (G) | Porin, eukaryotic type (I) | |
| Voltage-dependent anion-selective channel protein (A8I528) | 2 | 276 | 268-275 | OM (G) | Porin, eukaryotic type (I) | |
| Predicted protein (B8BQH4) | 2 | 268 | 261-268 | OM (G) | Porin, eukaryotic type (I) | |
| cDNA FLJ52528, highly similar to Protein TOMM40-like (B7Z4T8) | 2 | 210 | 202-209 | OM (G) | Porin, eukaryotic type (I) | |
| Mdm10 (P18409) | 5 | 493 | 484-491 | Mito OM (U, G) | MDM10 family., Protein of unknown function DUF3722 (I) | |
| Mdm10 (A5DUG6) | 5 | 523 | 496-503 | Mito OM (U, G) | MDM10 family., Protein of unknown function DUF3722 (I) | |
| Probable mitochondrial import receptor sub-unit tom40 homolog (Q7RE39) | 10 | 396 | 331-338 | Mito OM (U, G) | Porin, eukaryotic type (I) | |
| Putative uncharacterized protein (Q4CQ17) | 5 | 479 | 417-424 | OM (G) | Bacterial surface antigen (I) | |
U, G and I in "Subcellular localization" and "Family and domain" represent the source of annotation from Uniprot, Gene ontology and InterPro, respectively
Figure 4. Protein clusters which match our automatic criteria of a conserved β-signal, but with the matches outside of the final predicted β-signal are shown. The top track indicates predicted β -strand (E), coil (C), or α-helix (H), by PSIPRED. The colored residues occur in full or partial matches to the β-signal motif.
Figure 5Examples which don't match the . (a) Putative MBOMP homologs, which do not match the β-signal motif, and (b) BBOMPs sorted to the mitochondrial outer membrane when expressed in yeast are shown. The C-terminus is indicated with an asterisk.
Figure 6Overview of features used for MBOMP prediction by SVM.
List of identified proteins in our Arabidopsis proteome analysis.
| Uniprot AC | Identification | Length | Highest score segment (score) | Domain and family in predicted region |
|---|---|---|---|---|
| Q9SRH5 | VDAC1 | 276 | Whole sequence (0.989) | Porin, eukaryotic |
| Q9SMX3 | VDAC2 | 274 | Whole sequence (1.000) | Porin, eukaryotic |
| Q9FJX3 | - | 276 | Whole sequence (0.989) | Porin, eukaryotic |
| Q9FKM2 | - | 274 | Whole sequence (0.977) | Porin, eukaryotic |
| Q9M2W6 | - | 226 | Whole sequence (0.998) | Porin, eukaryotic |
| Q9FHQ9 | - | 163 | Whole sequence (0.998) | Porin, eukaryotic |
| Q8LGE2 | - | 425 | C-terminal 300 (0.569) | Porin, eukaryotic |
| Q9LHE5 | Tom40 homolog 1 | 309 | Whole sequence (0.991) | Porin, eukaryotic |
| Q9SX55 | Tom40 homolog 2 | 310 | C-terminal 300 (0.999) | Porin, eukaryotic |
| Q8LEH7 | - | 524 | C-terminal 300 (0.986) | Bacterial surface antigen |
| Q9SRL6 | - | 520 | C-terminal 300 (0.995) | Bacterial surface antigen |
| Q9LXP7 | - | 435 | Whole sequence (0.645) | Bacterial surface antigen |
| Q5PP51 | - | 362 | C-terminal 150 (0.900) | Bacterial surface antigen |
| Q9C5J8 | Toc75-V/OEP80 | 732 | C-terminal 300 (0.746) | Bacterial surface antigen |
| O80565 | OEP37 | 343 | N-terminal 300 (0.863) | - |
| Q3EBH0 | OEP37 homolog | 333 | C-terminal 300 (0.891) | - |
| Q3EBG9 | OEP37 homolog | 280 | Whole sequence (0.645) | - |
| Q1H5C9 | OEP24 | 213 | Whole sequence (0.562) | - |
| A8MR28 | OEP24 homolog | 167 | Whole sequence (0.666) | - |
| Q9FPG2 | OEP21 | 167 | Whole sequence (0.925) | - |
| Q9LM70 | OEP21 homolog | 203 | C-terminal 150 (0.691) | - |
| Q6ID99 | OEP21 homolog | 167 | N-terminal 150 (0.875) | - |
| Q8VYB6 | - | 491 | Whole sequence (0.535) | Protein of unknown function (DUF1005) |
| Q9LPM5 | - | 460 | C-terminal 450 (0.855) | Protein of unknown function (DUF1005) |
| Q9M0F0 | - | 424 | N-terminal 300 (0.528) | Protein of unknown function (DUF1005) |
| Q9LEU1 | - | 389 | N-terminal 300 (0.971) | Plant protein of unknown function (DUF868) |
| Q9SIS2 | - | 354 | N-terminal 300 (0.655) | Plant protein of unknown function (DUF868) |
| Q9M903 | - | 479 | C-terminal 150 (0.993) | Protein of unknown function (DUF3769) |
| O80503 | - | 451 | C-terminal 300 (0.648) | Protein of unknown function (DUF3769) |
| Q3EAC5 | - | 134 | Whole sequence (0.704) | Domain of unknown function (DUF3406) |
| | - | 424 | C-terminal 300 (0.652) | - |
| Q9LH72 | - | 483 | C-terminal 300 (0.994) | - |
| Q9LPG1 | - | 468 | C-terminal 300 (0.816) | - |
| Q8W4R2 | - | 271 | Whole sequence (0.729) | - |
| O48573 | - | 1170 | C-terminal 150 (0.625) | - |
| Q8VY85 | - | 188 | C-terminal 150 (0.608) | - |
| Q9LZB6 | - | 177 | C-terminal 150 (0.506) | - |
| Q9M238 | - | 163 | C-terminal 150 (0.748) | - |
Figure 7Signal Peptide Model. The h-region is defined as the region of 20 residues around the position of the first peak of the hydrophobicity plot, and the n-region and c-region as the N-terminal and C-terminal flanking 5 residues, respectively, as shown at the top of this figure. The h-region is aligned to the first peak of the hydrophobicity plot (details in main text).
Prediction performance of our SVM MBOMP predictor.
| Fold | TP | FP | FN | TN | Precision | Recall | Specificity | F-measure |
|---|---|---|---|---|---|---|---|---|
| 1 | 15 | 1 | 4 | 527 | 0.938 | 0.789 | 0.998 | 0.857 |
| 2 | 16 | 0 | 3 | 528 | 1.000 | 0.789 | 1.000 | 0.914 |
| 3 | 15 | 0 | 4 | 528 | 1.000 | 0.789 | 1.000 | 0.882 |
| 4 | 16 | 0 | 3 | 528 | 1.000 | 0.842 | 1.000 | 0.914 |
| 5 | 17 | 0 | 1 | 528 | 1.000 | 0.944 | 1.000 | 0.971 |
| Mean | - | - | - | - | 0.988 | 0.842 | 1.000 | 0.908 |