| Literature DB >> 18831753 |
Sivakumar Kannan1, Gertraud Burger.
Abstract
BACKGROUND: In a previous study, we conducted a large-scale similarity-free function prediction of mitochondrion-encoded hypothetical proteins, by which the hypothetical gene murf1 (maxicircle unidentified reading frame 1) was assigned as nad2, encoding subunit 2 of NADH dehydrogenase (Complex I of the respiratory chain). This hypothetical gene occurs in the mitochondrial genome of kinetoplastids, a group of unicellular eukaryotes including the causative agents of African sleeping sickness and leishmaniasis. In the present study, we test this assignment by using bioinformatics methods that are highly sensitive in identifying remote homologs and confront the prediction with available biological knowledge.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18831753 PMCID: PMC2572627 DOI: 10.1186/1471-2164-9-455
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of FASTA hits for P. serpens MURF1 searched against UniProt
| Q9XKY50 | MURF1 | 5.3e-148 | 100.0% | |
| Q33559 | MURF1 | 2.7e-109 | 90.9% | |
| Q8HE85 | MURF1 | 3.1e-17 | 87.6% | |
| Q33547 | MURF1 | 1.2e-16 | 86.5% | |
| Q33552 | MURF1 | 6e-16 | 88.4% | |
| Q33556 | MURF1 | 5.1e-13 | 85.0% | |
| Q34937 | MURF2 | 2e-09 | 60.4% | |
| Q34096 | MURF2 | 3.2e-09 | 56.3% | |
| Q34192 | NAD5 | 3.8e-09 | 54.5% | |
| P48903 | NAD2 | 5.4e-07 | 57.9% | |
| Q5LRX2 | Putative membrane protein | 1.2e-06 | 58.0% | |
| Q6E773 | NAD2 | 1.5e-06 | 53.8% | |
| Q6SKY5 | NAD5 | 2.3e-06 | 55.2% | |
| Q5AG49 | Hypothetical protein | 3.3e-06 | 67.7% | |
| Q5AGI5 | Hypothetical protein | 7.1e-06 | 67.9% | |
| Q8SKS6 | NAD4 | 7.4e-06 | 57.3% | |
| Q85TH7 | NAD4 | 7.7e-06 | 58.1% | |
| Q33575 | NAD4 | 8.7e-06 | 57.3% | |
| P24499 | ATP6 | 1.1e-05 | 55.4% | |
| Q70NW4 | NAD4 | 1.2e-05 | 56.7% | |
| Q33570 | ATP6 | 1.5e-05 | 56.9% | |
| Q5CV17 | Hypothetical protein | 1.5e-05 | 61.5% | |
| Q057W5 | NADH dehydrogenase I chain L | 1.9e-05 | 54.9% | |
| Q8IBJ6 | Hypothetical protein | 2.9e-05 | 58.4% |
Best informative hits for the MURF1 profile HMM when searched against profile HMMs from various databases
| Pfam | NADH-Ubiquinone/plastoquinone (Complex I), various subunits | 1.6e-08 | 21% | 96.80 |
| PANTHER | NADH dehydrogenase | 4.3e-09 | 16% | 99.20 |
| COG | NADH:Ubiquinone oxidoreductase subunit 2 | 3.8e-03 | 19% | 39.65 |
| TIGR | NDH_I_N Proton-translocating NADH-Quinone oxidoreductase | 91 | 19% | 75.95 |
Best hits for the MURF1 profile HMM when searched against the profile HMMs of all NADH dehydrogenase subunits using HHsearch. The hits are ranked based on E-valuesa
| 1 | NAD2_0.45 | 96.6 | 2.70E-15 | 26 | 75.7 |
| 2 | NAD2_0.4 | 96.6 | 3.20E-15 | 25 | 75.3 |
| 3 | NAD2_0.5 | 96.6 | 1.70E-14 | 23 | 72.1 |
| 4 | NAD2_0.55 | 96.5 | 1.50E-12 | 34 | 63.1 |
| 5 | NAD2_0.6 | 96.3 | 6.30E-12 | 23 | 60.4 |
| 6 | NAD2_0.65 | 96.2 | 1.00E-11 | 26 | 59.4 |
| 7 | NAD4_0.4 | 96 | 2.10E-11 | 21 | 58 |
| 8 | NAD4_0.55 | 95.9 | 2.90E-11 | 23 | 57.4 |
| 9 | NAD4_0.6 | 95.2 | 1.40E-10 | 28 | 54.2 |
| 10 | NAD2_0.7 | 95.1 | 1.90E-10 | 27 | 53.6 |
| 11 | NAD4_0.7 | 95 | 2.40E-10 | 28 | 53.2 |
| 12 | NAD4_0.5 | 94.3 | 6.00E-10 | 28 | 51.4 |
| 13 | NAD2_0.75 | 93.8 | 1.20E-09 | 24 | 50 |
| 14 | NAD4_0.45 | 93.2 | 2.00E-09 | 27 | 49 |
| 15 | NAD4_0.75 | 93.1 | 2.30E-09 | 28 | 48.7 |
| 16 | NAD6_0.4 | 91.8 | 6.40E-09 | 24 | 46.7 |
| 17 | NAD6_0.45 | 90.1 | 1.80E-08 | 26 | 44.7 |
| 18 | NAD5_0.4 | 89.8 | 2.10E-08 | 21 | 44.4 |
| 19 | NAD1_0.55 | 88.9 | 3.30E-08 | 18 | 43.5 |
| 20 | NAD6_0.5 | 88.9 | 3.30E-08 | 23 | 43.5 |
| 21 | NAD5_0.5 | 88.4 | 4.00E-08 | 28 | 43.1 |
| 22 | NAD1_0.6 | 86.7 | 8.10E-08 | 17 | 41.7 |
| 23 | NAD1_0.5 | 86 | 1.10E-07 | 18 | 41.2 |
| 24 | NAD6_0.55 | 85.7 | 1.20E-07 | 25 | 41 |
| 25 | NAD5_0.55 | 85.7 | 1.20E-07 | 25 | 40.9 |
| 26 | NAD1_0.65 | 84.8 | 1.60E-07 | 20 | 40.4 |
| 27 | NAD1_0.4 | 84.3 | 1.80E-07 | 24 | 40.1 |
| 28 | NAD1_0.45 | 84.1 | 2.00E-07 | 22 | 40 |
| 29 | NAD4_0.65 | 83.5 | 2.40E-07 | 26 | 39.6 |
| 30 | NAD1_0.7 | 83.2 | 2.60E-07 | 21 | 39.4 |
| 31 | NAD5_0.45 | 25 | 2.90E-07 | 25 | 39.2 |
| 32 | NAD5_0.6 | 80.4 | 5.50E-07 | 21 | 37.9 |
| 33 | NAD6_0.65 | 80 | 6.10E-07 | 28 | 37.7 |
| 34 | NAD1_0.75 | 79.5 | 6.90E-07 | 18 | 37.5 |
| 35 | NAD5_0.65 | 77.3 | 1.10E-06 | 18 | 36.5 |
| 36 | NAD5_0.75 | 76.9 | 1.20E-06 | 21 | 36.3 |
| 37 | NAD5_0.7 | 76.7 | 1.30E-06 | 18 | 36.2 |
| 38 | NAD6_0.6 | 73.6 | 2.40E-06 | 25 | 35 |
| 39 | NAD6_0.7 | 69.8 | 4.70E-06 | 25 | 33.7 |
| 40 | NAD6_0.75 | 69.2 | 5.20E-06 | 20 | 33.5 |
| 41 | NAD3_0.4 | 62.1 | 1.50E-05 | 30 | 31.4 |
| 42 | NAD3_0.45 | 55.9 | 3.50E-05 | 27 | 29.8 |
| 43 | NAD3_0.55 | 48.2 | 8.80E-05 | 25 | 27.9 |
| 44 | NAD3_0.6 | 46.9 | 0.0001 | 24 | 27.6 |
| 45 | NAD3_0.65 | 45.8 | 0.00012 | 23 | 27.4 |
| 46 | NAD3_0.5 | 43.9 | 0.00014 | 21 | 27 |
| 47 | NAD4L_0.4 | 34.2 | 0.00044 | 25 | 24.8 |
| 48 | NAD4L_0.45 | 31.2 | 0.00062 | 18 | 24.1 |
| 49 | NAD4L_0.55 | 26.8 | 0.0011 | 15 | 23 |
| 50 | NAD3_0.7 | 26.7 | 0.0011 | 26 | 23 |
| 51 | NAD4L_0.6 | 26.5 | 0.0011 | 30 | 22.9 |
| 52 | NAD4L_0.7 | 23.3 | 0.0016 | 27 | 22.1 |
| 53 | NAD4L_0.5 | 21.1 | 0.0022 | 18 | 21.6 |
a Probability, e-value, identity and score for each hit were reported by HHSearch
b The number following the subunit name is the sequence identity threshold used for clustering the sequences from which we generate the profile HMM. For example, NAD2_0.45 profile HMM is generated by clustering all known NAD2 sequences at 45% sequence identity threshold using CD-HIT.
Figure 1Multiple sequence alignment of kinetoplastid MURF1 sequences with NAD2 sequences from other eukaryotes. The top five sequences are kinetoplastid proteins. Only the most conserved region of the protein is depicted. The range of amino acid positions included in the alignment is indicated by the numbers following the species name. Dashes specify alignment gaps.
List of kinetoplastid MURF1 sequences with GenBank Accession Numbers