| Literature DB >> 25503745 |
Fumi Suomi1, Katja E Menger1, Geoffray Monteuuis1, Uta Naumann1, V A Samuli Kursu1, Antonina Shvetsova1, Alexander J Kastaniotis1.
Abstract
The Saccharomyces cerevisiae genome encodes two sequence related acetyl-CoA carboxylases, the cytosolic Acc1p and the mitochondrial Hfa1p, required for respiratory function. Several aspects of expression of the HFA1 gene and its evolutionary origin have remained unclear. Here, we determined the HFA1 transcription initiation sites by 5' RACE analysis. Using a novel "Stop codon scanning" approach, we mapped the location of the HFA1 translation initiation site to an upstream AUU codon at position -372 relative to the annotated start codon. This upstream initiation leads to production of a mitochondrial targeting sequence preceding the ACC domains of the protein. In silico analyses of fungal ACC genes revealed conserved "cryptic" upstream mitochondrial targeting sequences in yeast species that have not undergone a whole genome duplication. Our Δhfa1 baker's yeast mutant phenotype rescue studies using the protoploid Kluyveromyces lactis ACC confirmed functionality of the cryptic upstream mitochondrial targeting signal. These results lend strong experimental support to the hypothesis that the mitochondrial and cytosolic acetyl-CoA carboxylases in S. cerevisiae have evolved from a single gene encoding both the mitochondrial and cytosolic isoforms. Leaning on a cursory survey of a group of genes of our interest, we propose that cryptic 5' upstream mitochondrial targeting sequences may be more abundant in eukaryotes than anticipated thus far.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25503745 PMCID: PMC4263661 DOI: 10.1371/journal.pone.0114738
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Yeast strains used for this study.
| Strain | Genotype | Reference |
| W1536 5B |
| H. Zou and R. Rothstein |
| W1536 8B |
| H. Zou and R. Rothstein |
| BY4741 Δ |
| Euroscarf |
| BY4742 Δ |
| Euroscarf |
| W1536 8B Δ |
| This study |
| W1536 8B Δ |
| This study |
| W1536 8B Δ |
| This study |
Plasmids used for the acetyl-CoA carboxylases study in yeasts.
| Plasmid | Description | Reference |
| YCp | Single copy vector, empty |
|
| YCp33 | Full length | This study |
| pET23a+ | Novagen (Merck Chemicals, Nottingham, United Kingdom) | |
| pET23a+ | Promoter region of | This study |
| YEp | Single copy vector, empty |
|
| pBS KS + | Fermentas (Helsinki, Finland) | |
| pBS KS N- | This study | |
| pTSV30A | Multi copy vector, empty |
|
| pTSV30 | Full length | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Generated through site directed mutagenesis from YCp33 | This study |
| YCp33 | Full length | This study |
| Ycp33 | ADH promoter cloned into | This study |
| YCp33 | Full length | This study |
| YCp33 | Full length | This study |
*these plasmids are unstable in E. coli and have to be maintained in S. cerevisiae frozen stocks
Oligonucleotides used for the acetyl-CoA carboxylases study in yeasts.
| Name | Sequence |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Hfa1p transcr. Start 3′ seq |
|
| Hfa1p 5′RACE Sp1 |
|
| Hfa1p 5′ RACE Sp2 |
|
| Hfa1p 5′ RACE Sp3 |
|
|
|
|
| HA-His tag coding |
|
| HA-His tag complementing |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| HindIII |
|
|
|
|
|
|
|
| ADH promoter klac-minusMTS ACC SpeI- F |
|
|
|
|
|
|
|
|
|
|
Figure 1Identification of the HFA1 transcription initiation site by 5′RACE: The HFA1 specific primer SP1 (Table 4) was used for the reverse transcription reaction to synthesize a first strand cDNA.
a) Major 5′ RACE product (500 bp), b) Minor 5′RACE product (650 bp) and c) third band produced only occasionally.
Figure 2Schematic depiction of location of the introduced stop codons for stop –codon scanning assay and RNAse protection assay results.
All nucleotide numbers are given respective to the annotated start codon at +1. Position −450 is the predicted transcribed but not translated region of HFA1. The underlined region up to position −216 region shows the putative minimum mitochondrial import sequence and upstream position −141 shows the end of the sequence similarity to ACC1. The ORFof HFA1 annotated in the Saccharomyces Genome Database starts from +1. The stop codon found to lead to a respiratory deficient phenotype in the screen performed by Kursu et al. 2013 is located at −273 and 8 more stop codons at −282, −312, −360, −363, −372 −375 −378 and −381 were introduced upstream in the promoter region of HFA1.
Figure 3The W1536 8B Δhfa1 strain carrying plasmids with inserted stop codon mutation at position downstream of −372 resulted in unchanged lactate deficiency.
Only stop codon mutations relevant to define the putative translation initiation site and controls are shown. The yeast cells were grown on media containing Glucose (SCD) or lactate (Lactate) as the sole carbon source at 33°C. Strains used for this study are W1536 8B, W1536 8B Δhfa1 or W1536 8B Δhtd2 (respiratory deficient control) and the plasmids carried by the strains are indicated at the left side of the panels. YCp33: empty plasmid; HFA1: YCp33 HFA1; −381: YCp33 HFA1 −381; −372: YCp33 HFA1 −372; −363: YCp33HFA1 −363. Only stop codon mutations relevant to define the putative translation initiation site and controls are shown. The results for other mutants shown in Fig. 2 can be found as supplementary data.
Figure 4K.lactis ACC complemented the respiratory deficiency when appended with the putative 5′ non-AUG initiated MTS, but not without this sequence.
Strains were grown on media containing glucose (SCD-URA), glycerol (SCG, non-fermentable) or lactate (SCL, non-fermentable) as the sole carbon source at 30°C and 33°C. 1) W1536 8B Δhfa1 +YCp33 HFA1. 2) W1536 8B Δhfa1+YCp33 3) W1536 8B Δhfa1. 4) W1536 8B (wild type). 5) W1536 8B Δhfa1+K.lactis ACC without 5′- encoded putative cryptic MTS sequence. 6) W1536 8B Δhfa1+K.lactis ACC with 5′ -encoded putative cryptic 5′ MTS.
Figure 5Acetyl-CoA carboxylases from yeast split into two subgroups with the exception of the S. cerevisiae HFA1.
The phylogenetic tree of representative members of the acetyl-CoA carboxylases from fungi species was constructed using Clustal W2 Phylogeny and NJPlot as described in Material and methods. Numbers at branch points are bootstrap values and were calculated with Clustal X2.
Acetyl-CoA carboxylases in yeasts and MTS probabilities.
| Organism | Accession number | Group number | MTS probability(MitoProtII)/cleavage site/AA sequence length | Upstream MTS probability (MitoProtII)/cleavage site/AA sequence lengthb |
|
| EEQ43196.1 | 1 | 0.8764/NP/2271 | NC |
|
| XP_002421671.1 | 1 | 0.0403/NP/2228 | NC |
|
| XP_002546225.1 | 1 | 0.9935/40/2275 | NC |
|
| XP_457211.1 | 1 | 0.9693/NP/2297 | NC |
|
| FN392319 | 1 | 0.0793/NP/2215 | 0.9050/31/2245 |
|
| XP_501721.1 | 1 | 0.981621/2266 | NC |
|
| BK006946 | 2 | 0.4993/NP/2123 | 0.7297/80/2248c |
|
| Kwal_6157 (KEGG entry) | 2 | 0.0195/NP/2230 | 0.9923/19/2281 |
|
| XP_455355 | 2 | 0.0201/NP/2231 | 0.9999/90/2317 |
|
| NP_982612 | 2 | 0.0321/NP/2231 | 0.9902/52/2298 |
|
| CR380958 | 2 | 0.0090/NP/2233 | 0.5589/NP/2292 |
|
| ACC1(Locus Tag) | 2 | 0.0279/NP/2233 | 0.0452/NP/2252 |
: The whole amino acid sequence of each protein was analyzed with MitoProt II. b: The whole sequence with the additional 5′ upstream region of the ORF of each protein was analyzed up to the first in-frame stop codon. c: The whole amino acid sequence from the 5′ end upstream start codon identified in this study was analyzed. NP: not predicted. NC: not calculated
Mitochondrial targeting prediction of selected proteins in S. cerevisiae.
| Gene name | MTS probability annotated ORF(MitoProt II)/cleavage site/sequence length | Upstream MTS probability/cleavage site/sequence length | Upstream MTS prediction/Target P | |||
| mTP | SP | Loc | RC | |||
| BPL1 | 0.0101/not (690) | 0.9329/51 (760) | 0.190 | 0.447 | S | 4 |
| FAA2 | 0.2275/not (744) | 0.8722/21 (764) | 0.283 | 0.074 | - | 4 |
| SAM2 | 0.0373/not (384) | 0.6337/not (431) | 0.363 | 0.516 | S | 5 |
| HFA1 | 0.4993/not (2123) | 0.7297/80 (2248) this study | 0.305 | 0.129 | - | 4 |
| ACP1 | 0.9996/37 (122) | NC | 0.960 | 0.009 | M | 1 |
The whole amino acid sequence and the whole amino acid sequence with the 5′ upstream region of the ORF of each protein was analyzed up to the first in-frame stop codon with MitoProt II. In case of the Hfa1p, the identified upstream start codon with the addition of the methionine for the convenience of the calculation with MitoProt II was used. The whole amino acid sequence with the 5′ upstream region of the ORF of each protein was analyzed up to the first in-frame stop codon with Target P. In case of Acp1p, the whole amino acid sequence was used for the calculation with Target P. mTP, SP: Final NN scores on which the final prediction is based. Note that the scores are not really probabilities, and they do not necessarily add to one. However, the location with the highest score is the most likely according to TargetP, and the relationship between the scores (the reliability class, see below) may be an indication of how certain the prediction is. Loc: S: Secretory pathway, i.e. the sequence contains SP, a signal peptide M: Mitochondrion, i.e. the sequence contains mTP, a mitochondrial targeting peptide; -: Any other location; RC: Reliability class, from 1 to 5, where 1 indicates the strongest prediction. RC is a measure of the size of the difference ('diff') between the highest (winning) and the second highest output scores. There are 5 reliability classes, defined as follows: 1: diff>0.800, 2: 0.800>diff>0.600, 3: 0.600> diff>0.400, 4: 0.400> diff>0.200, 5: 0.200> diff HFA1 and ACP1 are listed as controls for the prediction calculation of MitoProt II and target P.
Mitochondrial targeting prediction of fungal homologues of proteins in Table 5.
| Gene name | Organism | Entry | Definition | MTS probability/cleavage site/AA sequence length | Upstream MTS probability/cleavage site/AA sequence lengthb |
|
|
| YDL141W | Biotin:apoprotein ligase | 0.0101/NP/690 | 0.9329/51/760 |
|
| KLLA0F05049g | hypothetical protein | 0.0477/NP/693 | 0.9870/30/732 | |
|
| AGOS_ADR085W | hypothetical protein | 0.0314/NP/679 | 0.6205/NP/720 | |
|
| CD36_35420 | biotin: apoprotein ligase | 0.0108/NP/665 | 0.6348/NP/689 | |
|
| ZYRO0F05940g | hypothetical protein | 0.0219/NP/690 | 0.8872/30/722 | |
|
| KLTH0H02024g | biotin−protein ligase | 0.042/NP/660 | 0.4203/53/7110 | |
|
| KAFR_0B01050 | hypothetical protein | 0.0219/NP/690 | 0.8872/30/722 | |
|
|
| YER015W | medium-chain fatty acid-CoA ligase FAA2 | 0.1550/NP/744 | 0.8807/21/765 |
|
| KLLA0B11572g | long-chain fatty acyl-CoA synthetase | 0.1983/NP/745 | 0.2603/NP/749 | |
|
| CAGL0H09460g | hypothetical protein | 0.3174/NP/741 | 0.4203/NP/752 | |
|
| TDEL_0H02790 | hypothetical protein | 0.2323/NP/741 | 0.8615/NP/750 | |
|
|
| YDR502c | S-adenosylmethionine synthetase | 0.0373/NP/384 | 0.6377/NP/430 |
|
| KLLA0C01782g | hypothetical protein | 0.0159/NP/384 | 0.1565/NP/392 | |
|
| AGOS_AFR692C | AFR692Cp | 0.3994/14/382 | 0.1639/NP/387 | |
|
| ZYRO0F17248g | hypothetical protein | 0.0769/NP/382 | 0.3633/NP/412 | |
|
| PAS_chr3_0876 | S-adenosylmethionine synthetase | 0.0139/NP/384 | 0.5093/NP/423*1 | |
|
| CAGL0J08415g | hypothetical protein | 0.0634/15/383 | 0.2737/29/397*2 | |
|
| CAGL0B01122g | hypothetical protein | 0.0727/NP/382 | 0.0308/NP/394 | |
|
| CD36_10740 | S-adenosylmethionine synthetase, putative | 0.0280/NP/385 | 0.9093/58/447 |
: The whole amino acid sequence of each protein was analyzed with MitoProt II. b: The 5′ upstream region of the ORF of each protein was analyzed up to the first in-frame stop codon. *1The P. pastoris SAM homolog is preceded by a 39 aa sequence with high probability of being an MTS and starting with a bona fide ATG that apparently has been omitted in the annotation, probably because this sequence does not show homology with the S. cerevisiae SAM1 or SAM2 genes. *2 Translation from the first likely initiation codon (ATA, Ile) results in a probability of 0.4938.