| Literature DB >> 26257763 |
Daniel Köhler1, Dirk Dobritzsch1, Wolfgang Hoehenwarter2, Stefan Helm1, Jürgen M Steiner3, Sacha Baginsky1.
Abstract
Glaucophyta, rhodophyta, and chloroplastida represent the three main evolutionary lineages that diverged from a common ancestor after primary endosymbiosis. Comparative analyses between members of these three lineages are a rich source of information on ancestral plastid features. We analyzed the composition and the cleavage site of cyanelle transit peptides from the glaucophyte Cyanophora paradoxa by terminal amine labeling of substrates (TAILS), and compared their characteristics to those of representatives of the chloroplastida. Our data show that transit peptide architecture is similar between members of these two lineages. This entails a comparable modular structure, an overrepresentation of serine or alanine and similarities in the amino acid composition around the processing peptidase cleavage site. The most distinctive difference is the overrepresentation of phenylalanine in the N-terminal 1-10 amino acids of cyanelle transit peptides. A quantitative proteome analysis with periplasm-free cyanelles identified 42 out of 262 proteins without the N-terminal phenylalanine, suggesting that the requirement for phenylalanine in the N-terminal region is not absolute. Proteins in this set are on average of low abundance, suggesting that either alternative import pathways are operating specifically for low abundance proteins or that the gene model annotation is incorrect for proteins with fewer EST sequences. We discuss these two possibilities and provide examples for both interpretations.Entities:
Keywords: TAILS; cyanelle; evolution; quantitative proteomics; transit peptide
Year: 2015 PMID: 26257763 PMCID: PMC4510345 DOI: 10.3389/fpls.2015.00559
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1(A) Minimal peptide position of peptides identified from nucleus encoded cyanelle proteins (black bars) and non-cyanelle proteins (gray bars). Peptides are sorted into bins representing the amino acid position number of the peptide start. (B) Numbers of identified nucleus encoded cyanelle proteins. Assignment to the cyanelle was based on the detection here, a comparison with a recent cyanelle proteome study and a BLAST search against the Arabidopsis chloroplast reference proteome.
List of all nucleus encoded cyanelle proteins that match the minimal starting range 11–110 as identified by TAILS.
| 10247_cds:3280 | AEGPAPNVNPLTGAPR | MA | 75 | − | − | + |
| 10778_cds:3661 | AGTAAGASAAVAEAR | MRRVAALLSA | 14 | − | − | + |
| 10872_cds:3470 | AGKQITVGVIGAGR | MAA | 79 | − | − | + |
| 12207_cds:4390 | TGGASEPGPGAYTIDVGK | MGEGTAAHGG | 59 | − | − | + |
| 12588_cds:4370 | SINVPAVMSDVNAAAK | MRDTTDSDTG | 66 | − | − | + |
| 12656_cds:4591 | AGEEASMAHYAAPAVGALR | MVQEGKLAPE | 64 | − | − | + |
| 37200_cds:4999 | EEVSAAKKELAER | MQRLYRI | 84 | − | − | + |
| 37396_cds:7195 | LVNDSSPDPSLQR | MQP | 11 | − | − | + |
| 38265_cds:8081 | TARWRASSVASEANGQR | MQLPPGAAPA | 90 | − | − | + |
| 38986_cds:8738 | FAGDALVAFVAAGELEGGGR | MLRLSSTLNA | 29 | − | − | + |
| 39427_cds:8972 | AGAEGGLEPDR | MA | 81 | − | − | + |
| 52758_cds:10110 | AELVPLDYGAIR | MAALPVVGQI | 16 | − | − | + |
| 52931_cds:10073 | VNAATGDAAANRAR | MAVGAGSLNT | 82 | − | − | + |
| 54348_cds:11201 | EPAQPSPEELAAREAER | MAEAGEASPA | 91 | − | − | + |
| 6638_cds:99/60 | GLEMIASENFTSR | MSAPRRQLPM | 37 | − | − | + |
| 6977_cds:523/2 | AADSGAGAGGAAAGPR | MAMQRDGSLP | 40 | − | − | + |
| 7258_cds:763/1 | EAATSPGPSASAAPAAGGR | MDTA | 65 | − | − | + |
| 7675_cds:1094 | AGSSSAAPSPAK | MDRD | 63 | − | − | + |
| 8688_cds:1951 | AAAPLEEDYTATPSGLR | MIPAAA | 84 | − | − | + |
| 9509_cds:2612 | ALDATKDAVGSAAR | MAPA | 84 | − | − | + |
| 25545_cds:5400 | GVLKVFLENVIR | MSGRGKGGKG | 57 | + | − | − |
| 37212_cds:6941 | KSGEATGLDVLIVR | MNA | 106 | + | − | − |
| 39194_cds:8857 | AVPLLAYKPKTTNTR | MA | 51 | + | − | + |
| 54421_cds:11004 | SRVPVELIFDQGFGDVFVTR | MSPAVSPKSS | 62 | + | − | − |
| 56196_cds:11903 | NSAAGASDGSFNPYR | MAYISSLPGS | 70 | + | − | + |
| 8154_cds:1523 | SGPAGDPRFPVMSGEVPQDLSVYDKDGVAR | MTAVLPTTQT | 110 | + | − | − |
| 8403_cds:1901 | SGGITKSEPTFGGIETPAGAEVEIIR | MAQQITGVNP | 12 | + | − | − |
| 8517_cds:2013 | SIPIIEFAPKTTNTR | MAA | 52 | + | − | + |
| 8620_cds:2109 | KKGSGAGRPTGAPGGSMPPMPSQAAAGDLPVIYIFAR | M | 58 | + | − | − |
| 8899_cds:2361 | DTSGSAIKRITNPSSSPIAGDNR | MAS | 43 | + | − | + |
| 38654_cds:7924 | TDHPDYLGSLTQASVVR | MSSAETVL | 99 | − | − | + |
| 54708_cds:11657 | AKTADAPAYAR | MPPKRSSKRG | 26 | − | − | + |
| 25596_cds:5692 | NAEADKLTEATAR | MA | 61 | + | − | + |
| 37768_cds:7858 | VATDKITPLPMKGNAPLSDADPEVFDLIER | MAA | 64 | + | − | + |
| 38084_cds:7872 | AAQLGVALTNGNAR | MVSPVVKRPI | 106 | + | − | − |
| 39122_cds:8823 | GVDLEAGGR | MWPLGGYGPA | 29 | + | − | − |
| 8923_cds:2292 | SGFGREESIIADMSLADYGR | MALQLNLKAK | 18 | + | − | − |
| 12743_cds:4543 | ASAKPDTGPSQGQFSR | MLRAAAAARR | 31 | − | − | + |
| 40999_cds:9485 | ATNAVATPAYSELSSVPLYQQEKKIER | MSATMLRLAR | 31 | + | − | − |
| 7974_cds:1883 | SSTSQPLASVVTGRK | MLSLRRLAPS | 22 | + | − | − |
| 9664_cds:3056 | GATVVDTGAPIMIPVGR | MSTPAKAAPA | 79 | + | − | − |
| 10978_cds:3594 | EAPAESSDAEATDR | MESC | 71 | − | + | + |
| 11467_cds:3786 | ASASAPKLEVENVVIIGSGPAGLTSAIYTAR | MLG | 95 | − | + | − |
| 13287_cds:4604 | ATATAAQGVLPR | MAAVGTSLLR | 14 | − | + | − |
| 26296_cds:6567 | SLANDSSAGNDQLDIRR | MA | 101 | − | + | + |
| 37389_cds:7124 | EGLTYDQLNSANYLKLKGTGISNTCPTLSTSAR | MA | 85 | − | + | + |
| 38529_cds:8386 | SSSAAAADAGPSQR | MPS | 55 | − | + | + |
| 38915_cds:8582 | SSQQVKDASSTESQSATAR | M | 44 | − | + | − |
| 39038_cds:8649 | SANNLPPTYTGVSR | MA | 75 | − | + | + |
| 39838_cds:8861 | GLGNSFGHLFR | M | 38 | − | + | − |
| 53054_cds:9835 | QFSEVYVEVAKPLGVTFEEGPDGR | MA | 69 | − | + | − |
| 53171_cds:10566 | ATQAVASSASICASAAPR | MA | 14 | − | + | + |
| 53311_cds:10649 | SAASPSSEDILIVGAGVLGSR | M | 70 | − | + | − |
| 7674_cds:1262 | AIQPAAVDLGMPLDVLSR | MAA | 62 | − | + | − |
| 8074_cds:1451 | SLIEQQEEAEKAAR | META | 73 | − | + | + |
| 8340_cds:1819 | SVEQLQKVLEEQKALPR | MQWLRAAATA | 97 | − | + | − |
| 8437_cds:1947 | EAGGGSSPASVSPLDGAEDDGLAVRR | M | 57 | − | + | − |
| 10670_cds:3614 | EAAPAQQKGIEEHVATDVVPDSFLR | MESA | 69 | + | + | − |
| 11368_cds:3893 | ANPAVASAACSSVAPARAIR | MA | 16 | + | + | − |
| 25517_cds:5494 | AVEIAAATVKELR | MA | 15 | + | + | + |
| 25643_cds:5691 | ATAAMTPVDDK | MSA | 99 | + | + | + |
| 37061_cds:6844 | SAASDDVPDMGKR | MA | 61 | + | + | + |
| 37854_cds:7838 | EEEEAAPAEKVEKKAAAPKPFSVPTLNLNAPTPIFGGSTGGLLR | MNA | 54 | + | + | − |
| 38681_cds:8352 | SSVAAESSEQTSQPATTSSKTER | MA | 74 | + | + | − |
| 40175_cds:9516 | RMSAITAEPVGTPETLEYR | MEVA | 73 | + | + | + |
| 52898_cds:9949 | QAEDATAEIADTR | MAAAPAVVRA | 11 | + | + | + |
| 52975_cds:10182 | EEQSSPSAAGKSDAESDASGVSR | MQAD | 59 | + | + | − |
| 53396_cds:10332 | ASTEVVTGDDMKGETFTR | MA | 77 | + | + | + |
| 53411_cds:10720 | AKDAVKPFYDDAFIGHLSTPISNSSAVNGLLANLPAYR | MA | 55 | + | + | − |
| 53536_cds:10786 | GGLDLGSLSPAKGSNR | MA | 69 | + | + | + |
| 54611_cds:11381 | SATGFAAAIADKSVLDFDSKIFTKEVIQLADTQESIVR | MSSA | 93 | + | + | − |
| 7394_cds:1013 | ANPADAPPPPFGAGPKEDALEGFVSSIPGSLR | MQTENSTAPA | 77 | + | + | + |
| 7699_cds:1095 | SASGSAASKDAVIDQFTSNVMTTYAR | MESVAA | 90 | + | + | − |
| 8081_cds:1633 | SFSTAKSDEIFKR | MDTA | 61 | + | + | + |
| 8139_cds:1758 | EDEQPKPSVPEPDVSGNALGLGQLQNVFTR | MAA | 74 | + | + | − |
| 8721_cds:2147 | STSIVTTTKHTEIVELLVPESQR | MA | 76 | + | + | − |
| 8725_cds:2214 | SADSGSLLDNIPFR | MAA | 71 | + | + | − |
The proteins are represented by their CDS-Identifier and the identified peptide representing the minimal start identified for the protein. Phenylalanine (F) within the first 10 amino acids is highlighted and underlined. The data underlying the assignment of proteins as cyanelle proteins are indicated with a plus (+).
“published”: Cyanelle proteins previously identified by proteomics (Facchinelli et al., .
“BLAST”: C. paradoxa proteins identified as cyanelle proteins by BLAST searches against an A. thaliana chloroplast proteome reference table (van Wijk and Baginsky, .
“This study”: Proteins identified by mass spectrometry in samples of purified cyanelles in this study.
Figure 2(A) Amino acid composition of transit peptides identified from Arabidopsis and Cyanophora. For every amino acid, the average occurrence in transit peptides is provided along with the standard deviation. (B) Topology heat map of amino acid distribution in transit peptides of Arabidopsis thaliana and Cyanophora paradoxa. The number of amino acids in every transit peptide was set to 100% and the relative position of amino acids in the transit peptide was calculated on this basis. The darker the color the higher the frequency of occurrence.
Figure 3Tripartite structure of transit peptides from . The average hydropathy trend of all detected transit peptides is shown. The relative position in the transit peptides is given in the top panel as percentage. According to the hydropathy trend, transit peptides can be divided into three modules as indicated in the panel below.
Figure 4Characteristics of the transit peptide cleavage site in Arabidopsis and Cyanophora (A) Frequency of amino acid occurrence in the -1 position C-terminal of the cleavage site. (B) Alignment of the identified N-terminus with 10 amino acids up- and downstream of the anticipated cleavage site. First panel, all identified transit peptide, second panel transit peptides with a C-terminal arginine, and third panel transit peptides with a C-terminal alanine. The alignment was performed with WebLogo 3 (Crooks et al., 2004).
Figure 5Frequency of occurrence of amino acids at the most N-terminal position in the mature protein.