Literature DB >> 29333696

The Plantain Proteome, a Focus on Allele Specific Proteins Obtained from Plantain Fruits.

Nádia A Campos1, Rony Swennen1,2,3, Sebastien C Carpentier1,3,4.   

Abstract

Proteomics has been applied with great potential to elucidate molecular mechanisms in plants. This is especially valid in the case of non-model crops of which their genome has not been sequenced yet, or is not well annotated. Plantains are a kind of cooking bananas that are economically very important in Africa, India, and Latin America. The aim of this work was to characterize the fruit proteome of common dessert bananas and plantains and to identify proteins that are only encoded by the plantain genome. We present the first plantain fruit proteome. All data are available via ProteomeXchange with identifier PXD005589. Using our in-house workflow, we found 37 alleles to be unique for plantain covered by 59 peptides. Although we do not have access (yet) to whole-genome sequencing data from triploid banana cultivars, we show that proteomics is an easily accessible complementary alternative to detect different allele specific SNPs/SAAPs. These unique alleles might contribute toward the differences in the metabolism between dessert bananas and plantains. This dataset will stimulate further analysis by the scientific community, boost plantain research, and facilitate plantain breeding.
© 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  allele; amino acid polymorphisms; non-model proteomics; plantain

Mesh:

Substances:

Year:  2018        PMID: 29333696      PMCID: PMC6084372          DOI: 10.1002/pmic.201700227

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


Studies in plant biology through proteomics have increased considerably in the recent years. The main cause is that proteomics provides an insight into the metabolism, and is thus complementary to genomics results.1 It is known that the correlation between mRNA and protein at the same moment of extraction is often low.2, 3 High throughput proteomics for non‐model plants has been used to bypass this problem and to generate more applicable results.4, 5 Polyploidy and allopolyploidy considerably complicate the proteome analysis of crops. Bananas and plantains are polyploid crops originated from two wild diploid species: Musa acuminata (AA), which is highly polymorphous, with spindly plants that grow in clumps, and Musa balbisiana (BB), a more homogeneous hardy plant with a massive pseudo‐trunk. There are nowadays diploid, triploid, or tetraploid genome groups.6, 7 The main genome groups are AA, AB, AAA, AAB, and ABB. Most dessert banana cultivars are AAB or AAA. The Cavendish subgroup, that is sold on the export market8 has an AAA genome constitution while plantains are AAB. Plantains are sweet acid starchy bananas with typically long fruits and are mostly consumed after frying or boiling. Plantains are an important staple crop in West and Central Africa, India, and Latin America.6 Both dessert bananas and plantains are considered a non‐model crop and the complexity of their genomes makes it challenging to analyze the transcriptome and the proteome.9 We used here an easy and reproducible protocol for protein extraction and identification and we present the first proteome of plantain fruits (AAB). We created our own workflow to tackle the difficulties of working with a triploid non‐model species without an available database. The mass spectrometry proteomics data have been deposited at the ProteomeXchange Consortium Partner Repository10 via PRIDE11 with the dataset identifier PXD005589. These results will stimulate further analysis by the scientific community and will boost plantain research and facilitate breeding. Plantains fruits and Cavendish fruits were bought in the local supermarket in Leuven, Belgium. Five biological replicates (fruits) of each cultivar were selected based on their phenotypic characteristics and the same green peel color. All fruits were kept separately, cleaned, peeled, their pulp was cut into thin slices and immersed immediately in liquid nitrogen. All ten samples where lyophilized to a water content of 2.5%. After drying, the samples were hermetically sealed and stored at room temperature until the proteomics analysis was performed. Banana tissues are considered difficult for protein extraction due to the presence of many interfering compounds which makes the extraction process more difficult.12 Lyophilization provides easier material for manipulation without losses in protein content and is an easy and safe way to transport the samples.12 Protein extractions were performed according to the phenol extraction/ammonium acetate precipitation method we published13 and adapted for gel free proteomics.14 Twenty μg of proteins were digested with trypsin (Trypsin Protease, MS Grade Thermo Scientific) and purified by Pierce C18 Spin Columns (Thermo Scientific). The digested samples (0.5 μg/5 μL) were separated in an Ultimate 3000 (Thermo Scientific) UPLC system and then in a Q Exactive Orbitrap mass spectrometer (Thermo Scientific) as described.15 The Q Exactive Orbitrap mass spectrometer (Thermo Scientific, USA) was operated in positive ion mode with a nano spray voltage of 1.5 kV and a source temperature of 250 °C. Proteo Mass LTQ/FT‐Hybrid ESI Pos. Mode Cal Mix (MS CAL5‐1EASUPELCO, Sigma‐Aldrich) was used as an external calibrant and the lock mass 445.12003 as an internal calibrant. The instrument was operated in data‐dependent acquisition (DDA) mode with a survey MS scan at a resolution of 70 000 (fw hm at m/z 200) for the mass range of m/z 400–1600 for precursor ions, followed by MS/MS scans of the top ten most intense peaks with +2, +3, +4, and +5 charged ions above a threshold ion count of 16 000 at 17 500 resolution using normalized collision energy (NCE) of 25 eV with an isolation window of 3.0 m/z and dynamic exclusion of 10 s. All data were acquired with Xcalibur 3.0.63 software (Thermo Scientific). For protein identification, we used MASCOT version 2.2.06 (Matrix Science) against our in house Musa A‐B database containing acuminata AA proteins (dh PahangV1), the non‐redundant unique balbisiana BB proteins (PKW) (http://banana-genome-hub.southgreen.fr/) and the usual contaminants for mass spectrometry (76 220 proteins). The parameters used to search were: parent mass tolerance of 10 PPM, fragment tolerance of 0.02 Da, oxidation of M as variable modification, carbamidomethyl C as fixed modification and up to one missed cleavage was allowed for trypsin. Results from MASCOT were imported to Scaffold version 3.6.5. In Scaffold, the threshold was set to minimum one peptide identified with 95% confidence and the false discovery rate (FDR) was automatically calculated based on default parameters from the software. Using our Musa A‐B database we identified in total 2144 different proteins with 0.2% FDR (Supporting Information, Table 1). Taking into account only the proteins identified in at least two biological replicates reduces this number to 1731, of which 1344 proteins were identified in Cavendish fruits and 1363 in plantain fruits (Supporting Information, Table 1). Esteve et al.16 utilized the proteominer beads to identify the proteome of Cavendish fruits. The authors were able to identify 1131 proteins using a cross species approach (Musa EST database and Uniprot Viridiplantae Database). The three most abundant annotation categories were oxidation‐reduction, ATP binding, and nucleotide binding. In our work, we used a merged database derived from two diploid species (AA and BB). We identified and annotated, 4 years later, more proteins due to the availability of more powerful mass spectrometry and more genetic resources. In the category Molecular Function, 525 different gene ontologies could be retrieved. The five most represented ontologies were GO:0016491 oxidoreductase activity (168 proteins); GO:0016787 hydrolase activity (122 proteins); GO:0000166 nucleotide binding (112 proteins); GO:0003824 catalytic activity (103 proteins); and GO:0005524 ATP binding (99 proteins) (Supporting Information, Table 1). The aim of our study was to characterize the proteome of plantain fruits and compare it to Cavendish fruits to identify important allele specific proteins in a cultivar that is not sequenced, plantain. The main contrasting characteristics between plantain and Cavendish are undoubtedly related to unique alleles that can explain together with epigenetic regulations the different phenotypes.17 To find allele specific peptides in plantain fruits, we used a basic but very useful principle: spectral counting (Scaffold). Potential plantain allele specific peptides were filtered using the following conditions. Maximum spectral count in Cavendish = 0, which means the peptide was never identified in Cavendish; median spectral count in plantain ≠ 0, being identified at least in three biological replicates. To detect single amino acid polymorphisms (SAAPs) in acuminata (A) and balbisiana alleles (B), the identified plantain unique peptides were filtered further. Only peptide sequences that were exclusively identified in a B derived protein accession were accepted. Their allelic acuminata homolog was searched using the Greenphyl homolog function (http://www.greenphyl.org/cgi-bin/get_homologs.cgi) to determine the SAAP. Only plantain specific proteins where the acuminata homolog was successfully identified were accepted (Supporting Information, Table 2). This allowed us to allocate a protein as an A and B allele version. Further annotations of the proteins were retrieved from Uniprot software (http://www.uniprot.org/uploadlists/). Analysis of gene functions from the allelic specific proteins were made through GO enrichment annotations via our in house software (https://labtrop.shinyapps.io/UniGO/). Following our workflow, we identified 37 interesting loci spread over all 11 chromosomes (Table 1). We appointed 59 peptides as B allele specific and 47 peptides as A allele specific. The introduction of M. balbisiana genes is said to be correlated to hardiness, drought tolerance, a changed nutritional value, increased starchiness, and different maturation process.18, 19, 20 To check which pathways are affected by mutations/polymorphisms, we performed a GO annotation for the 37 loci. GO:0004134 (4‐alpha‐glucanotransferase activity) and GO:0004133 (glycogen debranching enzyme activity) are the two most significant GOs for Molecular Function (p‐value 3.4e‐06 and 2.0e‐05, respectively) (Supporting Information, Table 3). One single amino acid change can drastically affect the function of proteins.21, 22, 23 Through evolution, mutations in the coding region of a gene are likely to have a different biological function, especially if the mutations occur in the protein domain, since they are generally considered as the basic units of protein folding, evolution, and function.24
Table 1

Overview of the 59 plantain allele specific peptides and their allelic variant

Uniprot entry and protein annotationBB accession numbera AA accession numbera B alleleb, c A alleleb, c, d
M0S0K5 Protein disulfide‐isomeraseKMMuB_chr1_G01477GSMUA_Achr1T16970_001AASILSKNDPPVVLAKAASVLSKNDPPVVLAK
EADGIVEYLKKEADGIIEYLK
LHEVAENYKGKLHETAENYKGK
M0S1P5 Uncharacterized proteinKMMuB_chr1_G01832GSMUA_Achr1T20870_001AGVENMFGVVGIPVTAVATRAGVQHMFGVVGIPVTAVATR
ATHITQIPRNo ID
GYLAGTPEELKSALSESFSARGYLVGTPEELKSALSESFSAR
LNWLLHFGQPPRLNWLLHFGEPPR
SHPWVEAISKKNo ID
M0RFU7 D‐3‐phosphoglycerate dehydrogenaseKMMuB_chr10_G29495GSMUA_Achr10T06200_001GLGMHVISHDPYAPADRGLGMHVIAHDPYAPADR
M0RKL2 Uncharacterized proteinKMMuB_chr10_G30906GSMUA_Achr10T22870_001LVLPGELAKNo ID
M0RPM3 PectinesteraseKMMuB_chr11_G32183GSMUA_Achr11T05430_001SNTNLMFMGDGIGKNo ID
M0RQL6 Uncharacterized proteinKMMuB_chr11_G32507GSMUA_Achr11T08860_001IVQDQSVLQDEKRIVHDQSVLQDEKR
M0RS60 Uncharacterized proteinKMMuB_chr11_G33244GSMUA_Achr11T14300_001YGVKPDAETLDILNTIARYGVKPDAETLDILNTVAR
M0RTB8 Uncharacterized proteinKMMuB_chr11_G33678GSMUA_Achr11T18380_001KTIEDLSSSHEKKIIEDLSSSHEK
SRDLGLDTSTLSKNo ID
M0RTW1 Uncharacterized proteinKMMuB_chr11_G33846GSMUA_Achr11T20310_001LVPVGYGIKLVAVGYGIK
TCISGDQISKDDVK TYISGDQISKDDVR
WYDSVFGILAPRFPGKWYESVSGILALRFPGK
M0RUN0 Ubiquitin carboxyl‐terminalhydrolaseKMMuB_chr11_G34086GSMUA_Achr11T23000_001FVEESFLDRFYKFIEESFLDRFYK
M0S5Q5 Uncharacterized proteinKMMuB_chr2_G03699GSMUA_Achr2T06640_001SYITGYQASKDDIAVYSALATSPSADYVNVARSYITGYQASKDDISVYSALATSPSADYVNVAR
M0S5R0 Uncharacterized proteinKMMuB_chr2_G03706GSMUA_Achr2T06690_001LSDAESMIALKDFLNKLSDAESLMALKDFLNK
M0SA45 4‐alpha‐glucanotransferaseKMMuB_chr2_G05054GSMUA_Achr2T22040_001TGDDLPVDYDTRFPSVDPTRNo ID
M0SAT8 Formate dehydrogenase, mitochondrialKMMuB_chr3_G05265GSMUA_Achr3T01200_001AAAESGLTVAEVTGSNVVSVAEDELMRAAAEAGLTVAEVTGSNVVSVAEDELMR
LKPFNCNLLYHDRLKPFNCNLLYYDR
M0SB08 Sucrose synthaseKMMuB_chr3_G05330GSMUA_Achr3T01900_001SVPLAADGEAAFNSAKNo ID
VVHGIDVFDPKFNIVSPGADLTIYFPYTEKVVHGIDVFDPKFNIVSPGADMTIYFPYTEK
M0SC42 PectinesteraseKMMuB_chr3_G05670GSMUA_Achr3T05740_001LPRPGQINTITAQGRLPSPGQINTITAQGR
M0SCZ5 Uncharacterized proteinKMMuB_chr3_G05965GSMUA_Achr3T08780_001SYPVNETNASSSEKSDPVNETNASSSEK
TIIKDMVLSSERNo ID
M0SIC8 Uncharacterized proteinKMMuB_chr3_G07974GSMUA_Achr3T27620_001ALVTELKKNo ID
M0SR05 Malate dehydrogenaseKMMuB_chr4_G10674GSMUA_Achr4T21920_001NAIIWGNHSSTQYPDVCHATVKNVIIWGNHSSTQYPDVSHATVK
M0SRM5 Uncharacterized proteinKMMuB_chr4_G10876GSMUA_Achr4T24140_001AFDSYEAVLKDPGVDAAYVPLPTSLHLRAFGSYEAVLEDPGVDAVYVPLPTSLHLR
AIGLAPNSVIVAVGSRAIGLAPNAVIVAVGSR
HLLLEKPTALCAADLDRHLLLEKPTALCAAELDR
WAVAAAECGKWAVAAAER
M0SVV8 Uncharacterized proteinKMMuB_chr5_G12124GSMUA_Achr5T05290_001GYYIQPTIFSDVEDKMKGYYIQPTVFSDVEDKMK
M0SWS3 Protein transport Sec61 subunit betaKMMuB_chr5_G12409GSMUA_Achr5T08440_001ARGSSQSQTTASAGGGARPAGAVPRARGSSQSQTAASAGGGARPAGAVPR
M0SZK9 Nucleoside diphosphate kinaseKMMuB_chr5_G13683GSMUA_Achr5T18300_001GLVGEIINRFEKGLVGEIISR
NVIHGSDSIEGASKNVIHGSDSIEGAR
M0T1H5 Uncharacterized proteinKMMuB_chr5_G14355GSMUA_Achr5T25000_001HVTITAFSKHVTVTAFSK
M0SHD1 Importin subunit alphaKMMuB_chr5_G14553GSMUA_Achr3T24150_001SPPIEEVIQAGVVPRNo ID
M0TA69 Uncharacterized proteinKMMuB_chr6_G17518GSMUA_Achr6T25730_001EASEKHHHHLF EEAEEASGKHHHHLF
M0TB62 Methylthioribose‐1‐phosphate isomeraseKMMuB_chr6_G17823GSMUA_Achr6T29170_001LTAFELVHDRIPATLIADSAVAFLMKLTAFELVHDRIPATLIADSAVAALMK
M0TCA9 Uncharacterized proteinKMMuB_chr6_G18191GSMUA_Achr6T33160_001IELVPVDLLNRPAWYKEKIELVPVDLLNRPAWYKDK
M0TDU3 Uncharacterized proteinKMMuB_chr7_G18687GSMUA_Achr7T01530_001ALADQKDEAFFLANAAAQASRALADQKDEAFFSANAAAQASR
CVKPPIIFGDVSRPKCVKPPIIYGDVSRPK
EGVKYGAGIGPGVYDIHSPRNo ID
M0TEY6 Uncharacterized proteinKMMuB_chr7_G19042GSMUA_Achr7T05460_001ALDEAALVEYIKALDEASLVEYIK
IVGVAHVEDFESISDETKRIVGVAHVEDFESISDVTKR
YMTNLFHDALGFGAAKYMTDLFHDALGFGAAK
M0TIX4 Uncharacterized proteinKMMuB_chr7_G20737GSMUA_Achr7T19370_001IATLYSDVLAATILDAEQcKELKIATLYSDVLTATILDAEQCKELK
RLGLVGLGSSSSTVAAYRRLGLVGLGSSSSTAAAYR
TVDIIGFGSGTVVDQKTVDVIGFGSGTVVDQK
M0TJC2 Uncharacterized proteinKMMuB_chr7_G20876GSMUA_Achr7T20850_001QFNSIPGIMEGNAKPDYATCVKQFNSIPGLMEGTAKPDYATCVK
M0TRR4 Uncharacterized proteinKMMuB_chr8_G23675GSMUA_Achr8T19110_001IMYAEDAPDFGAASDGDGDRNMILGRIMYSEDAPDFGAASDGDGDRNMILGR
M0TS86 Uncharacterized proteinKMMuB_chr8_G23861GSMUA_Achr8T20830_001MKEIAEAYLGSVIKMKEIAEAYLGSVVK
M0U2G0 Uncharacterized proteinKMMuB_chr9_G27509GSMUA_Achr9T22010_001GLLSCGTGVGVSIFANKFPRGLLSCGTGVGVSMFANKFPR
M0U7I3 Uncharacterized proteinKMMuB_chrUn_random_G35868GSMUA_AchrUn_randomT08730_001NKLEDHDELLGADIVQKNo ID
M0UCP7 Uncharacterized proteinKMMuB_chrUn_random_G39488GSMUA_AchrUn_randomT26890_001ALQWTIDNLLDIGETLFVIHVLRPKALQWTIDNLLDIGETLIVIHVLRPK

Chr in the name of the accessions refers to the chromosome number of the locus.

All peptides have been identified with a probability > 99%.

SAAPs are indicated in bold.

No ID: the allelic variant peptide was not confidently identified in our experiment.

Overview of the 59 plantain allele specific peptides and their allelic variant Chr in the name of the accessions refers to the chromosome number of the locus. All peptides have been identified with a probability > 99%. SAAPs are indicated in bold. No ID: the allelic variant peptide was not confidently identified in our experiment. Ramu et al.25 highlighted some possible deleterious mutations in domesticated cassava using whole genomic screening experiments of wild ancestors and cultivars. Like banana, cassava cultivars are clonally propagated and this genomic screening study suggests that many deleterious mutations have not been crossed out. We expect a similar situation in banana. Advanced whole genomic screening experiments enable the identification and interpretation of mutations at the genome level.24, 25 Although we do not have access (yet) to whole‐genome sequencing data from triploid banana cultivars, we show that proteomics is an easily accessible complementary alternative to detect the different allele specific SNPs/SAAPs. To our knowledge, this is the first proteomic investigation in plantain fruits, and the most extensive fruit proteomic study in the genus Musa. This public release of the plantain fruit proteome is an important step for plantain varietal selection and breeding. data dependent acquisition expressed sequence tag false discovery rate gene ontology mass spectrometry normalized collision energy parts per million single amino acid polymorphism single nucleotide polymorphism

Conflict of Interest

The authors have declared no conflict of interest. Supporting Information Click here for additional data file. Supporting Information Click here for additional data file. Supporting Information Click here for additional data file. Supporting Information Click here for additional data file.
  21 in total

1.  Errors in genomics and proteomics.

Authors:  Ritsert C Jansen; Jan-Peter Nap; Ludmila Mlynárová
Journal:  Nat Biotechnol       Date:  2002-01       Impact factor: 54.908

2.  Lyophilization, a practical way to store and transport tissues prior to protein extraction for 2DE analysis?

Authors:  Sebastien C Carpentier; Koen Dens; Ines Van den houwe; Rony Swennen; Bart Panis
Journal:  Proteomics       Date:  2007-09       Impact factor: 3.984

3.  Proteome analysis of orphan plant species, fact or fiction?

Authors:  Sebastien C Carpentier; Twan America
Journal:  Methods Mol Biol       Date:  2014

4.  Exhaustive enumeration of protein domain families.

Authors:  Andreas Heger; Liisa Holm
Journal:  J Mol Biol       Date:  2003-05-02       Impact factor: 5.469

5.  Preparation of protein extracts from recalcitrant plant tissues: an evaluation of different methods for two-dimensional gel electrophoresis analysis.

Authors:  Sebastien Christian Carpentier; Erwin Witters; Kris Laukens; Peter Deckers; Rony Swennen; Bart Panis
Journal:  Proteomics       Date:  2005-07       Impact factor: 3.984

Review 6.  From crossbreeding to biotechnology-facilitated improvement of banana and plantain.

Authors:  Rodomiro Ortiz; Rony Swennen
Journal:  Biotechnol Adv       Date:  2013-10-01       Impact factor: 14.227

7.  The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013.

Authors:  Juan Antonio Vizcaíno; Richard G Côté; Attila Csordas; José A Dianes; Antonio Fabregat; Joseph M Foster; Johannes Griss; Emanuele Alpi; Melih Birim; Javier Contell; Gavin O'Kelly; Andreas Schoenegger; David Ovelleiro; Yasset Pérez-Riverol; Florian Reisinger; Daniel Ríos; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

8.  Data for the characterization of the HSP70 family during osmotic stress in banana, a non-model crop.

Authors:  Anne-Catherine Vanhove; Wesley Vermaelen; Alberto Cenci; Rony Swennen; Sebastien C Carpentier
Journal:  Data Brief       Date:  2015-02-13

9.  ProteomeXchange provides globally coordinated proteomics data submission and dissemination.

Authors:  Juan A Vizcaíno; Eric W Deutsch; Rui Wang; Attila Csordas; Florian Reisinger; Daniel Ríos; José A Dianes; Zhi Sun; Terry Farrah; Nuno Bandeira; Pierre-Alain Binz; Ioannis Xenarios; Martin Eisenacher; Gerhard Mayer; Laurent Gatto; Alex Campos; Robert J Chalkley; Hans-Joachim Kraus; Juan Pablo Albar; Salvador Martinez-Bartolomé; Rolf Apweiler; Gilbert S Omenn; Lennart Martens; Andrew R Jones; Henning Hermjakob
Journal:  Nat Biotechnol       Date:  2014-03       Impact factor: 54.908

10.  Evolutionary dynamics and biogeography of Musaceae reveal a correlation between the diversification of the banana family and the geological and climatic history of Southeast Asia.

Authors:  Steven B Janssens; Filip Vandelook; Edmond De Langhe; Brecht Verstraete; Erik Smets; Ines Vandenhouwe; Rony Swennen
Journal:  New Phytol       Date:  2016-02-01       Impact factor: 10.151

View more
  3 in total

1.  Suppressive Effect of Huzhentongfeng on Experimental Gouty Arthritis: An In Vivo and In Vitro Study.

Authors:  Zi-Cong Wu; Qiang Xue; Zhen-Ling Zhao; Peng-Jun Zhou; Qun Zhou; Zhen Zhang; Jian-Ping Deng; Ke Yang; Hua Fan; Yi-Fei Wang; Zhi-Ping Wang
Journal:  Evid Based Complement Alternat Med       Date:  2019-12-04       Impact factor: 2.629

2.  From fruit growth to ripening in plantain: a careful balance between carbohydrate synthesis and breakdown.

Authors:  Nadia A Campos; Sophie Colombié; Annick Moing; Cedric Cassan; Delphine Amah; Rony Swennen; Yves Gibon; Sebastien C Carpentier
Journal:  J Exp Bot       Date:  2022-08-11       Impact factor: 7.298

3.  The Plantain Proteome, a Focus on Allele Specific Proteins Obtained from Plantain Fruits.

Authors:  Nádia A Campos; Rony Swennen; Sebastien C Carpentier
Journal:  Proteomics       Date:  2018-02       Impact factor: 3.984

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.