Literature DB >> 28480323

Mass spectrometry data from label-free quantitative proteomic analysis of harmless and pathogenic strains of infectious microalgae, Prototheca spp.

Jayaseelan Murugaiyan1, Murat Eravci2, Christoph Weise2, Uwe Roesler1.   

Abstract

Here, we provide the dataset associated with our research article 'label-free quantitative proteomic analysis of harmless and pathogenic strains of infectious microalgae, Prototheca spp.' (Murugaiyan et al., 2017) [1]. This dataset describes liquid chromatography-mass spectrometry (LC-MS)-based protein identification and quantification of a non-infectious strain, Prototheca zopfii genotype 1 and two strains associated with severe and mild infections, respectively, P. zopfii genotype 2 and Prototheca blaschkeae. Protein identification and label-free quantification was carried out by analysing MS raw data using the MaxQuant-Andromeda software suit. The expressional level differences of the identified proteins among the strains were computed using Perseus software and the results were presented in [1]. This DiB provides the MaxQuant output file and raw data deposited in the PRIDE repository with the dataset identifier PXD005305.

Entities:  

Year:  2017        PMID: 28480323      PMCID: PMC5407497          DOI: 10.1016/j.dib.2017.04.006

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Cell culture, harvest and protein isolation In-solution trypsin digestion and mass spectrometry analysis Protein identification and quantitative proteomic analysis Value of the data The data further validate the protein identification presented in Murugaiyan et al. [1]. Data from the LC–MS analysis will provide researchers with detailed information on proteins associated with non-infectious, mildly and severely infectious strains of Prototheca spp. Prototheca spp. represents an “orphan species” whose genome sequence has not yet been sequenced, therefore, this raw data is useful for quick analysis once the genome sequence has become available.

Data

This mass spectrometry data-in-brief is associated with the research article aimed towards identification of differentially expressed proteins among three different strains of Prototheca spp., Prototheca zopfii genotype 1 (GT1), genotype 2 (GT2) and Prototheca blaschkeae [1]. The dataset comprises raw data, results of protein identification using MaxQuant-Andromeda software suit and a list of proteins identified as differentially expressed between non-infectious, infectious and mildly infectious strains of Prototheca spp. The raw data can be downloaded from the PRIDE repository (identifier PXD005305), a compilation of the identified proteins is presented in Supplementary table 1 and the differentially expressed proteins are listed in Table 1.
Table 1

List of proteins identified as differentially expressed.

S.NoUniProt Acc. No.Protein name−Log2(fold change)
P. zopfii GT2 vs P. zopfii GT1P. blaschkeae vs P. zopfii GT1P. zopfii GT2 vs P. blaschkeae
1E1ZQV2Heat shock protein 70−1.0−0.4−0.6
2E1ZLA8Acetyl-coenzyme A synthetase−6.8−6.80.0
3A0A087SCT6Citrate synthase−3.6−3.60.0
4E1ZL24Putative uncharacterized protein−4.6−4.60.0
5A0A087SSM0Actin−0.6+0.1−0.7
6A0A087SFG0Cysteine synthase, chloroplastic/chromoplastic−3.9+1.7−5.6
7A0A087SP16FK506-binding protein 1−1.4−0.1−1.3
8E1ZK88Ubiquitin−1.1+0.3−1.4
9A0A087SJV3Aldehyde dehydrogenase family 2 member B4+0.5−0.5+1.0
10E1ZG37Putative uncharacterized protein+0.6−3.8+4.4
11A0A087SS91Aconitate hydratase, mitochondrial (Aconitase)+0.6−7.3+8.0
12E1ZTB0Fructose-bisphosphate aldolase+8.3+8.8−0.6
13E1ZCI5Putative uncharacterized protein+0.5+0.7−0.3
14E1ZT42V-type H+ ATPase subunit A+0.5+0.4+0.1
15A0A087SJM740S ribosomal protein S10+6.90.0+6.9
16E1ZQY440S ribosomal protein S5+3.30.0+3.3
17A0A087SBU85-methyltetrahydropteroyltriglutamate-homocysteine methyltransferase+6.40.0+6.4
18A0A087SNV160S ribosomal protein L12-1+6.70.0+6.7
19A0A087SKG660S ribosomal protein L6+4.40.0+4.4
20A0A087SN436-phosphogluconate dehydrogenase, decarboxylating (EC 1.1.1.44)+4.5+0.7+3.8
21A0A087SJX6Argininosuccinate synthase+3.60.0+3.6
22A0A087SPA9Carbamoyl-phosphate synthase large chain+4.6+1.1+3.4
23A0A087SHS8Eukaryotic initiation factor 4A-10+0.4−0.2+0.6
24E1ZFZ5Glutamate dehydrogenase+3.10.0+3.1
25A0A087SQ68Phosphate carrier protein, mitochondrial+3.10.0+3.1
26E1ZGA340S ribosomal protein S27+3.3+1.2+2.1
27E1Z7R4Heat shock protein 70+5.3+2.2+3.1
28E1ZSM6Putative uncharacterized protein+3.3+1.2+2.1
29A0A087SF19Adenosylhomocysteinase+1.7−2.4+4.2
30A0A087SK74Elongation factor 1-alpha+0.2−0.6+0.8
31E1Z5R3Putative uncharacterized protein−1.6−5.3+3.8
32E1ZJM1Tubulin beta chain0.0−0.6+0.6
33A0A087SE71Elongation factor Tu−1.5−4.3+2.8
34A0A087SG29Glucose-6-phosphate isomerase−3.2−5.3+2.1
35A0A087SSF2Nucleoside diphosphate kinase 1−2.0−4.5+2.5
36A0A087SL21Ubiquitin-60S ribosomal protein L40-2−3.7−8.2+4.5
37A0A087SI38Acetyl-coenzyme A synthetase0.0+4.6−4.6
38A0A087SBN0ATP synthase subunit beta (Delta-aminolevulinic acid dehydratase)0.0+0.5−0.5
39A0A087SQR3Chaperonin CPN60, mitochondrial+0.2+0.9−0.7
40A0A087SBQ6Glyceraldehyde-3-phosphate dehydrogenase, cytosolic0.0+6.8−6.8
41A0A087SND2Heat shock 70 kDa protein, mitochondrial−0.1+0.6−0.7
42A0A087ST26Phosphoglycerate kinase0.0+5.5−5.5
43A0A087SNN6Stress-induced-phosphoprotein 10.0+3.7−3.7
44A0A087SIY9Succinyl-CoA ligase [ADP-forming] subunit alpha-1, mitochondrial0.0+4.7−4.7
45A0A087S9W3Histone H40.0+2.9−2.9
46E1ZRV3Putative uncharacterized protein+0.7+4.3−3.6
47E1ZMD2Putative uncharacterized protein0.0+2.4−2.4
48A0A087SAK4Chaperone protein ClpB1−0.8+2.0−2.8
49A0A087S9L8Enolase−3.7+1.7−5.4
50A0A087SI84GTP-binding nuclear protein−0.6+0.4−1.0
51E1ZD41Putative uncharacterized protein+3.3−0.7+4.0

(+) indicates upregulated and (−) indicates downregulated.

Statistical significance was calculated using two-way Student-t test and error correction (p value <0.05) using the method of Benjamini–Hochberg [2].

Experimental design

The dataset presented here was obtained from using the label-free proteomic analysis of three different strains of Prototheca species, P. zopfii genotype 1, genotype 2 and P. blaschkeae representing non-infectious, infectious and moderately infectious strains, respectively. In total 17 samples representing six independent cultures for each (only five in P. zopfii genotype 2) were used to generate the dataset (experimental design is shown in Fig. 1). A Student-t test, p-value <0.05% and 1% false discovery rate (FDR) was applied for identification of differentially expressed proteins between (a) P. zopfii genotype 2 and P. zopfii genotype 1; (b) P. blaschkeae and P. zopfii genotype 1; and (c) P. zopfii genotype 2 and P. blaschkeae.
Fig. 1

Schematic overview of the overall analysis workflow.

Materials and methods

Prototheca strains

The following three strains from the culture collection of the Institute of Animal Hygiene and Environmental Health, Freie Universität Berlin, Germany were utilized for this study [3]. P. zopfii genotype 1 (SAG 2063T), non-infectious environmental strain. P. zopfii genotype 2 (SAG 2021T), clinical strain. P. blaschkeae (SAG 2064T), clinical strain.

Cell culture and protein extraction

Following the retrieval from the culture collection, the strains were first streaked in Sabouraud dextrose solid media, incubated at 37 °C until the appearance of visible colonies. The species and genotypes were reconfirmed using MALDI profiling as described [4]. The cell culture and protein extraction was carried out as described [1].

Mass spectrometry analysis

The proteins were subjected to in-solution trypsin digested as described [1]. The resultant peptides were purified using solid phase extraction procedure [5], separated by nanoscale C18 reverse-phase liquid chromatography using the Dionex Ultimate 3000 nanoLC (Dionex/Thermo Fisher Scientific, Idstein, Germany) and directly ionised by electrospray ionization and measured after transfer into an LTQ Orbitrap Velos mass spectrometer (Thermo Fisher Scientific, Bremen, Germany). MS survey scan (m/z 300–1700, resolution 60,000) was acquired in the Orbitrap and the 20 most intensive precursor ions were fragmented.

Data analysis

Data from MS/MS spectra was searched using MaxQuant-Andromeda software suit [6], [7], [8] against the Uniprot FASTA dataset of Chlorella variabilis and Auxenochlorella protothecoides proteome with the parameters settings as described in [1]. Table 2 shows the experimental design and sample file naming format and the dataset associated to the MaxQuant analysis is shown in Supplementary table 2.
Table 2

Experimental design and raw data file naming format.

S. NoSample nameStrain designationReplicatesraw data file designation
1P. zopfii genotype 1SAG 2063T1I_3_01
22I_3_02
33I_3_03
44I_3_04
55I_3_05
66I_3_06











7P. blaschkeaeSAG 2064T1III_3_01
82III_3_02
93III_3_03
104III_3_04
115III_3_05
126III_3_06











13P. zopfii genotype 2SAG 2021T1LZ5_01
142LZ5_02
153sample lost during transit
164LZ5_04
175LZ5_05
186LZ5_06
The statistical analysis was carried out using Perseus 1.4.1.3 (Available online: http://141.61.102.17/ perseus_doku/doku.php?id=start) as described [1]. The differences in protein expression computed in three different ways i) mildly infectious vs environmental strain, ii) severe infection-associated vs environmental strain and iii) severely infectious vs mildly infectious strain were presented in Murugaiyan et al. [1].

Mass Spectrometry dataset deposit

The mass spectrometry data was deposited at the ProteomeXchange (PX) Consortium [9], [10], [11] via the PRIDE (PRoteomics IDEntifications) partner repository at the European Bioinformatics Institute (http://www.ebi.ac.uk/pride/) and is now accessible with the dataset identifier PXD005305.
Subject areaBiology
More specific subject areaLabel-free quantitative proteomics, Bovine mastitis-associated infectious microalgae, Prototheca. spp.
Type of dataRaw data, table and Excel output files
How data was acquiredLC-MS using an UltiMate 3000 HPLC system (Dionex) connected online to an LTQ-Orbitrap Velos (Thermo Scientific)
Data formatRaw, processed
Experimental factors

Cell culture, harvest and protein isolation

In-solution trypsin digestion and mass spectrometry analysis

Protein identification and quantitative proteomic analysis

Experimental featuresWhole cell proteins were extracted from Prototheca cultured strains cultured until mid-logarithmic phase of growth.For each sample protein concentrations were determined using the Bradford assay (Bio-Rad). Proteins were reduced, alkylated and digested with trypsin in solution. Following LC–MS analysis, protein identification and quantification was performed with MaxQuant software, the label-free quantitation was carried out using Perseus software.
Data source locationBerlin, Germany
Data accessibilityData available at PRIDE:PXD005305.
  10 in total

1.  Stop and go extraction tips for matrix-assisted laser desorption/ionization, nanoelectrospray, and LC/MS sample pretreatment in proteomics.

Authors:  Juri Rappsilber; Yasushi Ishihama; Matthias Mann
Journal:  Anal Chem       Date:  2003-02-01       Impact factor: 6.986

2.  Establishment of a matrix-assisted laser desorption ionization time-of-flight mass spectrometry database for rapid identification of infectious achlorophyllous green micro-algae of the genus Prototheca.

Authors:  J Murugaiyan; J Ahrholdt; V Kowbel; U Roesler
Journal:  Clin Microbiol Infect       Date:  2011-07-22       Impact factor: 8.067

3.  Andromeda: a peptide search engine integrated into the MaxQuant environment.

Authors:  Jürgen Cox; Nadin Neuhauser; Annette Michalski; Richard A Scheltema; Jesper V Olsen; Matthias Mann
Journal:  J Proteome Res       Date:  2011-02-22       Impact factor: 4.466

4.  The MaxQuant computational platform for mass spectrometry-based shotgun proteomics.

Authors:  Stefka Tyanova; Tikira Temu; Juergen Cox
Journal:  Nat Protoc       Date:  2016-10-27       Impact factor: 13.491

5.  Diversity within the current algal species Prototheca zopfii: a proposal for two Prototheca zopfii genotypes and description of a novel species, Prototheca blaschkeae sp. nov.

Authors:  Uwe Roesler; Asia Möller; Andreas Hensel; Daniela Baumann; Uwe Truyen
Journal:  Int J Syst Evol Microbiol       Date:  2006-06       Impact factor: 2.747

6.  Label-Free Quantitative Proteomic Analysis of Harmless and Pathogenic Strains of Infectious Microalgae, Prototheca spp.

Authors:  Jayaseelan Murugaiyan; Murat Eravci; Christoph Weise; Uwe Roesler
Journal:  Int J Mol Sci       Date:  2016-12-29       Impact factor: 5.923

7.  The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013.

Authors:  Juan Antonio Vizcaíno; Richard G Côté; Attila Csordas; José A Dianes; Antonio Fabregat; Joseph M Foster; Johannes Griss; Emanuele Alpi; Melih Birim; Javier Contell; Gavin O'Kelly; Andreas Schoenegger; David Ovelleiro; Yasset Pérez-Riverol; Florian Reisinger; Daniel Ríos; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

8.  ProteomeXchange provides globally coordinated proteomics data submission and dissemination.

Authors:  Juan A Vizcaíno; Eric W Deutsch; Rui Wang; Attila Csordas; Florian Reisinger; Daniel Ríos; José A Dianes; Zhi Sun; Terry Farrah; Nuno Bandeira; Pierre-Alain Binz; Ioannis Xenarios; Martin Eisenacher; Gerhard Mayer; Laurent Gatto; Alex Campos; Robert J Chalkley; Hans-Joachim Kraus; Juan Pablo Albar; Salvador Martinez-Bartolomé; Rolf Apweiler; Gilbert S Omenn; Lennart Martens; Andrew R Jones; Henning Hermjakob
Journal:  Nat Biotechnol       Date:  2014-03       Impact factor: 54.908

9.  Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ.

Authors:  Jürgen Cox; Marco Y Hein; Christian A Luber; Igor Paron; Nagarjuna Nagaraj; Matthias Mann
Journal:  Mol Cell Proteomics       Date:  2014-06-17       Impact factor: 5.911

10.  2016 update of the PRIDE database and its related tools.

Authors:  Juan Antonio Vizcaíno; Attila Csordas; Noemi del-Toro; José A Dianes; Johannes Griss; Ilias Lavidas; Gerhard Mayer; Yasset Perez-Riverol; Florian Reisinger; Tobias Ternent; Qing-Wei Xu; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2015-11-02       Impact factor: 16.971

  10 in total
  1 in total

1.  Equine Herpesvirus Type 1 Modulates Cytokine and Chemokine Profiles of Mononuclear Cells for Efficient Dissemination to Target Organs.

Authors:  Selvaraj Pavulraj; Mohamed Kamel; Heike Stephanowitz; Fan Liu; Johanna Plendl; Nikolaus Osterrieder; Walid Azab
Journal:  Viruses       Date:  2020-09-08       Impact factor: 5.048

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.