Literature DB >> 29900324

Proteome data of Anopheles stephensi hemolymph using high resolution mass spectrometry.

Gourav Dey1,2,3, Ajeet Kumar Mohanty4, Manish Kumar2,3, Sreelakshmi K Sreenivasamurthy2,3, Ashwani Kumar4, T S Keshava Prasad1,2.   

Abstract

The article provides insights into the protein expression in Anopheles stephensi hemolymph. We carried out data acquisition using a high-resolution LTQ-Orbitrap Velos mass spectrometer to identify the hemolymph proteins of An. stephensi. Experimentally derived mass spectrometry data was analyzed using Proteome Discoverer 2.1 software using two different search algorithms SEQUEST and MASCOT. A total of 1091 proteins were identified from the hemolymph. The identified proteins were categorized for their role in biological processes and molecular functions. The interactions between these proteins were predicted using STRING online tool. Relation can be drawn between the data provided in this study to the already published article "Integrating transcriptomics and proteomics data for accurate assembly and annotation of genomes" (Prasad et al., 2017) [1].

Entities:  

Year:  2018        PMID: 29900324      PMCID: PMC5997892          DOI: 10.1016/j.dib.2018.04.031

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The data provides a list of proteins identified to be expressed in the adult An. stephensi female hemolymph. The data provides an insight into the class of proteins identified in the hemolymph and their associated biological processes. The data provides a protein-protein interaction map for An. stephensi hemolymph proteins. The data provides a platform for further experiments to understand the molecular mechanism involved in the hemolymph of An. stephensi.

Data

To identify the proteins expressed in the mosquito hemolymph, we carried out proteomic analysis of hemolymph from sugar fed female An. stephensi (Liston strain) mosquitoes using high-resolution mass spectrometer. Proteins extracted from the samples were fractionated at protein level using 10% SDS-PAGE. Each fraction was then analyzed separately, which, resulted in the identification of 55,204 peptide-spectrum matches (PSMs) corresponding to 8999 peptide groups and 1091 proteins, of which, multiple PSMs were observed for 1057 proteins. The experimental workflow is depicted in (Fig. 1) and the complete list of identified proteins and their corresponding peptides have been provided in Supplementary Table S1 and S2.
Fig. 1

Graphical representation of mosquito insectary conditions (light-dark cycle, temperature and humidity), sample processing steps, fractionation method used, and workflow of data analysis undertaken in the study.

Graphical representation of mosquito insectary conditions (light-dark cycle, temperature and humidity), sample processing steps, fractionation method used, and workflow of data analysis undertaken in the study.

Experimental design, materials and methods

Maintenance of mosquito colony

Female An. stephensi mosquitoes were obtained from the insectary at National Institute of Malaria Research, Field Station at Goa, where a continuous cyclic colony of the species is being maintained at 27 ± 2 °C, 75% relative humidity, a cycle of 12 h in light and 12 h in darkness. The adult mosquitoes were maintained on 10% glucose solution soaked in a cotton pad. From these cyclic colonies, female mosquitoes were used for the experiments and hemolymph was extracted as described [1].

Protein extraction

Mosquito heads were decapitated, and the thorax gently pressed to discharge out the hemolymph which, was then collected and stored at − 80 °C until further use. Samples were homogenized in 2% SDS using a probe sonicator. The tissue lysate was centrifuged at 14,000 rpm for 10 min at 4 °C and the supernatant collected. The extracted proteins were quantified according to modified Lowry's method (Bio-Rad DC Protein assay) and normalized on 10% SDS-PAGE.

Fractionation

For in-gel digestion, 300 µg of protein was resolved on a 10% SDS-PAGE gel as discussed previously [1], [2], [3]. Gel bands were excised out of the gel based on their intensity and then destained, reduced and alkylated priory to trypsin digestion. Trypsin digestion was carried out at 37 °C overnight using Promega sequencing grade trypsin. Digested peptides were extracted out using 40% acetonitrile solution in 0.1% of formic acid. Extracted peptides were desalted using stage-tip C18 columns and vacuum dried prior to acquisition on mass spectrometer.

Mass spectrometry analysis

The fractions were analyzed on LTQ-Orbitrap Velos mass spectrometer (Thermo Scientific, Bremen, Germany) interfaced with Easy-nLCII (Thermo Scientific, Bremen, Germany). Peptides were initially enriched on a reversed phase liquid chromatography (RPLC) pre-column (2 cm, 5 μ–100 Ǻ), followed by separation on an analytical column (11 cm, 3 μ–100 Ǻ) packed in-house with magic AQ C18 material (Michrom Bioresources, Inc, Auburn, CA). The solvent system used included 0.1% aqueous formic acid as solvent A and 95% acetonitrile, 0.1% formic acid as solvent B. The peptides were loaded on the trap column using solvent A, followed by resolution on the analytical column using a gradient of 10–35% solvent B for 75 min at a constant flow rate of 0.25 μL/min. The spray voltage and heated capillary temperature were set to 2.0 kV and 220 °C, respectively and data was acquired in a data dependent manner. Fifteen most intense precursor ions were selected for fragmentation from each MS scan. MS and MS/MS scans were acquired in an Orbitrap mass analyzer with mass resolution of 30,000 and 15,000 at 400 m/z, respectively. The peptides were fragmented by higher energy collision dissociation with normalized collision energy of 39%. The automatic gain control (AGC) for full FTMS was set to 1 million ions and for FT MS/MS was set to 0.1 million ions with maximum accumulation time of 100 ms and 200 ms, respectively.

Gene Ontology categorization and generation of interaction map

Categorization of the identified proteins was performed by fetching information provided in the Panther database [4]. As Panther database has identifiers for the much-studied An. gambiae but not the An. stephensi proteins. We used Biomart (version 0.7) [5] tool provided through VectorBase [6] to fetch the corresponding An. gambiae orthologs for the An. stephensi proteins (Supplementary Table S3). These An. gambiae identifiers were then used to fetch the Gene Ontology related information. To generate a protein-protein interaction map of the identified proteins, we used STRING (Search Tool for the Retrieval Interacting Genes/Proteins) online tool (version 10.5).plugin (version 1.1.0) [7].

Data analysis

The data was searched against the protein database of An. stephensi proteins using Proteome Discoverer software, version 2.1 (Thermo Fischer Scientific, Bremen, Germany). The processing workflow consisted of spectrum selector and percolator besides SEQUEST and Mascot search nodes. Searches were initiated with trypsin as enzyme, allowing a single missed cleavage with a minimum peptide length of 6 amino acids. Other parameters include carbamidomethylation of cysteine as static modifications and oxidation of methionine as variable modification. A false discovery rate (FDR) of 1% was applied to the results. The proteins identified from these searches were analyzed further for functional categorization based on biological processes, molecular function, protein class and cellular component using Gene Ontology annotations provided in Panther database. Majority of the identified proteins were found to be associated with metabolism (46.4%), cellular processes (36.8%) and localization (12.7%). The detailed Gene Ontology categorization of the identified proteins is provided in Fig. 2.
Fig. 2

Functional analysis of the identified protein in hemolymph was performed using PANTHER database. Pie chart representation of A) Biological processes, B) Molecular function, C) Protein Class and Bar graph representation of D) Sub-cellular localization. Major processes and functions identified were further analyzed for their sub-class categorization.

Functional analysis of the identified protein in hemolymph was performed using PANTHER database. Pie chart representation of A) Biological processes, B) Molecular function, C) Protein Class and Bar graph representation of D) Sub-cellular localization. Major processes and functions identified were further analyzed for their sub-class categorization. The total identified proteins were analyzed using online STRING tool to generate an interacting map for the hemolymph proteins (Fig. 3, Supplementary Table S4). We also, compared the list of total identified proteins with the list of immune related proteins provided in ImmunoDB. A total of 40 proteins identified in the hemolymph were found to be associated with immune related functions (Supplementary Table S5) of which, 20 proteins were predicted to interact with each other (Fig. 4, Supplementary Table S6).
Fig. 3

Predicted protein-protein interaction map of proteins with few distinct clusters, identified in the An. stephensi hemolymph. The interaction map was generated using STRING online tool with default parameters.

Fig. 4

Predicted protein-protein interaction map of proteins identified in hemolymph and having a potential role in immunity (predicted by mapping to ImmunoDB database). Online STRING tool with default parameters was used to generate the represented interaction map.

Predicted protein-protein interaction map of proteins with few distinct clusters, identified in the An. stephensi hemolymph. The interaction map was generated using STRING online tool with default parameters. Predicted protein-protein interaction map of proteins identified in hemolymph and having a potential role in immunity (predicted by mapping to ImmunoDB database). Online STRING tool with default parameters was used to generate the represented interaction map.
Subject areaBiology
More specific subject areaVector biology
Type of dataExcel files, figures
How data was acquiredLTQ-Orbitrap Velos ETD mass spectrometer (Thermo Scientific, Bremen, Germany)
Proteome Discoverer 2.1 and MASCOT search engine (Matrix Science, London, UK; version 2.2)
Protein database Anopheles stephensi Indian strain (www.VectorBase.org, release February 25, 2014)
Data formatAnalyzed
Experimental factorsHemolymph were collected from sugar fed mosquitoes and proteins extracted.
Experimental featuresProteome profiling of Anopheles stephensi hemolymph
Data source locationGoa and Bangalore, India
Data accessibilityRaw mass spectrometric data is available via a web application (ProteomeXchange) Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with the dataset identifier PXD001128.
Analyzed data is provided along with this article as excel sheets.
  7 in total

1.  Annotation of the zebrafish genome through an integrated transcriptomic and proteomic analysis.

Authors:  Dhanashree S Kelkar; Elayne Provost; Raghothama Chaerkady; Babylakshmi Muthusamy; Srikanth S Manda; Tejaswini Subbannayya; Lakshmi Dhevi N Selvan; Chieh-Huei Wang; Keshava K Datta; Sunghee Woo; Sutopa B Dwivedi; Santosh Renuse; Derese Getnet; Tai-Chung Huang; Min-Sik Kim; Sneha M Pinto; Christopher J Mitchell; Anil K Madugundu; Praveen Kumar; Jyoti Sharma; Jayshree Advani; Gourav Dey; Lavanya Balakrishnan; Nazia Syed; Vishalakshi Nanjappa; Yashwanth Subbannayya; Renu Goel; T S Keshava Prasad; Vineet Bafna; Ravi Sirdeshmukh; Harsha Gowda; Charles Wang; Steven D Leach; Akhilesh Pandey
Journal:  Mol Cell Proteomics       Date:  2014-07-24       Impact factor: 5.911

2.  Brain proteomics of Anopheles gambiae.

Authors:  Sutopa B Dwivedi; Babylakshmi Muthusamy; Praveen Kumar; Min-Sik Kim; Raja Sekhar Nirujogi; Derese Getnet; Priscilla Ahiakonu; Gourav De; Bipin Nair; Harsha Gowda; T S Keshava Prasad; Nirbhay Kumar; Akhilesh Pandey; Mobolaji Okulate
Journal:  OMICS       Date:  2014-06-17

3.  Large-scale gene function analysis with the PANTHER classification system.

Authors:  Huaiyu Mi; Anushya Muruganujan; John T Casagrande; Paul D Thomas
Journal:  Nat Protoc       Date:  2013-07-18       Impact factor: 13.491

4.  The BioMart community portal: an innovative alternative to large, centralized data repositories.

Authors:  Damian Smedley; Syed Haider; Steffen Durinck; Luca Pandini; Paolo Provero; James Allen; Olivier Arnaiz; Mohammad Hamza Awedh; Richard Baldock; Giulia Barbiera; Philippe Bardou; Tim Beck; Andrew Blake; Merideth Bonierbale; Anthony J Brookes; Gabriele Bucci; Iwan Buetti; Sarah Burge; Cédric Cabau; Joseph W Carlson; Claude Chelala; Charalambos Chrysostomou; Davide Cittaro; Olivier Collin; Raul Cordova; Rosalind J Cutts; Erik Dassi; Alex Di Genova; Anis Djari; Anthony Esposito; Heather Estrella; Eduardo Eyras; Julio Fernandez-Banet; Simon Forbes; Robert C Free; Takatomo Fujisawa; Emanuela Gadaleta; Jose M Garcia-Manteiga; David Goodstein; Kristian Gray; José Afonso Guerra-Assunção; Bernard Haggarty; Dong-Jin Han; Byung Woo Han; Todd Harris; Jayson Harshbarger; Robert K Hastings; Richard D Hayes; Claire Hoede; Shen Hu; Zhi-Liang Hu; Lucie Hutchins; Zhengyan Kan; Hideya Kawaji; Aminah Keliet; Arnaud Kerhornou; Sunghoon Kim; Rhoda Kinsella; Christophe Klopp; Lei Kong; Daniel Lawson; Dejan Lazarevic; Ji-Hyun Lee; Thomas Letellier; Chuan-Yun Li; Pietro Lio; Chu-Jun Liu; Jie Luo; Alejandro Maass; Jerome Mariette; Thomas Maurel; Stefania Merella; Azza Mostafa Mohamed; Francois Moreews; Ibounyamine Nabihoudine; Nelson Ndegwa; Céline Noirot; Cristian Perez-Llamas; Michael Primig; Alessandro Quattrone; Hadi Quesneville; Davide Rambaldi; James Reecy; Michela Riba; Steven Rosanoff; Amna Ali Saddiq; Elisa Salas; Olivier Sallou; Rebecca Shepherd; Reinhard Simon; Linda Sperling; William Spooner; Daniel M Staines; Delphine Steinbach; Kevin Stone; Elia Stupka; Jon W Teague; Abu Z Dayem Ullah; Jun Wang; Doreen Ware; Marie Wong-Erasmus; Ken Youens-Clark; Amonida Zadissa; Shi-Jian Zhang; Arek Kasprzyk
Journal:  Nucleic Acids Res       Date:  2015-04-20       Impact factor: 16.971

5.  VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases.

Authors:  Gloria I Giraldo-Calderón; Scott J Emrich; Robert M MacCallum; Gareth Maslen; Emmanuel Dialynas; Pantelis Topalis; Nicholas Ho; Sandra Gesing; Gregory Madey; Frank H Collins; Daniel Lawson
Journal:  Nucleic Acids Res       Date:  2014-12-15       Impact factor: 16.971

6.  Integrating transcriptomic and proteomic data for accurate assembly and annotation of genomes.

Authors:  T S Keshava Prasad; Ajeet Kumar Mohanty; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Gourav Dey; Raja Sekhar Nirujogi; Sneha M Pinto; Anil K Madugundu; Arun H Patil; Jayshree Advani; Srikanth S Manda; Manoj Kumar Gupta; Sutopa B Dwivedi; Dhanashree S Kelkar; Brantley Hall; Xiaofang Jiang; Ashley Peery; Pavithra Rajagopalan; Soujanya D Yelamanchi; Hitendra S Solanki; Remya Raja; Gajanan J Sathe; Sandip Chavan; Renu Verma; Krishna M Patel; Ankit P Jain; Nazia Syed; Keshava K Datta; Aafaque Ahmed Khan; Manjunath Dammalli; Savita Jayaram; Aneesha Radhakrishnan; Christopher J Mitchell; Chan-Hyun Na; Nirbhay Kumar; Photini Sinnis; Igor V Sharakhov; Charles Wang; Harsha Gowda; Zhijian Tu; Ashwani Kumar; Akhilesh Pandey
Journal:  Genome Res       Date:  2016-11-15       Impact factor: 9.043

7.  The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.

Authors:  Damian Szklarczyk; John H Morris; Helen Cook; Michael Kuhn; Stefan Wyder; Milan Simonovic; Alberto Santos; Nadezhda T Doncheva; Alexander Roth; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2016-10-18       Impact factor: 16.971

  7 in total
  1 in total

1.  Proteome data of Anopheles stephensi ovary using high-resolution mass spectrometry.

Authors:  Gourav Dey; Ajeet Kumar Mohanty; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Arun H Patil; T S Keshava Prasad; Ashwani Kumar
Journal:  Data Brief       Date:  2018-08-29
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.