Literature DB >> 30211266

Proteome data of Anopheles stephensi ovary using high-resolution mass spectrometry.

Gourav Dey1,2,3, Ajeet Kumar Mohanty4, Manish Kumar2,3, Sreelakshmi K Sreenivasamurthy2,3, Arun H Patil1,2,5, T S Keshava Prasad1,2, Ashwani Kumar4.   

Abstract

This article contains data on the proteins expressed in the ovaries of Anopheles stephensi, a major vector of malaria in India. Data acquisition was performed using a high-resolution Orbitrap-Velos mass spectrometer. The acquired MS/MS data was searched against An. stephensi protein database comprising of 11,789 sequences. Overall, 4407 proteins were identified, functional analysis was performed for the identified proteins and a protein-protein interaction map predicted. The data provided here is also related to a published article - "Integrating transcriptomics and proteomics data for accurate assembly and annotation of genomes" (Prasad et al., 2017) [1].

Entities:  

Year:  2018        PMID: 30211266      PMCID: PMC6129740          DOI: 10.1016/j.dib.2018.08.066

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The data tabulates proteins identified in the adult An. stephensi ovaries, which, plays an essential role in determining the fecundity of this important malaria vector. The data provides better understanding of the underlying physiology of An. stephensi females that can be manipulated to prevent the spread of malaria. The data provides potential protein-protein interactions among the proteins identified in the ovaries. Further study of these proteins can provide deeper insights in to the underlying physiological processes in female An. stephensi. The data provides a baseline platform for further studies on vector-pathogen interactions in female An. stephensi mosquitoes.

Data

To identify proteins expressed in the mosquito ovaries, we carried out high-resolution mass spectrometry-based proteomic analysis of ovaries dissected from sugar fed female An. stephensi Liston (Fig. 1). Extracted proteins were digested using trypsin followed by fractionation at protein level using SDS-PAGE. In addition, we also performed basic reverse phase liquid chromatograhy (bRPLC) fractionation at the peptide level. Each of these fractions were then analyzed on a high-resolution LTQ-Orbitrap Velos mass spectrometer and the acquired data searched against An. stephensi protein database. A total of 4407 were identified of which, 4284 proteins were identified with multiple PSMs. The complete list of proteins identified, and their corresponding peptides are provided in Supplementary Table S1 and S2.
Fig. 1

Pictorial representation of mosquito insectary conditions (light-dark cycle, temperature and humidity), sample processing steps, fractionation method used, and workflow of data analysis undertaken in the study.

Pictorial representation of mosquito insectary conditions (light-dark cycle, temperature and humidity), sample processing steps, fractionation method used, and workflow of data analysis undertaken in the study.

Experimental design, materials and methods

Maintenance of mosquito colony

Continuous cyclic colony of An. stephensi mosquitoes were obtained from the insectary at ICMR-National Institute of Malaria Research, Field Station Goa. Colonies were maintained in a cycle of 12 hours in light and 12 hours in darkness, at 27±2 °C, 75% relative humidity. Adult mosquitoes were fed on 10% glucose solution.

Protein extraction

Altogether, 3,500 pairs of ovaries were dissected in 0.65% normal saline under dissecting microscope and homogenized in 200 µl of 4% SDS solution using a probe sonicator. Supernatant was collected after centrifugation of tissue lysate at 14,000 rpm for 10 minutes at 4 °C. Quantification of the extracted proteins were carried out according to Lowry׳s method (Bio-Rad DC Protein assay).

Fractionation

Proteins extracted from ovaries were fractionated as discussed in our previous studies [1], [2], [3]. For fractionation on gel, 300 µg of protein was resolved on a 10% SDS-PAGE, and 24 bands were excised. Excised bands were destained, reduced and alkylated followed by overnight digestion at 37°C using sequencing grade trypsin (Promega). Digested peptides were extracted and stored at -20 °C, until further use. For fractionation at peptide level, 500 µg of protein were reduced and alkylated prior to trypsin digestion at 37 °C for 16 hours. Digested peptides were cleaned using Sepak C18 column and lyophilized prior to bRPLC fractionation. The lyophilized samples were reconstituted in bRPLC solvent A (10 mM triethyl ammonium bicarbonate (TEABC) in water at, pH 9.5), loaded on XBridge C18, 5 µm 250×4.6 mm column (Waters, Milford, MA) connected to Agilent 1100 series HPLC system. The peptide digest was resolved using a gradient of 5% to 100% solvent B (10 mM TEABC in Acetonitrile, pH 9.5) in 70 minutes. The total fractionation time was 60 minutes. Eluting peptides were collected in a 96 well plate and then concatenated into 36 fractions. Fractions were dried and reconstituted in 20 μl of 0.1% formic acid before mass spectrometric analysis [4].

Mass spectrometry analysis

The fractionated and cleaned peptides were analyzed on LTQ-Orbitrap Velos mass spectrometer (Thermo Scientific, Bremen, Germany) interfaced with Easy-nLCII (Thermo Scienific, Bremen, Germany). Peptides were enriched on a pre-column (2 cm, 5μ–100Ǻ), followed by separation on an analytical column (11 cm, 3μ–100Ǻ) packed in-house with magic AQ C18 material (Michrom Bioresources, Inc, Auburn, CA). The solvents used contained 0.1% aqueous formic acid as solvent A and 95% acetonitrile, 0.1% formic acid as solvent B. Solvent A was used to load peptides on to the trap column, followed by resolution on the analytical column using a gradient of 10–35% solvent B for 75 min at a constant flow rate of 0.25 μl/min. The spray voltage and heated capillary temperature were set to 2.0 kV and 220 °C. Data acquisition was carried out in a data dependent manner with a full scan in the range of m/z 350 to 2000. Orbitrap mass analyzer was used at both MS and MS/MS scans. Full MS scans were measured at a resolution 30,000 at m/z 400 and fifteen most intense precursors were selected for MS/MS fragmentation. Fragmentation was carried using higher-energy collisional dissociation (HCD) method and detected at a mass resolution of 15,000 at m/z 400. For full FTMS automatic gain control (AGC) was set to 1 million ions while FT MS/MS was set to 0.1 million ions with maximum accumulation time of 100 ms and 200 ms, respectively.

Functional categorization and prediction of interaction map

Categorization of the Gene Ontology terms for the identified proteins was performed by fetching information provided in the Panther database [5]. As Panther database has identifiers only for An. gambiae, which, is an ortholog of An. stephensi, we fetched the An. gambiae orthologs for the identified An. stephensi proteins using Biomart (version 0.7) [6] tool provided through VectorBase [7] (Supplementary Table S3). These An. gambiae identifiers were then used to fetch the Gene Ontology information. A predicted protein-protein interaction map of the identified proteins was then generated using STRING (Search Tool for the Retrieval Interacting Genes/Proteins) online tool (version 10.5) [8].

Data analysis

The mass spectrometry-derived data was searched against a database containing An. stephensi proteins along with known contaminants using Proteome Discoverer software, version 2.1 (Thermo Fischer Scientific, Bremen, Germany). The workflow consisted of Spectrum selector and SEQUEST and MASCOT search nodes. Trypsin was used as an enzyme allowing a single missed cleavage and 6 amino acids as the minimum peptide length. Search parameters also included carbamidomethylation of cysteine as static modifications and oxidation of methionine as a variable modification. Results were retrieved using a 1% false discovery rate (FDR) at the peptide level. The proteins identified were annotated for their role in biological processes, molecular functions and their localization using Gene Ontology. Most of the identified proteins belonged to the protein class of nucleic acid binding proteins (23.1%), hydrolases (11.8%), transferases (10%) and oxidoreductases (7.5%). Functional annotations categorized the identified proteins to be associated with biological processes such as metabolism (33.6%), cellular processes (32%) and biogenesis (9.8%) (Fig. 2). We also generated a predictive protein-protein interaction map for the proteins identified in ovaries (Fig. 3, Supplementary Table S4). The interaction map showed distinctive clusters associated with ribosomal pathway, regulation of gene expression, RNA processing and proteasome pathways.
Fig. 2

Functional analysis of the identified protein in An. stephensi ovaries. Gene Ontology information was fetched from PANTHER database. A) Biological processes, B) Molecular function, are represented in bar graph and major process and functions are further categorized into their sub-classes and represented as pie charts; C) Bar graph representation of Protein Class; D) Pie chart of Sub-cellular localization of the identified proteins.

Fig. 3

Predicted protein-protein interaction map of proteins identified in the ovary. Few of the identified proteins were found to cluster together into distinct clusters with associated roles in ribosomal pathway, regulation of gene expression, RNA processing and proteasomal pathway.

Functional analysis of the identified protein in An. stephensi ovaries. Gene Ontology information was fetched from PANTHER database. A) Biological processes, B) Molecular function, are represented in bar graph and major process and functions are further categorized into their sub-classes and represented as pie charts; C) Bar graph representation of Protein Class; D) Pie chart of Sub-cellular localization of the identified proteins. Predicted protein-protein interaction map of proteins identified in the ovary. Few of the identified proteins were found to cluster together into distinct clusters with associated roles in ribosomal pathway, regulation of gene expression, RNA processing and proteasomal pathway. The potential role of certain genes in mosquitoes as agonist or antagonist to the development of Plasmodium parasite in the mosquito midgut and salivary glands has been studied through gene knockdown experiments [9]. We observed 30 of such proteins to be expressed in the An. stephensi ovaries with potential role in xenobiotics and drug metabolism that involves Cytochrome P450 (Fig. 4, Supplementary Table S5). However, these proteins were not restricted only to the ovaries and were also observed to be expressed in midgut, fatbody and salivary glands [1], [10], [11], [12]. The complete list of proteins identified in ovaries that mapped to the list of vector proteins with experimentally proven roles in parasite development is provided in (Supplementary Table S6). To further evaluate for related functions of the proteins identified in the ovaries, we mapped our data against Immunodb, which is a resource that provides a list of immune-related gene families. Seventy of the identified proteins were observed to have a potential role in mosquito vector immune responses. The list of proteins mapping to Immunodb is provided in (Supplementary Table S7). To evaluate the interaction network of the proteins mapping to Immunodb data, we generated a putative interacting map of the 70 proteins using STRING online tool (Fig. 5, Supplementary Table S8). Most of these proteins were found to be associated with peroxisome, regulation of autophagy, gamete generation and spliceosome.
Fig. 4

Predicted protein-protein interaction map of proteins identified in the An. stephensi ovary for which, their An. gambiae orthologs were found to have a role in vector-pathogen interactions.

Fig. 5

Predicted protein-protein interaction map of proteins and having a potential role in immunity (predicted by mapping to ImmunoDB database). Proteins associated with regulation of autophagy, spliceosome machinery, gamete generation and peroxisomes were predicted to be interacting among each other.

Predicted protein-protein interaction map of proteins identified in the An. stephensi ovary for which, their An. gambiae orthologs were found to have a role in vector-pathogen interactions. Predicted protein-protein interaction map of proteins and having a potential role in immunity (predicted by mapping to ImmunoDB database). Proteins associated with regulation of autophagy, spliceosome machinery, gamete generation and peroxisomes were predicted to be interacting among each other. The data presented here, thus facilitates further analysis and studies on An. stephensi by providing a baseline data.
Subject areaBiology
More specific subject areaVector biology
Type of dataExcel files, figures
How data was acquiredLTQ-Orbitrap Velos ETD mass spectrometer (Thermo Scientific, Bremen, Germany)
Proteome Discoverer 2.1 and MASCOT search engine (Matrix Science, London, UK; version 2.2)
Protein database Anopheles stephensi Liston (Indian strain) (www.VectorBase.org, release February 25, 2014)
Data formatAnalyzed
Experimental factorsOvaries were dissected from sugar fed mosquitoes and proteins extracted.
Experimental featuresProteome profiling of Anopheles stephensi ovaries
Data source locationGoa and Bangalore, India
Data accessibilityRaw mass spectrometric data is available from a web application (ProteomeXchange) Consortium
(http://proteomecentral.proteomexchange.org/cgi/GetDataset?ID=PXD001128).
Analyzed data is provided along with this article as excel sheets.
Related research articlePrasad et al. [1].
  12 in total

1.  Mosquito-Borne Diseases and Omics: Tissue-Restricted Expression and Alternative Splicing Revealed by Transcriptome Profiling of Anopheles stephensi.

Authors:  Sreelakshmi K Sreenivasamurthy; Anil K Madugundu; Arun H Patil; Gourav Dey; Ajeet Kumar Mohanty; Manish Kumar; Krishna Patel; Charles Wang; Ashwani Kumar; Akhilesh Pandey; Thottethodi Subrahmanya Keshava Prasad
Journal:  OMICS       Date:  2017-07-14

2.  Annotation of the zebrafish genome through an integrated transcriptomic and proteomic analysis.

Authors:  Dhanashree S Kelkar; Elayne Provost; Raghothama Chaerkady; Babylakshmi Muthusamy; Srikanth S Manda; Tejaswini Subbannayya; Lakshmi Dhevi N Selvan; Chieh-Huei Wang; Keshava K Datta; Sunghee Woo; Sutopa B Dwivedi; Santosh Renuse; Derese Getnet; Tai-Chung Huang; Min-Sik Kim; Sneha M Pinto; Christopher J Mitchell; Anil K Madugundu; Praveen Kumar; Jyoti Sharma; Jayshree Advani; Gourav Dey; Lavanya Balakrishnan; Nazia Syed; Vishalakshi Nanjappa; Yashwanth Subbannayya; Renu Goel; T S Keshava Prasad; Vineet Bafna; Ravi Sirdeshmukh; Harsha Gowda; Charles Wang; Steven D Leach; Akhilesh Pandey
Journal:  Mol Cell Proteomics       Date:  2014-07-24       Impact factor: 5.911

3.  Response to Blood Meal in the Fat Body of Anopheles stephensi Using Quantitative Proteomics: Toward New Vector Control Strategies Against Malaria.

Authors:  Manish Kumar; Ajeet Kumar Mohanty; Sreelakshmi K Sreenivasamurthy; Gourav Dey; Jayshree Advani; Sneha M Pinto; Ashwani Kumar; Thottethodi Subrahmanya Keshava Prasad
Journal:  OMICS       Date:  2017-09-05

4.  Large-scale gene function analysis with the PANTHER classification system.

Authors:  Huaiyu Mi; Anushya Muruganujan; John T Casagrande; Paul D Thomas
Journal:  Nat Protoc       Date:  2013-07-18       Impact factor: 13.491

5.  The BioMart community portal: an innovative alternative to large, centralized data repositories.

Authors:  Damian Smedley; Syed Haider; Steffen Durinck; Luca Pandini; Paolo Provero; James Allen; Olivier Arnaiz; Mohammad Hamza Awedh; Richard Baldock; Giulia Barbiera; Philippe Bardou; Tim Beck; Andrew Blake; Merideth Bonierbale; Anthony J Brookes; Gabriele Bucci; Iwan Buetti; Sarah Burge; Cédric Cabau; Joseph W Carlson; Claude Chelala; Charalambos Chrysostomou; Davide Cittaro; Olivier Collin; Raul Cordova; Rosalind J Cutts; Erik Dassi; Alex Di Genova; Anis Djari; Anthony Esposito; Heather Estrella; Eduardo Eyras; Julio Fernandez-Banet; Simon Forbes; Robert C Free; Takatomo Fujisawa; Emanuela Gadaleta; Jose M Garcia-Manteiga; David Goodstein; Kristian Gray; José Afonso Guerra-Assunção; Bernard Haggarty; Dong-Jin Han; Byung Woo Han; Todd Harris; Jayson Harshbarger; Robert K Hastings; Richard D Hayes; Claire Hoede; Shen Hu; Zhi-Liang Hu; Lucie Hutchins; Zhengyan Kan; Hideya Kawaji; Aminah Keliet; Arnaud Kerhornou; Sunghoon Kim; Rhoda Kinsella; Christophe Klopp; Lei Kong; Daniel Lawson; Dejan Lazarevic; Ji-Hyun Lee; Thomas Letellier; Chuan-Yun Li; Pietro Lio; Chu-Jun Liu; Jie Luo; Alejandro Maass; Jerome Mariette; Thomas Maurel; Stefania Merella; Azza Mostafa Mohamed; Francois Moreews; Ibounyamine Nabihoudine; Nelson Ndegwa; Céline Noirot; Cristian Perez-Llamas; Michael Primig; Alessandro Quattrone; Hadi Quesneville; Davide Rambaldi; James Reecy; Michela Riba; Steven Rosanoff; Amna Ali Saddiq; Elisa Salas; Olivier Sallou; Rebecca Shepherd; Reinhard Simon; Linda Sperling; William Spooner; Daniel M Staines; Delphine Steinbach; Kevin Stone; Elia Stupka; Jon W Teague; Abu Z Dayem Ullah; Jun Wang; Doreen Ware; Marie Wong-Erasmus; Ken Youens-Clark; Amonida Zadissa; Shi-Jian Zhang; Arek Kasprzyk
Journal:  Nucleic Acids Res       Date:  2015-04-20       Impact factor: 16.971

Review 6.  A compendium of molecules involved in vector-pathogen interactions pertaining to malaria.

Authors:  Sreelakshmi K Sreenivasamurthy; Gourav Dey; Manjula Ramu; Manish Kumar; Manoj K Gupta; Ajeet K Mohanty; H C Harsha; Pushkar Sharma; Nirbhay Kumar; Akhilesh Pandey; Ashwani Kumar; T S Keshava Prasad
Journal:  Malar J       Date:  2013-06-26       Impact factor: 2.979

7.  VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases.

Authors:  Gloria I Giraldo-Calderón; Scott J Emrich; Robert M MacCallum; Gareth Maslen; Emmanuel Dialynas; Pantelis Topalis; Nicholas Ho; Sandra Gesing; Gregory Madey; Frank H Collins; Daniel Lawson
Journal:  Nucleic Acids Res       Date:  2014-12-15       Impact factor: 16.971

8.  Integrating transcriptomic and proteomic data for accurate assembly and annotation of genomes.

Authors:  T S Keshava Prasad; Ajeet Kumar Mohanty; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Gourav Dey; Raja Sekhar Nirujogi; Sneha M Pinto; Anil K Madugundu; Arun H Patil; Jayshree Advani; Srikanth S Manda; Manoj Kumar Gupta; Sutopa B Dwivedi; Dhanashree S Kelkar; Brantley Hall; Xiaofang Jiang; Ashley Peery; Pavithra Rajagopalan; Soujanya D Yelamanchi; Hitendra S Solanki; Remya Raja; Gajanan J Sathe; Sandip Chavan; Renu Verma; Krishna M Patel; Ankit P Jain; Nazia Syed; Keshava K Datta; Aafaque Ahmed Khan; Manjunath Dammalli; Savita Jayaram; Aneesha Radhakrishnan; Christopher J Mitchell; Chan-Hyun Na; Nirbhay Kumar; Photini Sinnis; Igor V Sharakhov; Charles Wang; Harsha Gowda; Zhijian Tu; Ashwani Kumar; Akhilesh Pandey
Journal:  Genome Res       Date:  2016-11-15       Impact factor: 9.043

9.  The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.

Authors:  Damian Szklarczyk; John H Morris; Helen Cook; Michael Kuhn; Stefan Wyder; Milan Simonovic; Alberto Santos; Nadezhda T Doncheva; Alexander Roth; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2016-10-18       Impact factor: 16.971

10.  Mapping Anopheles stephensi midgut proteome using high-resolution mass spectrometry.

Authors:  Ajeet Kumar Mohanty; Gourav Dey; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Sandeep Garg; T S Keshava Prasad; Ashwani Kumar
Journal:  Data Brief       Date:  2018-02-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.