| Literature DB >> 32835036 |
Bernardina Rivera1, Alejandro Leyva1, María Magdalena Portela1,2, Gonzalo Moratorio3,4, Pilar Moreno3,4, Rosario Durán1, Analía Lima1.
Abstract
Since January 2020, the world is facing the COVID-19 pandemic caused by SARS-CoV-2. In a big effort to cope with this outbreak, two Uruguayan institutions, Institut Pasteur de Montevideo and Universidad de la República, have developed and implemented a diagnosis pipeline based on qRT-PCR using entirely local resources. In this context, we performed comparative quantitative proteomic analysis from oro- and naso-pharyngeal swabs used for diagnosis. Tryptic peptides obtained from five positive and five negative samples were analysed by nano-LC-MS/MS using a Q-Exactive Plus mass spectrometer. Data analysis was performed using PatternLab for Proteomics software. From all SARS-CoV-2 positive swabs we were able to detect peptides of the SARS-CoV-2 nucleoprotein that encapsulates and protect the RNA genome. Additionally, we detected an average of 1100 human proteins from each sample. The most abundant proteins exclusively detected in positive swabs were "Guanylate-binding protein 1", "Tapasin" and "HLA class II histocompatibility antigen DR beta chain". The biological processes overrepresented in infected host cells were "SRP-dependent cotranslational protein targeting to membrane", "nuclear-transcribed mRNA catabolic process, nonsense-mediated decay", "viral transcription" and "translational initiation". Data is available via ProteomeXchange with identifier PXD020394. We expect that this data can contribute to the future development of mass spectrometry based approaches for COVID-19 diagnosis. Also, we share this preliminary proteomic characterization concerning the host response to infection for its reuse in basic investigation.Entities:
Keywords: COVID-19; Nucleoprotein; Quantitative proteomics; SARS-CoV-2; Shotgun proteomics
Year: 2020 PMID: 32835036 PMCID: PMC7405904 DOI: 10.1016/j.dib.2020.106121
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Venn diagram showing the overlapping of proteins identified in positive and negative samples (proteins identified in at least 3 replicates of each condition were considered). The number of proteins statistically detected as uniquely present in positive samples (light gray) and negative samples (dark gray) are shown. The number of proteins in the intersection represents proteins common to both conditions.
Fig. 2Volcano plot indicating proteins present in both sample sets but with statistical difference in relative abundance according to spectrum counts. Each spot represents one protein. In the y-axis Log2(p-value) is indicated. In the x-axis Log2(fold-change) is shown (negative values indicate proteins overrepresented in negative samples and positive values show proteins overrepresented in positive samples). Blue dots correspond to proteins that show a statistical difference in relative abundance between conditions.
Biological processes overrepresented in positive samples.
| GO Biological Process complete | ||||||
|---|---|---|---|---|---|---|
| Input (proteins overrepresented in positive samples) | ||||||
| # | # | Expected | Fold enrichment | +/- | Raw | |
| SRP-dependent cotranslational protein targeting to membrane | 96 | 17 | 0.46 | 37.3 | + | 2.75E-17 |
| nuclear-transcribed mRNA catabolic process, nonsense-mediated decay | 120 | 17 | 0.57 | 29.84 | + | 8.36E-16 |
| viral transcription | 115 | 16 | 0.55 | 29.30 | + | 1.41E-14 |
| translation initiation | 143 | 16 | 0.68 | 23.57 | + | 3.43E-13 |
| aerobic respiration | 78 | 8 | 0.37 | 21.60 | + | 6.61E-05 |
| antigen processing and presentation of exogenous peptide antigen | 176 | 10 | 0.84 | 11.97 | + | 1.70E-04 |
| rRNA processing | 253 | 11 | 1.20 | 9.16 | + | 4.43E-04 |
| electron transport chain | 175 | 9 | 0.83 | 10.83 | + | 2.03E-03 |
| interferon-gamma-mediated signaling pathway | 72 | 6 | 0.34 | 17.55 | + | 1.69E-02 |
| oxidative phosphorylation | 119 | 7 | 0.57 | 12.39 | + | 2.05E-02 |
All genes from Homo sapiens were used as comparative dataset.
The number of proteins found in the input dataset that were assigned to the corresponding biological process.
This column represents the expected number of H. sapiens genes according to the size of dataset used as an input. The input dataset was all proteins incremented in positive samples when compared to negative samples. Fisher exact was used as statistical analysis with Bonferroni correction for multiple testing. Biological processes are ordered according to the p-value.
Reactome pathway overrepresented in positive samples.
| Reactome pathway | ||||||
|---|---|---|---|---|---|---|
| Input (proteins overrepresented in positive samples) | ||||||
| # | # | Expected | Fold enrichment | +/- | Raw | |
| SRP-dependent cotranslational protein targeting to membrane | 112 | 17 | 0.53 | 31.97 | + | 6.62E-17 |
| Viral mRNA Translation | 89 | 15 | 0.42 | 35.50 | + | 3.04E-15 |
| Peptide chain elongation | 89 | 15 | 0.42 | 35.50 | + | 3.04E-15 |
| Nonsense Mediated Decay (NMD) enhanced by the Exon Junction Complex (EJC) | 115 | 16 | 0.55 | 29.30 | + | 3.21E-15 |
| Selenocysteine synthesis | 93 | 15 | 0.44 | 33.97 | + | 5.94E-15 |
| Eukaryotic Translation Termination | 93 | 15 | 0.44 | 33.97 | + | 5.94E-15 |
| Nonsense Mediated Decay (NMD) independent of the Exon Junction Complex (EJC) | 95 | 15 | 0.45 | 33.97 | + | 7.31E-15 |
| Formation of a pool of free 40S subunits | 101 | 15 | 0.48 | 31.28 | + | 1.67E-14 |
| Response of EIF2AK4 (GCN2) to amino acid deficiency | 101 | 15 | 0.48 | 31.28 | + | 1.67E-14 |
| Regulation of expression of SLITs and ROBOs | 169 | 17 | 0.80 | 21.19 | + | 3.82E-14 |
| L13a-mediated translational silencing of Ceruloplasmin expression | 111 | 15 | 0.53 | 28.46 | + | 6.03E-14 |
| GTP hydrolysis and joining of the 60S ribosomal subunit | 112 | 15 | 0.53 | 28.21 | + | 6.81E-14 |
| Major pathway of rRNA processing in the nucleolus and cytosol | 182 | 15 | 0.86 | 17.36 | + | 5.18E-11 |
| Formation of the ternary complex, and subsequently, the 43S complex | 51 | 7 | 0.24 | 28.91 | + | 2.17E-05 |
| Ribosomal scanning and start codon recognition | 58 | 7 | 0.28 | 25.42 | + | 4.88E-05 |
| Translation initiation complex formation | 58 | 7 | 0.28 | 25.42 | + | 4.88E-05 |
| Mitochondrialproteinimport | 65 | 6 | 0.31 | 19.44 | + | 2.21E-03 |
| Interferon gamma signaling | 91 | 6 | 0.43 | 13.89 | + | 1.38E-02 |
| Downstream TCR signaling | 102 | 6 | 0.48 | 12.39 | + | 2.56E-02 |
All genes from H. sapiens were used as comparative dataset.
The number of proteins found in the input dataset that were assigned to the corresponding biological process.
This column represents the expected number of H. sapiens genes according to the size of dataset used as an input. The input dataset was all proteins incremented in positive samples when compared to negative samples. Fisher exact was used as statistical analysis with Bonferroni correction for multiple testing. Biological processes are ordered according to the p-value. Pathways with less than 5 proteins found in the input dataset were not considered.
| Subject | Biochemistry, Genetics and Molecular Biology (General). |
| Specific subject area | Label-free quantitative proteomics, virology, SARS-CoV-2. |
| Type of data | Tables. |
| How data were acquired | Swabs for clinical diagnosis of COVID-19 were biological inactivated with 2% SDS. |
| Data format | RAW data (ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD020394) and supplementary excel files with data analysis output, figures and tables. |
| Parameters for data collection | Five SARS-CoV-2 positive and five negatives oro- and naso-pharyngeal swabs were obtained from the COVID-19 diagnostic laboratory installed at Institut Pasteur de Montevideo. They were selected in a random way. Data from swabs samples origin was confidential and thus totally unknown. |
| Description of data collection | Sample was collected by health professionals from local public hospitals. |
| Data source location | Institut Pasteur de Montevideo |
| Data accessibility | Repository name: ProteomeXchange Consortium via the PRIDE partner |