| Literature DB >> 34977300 |
Anna Mamaeva1, Andrey Knyazev1, Anna Glushkevich1, Igor Fesenko1.
Abstract
Small open reading frames (<100 codons) that are located on long noncoding RNAs (lncRNAs) can encode functional microproteins. These microproteins are shown to play important roles in different cellular processes, such as cell proliferation, development and disease response [1], [2], [3], [4], [5], [6]. However, there are only a few known lncRNA-encoded functional microproteins in plants. One such microprotein that was named PSEP3, was identified in the moss Physcomitrium patens by mass-spectrometry analysis. 57-aa PSEP3 contains Low Complexity Region (LCR) enriched with proline. We have previously shown that PSEP3 is translated in protonemata and gametophores of P. patens, and its knockout (KO line) or overexpression (OE line) affects protonemata growth [7]. We performed a quantitative proteomic analysis of the mutant lines with PSEP3 knockout and overexpression. 7-days old protonemata of wild type (WT line) and both mutant lines (KO and OE) were collected and used for iTRAQ-based proteomic experiments. LC-MS/MS data were processed using PEAKS Studio v.8 software with protein identification based on a Phytozome protein database. More analysis of PSEP3 effects on plant growth can be obtained in the paper published in Nucleic Acid Research [8].Entities:
Keywords: Long noncoding RNA-encoded peptide; Physcomitrium patens; Proteomics; iTRAQ
Year: 2021 PMID: 34977300 PMCID: PMC8688553 DOI: 10.1016/j.dib.2021.107715
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Overview of the experimental workflow.
Characteristics of the proteomic datasets.
| PSEP3 KO dataset | PSEP3 OE dataset | |
|---|---|---|
| Number of MS/MS scans | 45815 | 54029 |
| Peptide-Spectrum Matches | 17784 | 27862 |
| Peptide sequences | 11170 | 16724 |
| Protein groups | 2710 | 2873 |
| Proteins | 3461 | 3511 |
Fig. 2Quality control metrics of the proteomic datasets. (A) The length distributions of peptides; (B) Protein sequence coverage distributions; (C) Protein mass distributions and (D) Distributions of the unique peptide numbers.
Fig. 3Principal component analysis (PCA) of wild type with PSEP3 KO (A) and PSEP3 OE (B) mutant samples. Wild type shown by orange and mutants shown by blue. The PCA analysis included all protein groups and was performed in the Python library sklearn.
| Subject | Omics: Proteomics |
| Specific subject area | Plant quantitative proteomics |
| Type of data | Table |
| How data were acquired | Raw data were acquired with mass spectrometry using Q Exactive HF benchtop Orbitrap mass spectrometer (Thermo Fisher Scientific) and iTRAQ kit, analysis was performed using PEAKS Software 8.0 |
| Data format | Raw and analyzed data |
| Description of data collection | The protonemata of WT, PSEP3 KO and PSEP3 OE mutant lines were grown in 200 ml liquid BCD medium supplemented with 5 mM ammonium tartrate (BCDAT) during a 16 h photoperiod at 25 |
| Data source location | Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences |
| Data accessibility | Data identification number: |
| Related research article | I. Fesenko, S.A. Shabalina, A. Mamaeva, A. Knyazev, A. Glushkevich, I. Lyapina, R. Ziganshin, S. Kovalchuk, D. Kharlampieva, V. Lazarev, M. Taliansky, E.V. Koonin, A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants, Nucleic Acids Research, 49, (2021) 10328–10346, |