| Literature DB >> 31757973 |
Huoming Zhang1, Pei Liu2, Tiannan Guo3, Huayan Zhao2, Dalila Bensaddek4, Ruedi Aebersold3,5, Liming Xiong2,6.
Abstract
Arabidopsis is an important model organism and the first plant with its genome completely sequenced. Knowledge from studying this species has either direct or indirect applications for agriculture and human health. Quantitative proteomics by data-independent acquisition mass spectrometry (SWATH/DIA-MS) was recently developed and is considered as a high-throughput, massively parallel targeted approach for accurate proteome quantification. In this approach, a high-quality and comprehensive spectral library is a prerequisite. Here, we generated an expression atlas of 10 organs of Arabidopsis and created a library consisting of 15,514 protein groups, 187,265 unique peptide sequences, and 278,278 precursors. The identified protein groups correspond to ~56.5% of the predicted proteome. Further proteogenomics analysis identified 28 novel proteins. We applied DIA-MS using this library to quantify the effect of abscisic acid on Arabidopsis. We were able to recover 8,793 protein groups of which 1,787 were differentially expressed. MS data are available via ProteomeXchange with identifier PXD012708 and PXD012710 for data-dependent acquisition and PXD014032 for DIA analyses.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31757973 PMCID: PMC6874543 DOI: 10.1038/s41597-019-0294-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Schematic diagram of experimental workflow. (a) Detailed sample information of the 10 Arabidopsis organs used for generation of spectral library; (b) Optimal protein extraction and peptide purification procedures for in-depth coverage of Arabidopsis spectral library; (c) The samples were analyzed using Orbitrap Fusion and TripleTOF mass spectrometer platforms for data dependent acquisition to construct of the spectral library. The DDA data were analyzed following by protein identification by Maxquant and generation of comprehensive library using Spectronaut Pulsar.
Number of proteins identified from each organ using either TripleTOF 5600+ or Orbitrap Fusion Lumos.
| Organs | No. of Proteins identified | |
|---|---|---|
| TripleTOF 5600 | Orbitrap Fusion | |
| Cotyledones | 7660 | 9308 |
| Rosettes | 6034 | 7889 |
| Caulines | 6592 | 8851 |
| Stems | 6596 | 8946 |
| Pollen grains | 8591 | 11321 |
| Flowers with fertiization | 8737 | 11594 |
| Siliques | 7950 | 10252 |
| Seeds | 4608 | 6739 |
| Roots | 8665 | 10325 |
| Root cell suppension | 8835 | 10640 |
The list of novel proteins identified using proteogenomics approach. These proteins were not present in TAIR proteome database.
| NO. | 6ORF ID | Genomics Locus sequence (DNA) | TAIR annotation | Total Spectrum Count | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| at1 | at2 | at3 | at4 | at5 | at6 | at7 | at8 | at9 | at10 | ||||
| 1 | AtChr1@30100671@30103941 | AT1G80020.1 | Transposable_element_gene | 38 | 25 | 22 | 24 | 72 | 51 | 31 | 46 | 79 | 13 |
| 2 | AtChr3@4967609@4969613 | AT3G14800.2 | Transposable_element_gene | 14 | 2 | 9 | 4 | 28 | 29 | 12 | 12 | 19 | |
| 3 | AtChr2@2859228@2863482 | AT2G06950.1 | Transposable_element_gene | 6 | 5 | 2 | 25 | 21 | 6 | 10 | 1 | ||
| 4 | AtChr3@1247686@1249978 | AT3G04605.1 | Transposable_element_gene | 5 | 1 | 3 | 1 | 14 | 11 | 6 | 1 | 23 | |
| 5 | AtChr3@22515146@22515965 | AT3G60930.1 | Transposable_element_gene | 4 | 1 | 1 | 1 | 9 | 8 | 1 | 2 | 5 | 4 |
| 6 | AtChr5@19854556@19856566 | AT5G48965.1 | Transposable_element_gene | 2 | 2 | 8 | 8 | 3 | 3 | 17 | |||
| 7 | AtChr5@7082385@7083282 | AT5G20880.1 | Transposable_element_gene | 3 | 1 | 3 | 2 | 5 | 9 | 2 | 4 | 3 | 1 |
| 8 | AtChr3@22515868@22516741 | AT3G60930.2 | Transposable_element_gene | 1 | 1 | 3 | 5 | 3 | 1 | 7 | |||
| 9 | AtChr1@26346412@26348515 | AT1G69950.1 | Transposable_element_gene | 4 | 5 | 4 | |||||||
| 10 | AtChr3@15403874@15404801 | AT3G43510.1 | Transposable_element_gene | 1 | 5 | 2 | |||||||
| 11 | AtChr5@13104627@13107087 | AT5G34853.1 | Transposable_element_gene | 2 | 1 | 5 | |||||||
| 12 | AtChr5@13103632@13104208 | AT5G34853.1 | Transposable_element_gene | 3 | 2 | 5 | |||||||
| 13 | AtChr5@19854197@19854566 | AT5G48965.1 | Transposable_element_gene | 1 | 2 | ||||||||
| 14 | AtChr3@18791524@18795901 | AT3G50625.1 | Transposable_element_gene | 4 | |||||||||
| 15 | AtChr4@7689781@7690192@0 | AT4G06020.1 | Novel_transcribed_region | 5 | 9 | ||||||||
| 16 | AtChr5@10277623@10277968 | AT5G00495.1 | Novel_transcribed_region | 4 | 3 | ||||||||
| 17 | AtChr1@10428741@10429146 | AT1G29785.1 | Antisense_long_noncoding_rna | 2 | 2 | 1 | 3 | 5 | 1 | 1 | |||
| 18 | AtChr4@9188288@9188534 | AT4G06385.1 | Long_noncoding_rna | 5 | 1 | ||||||||
| 19 | AtChr3@3253541@3253958 | AT3G02345.1 | Long_noncoding_rna | 3 | |||||||||
| 20 | AtChr5@26518786@26519188 | AT5G66400.1 | Arabidopsis thaliana drought-induced 8 | 8 | |||||||||
| 21 | AtChr2@17714928@17715315 | AT2G42560.1 | Late embryogenesis abundant 25 | 4 | |||||||||
| 22 | AtChr2@2056255@2056639 | AT2G05580.1 | Glycine-rich protein family | 2 | |||||||||
| 23 | AtChr4@6693384@6693678 | AT4G10895.1 | Plant self-incompatibility protein S1 family | 0 | 0 | 0 | 0 | 6 | 7 | 0 | 0 | 0 | 0 |
| 24 | AtChr5@7534553@7534787 | AT5G22650.1 | Arabidopsis histone deacetylase 2 | 5 | 2 | 1 | 1 | ||||||
| 25 | AtChr1@20390129@20390324 | AT1G54630.1 | Acyl carrier protein 3 | 2 | |||||||||
| 26 | AtChr1@18512092@18512548 | AT1G49980.9 | DNA/RNA polymerases superfamily protein | 5 | 5 | 6 | |||||||
| 27 | AtChr2@15278911@15279358 | AT2G36410.1 | Transcriptional activator (DUF662) | 1 | 2 | 1 | 3 | 1 | |||||
| 28 | AtChr5@21376666@21377527 | AT5G52710.1 | DNA/RNA polymerases superfamily protein | 7 | |||||||||
The list of novel proteins identified using proteogenomics approach. These proteins/sequences were either not present in TAIR10 or incomplete, but present in the latest TAIR11 proteome database.
| No | 6ORF ID | Genomics Locus sequence (DNA) | TAIR annotation | Notes | Total Spectral Count | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| at1 | at2 | at3 | at4 | at5 | at6 | at7 | at8 | at9 | at10 | |||||
| 1 | Atchr1@19256097@19256652 | AT1G51850.2 | Leucine-rich repeat protein kinase family protein | New isoform/entry in Tair11 | 2 | |||||||||
| 2 | AtChr5@18590218@18590728 | AT5G45830.4 | Delay of gemination 1 | New isoform/entry in Tair11 | 3 | 31 | ||||||||
| 3 | AtChr3@1743539@1745543 | AT3G05850.1 | MuDR family transposase | New entry in Tair11 | 7 | 1 | 2 | 1 | 14 | 13 | 15 | 6 | 29 | 1 |
| 4 | AtChr3@1745505@1745964 | AT3G05850.1 | MuDR family transposase | New entry in Tair11 | 1 | 2 | 1 | 2 | ||||||
| 5 | AtChr5@20952547@20953303 | AT5G51585.1 | Transmembrane protein | New entry in Tair11 | 2 | |||||||||
| 6 | AtChr3@6174451@6174928 | AT3G18040.4 | MAP kinase 9 | New entry in Tair11 | 1 | 1 | 1 | |||||||
| 7 | AtChr4@9187917@9188217 | AT4G16233.1 | GDSL-like lipase/acylhydrolase superfamily protein | New entry in Tair11 | 4 | 4 | ||||||||
| 8 | AtChr3@3252605@3253004 | AT3G10455.1 | Plant self-incompatibility protein S1 family protein | New entry in Tair11 | 3 | 6 | ||||||||
| 9 | AtChr5@4376549@4378589 | AT5G13590.1 | Unknown protein | Point correction | 2 | 1 | 2 | 1 | 1 | |||||
| 10 | AtChr4@15093731@15095657 | AT4G30990.3 | ARM repeat superfamily protein | Sequence addition in Tair11 | 2 | |||||||||
| 11 | AtChr4@13558438@13559653 | AT4G27010.2 | Ribosome 60S biogenesis amino-terminal protein | Sequence addition in Tair11 | 2 | |||||||||
| 12 | AtChr4@7455224@7455494 | AT4G12610.1 | Transcription initiation factor IIF subunit alpha RAP74 | Sequence addition in Tair11 | 1 | 2 | 5 | 4 | 3 | 6 | 4 | |||
| 13 | AtChr4@1344112@1344550 | AT4G03050.1 | 2-oxoglutarate-dependent dioxygenase | Sequence addition in Tair11 | 40 | 19 | ||||||||
| 14 | AtChr5@15331919@15332135 | AT5G38360.1 | Alpha/beta-Hydrolases superfamily protein | Sequence addition in Tair11 | 3 | 1 | 3 | 1 | 1 | 1 | ||||
Arabidopsis spectral libraries constructed from Orbitrap Fusion and TripleTOF 5600+.
| Platform | Precursors | Modified peptides | Peptides | Proteoypci Peptides | Protein Groups | Proteins | Single hits | Fragments | Serach Engine | Created by |
|---|---|---|---|---|---|---|---|---|---|---|
| Fusion Lumos | 278,278 | 219,883 | 187,265 | 120,750 | 15,514 | 19,375 | 1,352 | 1,635,725 | Pulsar | Spectronaut v12.0 |
| TripleTOF 5600 | 118,675 | 89,149 | 80,492 | 50,366 | 10,915 | 14,420 | 1,352 | 705,063 |
Fig. 2Peptide properties in the Arabidopsis spectral library: peptide length distribution (a), precursor mass-to-charge ratio distribution (b), peptide charge (c) and peptide modification (d).
Fig. 3Heatmap shows the clustering of 3 technical replicates under 6 experimental conditions. Runs within the same condition cluster nicely as illustrated by the condition-based color code in the bottom of the heatmap and the x-axis dendrogram.
Fig. 4Gene ontology enrichment analysis (biological process: BP) of differential expressed proteins in response to ABA treatment. Blue bar indicates down regulation of biological processes at 2 h (a), 24 h (b), and 72 h post-treatment (c) respectively, whereas the orange bar indicates upregulation of biological process at 2 h (d), 24 h (e), and 72 h post-treatment (f) respectively. Only BP terms with a Benjamini-Hochberg corrected p-value of less than 0.05 were included.
| Measurement(s) | Proteome |
| Technology Type(s) | mass spectrometry assay • computational modeling technique |
| Sample Characteristic - Organism | Arabidopsis thaliana |