| Literature DB >> 28587253 |
Barbara Höllbacher1, Armin O Schmitt2, Heidi Hofer3, Fatima Ferreira4, Peter Lackner5.
Abstract
Pollen is one of the most common causes of allergy worldwide, making the study of their molecular composition crucial for the advancement of allergy research. Despite substantial efforts in this field, it is not yet clear why some plant pollens strongly provoke allergies while others do not. However, proteases and protease inhibitors from allergen sources are known to play an important role in the development of pollen allergies. In this study, we aim to uncover differences in the transcriptional pattern of proteases and protease inhibitors in Betula verrucosa and Pinus sylvestris pollen as models for high and low allergenic potential, respectively. We applied RNA sequencing to Betula verrucosa and Pinus sylvestris pollen. After de-novo assembly we derived general functional profiles of the protein coding transcripts. By utilization of domain based functional annotation we identified potential proteases and protease inhibitors and compared their expression in the two types of pollen. Functional profiles are highly similar between Betula verrucosa and Pinus sylvestris pollen. Both pollen contain proteases and inhibitors from 53 and 7 Pfam families, respectively. Some of the members comprised within those families are implicated in facilitating allergen entry, while others are known allergens themselves. Our work revealed several candidate proteins which, with further investigation, represent exciting new leads in elucidating the process behind allergic sensitization.Entities:
Keywords: allergenicity; pollen transcriptome; protease; protease inhibitor
Mesh:
Substances:
Year: 2017 PMID: 28587253 PMCID: PMC5486022 DOI: 10.3390/ijms18061199
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Analysis workflow. Two strategies were implemented: (i) De-novo assembly and mapping the transcript to SwissProt sequences split into Pfam domains. (ii) Direct read mapping to the Arabidopsis thaliana reference proteome with subsequent Pfam based assessment. An example of a multi-domain protease is given by the cartoon of Der p 1, which includes a protease domain (green) and a propeptide domain (blue).
RNA-Seq and de-novo assembly statistics.
| Attribute | ||
|---|---|---|
| Total paired reads | 223 M | 221 M |
| GC content (%) | 47 | 46 |
| Read length range | 35–151 | 35–151 |
| Read length median | 150 | 150 |
| Paired reads after filtering: | 218 M | 218 M |
| Assembled Trinity transcripts: | 59,581 | 85,151 |
| Assembled Trinity genes: | 25,263 | 38,064 |
Spearman rank correlation between functional profiles.
| COG | GO-BP | GO-MF | GO-CC | |||||
|---|---|---|---|---|---|---|---|---|
| Bet | Pin | Bet | Pin | Bet | Pin | Bet | Pin | |
| Lil | 0.80 | 0.82 | 0.52 | 0.52 | 0.43 | 0.43 | 0.87 | 0.87 |
| Bet | 0.98 | 1.00 | 0.99 | 0.98 | ||||
COG: COG Categories, GO-BP: GO Biological Processes, GO-MF: GO Molecular Functions, GO-CC: GO Cellular Components, Bet: Betula verrucosa, Pin: Pinus sylvestris. Lil: Lilium longiflorum.
List of 53 protease families identified in the Pinus sylvestris and Betula verrucosa transcriptomes. Sums over transcripts per million transcripts (TPM) of all Trinity transcripts assigned to a certain Pfam family are given.
| Pfam ID | Name | ∑ TPM Total | ∑ TPM | ∑ TPM |
|---|---|---|---|---|
| PF00082 | Subtilase family | 28,858.7 | 21,782.0 | 7076.6 |
| PF00450 | Serine carboxypeptidase | 6259.0 | 4190.4 | 2068.6 |
| PF14543 | Xylanase inhibitor N-terminal | 4466.0 | 1185.8 | 3280.2 |
| PF00112 | Papain family cysteine protease | 4439.9 | 3076.1 | 1363.7 |
| PF00227 | Proteasome subunit | 1451.6 | 319.9 | 1131.7 |
| PF14541 | Xylanase inhibitor C-terminal | 1147.3 | 880.4 | 266.9 |
| PF12146 | Serine aminopeptidase, S33 | 871.5 | 427.1 | 444.4 |
| PF00443 | Ubiquitin carboxyl-terminal hydrolase | 699.6 | 266.3 | 433.3 |
| PF00026 | Eukaryotic aspartyl protease | 554.0 | 240.7 | 313.3 |
| PF00557 | Metallopeptidase family M24 | 547.5 | 183.9 | 363.6 |
| PF01694 | Rhomboid family | 518.0 | 328.8 | 189.2 |
| PF04258 | Signal peptide peptidase | 469.7 | 267.2 | 202.5 |
| PF01434 | Peptidase family M41 | 341.0 | 199.0 | 142.0 |
| PF02338 | OTU-like cysteine protease | 253.4 | 91.8 | 161.6 |
| PF05903 | PPPDE putative peptidase domain | 253.1 | 106.6 | 146.6 |
| PF01433 | Peptidase family M1 | 230.4 | 80.7 | 149.7 |
| PF00574 | Clp protease | 228.3 | 48.9 | 179.5 |
| PF02127 | Aminopeptidase I zinc metalloprotease (M18) | 179.6 | 137.3 | 42.4 |
| PF00717 | Peptidase S24-like | 174.8 | 66.9 | 107.9 |
| PF01088 | Ubiquitin carboxyl-terminal hydrolase, family 1 | 166.1 | 35.3 | 130.8 |
| PF00326 | Prolyl oligopeptidase family | 146.6 | 60.9 | 85.7 |
| PF01965 | DJ-1/PfpI family | 123.3 | 13.7 | 109.6 |
| PF05577 | Serine carboxypeptidase S28 | 106.6 | 21.7 | 84.8 |
| PF10502 | Signal peptidase, peptidase S26 | 103.5 | 66.5 | 37.0 |
| PF02902 | Ulp1 protease family, C-terminal catalytic domain | 99.7 | 20.9 | 78.8 |
| PF02099 | Josephin | 99.5 | 74.6 | 24.9 |
| PF04389 | Peptidase family M28 | 95.9 | 11.2 | 84.8 |
| PF01435 | Peptidase family M48 | 93.2 | 23.0 | 70.2 |
| PF13365 | Trypsin-like peptidase domain | 80.8 | 39.4 | 41.3 |
| PF10275 | Peptidase C65 Otubain | 79.0 | 14.6 | 64.5 |
| PF00883 | Cytosol aminopeptidase family, catalytic domain | 73.5 | 15.6 | 57.9 |
| PF00675 | Insulinase (Peptidase family M16) | 64.7 | 20.1 | 44.6 |
| PF00089 | Trypsin | 60.1 | 13.7 | 46.4 |
| PF01470 | Pyroglutamyl peptidase | 54.5 | 2.0 | 52.5 |
| PF01432 | Peptidase family M3 | 52.8 | 26.0 | 26.8 |
| PF08325 | WLM domain | 52.7 | 39.7 | 13.0 |
| PF03416 | Peptidase family C54 | 45.6 | 5.8 | 39.7 |
| PF09668 | Aspartyl protease | 39.6 | 1.8 | 37.9 |
| PF09768 | Peptidase M76 family | 36.6 | 28.8 | 7.7 |
| PF02163 | Peptidase family M50 | 31.9 | 4.9 | 27.0 |
| PF00246 | Zinc carboxypeptidase | 29.9 | 20.6 | 9.4 |
| PF05362 | Lon protease (S16) C-terminal proteolytic domain | 27.0 | 2.8 | 24.2 |
| PF14538 | Raptor N-terminal CASPase like domain | 26.2 | 16.9 | 9.3 |
| PF07910 | Peptidase family C78 | 18.8 | 3.1 | 15.7 |
| PF03572 | Peptidase family S41 | 12.2 | 5.0 | 7.2 |
| PF03571 | Peptidase family M49 | 11.6 | 5.3 | 6.3 |
| PF02586 | SOS response associated peptidase (SRAP) | 11.6 | 5.9 | 5.6 |
| PF00814 | Glycoprotease family | 10.3 | 3.7 | 6.5 |
| PF01457 | Leishmanolysin | 8.0 | 0.6 | 7.4 |
| PF00648 | Calpain family cysteine protease | 5.5 | 1.8 | 3.7 |
| PF00413 | Matrixin | 5.2 | 5.2 | 0.0 |
| PF01551 | Peptidase family M23 | 4.4 | 0.0 | 4.4 |
| PF06480 | FtsH Extracellular | 1.3 | 0.6 | 0.6 |
Protease inhibitor families. Legend see Table 3.
| Pfam ID | Name | ∑ TPM Total | ∑ TPM | ∑ TPM |
|---|---|---|---|---|
| PF00188 | Cysteine-rich secretory protein family | 19,707.4 | 19,611.2 | 96.3 |
| PF05922 | Peptidase inhibitor I9 | 4382.9 | 2866.2 | 1516.7 |
| PF16845 | Aspartic acid proteinase inhibitor | 1328.2 | 1328.2 | 0.0 |
| PF00031 | Cystatin domain | 820.3 | 0.0 | 820.3 |
| PF00280 | Potato inhibitor I family | 154.1 | 121.2 | 32.9 |
| PF00079 | Serpin (serine protease inhibitor) | 84.4 | 31.7 | 52.8 |
| PF00197 | Trypsin and protease inhibitor | 6.5 | 6.5 | 0.0 |