| Literature DB >> 22520048 |
Holger Schuhmann1, Pitter F Huesgen, Iwona Adamska.
Abstract
BACKGROUND: The Deg/HtrA family of ATP-independent serine endopeptidases is present in nearly all organisms from bacteria to human and vascular plants. In recent years, multiple deg/htrA protease genes were identified in various plant genomes. During genome annotations most proteases were named according to the order of discovery, hence the same names were sometimes given to different types of Deg/HtrA enzymes in different plant species. This can easily lead to false inference of individual protease functions based solely on a shared name. Therefore, the existing names and classification of these proteolytic enzymes does not meet our current needs and a phylogeny-based standardized nomenclature is required.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22520048 PMCID: PMC3473262 DOI: 10.1186/1471-2229-12-52
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
The family of Deg/HtrA proteases in
| At3g27925 | DEG 1 | O22609 | 439 | PD-PDZ | Cre02.g088400 Cre14.g630550 Cre12.g498500 Os05g0568900 Pp1s160_79V6 Pp1s198_100V6 POPTR_0001s34960 | AtDeg1 |
| At2g47940 | DEG 2 | O82261 | 607 | PD-PDZ-PDZ | Cre19.g752200 Os05g0147500 Pp1s8_140V6 POPTR_0014s12970 POPTR_0020s00220 | AtDeg2 |
| At1g65630 | DEG 3 | Q9SHZ1 | 559 | PD-PDZ-PDZ | Deg 10 Subgroup | AtDeg3 |
| At1g65640 | DEG 4 | Q9SHZ0 | 518 | PD-PDZ-PDZ | Deg10 Subgroup | AtDeg4 |
| At4g18370 | DEG 5 | Q9SEL7 | 323 | PD | Cre02.g110600 Os12g0616600 Pp1s63_95V6 POPTR_0011s02330 | AtDeg5 |
| At1g51150 | DEG 6 | Q9C691 | 219 | PDia | n.a. | AtDeg6 |
| At3g03380 | DEG 7 | Q8RY22 | 1097 | PD-PDZ-PDZ- PDia-PDZ-PDZ | Cre03.g180650 Os02g0712000 Pp1s237_5V6 Pp1s21_327V6 POPTR_0017s03050 POPTR_0004s08740 POPTR_0004s08720 | AtDeg7 |
| At5g39830 | DEG 8 | Q9LU10 | 448 | PD-PDZ | Cre01.g028350 Os04g0459900 Pp1s31_50V6 POPTR_0004s13440 | AtDeg8 |
| At5g40200 | DEG9 | Q9FL12 | 592 | PD-PDZ-PDZ | Cre19.g752200 Os02g0742500 Os06g0234100 Pp1s176_87V6 Pp1s1_203V6 POPTR_0015s08440 POPTR_0004s13440 | AtDeg9 |
| At5g36950 | DEG10 | Q9FIV6 | 586 | PD-PDZ-PDZ | Cre14.g617600 Cre01.g013300 Os05g0417100 Pp1s55_7V5.1 POPTR_0008s07940 | AtDeg10 |
| At3g16540 | DEG11 | Q9LK71 | 555 | PD-PDZ-PDZ | Deg10 Subgroup | AtDeg11 |
| At3g16550 | DEG12 | Q9LK70 | 499 | PD-PDZ-PDZ | Deg10 Subgroup | AtDeg12 |
| At5g40560 | DEG13 | Q9FM41 | 486 | PD-PDZ-PDZ | Deg10 Subgroup | AtDeg13 |
| At5g27660 | DEG14 | Q3E6S8 | 429 | PD-PDZ | Os11g0246600 Pp1s180_15V6 POPTR_0013s01900 | AtDeg14 |
| At1g28320 | DEG15 | Q8VZD4 | 709 | NT-PD | Cre12.g548200 Os05g0497700 Pp1s196_28V6 POPTR_0004s04650 POPTR_0011s05510 | AtDeg15 |
| At5g54745 | DEG16 | Q3E8B4 | 198 | PDia | n.a. | AtDeg16 |
a According to TAIR database. b According to [14]. c If more than one protein entry was present, the different versions were analyzed by the HHPred platform (http://toolkit.tuebingen.mpg.de/hhpred/), and the one with intact protease domain and (if present) PDZ domain(s) was considered here. Sequences used in this study are supplied as Supplementary material (Additional file 1). d According to the HHPred platform. Abbreviations: aa, amino acids; n.a., not available; NT, elongated N-terminus; PD, potentially active protease domain; PDia, inactive protease domain (i.e. at least one residue of the catalytic triad is mutated or missing); PDZ, PDZ domain.
The family of Deg/HtrA proteases in
| POPTR_0001s34960 Pt706718 | PtDeg1 | A9PI52 | 429 | PD-PDZ | At3g27925 Cre02.g088400 Cre14.g630550 Cre12.g498500 Os05g0568900 Pp1s160_79V6 Pp1s198_100V6 | |
| POPTR_0014s12970 Pt572750 | PtDeg2.1 | B9I9X1 | 592 | PD-PDZ-PDZ | At2g47940 Cre19.g752200 Os05g0147500 Pp1s8_140V6 | |
| POPTR_0020s00220 Pt775566 | PtDeg2.2 | B9IBU0 | 624 | PD-PDZ-PDZ | At2g47940 Cre19.g752200 Os05g0147500 Pp1s8_140V6 | |
| POPTR_0011s02330 Pt771291 | PtDeg5.1 | B9HYW4 | 316 | PD | At4g18370 Cre02.g110600 Os12g0616600 Pp1s63_95V6 | |
| POPTR_0017s03050 Pt816849 | PtDeg7.1 | B9GV35 | 1128 | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Cre03.g180650 Os02g0712000 Pp1s237_5V6 Pp1s21_327V6 | |
| POPTR_0004s08740 Pt555951 | PtDeg7.2 | B9H390 | 1080 | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Cre03.g180650 Os02g0712000 Pp1s237_5V6 Pp1s21_327V6 | |
| POPTR_0004s08720 Pt714140 | PtDeg7.3 | B9H391 | 1117 | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Cre03.g180650 Os02g0712000 Pp1s237_5V6 Pp1s21_327V6 | |
| POPTR_0004s13440 Pt199267 | PtDeg8 | B9H3X7 | 465 | PD-PDZ | At5g39830 Cre01.g028350 Os04g0459900 Pp1s31_50V6 | |
| POPTR_0015s08440 Pt251989 | PtDeg9.1 | B9IEN8 | 556 | PD-PDZ-PDZ | At5g40200 Cre19.g752200 Os02g0742500 Os06g0234100 Pp1s176_87V6 Pp1s1_203V6 | |
| POPTR_0012s07930 Pt728836/Pt823359 | PtDeg9.2 | B9I375 | 559 | PD-PDZ-PDZ | At5g40200 Cre19.g752200 Os02g0742500 Os06g0234100 Pp1s176_87V6 Pp1s1_203V6 | |
| POPTR_0008s07940 | | B9HI10 | 587 | PD-PDZ-PDZ | At5g36950 Cre01.g013300 Cre14.g617600 Os05g0417100 Pp1s55_7V5.1 | |
| POPTR_0013s01900 Pt662713/Pt662714 | PtDeg14.1 PtDeg14.2 | B9I7J6 (partial) | 422 | PD-PDZ | At5g27660 Os11g0246600 Pp1s180_15V6 | |
| POPTR_0004s04650 Pt555773 | PtDeg15.1 | B9H2S3 | 752 | NT-PD | At1g28320 Cre12.g548200 Os05g0497700 Pp1s196_28V6 | |
| POPTR_0011s05510 Pt266544 | PtDeg15.2 | B9N3H9 | 729 | NT-PD | At1g28320 Cre12.g548200 Os05g0497700 Pp1s196_28V6 | |
| POPTR_0018s04140 Pt787034 | PtDeg17.1 | B9NA38 | 356 | PDia-PDZ | n.a. | |
| POPTR_0394s00220 Pt586371 | PtDeg17.2 | B9NA39 (fragment) | 298 | PDia-PDZ | n.a. | |
| POPTR_0018s04150 Pt577788 | PtDeg17.3 | B9INA2 | 364 | PDia-PDZ | n.a. |
a First model identifier is from Phytozome v7.0 (http://www.phytozome.net), the second identifier is the corresponding identifier according to [16]. Discrepancies between the suggested gene model and the UniprotKB entry were solved by analyzing the EST data (if present) and analysis of the genomic sequence for the presence of ORFs yielding aa sequences similar to ortholog or paralog proteins, with respect to potential splicing sites.b According to [16]c If more than one protein entry was present, the different versions were analyzed by the HHPred platform (http://toolkit.tuebingen.mpg.de/hhpred/), and the one with intact protease domain and (if present) PDZ domain(s) was considered here. Sequences used in this study are supplied as Supplementary material (Additional file 1). d According to the HHPred platform. Abbreviations: aa, amino acids; n.a., not available; NT, elongated N-terminus; PD, potentially active protease domain; PD(1/2), truncated protease domain, probably proteolytically inactive; PDia, inactive protease domain (i.e. at least one residue of the catalytic triad is mutated, or protease domain is incomplete); PDZ, PDZ domain.
The family of Deg/HtrA proteases in
| Os01g0278600 LOC_Os01g17070 | Os01g0278600 OsDegP1 | Q5NBK7 | 470 | PDia-PDZ | n.a. | |
| Os02g0712000 LOC_Os02g48180 | Os02g0712000 OsDegP2 | Q6ZIR2/B9F2C1 | 1092e | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Cre03.g180650 Pp1s237_5V6 Pp1s21_327V6 POPTR_0017s03050 POPTR_0004s08740 POPTR_0004s08720 | |
| Os02g0742500 LOC_Os02g50880 | Os02g0742500 OsDegP3 | Q6Z806 | 567 | PD-PDZ-PDZ | At5g40200 Cre19.g752200 Pp1s176_87V6 Pp1s1_203V6 POPTR_0015s08440 POPTR_0004s13440 | |
| -LOC_Os03g62900 | -OsDegP4 | Q84SQ1 | 299 | PD | n.a. – not a Deg? | |
| Os04g0459900 LOC_Os04g38640 | Os04g0459900 OsDegP5 | B7EBF9 | 445 | PD-PDZ | At5g39830 Cre01.g028350 Pp1s31_50V6 POPTR_0004s13440 | |
| Os05g0147500 LOC_Os05g05480 | Os05g0147500 OsDegP6 | Q6ASR0 | 596 | PD-PDZ-PDZ | At2g47940 Cre19.g752200 Pp1s8_140V6 POPTR_0014s12970 POPTR_0020s00220 | |
| Os05g0417100 LOC_Os05g34460 | Os05g0417100 OsDegP7 | Q6AT72 | 614 | PD-PDZ-PDZ | At5g36950 Cre01.g013300 Cre14.g617600 Pp1s55_7V5.1 POPTR_0008s07940 | |
| Os05g0497700 LOC_Os05g41810 | Os05g0497700 OsDegP8 | Q0DH14 | 722e | NT-PD | At1g28320 Cre12.g548200 Pp1s196_28V6 POPTR_0004s04650 POPTR_0011s05510 | |
| Os05g0568900 LOC_Os05g49380 | Os05g0568900 OsDegP9 | Q6AUN5 | 437 | PD-PDZ | At3g27925 Cre02.g088400 Cre14.g630550 Cre12.g498500 Pp1s160_79V6 Pp1s198_100V6 POPTR_0001s34960 | |
| Os06g0234100 LOC_Os06g12780 | Os06g0234100 OsDegP10 | Q67VA4 | 628 | PD-PDZ-PDZ | At5g40200 Cre19.g752200 Pp1s176_87V6 Pp1s1_203V6 POPTR_0015s08440 POPTR_0004s13440 | |
| Os08g0144400 LOC_Os08g04920 | Os08g0144400 OsDegP11 | Q7EYD8 | 496 | NT-PDia-PDZf | n.a. | |
| Os11g0246600 LOC_Os11g14170 | Os11g0246600 OsDegP12 | Q0ITK5 | 472e | PD-PDZ | At5g27660 Pp1s180_15V6 POPTR_0013s01900 | |
| Os12g0141500 LOC_Os12g04740 | Os12g0141500 OsDegP13 | Q2QXV8 | 228 | PD | n.a. – not a Deg? | |
| Os12g0141600 LOC_Os12g04750 | Os12g0141600 OsDegP14 | Q2QXV6 | 593 | PDia-PDia | n.a. | |
| Os12g0616600 LOC_Os12g42210 | Os12g0616600 OsDegP15 | Q2QM57 | 313 | PD | At4g18370 Cre02.g110600 Pp1s63_95V6 POPTR_0011s02330 | |
| Os03g0608600 LOC_Os03g41170 | Os03g0608600 expr. protein | Q75HK9 | 271 | PD | n.a. – not a Deg? |
a First model identifier from the Rice Annotation Project (Build5), second identifier according to the TIGR/MSU nomenclature (Osa1 Release 6.1). b First name according to GenBank/UnitProtKB, second identifier according to the TIGR/MSU nomenclature. c If more than one protein entry was present, the different versions were analyzed by the HHPred platform (http://toolkit.tuebingen.mpg.de/hhpred/), and the one with intact protease domain and (if present) PDZ domain(s) was considered here. Sequences used in this study are supplied as Supplementary material (Additional file 1). d According to the HHPred platform. Abbreviations: aa, amino acids; n.a., not available; NT, elongated N-terminus; PD, potentially active protease domain; PDia: inactive protease domain (i.e. at least one residue of the catalytic triad is mutated); PDZ, PDZ domain. e Sequence was modified based on the EST data (http://compbio.dfci.harvard.edu/tgi/plant.html) and comparison with orthologs from other species. f The HHPred platform detects secondary structures similar to RNA polymerase II large subunit from Saccharomyces cerevisiae in the N-terminal part of the protein – this is an indication that the predicted transcription start is incorrectly annotated.
The family of Deg/HtrA proteases in
| Pp1s160_79V6 | A9T3R3 | 500 | PD-PDZ | At3g27925 Cre02.g088400 Cre14.g630550 Cre12.g498500 Os05g0568900 POPTR_0001s34960 | |
| Pp1s198_100V6 | A9TBD2 | 475 | PD-PDZ | At3g27925 Cre02.g088400 Cre14.g630550 Cre12.g498500 Os05g0568900 POPTR_0001s34960 | |
| Pp1s79_92V6 | A9SHE2 | 501 | PD-PDZ | At3g27925 Cre02.g088400 Cre14.g630550 Cre12.g498500 Os05g0568900 POPTR_0001s34960 | |
| Pp1s21_138V6 | A9RQ01 | 486 | PD-PDZ | At3g27925 Cre02.g088400 Cre14.g630550 Cre12.g498500 Os05g0568900 POPTR_0001s34960 | |
| Pp1s8_140V6 | A9RGN6 | 618 | PD-PDZ-PDZ | At2g47940 Cre19.g752200 Os05g0147500 POPTR_0014s12970 POPTR_0020s00220 | |
| Pp1s63_95V6 | A9SBN1 | 362 | PD | At4g18370 Cre02.g110600 Os12g0616600 POPTR_0011s02330 | |
| Pp1s237_5V6 | A9TIB2 | 1076 | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Cre03.g180650 Os02g0712000 POPTR_0017s03050 POPTR_0004s08740 POPTR_0004s08720 | |
| Pp1s21_327V6 | A9RQ61 | 1072 | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Cre03.g180650 Os02g0712000 POPTR_0017s03050 POPTR_0004s08740 POPTR_0004s08720 | |
| Pp1s31_50V6 | A9RVV4 | 493 | PD-PDZ | At5g39830 Cre01.g028350 Os04g0459900 POPTR_0004s13440 | |
| Pp1s176_87V6 | A9T734 | 612 | PD-PDZ-PDZ | At5g40200 Cre19.g752200 Os02g0742500 Os06g0234100 POPTR_0015s08440 POPTR_0004s13440 | |
| Pp1s1_203V6 | A9RB23 | 540 | PD-PDZ | At5g40200 Cre19.g752200 Os02g0742500 Os06g0234100 POPTR_0015s08440 POPTR_0004s13440 | |
| Pp1s55_7V5 | | 651 | PD-PDZ-PDZ | At5g36950 Cre01.g013300 Cre14.g617600 Os05g0417100 POPTR_0008s07940 | |
| Pp1s180_15V6 | A9T7W1 | 473 | PD-PDZ | At5g27660 Os11g0246600 POPTR_0013s01900 | |
| Pp1s196_28V6 | A9TAV2 | 784 | NT-PD | At1g28320 Cre12.g548200 Os05g0497700 POPTR_0004s04650 POPTR_0011s05510 | |
| Pp1s152_166V5.1 | | 339d | PD-PDZ | Group 1a | |
| Pp1s176_111V6 | | 527 | PDia-PDZ | n.a. | |
| Pp1s67_44V6 | A9SD45 | 408 | PDia-PDZ | n.a. |
a Model identifier according to Phytozome v7.0 (http://www.phytozome.net). Discrepancies between the suggested gene model and the UniprotKB entry were solved by analyzing EST data (if present) and analysis of the genomic sequence for the presence of ORFs yielding aa sequences similar to ortholog and paralog proteins, with respect to potential splicing sites. b If more than one protein entry was present, the different versions were analyzed by the HHPred platform (http://toolkit.tuebingen.mpg.de/hhpred/), and the one with intact protease domain and (if present) PDZ domain(s) was considered here. Sequences used in this study are supplied as Supplementary material (Additional file 1). c According to the HHPred platform. Abbreviations: aa, amino acids; n.a., not available; NT, elongated N-terminus; PD, potentially active protease domain; PD(1/2), truncated protease domain, probably proteolytically inactive; PDia, inactive protease domain (i.e. at least one residue of the catalytic triad is mutated, or protease domain is incomplete); PDZ, PDZ domain. d Fragment extended based on the EST data (asmbl_4603.p5physco4 from Phytozome 5.0., TC42496 in DCFI http://compbio.dfci.harvard.edu/cgi-bin/tgi/tc_report.pl?tc=TC42496&species=moss).
The family of Deg/HtrA proteases in
| Cre02.g088400 | Deg1 Deg1A | A8I8X2 | 530 | PD-PDZ | At3g27925 Os05g0568900 Pp1s160_80V2.1 Pp1s198_95V2.1 POPTR_0001s34960 | |
| Cre14.g630550 | Deg13 - | - | 555 | PD-PDZ | At3g27925 Os05g0568900 Pp1s160_80V2.1 Pp1s198_95V2.1 POPTR_0001s34960 | |
| Cre12.g498500 | Deg11 - | - | 462 | PD-PDZ | At3g27925 Os05g0568900 Pp1s160_80V2.1 Pp1s198_95V2.1 POPTR_0001s34960 | |
| Cre02.g092000 | Deg2 Deg2 | A8I9B8 | 656 | PD-PDZ-PDZ | Deg2 Group | |
| Cre02.g110600 | Deg5 Deg5 | A8I3D5 | 356 | PD | At4g18370 Os12g0616600 Pp1s63_93V2.1 POPTR_0011s02330 | |
| Cre03.g180650 | Deg7 Deg7 | A8JH35 | 1108 | PD-PDZ-PDZ- PDia-PDZ-PDZ | At3g03380 Os02g0712000 Pp1s237_5V2.1 Pp1s21_312V2.1 POPTR_0017s03050 POPTR_0004s08740 POPTR_0004s08720 | |
| Cre01.g028350 | Deg8 Deg8 | A8HQB3 | 436 | PD-PDZ | At5g39830 Os04g0459900 Pp1s31_48V2.1 POPTR_0004s13440 | |
| Cre19.g752200e | - - | A8JBP6 | 1353 | PD-betaglycan- hydrolase | At5g40200 Os02g0742500 Os06g0234100 Pp1s176_79V2.1 Pp1s1_200V2.1 POPTR_0015s08440 POPTR_0004s13440 At2g47940 Os05g0147500 Pp1s8_145V2.1 POPTR_0014s12970 POPTR_0020s00220 | |
| Cre14.g617600 | Deg9 Deg9 | A8HNV3 | 619 | PD-PDZ-PDZ | At5g36950 Os05g0417100 Pp1s55_7V5.1 POPTR_0008s07940 | |
| Cre01.g013300 | Deg10 - | - | 739 | PD-PDZ-PDZ | At5g36950 Os05g0417100 Pp1s55_7V5.1 POPTR_0008s07940 | |
| Cre12.g548200 | - - | A8IYE3 (fragment) | 1249 | NT-PD | At1g28320 Os05g0497700 Pp1s196_28V2.1 POPTR_0004s04650 POPTR_0011s05510 | |
| Cre07.g332050 | - - | A8IGX3 (fragment) | 284 | PD | n.a. – not a Deg? | |
| Cre13.g579900 | - - | - | 415 | PDia-PDZ-PDZ | n.a. | |
| Cre03.g203730 | - CrDegO | A8IXF5 | 789 | PDia-PDZ | n.a. | |
| Cre38.g785300 | - | A8JG98 | 319 | PDia | n.a. |
a According to the Phytozome v7.0 database (http://www.phytozome.net/). b First name according the Phytozome v7.0 database, second name according to UniprotKB (http://www.uniprot.org/). c If more than one protein entry was present, the different versions were analyzed by the HHPred platform (http://toolkit.tuebingen.mpg.de/hhpred/), and the one with intact protease domain and (if present) PDZ domain(s) was considered here. Sequences used in this study are supplied as Supplementary material (Additional file 1). d According to the HHPred platform. Abbreviations: aa, amino acids; n.a., not available; NT, elongated N-terminus; PD, potantially active protease domain; PD(1/2), truncated protease domain, probably proteolytically inactive; PDia, inactive protease domain (i.e. at least one residue of the catalytic triad is mutated, or protease domain is incomplete); PDZ, PDZ domain. e Model is probably not correct, not supported by EST, repetetive stretches of single amino acids.
Figure 1Maximum likelihood phylogenetic tree of Deg/HtrA proteases in selected plant species. Following plant species were investigated: Arabidopsis thaliana, Oryza sativa, Populus trichocarpa, Physcomitrella patens, Chlamydomonas reinhardtii, and the cyanobacterium Synechocystis sp. PCC6803. Phylogenetic tree labeled labeled with the new names as suggested by this study. Filled circles indicated a bootstrap support (100 replicates) of > 90%, empty circles indicate a bootstrap support of > 70%. Additionally, the domain arrangement representative for proteases from each group is indicated. Deg/HtrA proteases from clade I contain one protease domain (oval shapes) and one PDZ domain (diamonds), with the exception of Deg5 proteases, which possess a protease domain only. Proteases from clade II contain an additional PDZ domain, clade III gathers proteases with one active (oval shape) and one inactive (discontinous oval shape) protease domain and four PDZ domains, whereas enzymes from clade IV contain a single protease domain, which is shifted toward the C-terminus.
Conservation of Deg/HtrA family members among photosynthetic organisms
| + | + | + | 1.1, 1.2, 1.3, 1.4 | 1.1, 1.2, 1.3 | |
| + | 2.1, 2.2 | + | + | + | |
| Deg3 | + | - | - | - | - |
| Deg4 | + | - | - | - | - |
| + | + | + | + | + | |
| Deg6 | + | - | - | - | - |
| + | 7.1, 7.2, 7.3 | + | 7.1, 7.2 | + | |
| + | + | + | + | + | |
| + | 9.1, 9.2 | 9.1, 9.2 | 9.1, 9.2 | 9.1, 9.2 | |
| + | + | + | + | + | |
| Deg11 | + | - | - | - | - |
| Deg12 | + | - | - | - | - |
| Deg13 | + | - | - | - | - |
| Deg14 | + | + | + | + | - |
| + | 15.1, 15.2 | + | + | + | |
| Deg16 | + | - | - | - | - |
| Deg17 | - | 17.1, 17.2, 17.3 | - | - | - |
The presence of a protease in a particular organism is indicated by +, its absence by -. If more than one isoform is present, the names are given. Proteases of the “core set” are depicted in bold. At, Arabidopsis thaliana; Cr, Chlamydomonas reinhardtii; Os, Oryza sativa; Pp, Physcomitrella patens; Pt, Populus trichocarpa.