| Literature DB >> 32326640 |
Nina Gubina1, Dominique Leboeuf2, Konstantin Piatkov3, Maxim Pyatkov4.
Abstract
Caspases are proteases conserved throughout Metazoans and responsible for initiating and executing the apoptotic program. Currently, there are over 1800 known apoptotic caspase substrates, many of them known regulators of cell proliferation and death, which makes them attractive therapeutic targets. However, most caspase substrates are by-standers, and identifying novel apoptotic mediators amongst all caspase substrates remains an unmet need. Here, we conducted an in silico search for significant apoptotic caspase targets across different species within the Vertebrata subphylum, using different criteria of conservation combined with structural features of cleavage sites. We observed that P1 aspartate is highly conserved while the cleavage sites are extensively variable and found that cleavage sites are located primarily in coiled regions composed of hydrophilic amino acids. Using the combination of these criteria, we determined the final list of the 107 most relevant caspase substrates including 30 novel targets previously unknown for their role in apoptosis and cancer. These newly identified substrates can be potential regulators of apoptosis and candidates for anti-tumor therapy.Entities:
Keywords: N-degron pathway; apoptosis; caspases; cleavage site; conservation; evolution; regulation
Mesh:
Substances:
Year: 2020 PMID: 32326640 PMCID: PMC7225963 DOI: 10.3390/biom10040612
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Clarification note for Table S1 1.
| Uniprot ID | Original Gene Names | Gene Name in |
|---|---|---|
| Q99666 |
|
|
| Q13748 |
|
|
| P62805 |
|
|
1 There are 3 Uniprot IDs with excessive numbers of corresponding genes. We decided to leave one gene name for each of the Uniprot IDs.
Figure 1Selecting orthologous targets and representative proteomes. (a) Scheme of the 60 amino acid query sequences for the pBLAST search to identify human apoptotic caspase cleavage site orthologs. P4–P4′: positions of amino acids within the cleavage site. D: key aspartate in the P1 position. Red lightning: scissile bond between P1 and P1′ amino acids. Green elements: caspase cleavage site. (b) Distribution of species by number of sequenced proteins in a non-redundant database and number of matches with human caspase cleavage sites. Each point corresponds to one species. The group above the red dashed line embraces species with well characterized proteomes. The red dashed line represents the threshold for the number of annotated proteins used to separate species with a well characterized proteome (log10(total N of proteins) > 3.9, or total N of proteins > 8000).
Column names and descriptions for Table S2.
| Column Name | Description |
|---|---|
| h uniprot id | Human Uniprot ID |
| h protein symbol | Human gene symbol (Uniprot) |
| h gene symbol | Human protein symbol (Uniprot) |
| v gene name | Vertebrate gene name |
| v accession | NCBI accession number of the vertebrate protein sequence |
| class | Class |
| order | Order |
| family | Family |
| genus | Genus |
| species | Species |
| n matches with humans | Number of vertebrate caspase targets in a species’s proteome |
| total n proteins | Total number of proteins in a species’s proteome |
| h 60 aa | Human 60-amino acid sequence |
| v 60 aa | Vertebrate 60-amino acid sequence |
| v pblast evalue | pBLAST e-value for the vertebrate 60-amino acid sequences |
| v pblast score | pBLAST score for the vertebrate 60-amino acid sequences |
| v pblast identity | pBLAST identity for the vertebrate 60-amino acid sequences |
| h cleavage site | Human cleavage site |
| v cleavage site | Vertebrate cleavage site |
| v p1 aa | Vertebrate P1 amino acid |
| hamdist | Hamming distance estimate between human and vertebrate cleavage sites |
| h p1 pr | P1′ amino acid in the human cleavage site |
| h p1 pr effect | Effect of P1′ amino acid in the human cleavage site |
| v p1 pr | P1′ amino acid in the vertebrate cleavage site |
| v p1 pr effect | Effect of P1′ amino acid in the vertebrate cleavage site |
| v hydr index 20 | Sum of hydrophobicity estimates for the central 20 amino acids in vertebrate 60-amino acid sequences |
| v hydr prev | Hydrophobicity prevalence values in vertebrate 60-amino acid sequences |
Distribution of amino acids in the P1 position.
| P1 Amino Acid | Number of Hits | % of Total |
|---|---|---|
| D | 562,772 | 92.19 |
| E | 28,423 | 4.66 |
| N | 5402 | 0.88 |
| G | 4771 | 0.78 |
| A | 1836 | 0.30 |
| S | 1772 | 0.29 |
| Gap (-) | 1331 | 0.22 |
| V | 698 | 0.11 |
| H | 683 | 0.11 |
| T | 632 | 0.10 |
| K | 564 | 0.09 |
| Q | 535 | 0.09 |
| Y | 221 | 0.04 |
| P | 186 | 0.03 |
| R | 137 | 0.02 |
| I | 126 | 0.02 |
| C | 121 | 0.02 |
| L | 103 | 0.02 |
| Unknown | 45 | 0.01 |
| F | 33 | 0.01 |
| M | 33 | 0.01 |
| D or N | 2 | 0.00 |
| W | 1 | 0.00 |
Distribution of orthologous caspase targets by cleavage site conservation and presence of aspartate in the P1 position.
| P1 Amino Acid | Hamming Distance Estimates 1 | Numberof Hits | % of Total |
|---|---|---|---|
| 0 | 346,816 | 56.82 | |
| 1 | 111,584 | 18.28 | |
| 2 | 57,620 | 9.44 | |
| Aspartate | 3 | 27,071 | 4.43 |
| 4 | 12,265 | 2.01 | |
| 5 | 5380 | 0.88 | |
| 6 | 1750 | 0.29 | |
| 7 | 286 | 0.05 | |
| 1 | 8894 | 1.46 | |
| 2 | 11,338 | 1.86 | |
| 3 | 10,704 | 1.75 | |
| Not | 4 | 7816 | 1.28 |
| aspartate | 5 | 5074 | 0.83 |
| 6 | 2650 | 0.43 | |
| 7 | 978 | 0.16 | |
| 8 | 201 | 0.03 |
1 Indication of conservation: 0 = sequences are identical; 8 = all eight amino acids are different.
Figure 2Secondary structure of the human caspase cleavage site surroundings. (a) Weblogo representation of the frequency of secondary structure elements surrounding the cleavage site (positions P4-P4′). One stack of letters corresponds to one amino acid. Secondary structure elements are abbreviated as follows: C: coils, H: helices, E: beta-strands. (b) Distribution of coil prevalence values of human caspase cleavage sites, calculated as described in the text.
Figure 3Hydrophobic properties of vertebrate cleavage sites. (a) Distribution of the sums of hydrophobicity indices (HI) for the 20 amino acids surrounding P1 aspartate in all caspase substrates. PolyR: polyarginine-20 (HI = −298.4), hydrophilic limit of the scale. PolyI: polyisoleucine-20 (HI = 98.4), hydrophobic limit of the scale. PolyG: polyglycine-20 used as a reference to designate a borderline between hydrophobic and hydrophilic peptides. It was chosen because its HI (0.94) is the closest to the median HI among all 20 amino acids (0.4). (b) Distribution of hydrophobicity prevalence values in all caspase substrates, calculated as described in the text.
Distribution of amino acids in the P1′ position and the nature of the amino acid according to the Arg/N-degron pathway.
| P1′ Amino Acid | Nature of aa 1 | Number of Hits in Humans | % in Humans | Number of Hits in Vertebrates | % in Vertebrates |
|---|---|---|---|---|---|
| G | stab | 1080 | 33 | 171,377 | 30 |
| S | stab | 862 | 26 | 131,638 | 23 |
| A | stab | 505 | 15 | 80,871 | 14 |
| T | stab | 92 | 3 | 20,011 | 4 |
| L | destab | 91 | 3 | 18,102 | 3 |
| V | stab | 83 | 3 | 17,729 | 3 |
| F | destab | 80 | 2 | 13,377 | 2 |
| Y | destab | 74 | 2 | 14,295 | 3 |
| N | destab | 65 | 2 | 18,225 | 3 |
| D | destab | 53 | 2 | 11,274 | 2 |
| M | stab | 50 | 2 | 8552 | 2 |
| E | destab | 46 | 1 | 9115 | 2 |
| K | destab | 41 | 1 | 7301 | 1 |
| I | destab | 40 | 1 | 7507 | 1 |
| C | destab | 36 | 1 | 6986 | 1 |
| H | destab | 31 | 1 | 6596 | 1 |
| P | stab | 28 | 1 | 9298 | 2 |
| Q | destab | 22 | 1 | 3929 | 1 |
| R | destab | 22 | 1 | 6320 | 1 |
| W | destab | 12 | 0 | 2754 | 0 |
1 stab: stabilizing after caspase cleavage, destab: destabilizing effect, according to the Arg/N-degron pathway.
Validation of conservation criteria based on the set of reference caspase targets with proven proapoptotic activity 1.
| Criteria | ||||
|---|---|---|---|---|
| Median Hamming Distance < 2 | Coil Prevalence > 1 | Hydrophobicity Prevalence < 1 | P1′ Destabilizing in > 50% of Orthologs | |
| Number of human targets selected by threshold | 2726 | 1454 | 1457 | 554 |
| Number of reference targets selected by threshold | 19 | 17 | 16 | 4 |
| Percentage of human targets selected by threshold | 91.48 | 48.79 | 48.89 | 18.59 |
| Percentage of reference targets selected by threshold | 79.17 | 70.83 | 66.67 | 16.67 |
1 The set of reference targets included 24 cleavage sites within 11 proteins: CASP2, CASP3, CAPS6, CASP7, CASP8, CASP9, PARP1, PARP2, RIPK1, TRAF1, CAD (Table S1).
Figure 4Pathway enrichment of the 107 highly conserved caspase targets. Pathway analysis using the DAVID free online software, with default parameters and an EASE threshold of <0.05 was followed by annotation using two databases: Gene Ontology [32] and Uniprot [22]. Proteins were classified in terms of Gene Ontology—Cellular Component (red), Gene Ontology—Molecular Function (green), and Uniprot Keywords (blue). The p-value for annotation selection was adjusted by Benjamini–Hochberg correction. Annotations with a corrected p-value < 0.05 were considered significant and were presented in the plots.
Novel apoptotic mediators not previously related to cancer and apoptosis.
| Human Gene Name | Human Protein Symbol | Human Uniprot ID | Human Cleavage Site | Nature of P1′ 1 |
|---|---|---|---|---|
| actin, beta like 2( | ACTBL | Q562R1 | ELPDGQVI | Stab |
| actin gamma 1( | ACTG | P63261 | ELPDGQVI | Stab |
| ataxin 2 like ( | ATX2L | Q8WWM7 | LESDMSNG | Stab |
| chaperonin containing TCP1 subunit 5 ( | TCPE | P48643 | VDKDGDVT | Stab |
| EH domain containing 4 ( | EHD4 | Q9H223 | CDCDGMLD | Stab |
| enolase 1 ( | ENOA | P06733 | YGKDATNV | Stab |
| enolase 3 ( | ENOB | P13929 | YGKDATNV | Stab |
| DiGeorge syndrome critical region gene 14 ( | ESS2 | Q96DF8 | VGPDGKEL | Stab |
| G elongation factor mitochondrial 2 ( | RRF2M | Q969S9 | TVTDFMAQ | Destab |
| heterogeneous nuclear ribonucleoprotein A3 ( | ROA3 | P51991 | SREDSVKP | Stab |
| heat shock protein family D (Hsp60) member 1 ( | CH60 | P10809 | VGYDAMAG | Stab |
| microtubule associated protein 1S ( | MAP1S | Q66K74 | DRVDAVLV | Stab |
| N(alpha)-acetyltransferase 15, NatA auxiliary subunit ( | NAA15 | Q9BXJ9 | HEADTANM | Stab |
| asparaginyl-tRNA synthetase ( | SYNC | O43776 | KKEDGTFY | Stab |
| 3′-phosphoadenosine 5′-phosphosulfate synthase 1 ( | PAPS1 | O43252 | TGIDSEYE | Stab |
| phosphatase and actin regulator 2 ( | PHAR2 | O75167 | DSPDYDRR | Destab |
| POTE ankyrin domain family member F ( | POTEF | A5A3E0 | ELPDGQVI | Stab |
| proteasome 26S subunit, ATPase 3 ( | PRS6A | P17980 | QEEDGANI | Stab |
| proteasome 26S subunit, ATPase 3 ( | PRS6A | P17980 | DILDPALL | Stab |
| glutamine rich 1 ( | QRIC1 | Q2TAL8 | LTVDSAHL | Stab |
| RNA binding motif protein 22 ( | RBM22 | Q9NW64 | SNSDGTRP | Stab |
| RNA binding motif protein 39 ( | RBM39 | Q14498 | ERTDASSA | Stab |
| replication factor C subunit 1( | RFC1 | P35251 | DEVDGMAG | Stab |
| splicing factor 3a subunit 1 ( | SF3A1 | Q15459 | VTWDGHSG | Stab |
| signal recognition particle 54 ( | SRP54 | P61011 | QELDSTDG | Stab |
| transducin beta like 1X-linked ( | TBL1X | O60907 | TVFDGRPI | Stab |
| transducin beta like 1, Y-linked ( | TBL1Y | Q9BQ87 | MEIDGDVE | Stab |
| tubulin alpha 3c ( | TBA3C | P0DPH7 | VKCDPRHG | Stab |
| tubulin alpha 4a ( | TBA4A | P68366 | IQPDGQMP | Stab |
| zinc finger CCCH-type containing 15 ( | ZC3HF | Q8WU90 | VYIDARDE | Stab |
1 stab: stabilizing after caspase cleavage, destab: destabilizing effect, according to the Arg/N-degron pathway.