| Literature DB >> 25288897 |
Joseph L Johnson1, Emily Chambers1, Keerthi Jayasundera1.
Abstract
BACE1, a membrane-bound aspartyl protease that is implicated in Alzheimer's disease, is the first protease to cut the amyloid precursor protein resulting in the generation of amyloid-β and its aggregation to form senile plaques, a hallmark feature of the disease. Few other native BACE1 substrates have been identified despite its relatively loose substrate specificity. We report a bioinformatics approach identifying several putative BACE1 substrates. Using our algorithm, we successfully predicted the cleavage sites for 70% of known BACE1 substrates and further validated our algorithm output against substrates identified in a recent BACE1 proteomics study that also showed a 70% success rate. Having validated our approach with known substrates, we report putative cleavage recognition sequences within 962 proteins, which can be explored using in vivo methods. Approximately 900 of these proteins have not been identified or implicated as BACE1 substrates. Gene ontology cluster analysis of the putative substrates identified enrichment in proteins involved in immune system processes and in cell surface protein-protein interactions.Entities:
Keywords: Alzheimer’s disease; BACE1; bioinformatics; protease; protease substrates
Year: 2013 PMID: 25288897 PMCID: PMC4147752 DOI: 10.4137/BECB.S8383
Source DB: PubMed Journal: Biomed Eng Comput Biol ISSN: 1179-5972
Predicted BACE1 cut sites for known substrates.
| UniProt ID | Protein | Topology | Predicted cleavage recognition
| ||
|---|---|---|---|---|---|
| Site | Sequence | Score | |||
| P05067 | APP | Type I | 13 | LVFFAEDV | 8.44E-03 |
| 33 | EVKMDAEF | 1.02E-03 | |||
| 41 | NIKTEEIS | 6.04E-05 | |||
| Q06481 | APLP2 | Type I | 9 | REDFSLSS | 1.20E-03 |
| 30 | MIFNAERV | 7.23E-05 | |||
| 44 | DENMVIDE | 3.55E-03 | |||
| P27930 | IL-1R-2 | Type I | 16 | TLSFQTLR | 1.02E-03 |
| Q02297 | NRG1 | Type I | 11 | QEKAEELY | 6.14E-05 |
| P56975 | NRG3 | Type I | 11 | FMESEEVY | 2.07E-05 |
| 13 | IEFMESEE | 2.52E-04 | |||
| 14 | GIEFMESE | 4.50E-02 | |||
| Q8IWT1 | VGSCβ4 | Type I | 15 | TIFLQVVD | 3.58E-01 |
| Q9NY72 | VGSCβ3 | Type I | 44 | EFEFEAHR | 1.09E-05 |
| Q07699 | VGSCβ1 | Type I | 21 | EHNTSVVK | 1.03E-04 |
| 28 | LLFFENYE | 1.09E-05 | |||
| 29 | RLLFFENY | 1.73E-05 | |||
| Q14242 | PSGL-1 | Type I | 21 | ASNLSVNY | 8.30E-05 |
| O60939 | VGSCβ2 | Type I | None | ||
| Q9H7Z7 | mPGES-2 | Type I | None | ||
| P51693 | APLP1 | Type I | None | ||
| Q07954 | LRP1 | Type I | None | ||
| P15907 | ST6Gal I | Type II | None | ||
Predicted cut sites and scores for novel putative BACE1 substrates from the human proteome.
| UniProt ID | Protein | Predicted cleavage recognition
| ||
|---|---|---|---|---|
| Site | Sequence | Score | ||
| Q495A1 | T cell immunoreceptor with Ig and ITIM domains | 21 | RIFLEVLE | 1.03E+00 |
| O95470 | Sphingosine-1-phosphate lyase 1 | 28 | EPYLEILE | 8.08E-01 |
| P12314 | High affinity immunoglobulin gamma Fc receptor I | 14 | ELELQVLG | 5.50E-01 |
| Q9BZM6 | NKG2D ligand 1 | 22 | EEFLMYWE | 4.48E-01 |
| Q9NP60 | X-linked interleukin-1 receptor accessory protein-like 2 | 46 | EVELALIF | 2.07E-01 |
| Q13445 | Transmembrane emp24 domain-containing protein 1 | 49 | EEMLDVKM | 1.58E-01 |
| Q5T7P8 | Synaptotagmin-6 | 46 | QEALAVLA | 1.16E-01 |
| Q6ZRP7 | Sulfhydryl oxidase 2 | 8 | GVDFSSLD | 1.09E-01 |
| A0PJX4 | Protein shisa-3 homolog | 50 | PEDFDTLD | 9.03E-02 |
| Q96A26 | Protein FAM162A | 17 | TVSLEMLD | 7.63E-02 |
| UDP-glucuronosyltransferase 1 family (combined) | 37 | PLDLAVFW | 7.42E-02 | |
| P60509 | HERV-R(b)_3p24.3 provirus ancestral env polyprotein | 40 | NISLALED | 7.41E-02 |
| Q4ADV7 | Protein RIC1 homolog | 35 | DENFSTLS | 6.68E-02 |
| Q3SXP7 | Uncharacterized protein KIAA1644 | 26 | ETEFQAVM | 6.15E-02 |
| O95140 | Mitofusin-2 | 32 | QEEFMVSM | 6.07E-02 |
| Q96FB5 | UPF0431 protein C1orf66 | 16 | PLNLAALQ | 6.01E-02 |
| O75578 | Integrin alpha-10 | 15 | ESLLEVVQ | 5.55E-02 |
| Q15363 | Transmembrane emp24 domain-containing protein 2 | 21 | QEYMEVRE | 4.86E-02 |
| Q5DX21 | Immunoglobulin superfamily member 11 | 19 | LLDLQVIS | 4.74E-02 |
| O43699 | Sialic acid-binding Ig-like lectin 6 | 17 | QISLSLFV | 4.58E-02 |
| O95971 | CD160 antigen | 35 | GHFFSILF | 4.32E-02 |
| O60499 | Syntaxin-10 | 37 | GIMLDAFA | 4.31E-02 |
| Q6ZNB6 | NF-X1-type zinc finger protein NFXL1 | 35 | QAELEAFE | 3.98E-02 |
| O95866 | Protein G6b | 48 | ELLLSAGD | 3.68E-02 |
| Q86UW2 | Organic solute transporter subunit beta | 16 | QELLEEML | 3.62E-02 |
| P26006 | Integrin alpha-3 | 15 | DIDSELVE | 3.44E-02 |
| Q9Y639 | Neuroplastin | 36 | IVNLQITE | 3.32E-02 |
| Q6UWI2 | Prostate androgen-regulated mucin-like protein 1 | 25 | LIDMETTT | 3.01E-02 |
| A2A2Y4 | FERM domain-containing protein 3 | 45 | FEDLEADE | 3.00E-02 |
| Q6P7N7 | Transmembrane protein 81 | 21 | EVNLDSYS | 2.88E-02 |
| A6NFR6 | Putative uncharacterized protein C5orf60 | 24 | AVDMDILF | 2.81E-02 |
| Q8N386 | Leucine-rich repeat-containing protein 25 | 20 | QHNLSAFL | 2.76E-02 |
| Q9HBW1 | Leucine-rich repeat-containing protein 4 | 12 | QTSLDEVM | 2.68E-02 |
| Q9Y5Y7 | Lymphatic vessel endothelial hyaluronic acid receptor 1 | 32 | EVFMETST | 2.65E-02 |
| P0C6S8 | Leucine-rich repeat neuronal protein 2 | 40 | DTYFATLT | 2.56E-02 |
| Q6NUS6 | Tectonic-3 | 43 | EVSLTTLV | 2.56E-02 |
| Q8IYS5 | Osteoclast-associated immunoglobulin-like receptor | 48 | EFFLEEVT | 2.47E-02 |
| Q9H5V8 | CUB domain-containing protein 1 | 16 | DLLFSVTL | 2.34E-02 |
| Q15399 | Toll-like receptor 1 | 41 | QVSSEVLE | 2.29E-02 |
| Q9Y2C9 | Toll-like receptor 6 | 41 | QVSSEVLE | 2.29E-02 |
| Q13651 | Interleukin-10 receptor subunit alpha | 47 | HENFSLLT | 2.28E-02 |
| Q9Y5I0 | Protocadherin alpha-13 | 34 | TVLLSLVE | 2.09E-02 |
| Q68DV7 | RING finger protein 43 | 28 | EKLMEFVY | 2.08E-02 |
| Q6UX41 | Butyrophilin-like protein 8 | 47 | EISLTVQE | 1.86E-02 |
| Q15262 | Receptor-type tyrosine-protein phosphatase kappa | 45 | NIYFQAMS | 1.85E-02 |
| Q5TH69 | Brefeldin A-inhibited guanine nucleotide-exchange protein 3 | 14 | DLLFELLR | 1.76E-02 |
| Q9Y5F3 | Protocadherin beta-1 | 21 | EPYLQFQD | 1.63E-02 |
| P29376 | Leukocyte tyrosine kinase receptor | 34 | QAELQLAE | 1.60E-02 |
| Q86XX4 | Extracellular matrix protein FRAS1 | 17 | NLEMQELA | 1.56E-02 |
| P60507 | HERV-F(c)1_Xq21.33 provirus ancestral Env polyprotein | 34 | ETSLLTLD | 1.40E-02 |
| Q5SWX8 | Protein odr-4 homolog | 47 | IEDLEIAE | 1.37E-02 |
| Q9H4D0 | Calsyntenin-2 | 49 | EFNLEVSI | 1.35E-02 |
| Q9P246 | Stromal interaction molecule 2 | 44 | EPSFMISQ | 1.27E-02 |
| A6BM72 | Multiple epidermal growth factor-like domains protein 11 | 25 | QAALMMEE | 1.22E-02 |
| Q9UQV4 | Lysosome-associated membrane glycoprotein 3 | 23 | DVQLQAFD | 1.17E-02 |
| Q6IEE7 | Transmembrane protein 132E | 8 | LTDLEIGM | 1.13E-02 |
| Q96KV6 | Butyrophilin subfamily 2 member A3 | 50 | DSLFMVTT | 1.11E-02 |
| Q96MU8 | Kremen protein 1 | 48 | QANLSVSA | 1.08E-02 |
| P13598 | Intercellular adhesion molecule 2 | 15 | PKMLEIYE | 1.06E-02 |
| Q13421 | Mesothelin | 31 | QDDLDTLG | 1.05E-02 |
| Q01638 | Interleukin-1 receptor-like 1 | 34 | EEDLLLQY | 1.04E-02 |
Gene ontology cluster analysis of putative BACE1 substrates from the bioinformatics analysis.
| Enrichment score | Annotation cluster terms |
|---|---|
| 85.1 | Immunoglobulin domain (230) |
| 72.8 | Receptor (302), signal transducer (314) |
| 61.5 | Cell adhesion (209), cadherin (73), cation binding (186) |
| 35.6 | Fibronectin type III (76) |
| 24.2 | Immune response (108), immune system process (145), response to stimulus (232) |
| 13.8 | Integrin mediated signaling (27), regulation of actin cytoskeleton (31) |
| 13.3 | Cytokine binding (35), cytokine-cytokine receptor interactions (48), growth factor binding (32) |
| 11.7 | Leucine-rich repeat (51) |
Predicted BACE1 cut sites for substrates identified by Hemming et al32 proteomics study.
| UniProt ID | Protein | Topology | Predicted cleavage recognition
| ||
|---|---|---|---|---|---|
| Site | Sequence | Score | |||
| P05067 | APP | Type I | 13 | LVFFAEDV | 8.44E-03 |
| 33 | EVKMDAEF | 1.02E-03 | |||
| 41 | NIKTEEIS | 6.04E-05 | |||
| Q06481 | APLP2 | Type I | 9 | REDFSLSS | 1.20E-03 |
| 30 | MIFNAERV | 7.23E-05 | |||
| 44 | DENMVIDE | 3.55E-03 | |||
| P40189 | Interleukin-6 receptor beta chain | Type I | 17 | GPEFTFTT | 9.00E-05 |
| 35 | DTLYMVRM | 2.17E-03 | |||
| P08581 | Hepatocyte growth factor receptor | Type I | 29 | NSELNIEW | 1.22E-05 |
| O75976 | Carboxypeptidase D | Type I | 22 | DAASSVVI | 4.17E-05 |
| P29317 | Ephrin type A receptor 2 | Type I | 15 | VHEFQTLS | 2.32E-03 |
| 28 | QALTQEGQ | 1.43E-04 | |||
| P54764 | Ephrin type A receptor 4 | Type I | 44 | NPLTSYVF | 6.06E-05 |
| Q15375 | Ephrin type A receptor 7 | Type I | 16 | GKMFEATA | 5.55E-03 |
| 25 | DVATLEEA | 2.89E-05 | |||
| 40 | RAFTAAGY | 2.89E-05 | |||
| P54760 | Receptor protein tyrosine kinase variant EPHB4V1 | Type I | 16 | QTQLDESE | 6.70E-04 |
| 41 | GASYLVQV | 1.20E-05 | |||
| Q92823 | Neuronal cell adhesion molecule 1 | Type I | 14 | GPAMASRQ | 2.46E-05 |
| P32004 | Neuronal cell adhesion molecule L1 | Type I | 24 | RHQMAVKT | 5.75E-05 |
| 38 | DTDYEIHL | 2.83E-04 | |||
| 40 | QPDTDYEI | 2.96E-04 | |||
| Q9NPR2 | Semaphorin-4B | Type I | 39 | GVADQTDE | 7.20E-05 |
| Q9C0C4 | Semaphorin-4C | Type I | 25 | EGYLVAVV | 1.17E-05 |
| Q9H2E6 | Semaphorin-6A | Type I | 31 | DPLGAVSS | 2.07E-05 |
| Q96JA1 | Leucine-rich repeats and immunoglobulin-like domains protein 1 | Type I | 51 | TPDNQLLV | 5.72E-05 |
| O94898 | Leucine-rich repeats and immunoglobulin-like domains protein 2 | Type I | 28 | HIYLNVIS | 1.28E-04 |
| Q6UXM1 | Leucine-rich repeats and immunoglobulin-like domains protein 3 | Type I | 51 | IVDSDVSD | 7.11E-05 |
| Q9Y6N7 | Roundabout homolog 1 | Type I | 9 | QISDVVKQ | 2.36E-05 |
| 15 | QVSLAQQI | 4.11E-04 | |||
| 47 | EVAASTGA | 1.99E-05 | |||
| Q9HCK4 | Roundabout homolog 2 | Type I | 47 | EVAASTSA | 1.75E-05 |
| Q7Z5N4 | Sidekick-1 | Type I | 17 | NPSTAVSA | 3.82E-05 |
| Q58EX2 | Sidekick-2 | Type I | 38 | GVSYDFRV | 3.74E-04 |
| 52 | EVSSYTFS | 3.77E-05 | |||
| P15151 | Poliovirus receptor | Type I | 23 | QAELTVQV | 5.00E-04 |
| Q92673 | Sortilin-related receptor | Type I | 14 | GADASATQ | 2.07E-05 |
| 22 | LLYDELGS | 1.02E-05 | |||
| 23 | ILLYDELG | 1.89E-04 | |||
| 46 | GHNYTFTV | 8.20E-05 | |||
| Q96JP9 | Protocadherin 21 (cadherin-related family member 1) | Type I | 15 | MAAFLIQT | 6.23E-05 |
| 17 | SPMAAFLI | 1.45E-05 | |||
| 26 | ITDAETLS | 2.20E-05 | |||
| 39 | SPSFSTTA | 5.71E-05 | |||
| Q9Y5H2 | Protocadherin gamma A11 | Type I | 11 | LANSETSD | 3.08E-05 |
| 20 | LADLGSLE | 3.89E-05 | |||
| 22 | EVLADLGS | 9.31E-05 | |||
| 40 | PPLSATVT | 1.54E-05 | |||
| Q9Y5G8 | Protocadherin gamma A5 | Type I | 8 | PEDLDLTL | 1.03E-02 |
| 22 | DILADLGS | 7.29E-05 | |||
| Q9Y5G5 | Protocadherin gamma A8 | Type I | 9 | DPNDSSLT | 6.06E-05 |
| 22 | EVLTELGS | 1.67E-03 | |||
| 40 | PPLSATVT | 1.54E-05 | |||
| Q9UN70 | Protocadherin gamma C3 | Type I | 40 | EPSLSTTA | 3.88E-03 |
| Q86VZ4 | Low-density lipoprotein receptor-related protein 11 | Type I | 23 | EESYIFES | 3.20E-05 |
| O75096 | Low-density lipoprotein receptor-related protein 4 | Type I | 37 | RTSLEEVE | 9.63E-03 |
| 47 | TTLYSSTT | 1.08E-05 | |||
| P31431 | Syndecan-4 | Type I | 43 | PKKLEENE | 1.67E-05 |
| MULTIPLE | HLA class I histocompatibility antigen (Combined) | Type I | 9 | EPSSQSTV | 3.00E-05 |
| Q13332 | Receptor-type tyrosine protein phosphatase S | Type I | 8 | IVDGEEGL | 2.82E-05 |
| Q13740 | CD166 antigen | Type I | 19 | DEADEISD | 1.29E-04 |
| Q12907 | Vesicular integral-membrane protein VIP36 | Type I | 52 | MKLFQLMV | 1.20E-03 |
| Q5VU97 | Cache domain containing 1 | Type I | 19 | DDMGAIGD | 2.22E-05 |
| Q9BYH1 | Seizure 6-like protein 2 | Type I | 12 | EAAAETSL | 1.25E-05 |
| 19 | EHALEVAE | 5.97E-02 | |||
| 51 | ELMGEVTI | 3.82E-03 | |||
| Q92859 | Neogenin | Type I | 45 | MPNDQASG | 1.60E-05 |
| Q6UVK1 | Chondroitin sulfate proteoglycan 4 | Type I | 9 | LSFLEANM | 3.03E-04 |
| 12 | GGFLSFLE | 9.84E-05 | |||
| Q24JP5 | Transmembrane protein 132A | Type I | 8 | VTELELGM | 4.24E-04 |
| Q13145 | BMP and activin membrane-bound inhibitor homolog | Type I | 14 | QELTSSKE | 1.42E-04 |
| Q14126 | Desmoglein 2 | Type I | 10 | QHDSYVGL | 9.29E-05 |
| 46 | EIQFLISD | 2.81E-03 | |||
| Q9NZV1 | Cysteine-rich motor neuron 1 protein | Type I | 45 | EVDLEVPL | 1.12E-03 |
| Q92896 | Golgi apparatus protein 1 | Type I | 13 | DLAMQVMT | 4.21E-03 |
| 15 | FSDLAMQV | 1.88E-04 | |||
| Q9NR96 | Toll-like receptor 9 | Type I | 47 | DFLLEVQA | 1.55E-03 |
| 48 | MDFLLEVQ | 8.73E-05 | |||
| 49 | FMDFLLEV | 1.41E-04 | |||
| 51 | AAFMDFLL | 3.58E-04 | |||
| O75509 | Tumor necrosis factor receptor superfamily member 21 | Type I | 37 | LPSMEATG | 3.14E-04 |
| P51654 | Glypican-3 | GPI | 31 | AYDLDVDD | 2.48E-05 |
| 33 | ELAYDLDV | 1.30E-03 | |||
| 35 | LAELAYDL | 3.35E-04 | |||
| P51693 | APLP1 | Type I | None | ||
| Q99523 | Sortilin | Type I | None | ||
| Q5ZPR3 | CD276 antigen | Type I | None | ||
| P19021 | Peptidyl-glycine alpha-amidating monooxygenase | Type I | None | ||
| Q6UX71 | Plexin domain-containing protein 2 | Type I | None | ||
| P35613 | Basigin | Type I | None | ||
| O95185 | Netrin receptor UNC5C | Type I | None | ||
| Q8TB96 | T-cell immunomodulatory protein | Type I | None | ||
| O14672 | Disintegrin and metalloproteinase domain-containing protein 10 | Type I | None | ||
| O43291 | Kunitz-type protease inhibitor 2 | Type I | None | ||
| O43493 | Trans-golgi network integral membrane protein 2 | Type I | None | ||
| Q12860 | Contactin-1 | GPI | None | ||
| Q8NFY4 | Semaphorin-6D | Type I | None | ||
| O00592 | Podocalyxin-like protein 1 | Type I | None | ||
| P56817 | Beta-secretase 1 | Type I | None | ||
| Q2VWP7 | Protogenin | Type I | None | ||
| P78504 | Jagged-1 | Type I | None | ||
| P11717 | Cation-independent mannose-6-phosphate receptor | Type I | None | ||
| Q86YC3 | Leucine-rich repeat-containing protein 33 | Type I | None | ||
| P52803 | Ephrin-A5 | GPI | None | ||
| O00461 | Golgi phosphoprotein 4 | Type II | None | ||