| Literature DB >> 27457357 |
Lu An1, Paul M Harrison2.
Abstract
BACKGROUND: Prions are proteinaceous particles that propagate alternative protein conformations/states to further copies of the same proteins, and are transmitted from cell-to-cell, and organism-to-organism. Prions are usually made of the beta-sheet rich assemblies termed amyloid. The original prion protein PrP causes devastating neurodegenerative disorders in humans and other mammals. In the yeast Saccharomyces cerevisiae, many prion-forming proteins have been observed; a prominent feature of these proteins is an intrinsically disordered domain rich in glutamine (Q) and asparagine (N) residues. Several human proteins that are yeast-prion-like, in particular those with poly-glutamine (poly-Q) expansions, have been experimentally implicated in human neurodegenerative diseases.Entities:
Keywords: Amyloid; Asparagine; Disease; Evolution; Glutamine; Human; Neurodegenerative; Poly-Q expansion; Prion; Yeast
Mesh:
Substances:
Year: 2016 PMID: 27457357 PMCID: PMC4960796 DOI: 10.1186/s13062-016-0134-5
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Summary of the distribution of yeast-prion-like proteins in the eukaryotic domain
| Species short name | Species binomial name | TOTALS OF YEAST-PRION-LIKE PROTEINSa | CONSERVATION OF PRION-LIKE PROTEINS FROM YEASTb | CONSERVATION OF YEAST-PRION-LIKE PROTEINS FROM HUMANb | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (1) TOTAL # of N/Q-rich proteins (NQPs) | (2) TOTAL # of prion predictions (yeast background) | (3) TOTAL # of prion predictions (organism’s background) | (4) # of YEAST proteins conserved as orthologs (total 5879) | (5) | (6) | (7) # of HUMAN proteins conserved as orthologs (total 101,933) | (8) | (9) | ||
| YEAST | Saccharomyces cerevisiae |
|
|
| -- | -- | -- | 4076 (4.0 %) | 9 (0.7 %) | 4 |
| TRICHOPLAX | Trichoplax adhaerens | 220 (1.9 %) | 308 (2.7 %) |
| 2223 (38 %) | 3 (1.0 %) | 5 | 9992 (9.8 %) | 24 (1.9 %) | 24 |
| SPONGE | Amphimedon queenslandica | 192 (1.4 %) | 217 (1.6 %) | 166 (1.2 %) | 2230 (38 %) | 1 (0.4 %) | 5 | 9565 (9.4 %) | 16 (1.3 %) | 27 |
|
| ||||||||||
| DROSOPHILA | Drosophila melanogaster |
|
|
| 2176 (37 %) | 26 (9.1 %) | 14 | 10138 (10 %) | 117 (9.2 %) | 72 |
| DAPHNIA | Daphnia pulex | 461 (1.5 %) | 509 (1.7 %) | 307 (1.0 %) | 2209 (38 %) | 5 (1.8 %) | 4 | 10609 (10 %) | 44 (3.5 %) | 30 |
| C_ELEGANS | Caenorhabditis elegans | 827 (2.6 %) |
| 732 (2.3 %) | 2044 (35 %) | 11 (3.9 %) | 14 | 8670 (8.5 %) | 41 (3.2 %) | 27 |
| OYSTER | Crassostrea gigas | 1059 (2.3 %) | 1414 (3.1 %) | 988 (2.1 %) | 2333 (40 %) | 7 (2.5 %) | 14 | 12934 (13 %) | 126 (9.9 %) | 89 |
| APLYSIA | Aplysia californica |
|
|
| 2310 (39 %) | 14 (4.9 %) | 12 | 11017 (11 %) | 35 (2.8 %) | 30 |
| LEECH | Helobdella robusta |
|
|
| 2159 (37 %) | 13 (4.6 %) | 7 | 10834 (11 %) | 65 (5.1 %) | 30 |
|
| ||||||||||
| SEA_URCHIN | Strongylocentrotus purpuratus | 1007 (2.8 %) | 909 (2.5 %) | 581 (1.6 %) | 2355 (40 %) | 12 (4.2 %) | 13 | 13596 (13 %) | 117 (9.2 %) | 59 |
| ZEBRAFISH | Danio rerio | 671 (1.6 %) | 805 (1.9 %) | 617 (1.5 %) | 2389 (41 %) | 5 (1.8 %) | 6 | 20283 (20 %) | 213 (17 %) | 143 |
| FUGU | Takifugu rubripes | 940 (2.0 %) | 912 (1.9 %) | 719 (1.5 %) | 2314 (39 %) | 4 (1.4 %) | 4 | 20340 (20 %) | 205 (17 %) | 129 |
| CHICKEN | Gallus gallus | 276 (1.7 %) | 227 (1.4 %) | 178 (1.1 %) | 2176 (37 %) | 8 (2.8 %) | 3 | 18441 (18 %) | 247 (19 %) | 144 |
| ANOLIS | Anolis caroliniensis | 798 (2.5 %) | 624 (1.9 %) | 468 (1.5 %) | 2311 (39 %) | 5 (1.8 %) | 3 | 22209 (22 %) | 320 (25 %) | 197 |
| MOUSE | Mus musculus | 1001 (1.8 %) | 675 (1.2 %) | 497 (0.9 %) | 2397 (41 %) | 6 (2 %) | 5 | 30987 (30 %) | 514 (41 %) | 314 |
| HUMAN | Homo sapiens | 1269 (1.2 %) | 1012 (1.0 %) | 766 (0.7 %) | 2399 (41 %) | 5 (1.8 %) | 3 | -- | -- | -- |
| OPOSSUM | Monodelphis domestica | 382 (1.7 %) | 298 (1.4 %) | 225 (1.0 %) | 2329 (40 %) | 7 (2.5 %) | 3 | 22205 (22 %) | 290 (23 %) | 197 |
| XENOPUS | Xenopus laevis | 348 (1.5 %) | 376 (1.7 %) | 317 (1.4 %) | 2299 (39 %) | 9 (3.2 %) | 4 | 19444 (19 %) | 226 (18 %) | 131 |
| LATIMERIA | Latimeria calumnae | 407 (1.7 %) | 415 (1.8 %) | 330 (1.4 %) | 2276 (39 %) | 7 (2.5 %) | 2 | 19756 (19 %) | 273 (22 %) | 145 |
| CIONA | Ciona intestinalis | 283 (1.8 %) | 491 (3.2 %) |
| 2213 (38 %) | 4 (1.4 %) | 5 | 11547 (11 %) | 72 (5.7 %) | 67 |
| HYDRA | Hydra vulgaris | 712 (3.2 %) | 489 (2.2 %) | 431 (1.9 %) | 2235 (38 %) | 14 (4.9 %) | 11 | 10605 (10 %) | 41 (3.2 %) | 36 |
| CHOANAFLAGELLATE | Monosiga brevicollis | 273 (3.0 %) | 134 (1.5 %) | 85 (0.9 %) | 2015 (34 %) | 4 (1.4 %) | 5 | 6632 (6.5 %) | 12 (0.9 %) | 7 |
| DICTYOSTELIUM | Dictyostelium discoideum |
|
|
| 2199 (37 %) | 65 (23 %) | 34 | 5839 (5.8 %) | 37 (2.9 %) | 31 |
| PLASMODIUM | Plasmodium falciparum |
|
|
| 1351 (23 %) | 23 (8.1 %) | 20 | 3207 (3.1 %) | 30 (2.4 %) | 25 |
| CHONDRUS | Chondrus crispus | 49 (0.5 %) | 93 (0.9 %) | 62 (0.6 %) | 1716 (29 %) | 6 (2 %) | 7 | 3790 (3.7 %) | 2 (0.2 %) | 5 |
|
| ||||||||||
| PHYSCOMITRELLA | Physcomitrella patens | 169 (0.5 %) | 323 (0.9 %) | 246 (0.7 %) | 2279 (39 %) | 5 (1.8 %) | 3 | 6450 (6.3 %) | 6 (0.5 %) | 5 |
| SELAGINELLA | Selaginella moellendorffii | 285 (0.8 %) | 243 (0.7 %) | 164 (0.5 %) | 2247 (38 %) | 1 (0.4 %) | 1 | 6370 (6.3 %) | 10 (0.8 %) | 4 |
| AMBORELLA | Amborella trichopoda | 171 (0.8 %) | 251 (1.2 %) | 164 (0.8 %) | 2288 (39 %) | 4 (1.4 %) | 4 | 6079 (6.0 %) | 13 (1.0 %) | 4 |
| BANANA | Musa acuminata | 421 (1.0 %) | 403 (1.0 %) | 295 (0.7 %) | 2281 (39 %) | 7 (2.5 %) | 4 | 6328 (6.2 %) | 5 (0.4 %) | 4 |
| RICE | Oryza sativa | 1118 (1.6 %) | 928 (1.4 %) | 540 (0.8 %) | 2264 (39 %) | 6 (2 %) | 8 | 6144 (6.0 %) | 9 (0.7 %) | 8 |
| ARABIDOPSIS | Arabidopsis thaliana | 342 (1.0 %) | 570 (1.7 %) | 412 (1.2 %) | 2300 (39 %) | 7 (2.5 %) | 7 | 6235 (6.2 %) | 11 (0.9 %) | 9 |
| CHLAMYDOMONAS | Chlamydomonas reinhardtii | 441 (3.0 %) | 180 (1.2 %) | 142 (1.0 %) | 1914 (33 %) | 1 (0.4 %) | 0 | 5235 (5.2 %) | 7 (0.6 %) | 2 |
| TRYPANOSOMA | Trypanosoma cruzi | 359 (1.8 %) | 225 (1.1 %) | 172 (0.9 %) | 1539 (26 %) | 8 (2.8 %) | 3 | 4032 (3.9 %) | 9 (0.7 %) | 13 |
| GUILLARDIA | Guillardia theta | 318 (1.3 %) | 238 (1.0 %) | 163 (0.7 %) | 2115 (36 %) | 4 (1.4 %) | 4 | 6409 (6.3 %) | 6 (0.5 %) | 5 |
aThe organisms are listed in the same order that they appear in the phylogenetic tree in Fig. 1. Organisms with percentages of NQPs in their proteomes >10 % are in bold underline text; those otherwise with >4 % NQPs in their proteomes are in bold italics. Where other organisms reach the level of the % prion predictions of these organisms, their totals of prion predictions are also given in bold italics. ‘Yeast background’ is the setting in the PLAAC program wherein the background composition in the yeast proteome is used for prion predictions, similarly for ‘Organism’s background’
bColumns (5), (6), (8) and (9) contain the following data
(5): # of YEAST N/Q-rich proteins conserved as N/Q-rich orthologs (total in YEAST =285), with the percentages this is of total YEAST NQPs in brackets
(6): # of YEAST prion predictions conserved as prion predictions (total in YEAST =186) (yeast background)
(8): # of HUMAN N/Q-rich proteins conserved as N/Q-rich orthologs (total in HUMAN =1269), with the percentages this is of total HUMAN NQPs in brackets
(9): # of HUMAN prion predictions conserved as prion predictions (total in HUMAN =1012) (yeast background)
Fig. 1Human yeast-prion-like proteins have limited conservation outside of deuterostomes. The first colour scale (ranging from magenta to cyan) indicates the percentage of N/Q-rich proteins (as determined using the LPS program, see Methods). The second colour scale (from green to red) indicates a weighted ratio that shows the degree of conservation of the human yeast-prion-like proteins. The legends for the colour scales are on the bottom of the figure. The formula for the weighted ratio is as follows: . The columns are those from Table 1. The value 1.2 is the percentage of NQPs in the human proteome
Orthologous proteins that are yeast-prion-like in both Saccharomyces cerevisiae and Human
| NQ-rich proteins (NQPs) | ||||
| Yeast gene ID | Yeast gene name | Descriptiona | Human orthologsb | # of species with NQP orthologsc |
| YLL013C | PUF3 | Protein of the mitochondrial outer surface; links the Arp2/3 complex with the mitochore during anterograde mitochondrial movement; also functions in mRNA degradation | PUM1 (ENSP00000257075, ENSP00000400141), PUM2 (ENSP00000409905) | 16 (30) |
| YGL237C | HAP2 | Subunit of the Hap2p/3p/4p/5p CCAAT-binding complex; complex is heme-activated and glucose-repressed; complex is a transcriptional activator and global regulator of respiratory gene expression | NFYA (ENSP00000229418, ENSP00000345702) | 12 (24) |
| YOR194C | TOA1 | TFIIA large subunit; involved in transcriptional activation, acts as antirepressor or as coactivator; required, along with Toa2p, for ribosomal protein gene transcription in vivo | GTF2A1 (ENSP00000409492, ENSP00000452454) | 10 (26) |
| YFL024C | EPL1 | Subunit of NuA4, an essential histone H4/H2A acetyltransferase complex; conserved region at N-terminus is essential for interaction with the NPC (nucleosome core particle); required for autophagy | EPC2 (ENSP00000258484) | 9 (17) |
| YNL251C | NRD1 | RNA-binding subunit of Nrd1 complex; complex interacts with exosome to mediate 3′-end formation of some mRNAs, snRNAs, snoRNAs, and CUTs; interacts with CTD of RNA pol II large subunit Rpo21p at phosphorylated Ser5 to direct transcription termination of non-polyadenylated transcripts | SCAF4 (ENSP00000382703) | 5 (7) |
| Prion predictions | ||||
| Yeast gene ID | Yeast gene name | Descriptiona | Human orthologs | # of species with orthologous prion predictionsb |
| YLR191W | PAS20 | Peroxisomal importomer complex component; integral peroxisomal membrane protein required for docking and translocation of peroxisomal matrix proteins | PEX13 (ENSP00000295030) | 15 (19) |
| YNL016W | PUB1 | Poly (A) + RNA-binding protein; abundant mRNP-component protein that binds mRNA and is required for stability of many mRNAs; component of glucose deprivation induced stress granules, involved in P-body-dependent granule assembly; protein abundance increases in response to DNA replication stress. The human ortholog associated with Welandar distal myopathy | TIA1 (ENSP00000404023) | 14 (23) |
| YLL013C | PUF3 |
| PUM1 (ENSP00000400141, ENSP00000257075) | 13 (30) |
aThese descriptions are adapted from text on the UniProt [37] and SGD databases [49]
bThe gene names are followed by the Ensembl protein names for the orthologous proteins
cTotal number of the species listed in Table 1 that contain orthologous prion predictions or NQPs, with the total number of orthologs (regardless of whether they have a prion-like domain or not) in brackets
Fig. 2Yeast-prion-like proteins are statistically enriched in human neurological/neurodegenerative diseases. Enrichments of yeast-prion-like proteins are depicted as flow charts. Four sets are shown: NQPs (counted for the LPS P-value threshold ≤1e-10); predictions by the program PAPA [29]; predictions by the program PLAAC [30] (using yeast background settings); the union of all yeast-prion-like proteins from all three sources. These programs were applied as described in Methods. The fraction of all genes, disease-linked genes, neurologically-linked genes and neurodegeneration-linked genes are listed in the boxes. P values for hypergeometric tests for enrichments are listed (all of the P values are enrichments, no depletions are discovered), relative to the previous set. Cases that might not be significant given a Holm-Bonferroni correction for multiple hypothesis testing are in orange type, otherwise significant cases are in red type. For enrichments in the neurodegenerative set, there are three P-values for comparison to disease-, neurological- and neurodegenerative-linked sets as background populations
Fig. 3Intrinsically disordered proteins are generally not enriched in neurological/neurodegenerative diseases. These are the same calculations as above for yeast-prion-like proteins, except for three sets of proteins that are intrinsically disordered in the human proteome. Three different thresholds for intrinsic disorder are examined, 30 % of the protein, 50 % and 70 %
Yeast-prion-like genes genetically linked to neurological diseases in humansa
| Ensembl gene ID | Name | NQP ?b | Prion prediction ?b | Neurodegenerative ?c | Diseases linked to gene (with OMIM numbers) |
|---|---|---|---|---|---|
| Nucleic-acid binding/Transcription factor (PC00171, PC00218) | |||||
| ENSG00000104973 | MED25, Mediator complex subunit 25 | X | -- | U | Charcot-Marie-Tooth disease, type 2B2, #605589 |
| ENSG00000112592 | TBP, TATA box binding protein | X | X | U | Spinocerebellar ataxia 17, #607136 |
| ENSG00000204842 | ATXN2, Ataxin 2 | X | -- | U | Amyotrophic lateral sclerosis, susceptibility to, 13, #183090; |
| Parkinson disease, late-onset, susceptibility to, #168600; | |||||
| Spinocerebellar ataxia 2, #183090 | |||||
| ENSG00000085224 | ATRX, Alpha thalassemia/mental retardtn. syndrome X-linked | X | X | -- | Mental retardation-hypotonic facies syndrome, X-linked, #309580 |
| ENSG00000089280 | FUS RNA binding protein | X | X | U | Amyotrophic lateral sclerosis 6, with or without frontotemporal dementia, #608030; |
| Tremor, hereditary essential, 4, #614782 | |||||
| ENSG00000100888 | CHD8, Chromodomain helicase DNA binding protein 8 | X | -- | -- | Autism, susceptibility to, 18, #615032 |
| ENSG00000160299 | PCNT, Pericentrin | X | -- | -- | Microcephalic osteodysplastic primordial dwarfism, type II, #210720 |
| ENSG00000182944 | EWSR1, EWS RNA-binding protein 1 | X | X | -- | Neuroepithelioma, #612219 |
| ENSG00000111752 | PHC1, Polyhomeotic homolog 1 | X | X | -- | Microcephaly 11, primary, autosomal recessive, #615414 |
| ENSG00000198026 | ZNF335, Zinc finger protein 335 | X | -- | -- | Microcephaly, #615095 |
| ENSG00000052850 | ALX4, ALX homeobox 4 | + | -- | -- | Craniosynostosis 5, #615529 |
| ENSG00000066427 | ATXN3, Ataxin 3 | X | -- | U | Machado-Joseph disease, #109150 |
| ENSG00000140521 | POLG, Polymerase (DNA directed), © | X | -- | U | Mitochondrial recessive ataxia syndrome (includes SANDO and SCAE), #607459 |
| ENSG00000156876 | SASS6, SAS-6 centriolar assembly protein | + | -- | -- | Microcephaly 14, primary, autosomal recessive, #616402 |
| ENSG00000169083 | AR, Androgen receptor | X | -- | U | Spinal and bulbar muscular atrophy of Kennedy, #313200 |
| Cytoskeletal protein (PC00085) | |||||
| ENSG00000066279 | ASPM, abnormal spindle microtubule assembly | X | X | -- | Microcephaly 5, #608716 |
| ENSG00000131018 | SYNE1, Spectrin repeat containing nuclear envelope 1 | X | -- | U | Emery-Dreifuss muscular dystrophy 4, autosomal dominant, #612998; |
| Spinocerebellar ataxia, autosomal recessive 8, #610743 | |||||
| ENSG00000151914 | DST, Dystonin | X | -- | -- | Neuropathy, hereditary sensory and autonomic, type VI, #614653 |
| ENSG00000008056 | SYN1, synapsin I | X | -- | -- | X-linked epilepsy, #300491 |
| ENSG00000133454 | MYO18B, Myosin XVIIIB | X | -- | -- | Klippel-Feil syndrome 4, autosomal recessive, with myopathy, #616549 |
| ENSG00000166813 | KIF7, Kinesin family member 7 | X | -- | -- | Joubert syndrome, etc., #200990 |
| ENSG00000178209 | PLEC, Plectin | X | -- | CM | various muscular dystrophy, #226670, #613723 |
| ENSG00000277586 | NEFL, Neurofilament light polypeptide | -- | X | U | Charcot-Marie-Tooth disease, #607734, #607684 |
| ENSG00000054654 | SYNE2, Spectrin repeat containing, nuclear envelope 2 | + | -- | CM | Emery-Dreifuss muscular dystrophy 5, #612999 |
| ENSG00000100345 | MYH9, Myosin, heavy chain 9, non-muscle | + | -- | -- | Macrothrombocytopenia and progressive sensorineural deafness, #600208 |
| ENSG00000138778 | CENPE, Centromere protein E, 312 kDa | + | -- | -- | Microcephaly 13, primary, autosomal recessive, #616051 |
| ENSG00000198947 | DMD, Dystrophin | + | -- | CM | Becker muscular dystrophy, #300376; Duchenne muscular dystrophy, #310200 |
| Transporter (PC00227) | |||||
| ENSG00000157388 | CACNA1D, Ca channel, voltage-depdt., L type, 〈 1D subunit | -- | X | -- | Primary aldosteronism, #615474 |
| Sinoatrial node dysfunction and deafness, #614896 | |||||
| ENSG00000198734 | F5, Coagulation factor V (proaccelerin, labile factor) | -- | X | -- | Suscept. to stroke, #601367 |
| ENSG00000007314 | SCN4A, Na channel, voltage gated, type IV 〈 subunit | -- | X | -- | Hyperkalemic periodic paralysis, type 2, #170500, #613345; |
| Myotonia congenita, atypical, acetazolamide-responsive, #608390; Paramyotonia congenita, #168300 | |||||
| ENSG00000036828 | CASR, Calcium-sensing receptor | X | -- | -- | Idiopathic generalized epilepsy, #612899 |
| ENSG00000141837 | CACNA1A, Ca channel, voltage-depdt., P/Q type, 〈 1A subunit | X | -- | U | Spinocerebellar ataxia 6, #183086; |
| Migraine, familial hemiplegic, 1, with progressive cerebellar ataxia, #141500; | |||||
| Episodic ataxia, type 2, #108500; | |||||
| ENSG00000164588 | HCN1, Hyperpolariztn. activated cyclic-nucleotide-gated K channel 1 | X | -- | -- | Epileptic encephalopathy, early infantile, 24, #615871 |
| Others | |||||
| ENSG00000116001 | TIA1, TIA1 cytotoxic granule-associated RNA binding protein | + | X | CM | Welander distal myopathy, #604454 |
| ENSG00000157212 | PAXIP1, PAX interacting protein 1 | X | X | CN | Alzheimer disease, susceptibility to, #104300 |
| ENSG00000162928 | PEX13, Peroxisomal biogenesis factor 13 | -- | X | -- | Peroxisome biogenesis disorder 11A (Zellweger), #614883 |
| ENSG00000197386 | HTT, Huntingtin | X | X | U | Huntington disease, #143100 |
| ENSG00000204120 | GIGYF2, GRB10 interacting GYF protein 2 | X | -- | U | Parkinson disease 11, #607688 |
| ENSG00000103995 | CEP152, Centrosomal protein 152 kDa | X | -- | -- | Microcephaly 9, primary, autosomal recessive, #614852 |
| ENSG00000114354 | TFG, TRK-fused gene | X | X | U | Hereditary motor and sensory neuropathy, Okinawa type, #604484; |
| Spastic paraplegia 57, autosomal recessive, #615658 | |||||
| ENSG00000120948 | TARDBP, TAR DNA binding protein | -- | X | U | Amyotrophic lateral sclerosis 10, with or without FTD, #612069; |
| Frontotemporal lobar degeneration, TARDBP-related, #612069 | |||||
| ENSG00000122566 | HNRNPA2B1, Heterogen. nuclear ribonucleoprotein A2B1d | -- | X | CNd | Inclusion body myopathy with early-onset Paget disease, #615422 |
| ENSG00000135486 | HNRNPA1, Heterogeneous nuclear ribonucleoprotein A1d | -- | X | Ud | Amyotrophic lateral sclerosis 20, #615426; |
| Inclusion body myopathy wtih early-onset Paget disease without frontotemporal dementia 3, #615424 | |||||
| ENSG00000136352 | NKX2-1, NK2 homeobox 1 | X | -- | -- | Chorea, hereditary benign, #118700; |
| Choreoathetosis, hypothyroidism, and neonatal respiratory distress, #610978 | |||||
| ENSG00000145868 | FBXO38, F-box protein 38 | X | -- | U | Neuronopathy, distal hereditary motor, type IID, #615575 |
| ENSG00000152795 | HNRNPDL, Heterogeneous nuclear ribonucleoprotein D-like | -- | X | CM | Limb-girdle muscular dystrophy, type 1G, #609115 |
| ENSG00000154118 | JPH3, Junctophilin 3 | -- | X | CN | Huntington disease-like 2, #606438 |
| ENSG00000168000 | BSCL2, Berardinelli-Seip congenital lipodystrophy 2 (seipin) | -- | X | U | Encephalopathy, progressive, with or without lipodystrophy, #615924; |
| Neuropathy, distal hereditary motor, type VA, #600794; | |||||
| Silver spastic paraplegia syndrome, #270685 | |||||
| ENSG00000186472 | PCLO, Piccolo presynaptic cytomatrix protein | X | -- | U | Pontocerebellar hypoplasia, type 3, #608027 |
| ENSG00000269335 | IKBKG, Inhibitor of | light polypeptide B-cell gene enhancer, kinase © | X | -- | -- | Incontinentia pigmenti, #308300 |
| ENSG00000027075 | PRKCH, Protein kinase C, eta | -- | X | -- | Cerebral infarction, susceptibility to, #601367 |
| ENSG00000060237 | WNK1, WNK lysine deficient protein kinase 1 | X | X | U | Neuropathy, hereditary sensory and autonomic, type II, #201300 |
| ENSG00000074047 | GLI2, GLI family zinc finger 2 | + | -- | -- | Holoprosencephaly-9, #610829 |
| ENSG00000111676 | ATN1, Atrophin 1 | X | X | U | Dentatorubro-pallidoluysian atrophy, #125370 |
| ENSG00000124788 | ATXN1, Ataxin 1 | X | X | U | Spinocerebellar ataxia 1, #164400 |
| ENSG00000127838 | PNKD, Paroxysmal nonkinesigenic dyskinesia | -- | X | -- | Paroxysmal nonkinesigenic dyskinesia, #118800 |
| ENSG00000148356 | LRSAM1, Leucine rich repeat and sterile motif containing 1 | X | -- | U | Charcot-Marie-Tooth disease, axonal, type 2P, #614436 |
| ENSG00000163635 | ATXN7, Ataxin7 | X | -- | U | Spinocerebellar ataxia 7, #164500 |
| ENSG00000164342 | TLR3, Toll-like receptor 3 | + | -- | -- | Herpes simplex encephalitis, susceptibility to, 2 #613002 |
| ENSG00000188021 | UBQLN2, Ubiquilin 2 | + | -- | U | Amyotrophic lateral sclerosis 15, with or without frontotemporal dementia, #300857 |
aThe genes are grouped according to the three most common PANTHER protein classes [48]. In making these listings, the four organisms with very high rates of yeast-prion-like proteins (see Table 1) are not considered
bNQPs are N/Q-rich proteins as defined. In the ‘NQP?’ column, genes which encode an NQP with LPS P-value <1e-10 are labelled with an ‘X’ , those with P-value otherwise <1e-08 are labelled with a ‘+’ sign. If the gene has an algorithmic prion prediction, it is labelled with an ‘X’ in the ‘Prion Prediction?’ column
cGenes that are in the UniProt neurodegenerative list are labelled ‘U’. To other labels arise from curation of the scientific literature: those that were determined as neurodegeneration-linked genes are labelled ‘CN’ , whereas those that are specifically linked to muscular degeneration are labelled ‘CM’
dThese two were identified as linked to neurodegenerative illnesses after a biased screen for proteins with prion-like domains. However, they have previously identified links to neurological illnesses. Removal of these two cases found in such screens (hnRNPA2B1 and hnRNPA1, which are linked to ALS and other muscular degeneration [25]) does not affect the calculations. Also, these two genes have previously discovered genetic linkages to neurodegenerative disorders [50, 51]
Tissue-specifically expressed subsets of genes in relevant human tissuesa
| Protein sets tested and expression sets used (columns) Tissue types (rows) | NQ-rich proteinsb | Prion predictions by PLAACb | Prion predictions by PAPAb | DISORDERED PROTEINS (>70 % set)b | ||||
|---|---|---|---|---|---|---|---|---|
| GXA Illumina body map expression library | Specond | GXA Illumina body map expression library | Specond | GXA Illumina body map expression library | Specond | GXA Illumina body map expression library | Specond | |
| Brain | 38/1966 (1.9 %, N.S.) | 9/503 (1.8 %, N.S.) | 10/1966 (0.5 %, N.S.) | 3/503 (0.6 %, N.S.) | 25/1966 (1.3 %, N.S.) | 8/503 (1.6 %, N.S.) | 127/1966 (6.5 %, N.S.) | 42/503 (8.4 %, N.S.) |
| Spinal Cord | N/A | 3/262 (1.1 %, N.S.) | N/A | 2/262 (1.1 %, N.S.) | N/A | 3/262 (1.1 %, N.S.) | N/A |
|
| Skeletal Muscle |
| 5/270 (1.9 %, N.S.) | 6/753 (0.8 %, N.S.) | 3/270 (1.1 %, N.S.) | 4/753 (0.5 %, N.S.) | 4/270 (1.5 %, N.S.) | 52/752 (7.0 %, N.S.) | 19/270 (7.0 %, N.S.) |
aEnrichments are in bold type. For each is listed the fraction, with the percentage and hypergeometric P-value in brackets (N.S. = not significant)
bFor NQ-rich proteins, the LPS threshold used is 1e-10 (overall fraction is 414/22699, 1.8 %). The PLAAC and PAPA predictions use default values (overall fractions: 136 / 22699 (0.6 %) PLAAC; 279 / 22699 (1.2 %) PAPA). In all cases, human protein composition is used for these annotations. The disordered proteins are annotated as described in Methods