Literature DB >> 25075304

Yeast Augmented Network Analysis (YANA): a new systems approach to identify therapeutic targets for human genetic diseases.

David J Wiley¹, Ilona Juan¹, Hao Le², Xiaodong Cai³, Lisa Baumbach⁴, Christine Beattie², Gennaro D'Urso¹.

Abstract

Genetic interaction networks that underlie most human diseases are highly complex and poorly defined. Better-defined networks will allow identification of a greater number of therapeutic targets. Here we introduce our Yeast Augmented Network Analysis (YANA) approach and test it with the X-linked spinal muscular atrophy (SMA) disease gene UBA1. First, we express UBA1 and a mutant variant in fission yeast and use high-throughput methods to identify fission yeast genetic modifiers of UBA1. Second, we analyze available protein-protein interaction network databases in both fission yeast and human to construct UBA1 genetic networks. Third, from these networks we identified potential therapeutic targets for SMA. Finally, we validate one of these targets in a vertebrate (zebrafish) SMA model. This study demonstrates the power of combining synthetic and chemical genetics with a simple model system to identify human disease gene networks that can be exploited for treating human diseases.

Entities: CellLine Chemical Disease Gene Mutation Species

Year: 2014 PMID： 25075304 PMCID： PMC4097366 DOI： 10.12688/f1000research.4188.1

Source DB: PubMed Journal: F1000Res ISSN： 2046-1402

Introduction

Many disease-associated genes have been identified, yet most genetic diseases remain untreatable. One path to treatment is to develop extensive genetic networks in which human disease genes function (or dysfunction) and then target therapies to the genes identified in those networks. Eventually, an individual’s own genotype for proteins in the network may also be considered in the therapeutic options. Developing gene networks is already widely recognized as a powerful approach to identify new drug targets [1– 6]. Gene networks are based on the principle that networks contain proteins that interact (physically and/or functionally) and that these interactions govern most, if not all, cellular functions. Importantly, gene network interactions are often conserved in different organisms even though the output of the networks may differ [7]. Candidate networks can be generated from studies in model organisms and then extrapolated to human cells. One can expect that, for disease gene networks, some of the interacting genes might modulate the disease phenotype thus representing potential therapeutic targets. The direct identification of disease networks from studies in human cells is, of course, challenging because of the large number of potential proteins and incomplete knowledge of how the activity of any one protein may affect the network output. Analysis of networks would benefit from studies in a tractable model organism amenable to high throughput methods. Given that network interactions may be conserved between humans and model organisms, even while the outputs may differ, we hypothesized that we can identify human disease networks in the fission yeast Schizosaccharomyces pombe, a simple, genetically tractable model organism, coupled to existing protein-protein interaction databases. We can then transfer that network knowledge to human cells with the goal of identifying therapeutic target genes. How valid is the approach of using fission yeast networks to identify human disease gene networks? First, the proteins that control most core cellular functions are in fact evolutionarily conserved, underscoring the “deep homology” that exists between all living organisms [8, 9]. For example, in S. pombe >65% of the genome is orthologous to human ( http://orthomcl.org/orthomcl/ and http://www.pombase.org). Second, many of the cellular processes implicated in human disease, e.g. vesicular transport, protein folding, metabolism, and RNA processing, are also evolutionarily conserved and highly interconnected [10]. Importantly, the value of a simple model organism for discovering “druggable” genetic pathways has recently been demonstrated for the budding yeast Saccharomyces cerevisiae [11], and thus extending the study of human disease genes to fission yeast seems promising. Our approach ( Figure 1) involves the expression of human disease associated genes in S. pombe and then the analysis of their effect on yeast fitness (growth). We performed high-throughput synthetic genetic array (SGA) screens to identify the fission yeast genetic modifiers that alter this effect, as measured by a simple yeast growth assay. The genetic modifiers are then assembled into human disease gene networks or clusters (using protein-protein interaction datasets from both S. pombe and humans). Any modifiers or genes in the networks or clusters represent potential therapeutic targets. This unbiased high-throughput approach could be widely and rapidly applied to many different disease-associated genes at relatively low cost.

Figure 1.

YANA analysis of UBA1 genes. Numbers illustrate various steps in our analysis.

YANA analysis of UBA1 genes. Numbers illustrate various steps in our analysis.

(1) Construction of query strains by integrating wildtype and mutant variants of the UBA1 gene into pombe. (2) Crossing the query strains (mutant or wildtype) to a complete non-essential deletion collection. (3) Bioinformatic analysis to generate the top ‘gene hits’. (4) Validation of the top hits in yeast. (5) Confirmation of the top hits in zebrafish. To demonstrate the power of YANA, we considered spinal muscular atrophy, a common neurodegenerative disease affecting approximately 1/6000 births worldwide and the number one genetic cause of infantile death in the United States [12]. Most cases (>90%) are caused by deletion of SMN1, a gene encoding the survival motor neuron (SMN), a protein involved in the assembly of spliceosomal small nuclear ribonuceoproteins (snRNPs) [13]. In contrast, X-linked spinal muscular atrophy is caused by mutations in UBA1, a gene encoding the ubiquitin activating (UBA) enzyme 1, an E1 ubiquitin ligase [14, 15]. Consistent with a role of ubiquitination in spinal muscular atrophy and SMN biology, it was recently shown that ubiquitin-mediated proteolysis regulates SMN stability [16, 17]. Therefore, we systematically investigated the genetic network for UBA1 to better understand the potential connections between UBA1, its modifiers and the spinal muscular atrophy phenotype. Using YANA we identified several potential therapeutic targets and validated one of these targets in a SMA vertebrate (zebrafish) model.

Methods

Query Strain Creation

UBA1 was cloned into a pENTR/D-TOPO vector (Life Technologies, Cat # K2400-20) from cDNA (Origene, Cat. # SC320329) following the manufacturer’s protocols. The following primers were used: forward primer: 5′- CACCATGTCCAGCTCGCCGC-3′; reverse primer: 5′- TCAGCGGATGGTGTATCGGAC-3′. Genetic insertion was confirmed by sequencing ( http://sylvester.org/shared-resources/oncogenomics). UBA1 (G1617T), mut UBA1, was created by site directed mutagenesis using a QuikChange Lightning Site-Directed Mutagenesis kit (Agilent Technologies, Cat. # 210518; the detailed protocol is available in the kit). The primers used were: forward primer: 5′-GCAGCTGTGCGCCAAATTAATCCACATATCCGG-3′; reverse primer: 5′-CCGGATATGTGGATTAATTTGGCGCACAGCTGC-3′. LR Gateway reactions (Gateway Cloning Protocols: http://www.lifetechnologies.com/us/en/home/life-science/cloning/gateway-cloning/protocols.html#lr) were then performed to flip the UBA1 genes into destination vectors, to create N-terminal HA-tagged UBA1 under the control of the nmt1 promoter (LR Clonase II from Life Technologies, Cat # 11791020). The newly generated expression vectors were then integrated into an h strain (PN572) to create a wt UBA1 query strain ( h ) and a mut UBA1 query strain ( h ). All media, growth conditions, and genetic manipulations were as previously described [18].

Western Blot and Growth Curves

Strains containing wt UBA1 and mut UBA1 were grown exponentially in PMG media (Sunrise Scientific Cat. #2060, keeping the OD 595 below 0.4) or eight generations at 32°C and then induced for expression of the HA-tagged wt UBA1 and mut UBA1 by washing the cells three times with sterile water to remove thiamine. Cells were then grown exponentially for 16 hours and then lysed using a FastPrep 120 bead beater (MP Biomedical), followed by boiling in sample buffer (2x Laemmli Sample Buffer, Bio-Rad Cat. #161-0737) and then clarified by centrifugation. The expression of UBA1 was confirmed by Western Blot analysis using an anti-HA antibody at 1:2000 dilution (Covance, Cat #MMS-101P, AB_10063488) and standard procedures [19]. Growth curve analysis was completed in the presence and absence of thiamine (final concentration, 15 µM; Sigma, Cat. #T4625). Removal of thiamine was achieved by washing the cells three times with sterile water. Cells were incubated in the absence of thiamine for 22 hours under exponential growth, diluted and 120 µl added to the Tecan Infinity F200 plate readers (starting OD 595 of 0.05) for growth curve analysis with an n=5 for each sample. For validation experiments, cul3 and gsk3 were not pre-induced, while pub1 was pre-induced for 22 hours. The deletions of cul3, pub1, and gsk3 were confirmed by PCR analysis of genomic DNA from each strain, as previously described [20].

SGA screening

Query strains were grown in liquid media and then pinned to agar in a 384-format using a RoToR HDA (Singer Instruments). The query was then crossed to the S. pombe haploid deletion library (Bioneer, Version 3.0 equivalent) on SPAS media (details can be found at http://www-bcf.usc.edu/~forsburg/media.html) using a modified SGA procedure [21]. For germination, four replicates of each cross were pinned to a 1536 format on selective PMG media containing thiamine, adenine (225 mg/L, Sigma Cat. #A8751), leucine (225 mg/L, Sigma Cat. #L8912), and the antibiotic G418 (150 µM, Gold Bio Cat. #G-1418) (PAUT+G418). After three days, the plates were then pinned to both PAUT+G418 (non-inducing) and PAU+G418 (inducing) plates. The colonies growing on the PAUT+G418 plates were documented on a flatbed scanner for the next three consecutive days. The PAU+G418 plates were grown for two days, and then re-pinned to fresh PAU+G418 plates and documented over the next three consecutive days. Based on the growth characteristics of the wt UBA1 strain, the plates were then pinned to fresh PAU+G418 plates and documented for an additional three days.

Hit analysis

The documented plates were analyzed for ‘Hits’ representing a growth defect (SL) or growth suppressor (SS) using ScreenMill software [22]. The software is used to quantify colony size for each individual cross and then to normalize the quantified plates with and without replicate exclusion for each quadruplicate of the query crossed to a specific deletion strain. Data was then compared (between non-induced versus induced) and ranked in Excel (P≤0.05) (Microsoft). The orthologs were then identified based on curated data from PomBase ( www.pombase.org; build 2013-11-11-v1), OrthoMCL ( http://orthomcl.org/orthomcl/; Version 5), InParanoid8 ( http://inparanoid.sbc.su.se/cgi-bin/index.cgi; Version 8.0), and Homologene ( http://www.ncbi.nlm.nih.gov/homologene; build 67).

Bio-Informatics

The S. pombe and human ‘Hits’ were analyzed in String ( www.string-db.org) to map protein-protein interactions limited to data from experiments at the highest confidence level (0.900), and named pombe primary and human primary. The S. pombe primary was then extended by adding first degree neighbors to the original pombe ‘Hits’, keeping all interactions based on data from experiments at the highest confidence level (0.900). From the extended pombe network interaction map, new ‘Hits’ were extracted that corresponded to at least two previous ‘orphans’ (‘Hits’ that were not previously mapped) that interact through a nearest neighbor. All ‘Hits’ were then analyzed through PubMed ( http://www.ncbi.nlm.nih.gov/pubmed/) for relevance in terms of SMA, with special focus on the ‘Hits’ in the interaction maps.

Morpholino (MO) injection and drug test

Transgenic Tg(mnx1:0.6hsp70:GFP)os26 embryos that express GFP in ventrally projecting motor axons, referred to as Tg(mnx1:GFP) embryos, were used for all zebrafish experiments. Embryos were staged according to Fritz et al. [23] All fish were grown and maintained in the Ohio State University (OSU) zebrafish facility following established protocols and OSU animal welfare guidelines as stated in Dr. Beattie's animal protocol (On file at OSU: 2009A0141-R1). Tg(mnx1:GFP) embryos were injected with 4 ng smn MO at the one- to 2-cell stage to knock down Smn as previously described [24]. At 10 hours post-fertilization (hpf) injected embryos (in their chorions) were placed in Petri dishes in fish water (60 µg/ml Instant Ocean ® sea salts) containing compounds in 0.75% dimethyl sulfoxide (DMSO) or 0.75% DMSO only and incubated at 28.5°C in incubator until 28 hpf. Tg(mnx1:GFP) embryos at 28 hpf were anesthetized with tricaine (250 µg/ml, Sigma A-5040) and fixed overnight at 4°C in 4% formaldehyde/PBS. After removing embryos from fix, they were mounted on glass coverslips for observation under a Zeiss Axioplan microscope. Motor axons innervating the mid-trunk (myotomes 6–15) on both sides of the fish were scored as described [25]. The control topped embryos were treated and fixed as described above for Tg(mnx1:GFP) embryos; the topped mutants, however, were processed for znp1 antibody (Hybridoma Bank Cat#znp-1, AB_531910) labeling as previously described [26] to visualize motor axons for scoring as they did not have the Tg(mnx1:GFP) in the background.

Zebrafish western blot

Samples of zebrafish embryos injected with 4 ng smn MO at the one- to two-cell stage and uninjected embryos were treated from 10 to 28 hpf with DMSO or with DMSO and compounds as described above were collected for western blot. Samples were generated by boiling 25 identically treated embryos in 75 µl blending buffer (63 mM Tris (pH 6.8), 5 mM EDTA, 10% SDS). 10 µl (equivalent to 3 embryos or 75 µg of protein) were added to 10 µl of sample buffer (100 mM Tris (pH 6.8), 0.2% bromophenol blue, 20% glycerol, 200 mM dithiothreitol) and run on a 10% polyacrylamide gel, blotted to nitrocellulose, probed with mouse anti-Smn (1/500; MANSMA12, a gift from Dr. G.E. Morris or anti HuD (1/1000; Santa Cruz Cat#sc-28299, AB_627765) and detected by chemiluminescence of bound HRP-conjugated mouse antibody. Blots were stripped and re-probed with mouse anti-β-actin (1/1000; Santa Cruz Cat #sc-47778, AB_626632).

Results

We expressed the human wildtype UBA1 (wt UBA1) and the disease-causing variant 1617 UBA1 (mutUBA1) in S. pombe under the control of a regulatable nmt1 promoter [27]. We identified strains with stable integrated human UBA1 genes expressing equivalent levels of UBA1 proteins ( Figure 2A). Under non-inducing conditions (+ thiamine) all UBA1-containing strains had a similar growth rate to the control non- UBA1-containing wildtype strain ( Figure 2B). We then found that cells expressing human wt UBA1 (fission yeast cells grown in media lacking thiamine to induce UBA1 expression), experienced a decrease in cell growth. No growth defect was observed for yeast cells expressing mut UBA1 when compared to the control ( Figure 2B).

Figure 2.

Construction of S. pombe strains integrated with human UBA1.

Construction of S. pombe strains integrated with human UBA1.

Wildtype and mutant forms of UBA1 were integrated into fission yeast under the control of the nmt1 thiamine-repressible promoter. ( A) Western blot analysis of protein expression levels shows that both forms of UBA1 (wildtype and mutant) are expressed to equivalent levels. ( B) Expression of wild-type UBA1 inhibits S. pombe cell growth. We set out to identify yeast gene mutations that either enhance or suppress cell growth dependent upon expression of either wt or mut UBA1 (i.e., epistatic modifiers). We used automated genetics to introduce the wt and mut UBA1 genes into a deletion strain collection corresponding to more than 90% of the non-essential genes in S. pombe (ca. 80% of the complete genome). We then tested each of the >7000 unique strains expressing either wt or mut UBA1 combined with a specific gene deletion, for growth properties. Each strain was scored for significance of growth difference compared to a control, and the top hits (P<0.05) were assembled into a database ( Table 1). The categories of growth phenotypes were those that were synthetic lethal for wildtype or mutant (wt-SL, or mut-SL) or synthetically suppressed for wildtype or mutant (wt-SS, or mut-SS). We identified 173 UBA1 or mutUBA1 modifiers. Notably, 145 of the modifiers (83.8%) were orthologous to human genes, and thus are potential drug targets to modify the SMA phenotype.

Table 1.

S. pombe SGA top hits.

The tops hits for wtUBA1 SL, wtUBA1 SS, mutUBA1 SL, and mutUBA1 SS were identified by bioinformatic analysis as described in the materials and methods section.

	wtUBA1 SL	wtUBA1 SS	mutUBA1 SL	mutUBA1 SS
1	SPAC24H6.03	SPBC365.06	SPBC8D2.01	SPAC11G7.02
2	SPAC1687.15	SPAC11G7.02	SPBC6B1.10	SPCC13B11.01
3	SPAC3G9.07c	SPCC31H12.05c	SPBC216.05	SPBC2G2.03c
4	SPBC6B1.10	SPBC31F10.13c	SPCC1795.01c	SPBC31F10.13c
5	SPAPYUG7.04c	SPCC1620.14c	SPBC24C6.11	SPCC1620.14c
6	SPBC24C6.11	SPAC57A10.02	SPBC1734.08	SPAC3G6.02
7	SPBC1734.08	SPAC1556.01c	SPAC9.13c	SPAC1556.01c
8	SPAC23D3.09	SPAC30D11.13	SPAC23D3.09	SPAC821.05
9	SPAC17A5.11	SPCC970.07c	SPAC57A10.02	SPBC6B1.04
10	SPAC16A10.05c	SPAC23H4.12	SPAC16A10.05c	SPCC1919.15
11	SPCP1E11.07c	SPCC11E10.08	SPCC895.07	SPBC23E6.09
12	SPBC146.13c	SPBC21D10.12	SPCP31B10.05	SPBC21D10.11c
13	SPAC821.09	SPCC1672.06c	SPCC550.12	SPAC22H10.07
14	SPBC23E6.09	SPAC3C7.03c	SPAC23H4.12	SPCC338.08
15	SPCC1682.16	SPBC32F12.02	SPBC11C11.02	SPBP35G2.08c
16	SPBC106.01	SPCC736.04c	SPBC1289.11	SPBC29A3.14c
17	SPCC622.16c	SPBC30D10.13c	SPCC126.02c	SPAC29A4.18
18	SPAC22H10.07	SPCC737.09c	SPBC13G1.13	SPBC1921.03c
19	SPBC13G1.13	SPBC56F2.01	SPCC188.13c	SPBC2F12.11c
20	SPBC16H5.06	SPBC2F12.11c	SPCC663.12	SPAPB1E7.02c
21	SPBC29A10.05	SPBC16A3.07c	SPCC338.05c	SPAC17A5.16
22	SPBC28F2.07	SPAC5D6.05	SPAC343.11c	SPAC20H4.03c
23	SPCC1827.08c	SPAC18G6.02c	SPCC1919.03c	SPCC1259.03
24	SPAC9G1.10c	SPAC19D5.01	SPAC343.18	SPAC18G6.02c
25	SPBC83.03c	SPAC688.11	SPCC737.09c	SPAPB1E7.06c
26	SPBC342.04	SPAC227.07c	SPBC106.16	SPAC1851.04c
27	SPCC4G3.19	SPAC1851.04c	SPAC3A11.14c	SPAC13G7.06
28	SPAC9G1.05	SPBC16H5.07c	SPAC589.02c	SPBC725.11c
29	SPCC4E9.02	SPAC17A5.18c	SPCC1450.05c	SPCC1739.12
30	SPCC794.11c	SPBC725.11c	SPAC8F11.03	SPCC663.01c
31	SPBC216.06c	SPAC23C11.08	SPAC13A11.04c	SPBC21C3.02c
32	SPCC1450.05c	SPBC3B8.02	SPAC1952.07	SPAC1F7.09c
33	SPAC22A12.16	SPBC17D1.06	SPAC14C4.03	SPBC365.11
34	SPAC12B10.07	SPAC11E3.06	SPAC19G12.06c	SPAC144.04c
35	SPAC4A8.05c	SPCC663.01c	SPAC2F7.04	SPCC4F11.03c
36	SPAC1B9.02c	SPAC167.04	SPAC1006.09	SPAC4G8.03c
37	SPAC16C9.05	SPAC14C4.12c	SPAC17G6.04c	SPBP8B7.06
38	SPAC23H3.08c	SPBC2A9.06c	SPAC22H12.02	SPCC1223.05c
39	SPAC2F7.04	SPAC24H6.13	SPBC725.02	SPAC1556.05c
40	SPAC22H12.02	SPAPB1A10.03	SPAC140.01	SPBC16C6.03c
41	SPBC725.02	SPAC12G12.03	SPAC1805.04	SPBC2G2.07c
42	SPAC21E11.03c	SPCC663.03	SPBC1709.10c	SPAC31G5.12c
43	SPBC12C2.02c	SPBC1539.10	SPBC577.15c	SPAC4G9.15
44	SPBC21B10.13c	SPCC1223.05c	SPAC1420.03	SPAC25B8.05
45	SPAC1687.13c	SPCC31H12.04c	SPBC29A10.03c	SPAC1071.07c
46	SPBC3F6.05	SPCC1739.08c	SPAC4F10.04
47	SPBC4B4.03	SPAC2C4.07c	SPAPB1E7.12
48	SPAC222.16c	SPAC1142.08	SPCC320.08
49	SPAC4F10.04		SPCC830.07c
50	SPAC19B12.10		SPAC4G9.05
51	SPBC2G2.02		SPBC13G1.14c
52	SPAC1834.04		SPCC24B10.08c
53	SPAPB1E7.12		SPAC4H3.03c
54	SPCC320.08		SPAC1805.05
55	SPCC830.07c
56	SPAC4G9.05
57	SPBC13G1.14c

S. pombe SGA top hits.

The tops hits for wtUBA1 SL, wtUBA1 SS, mutUBA1 SL, and mutUBA1 SS were identified by bioinformatic analysis as described in the materials and methods section. To generate our candidate human disease gene networks (that we shall call network clusters) we relied on two complementary approaches. Both approaches rely on networks constructed from published protein-protein interaction data from either human ( Figure 3 and Figure S3) or fission yeast ( Figure 4, Figure S1 and Figure S2). By including two types of genetic interactions, synthetic lethals or synthetic suppressors, yeast augmented network analysis (YANA), yields the most complete unbiased list of potential modifiers. In a first approach, we converted all 145 S. pombe modifier genes to their corresponding human orthologs. All the data were combined to create network cluster diagrams. Individual datasets were delineated using a color key. (Note that we designate each interaction as a synthetic lethal or synthetic suppressor a priori, and include them both as it is difficult to predict which of these classes will have potential therapeutic value). Using the highest confidence (0.9) experimentally confirmed protein-protein interaction data (from www.string-db.org), we used cytoscape to draw network cluster modules that include all modifier genes. We identified 22 clusters using this approach ( Figure S3). We then limited the number of networks by focusing on those modifiers that shared at least two interactions with other modifiers shrinking the number of clusters to 10 ( Figure 3). Within our human primary network clusters, we identified several genes that represent compelling therapeutic targets. These included GSK3 (Glycogen Synthase Kinase-3), CUL3 (Cullin-3) and NEDD4 (E3 ubiquitin ligase) along with SUMO1 (Small Ubiquitin Modifier-1) and HDAC (Histon deacetylase), a known chromatin modifier. Interestingly, GSK3 inhibitors increase SMN levels in spinal muscular atrophy patient-derived fibroblasts and mouse motor neurons [28]. The HDAC inhibitor Trichostatin A increases SMN expression and survival in a mouse model of spinal muscular atrophy [29]. Therefore, evidence already exists that genes within our clusters affect SMA biology.

Figure 3.

Human UBA1 primary gene interaction network.

Human orthologs of yeast genes identified from four independent SGA screens were used to map the corresponding protein-protein interaction network. Only proteins that have experimentally verified data (STRING) supporting a physical interaction with at least two additional genes/proteins identified in the UBA1-SGA screens are displayed. Of particular interest is the discovery of NEDD4, CUL3, GSK3, SUMO and HDAC in distinct network clusters. Proteins of interest are highlighted by a larger node size and indicated with an arrow.

Figure S3.

Complete human UBA1 primary gene interaction network representing 22 clusters.

Human orthologs of yeast genes identified from four independent SGA screens were used to map the corresponding protein-protein interaction network.

Figure 4.

S. pombe extended UBA1 gene interaction network (extracted from Figure S1 on the basis of identifying new primary hit interactions through a 1° neighbor).

Figure S1.

S. pombe extended UBA1 interaction network (complete).

The full protein-protein interaction network including first-degree neighbors for all fission yeast gene modifiers identified from our two independent SGA screens.

Figure S2.

S. pombe UBA1 primary gene interaction network.

Fission yeast epistatic modifiers from all four SGA datasets were combined and used to map a protein-protein interaction network using the highest confidence protein interaction data (STRING-db).

Human UBA1 primary gene interaction network.

S. pombe extended UBA1 gene interaction network (extracted from Figure S1 on the basis of identifying new primary hit interactions through a 1° neighbor).

Primary yeast gene hits from all four SGA screens were combined and mapped based on highest confidence experimentally confirmed protein-protein interaction data. For this extended analysis, we also included first-degree neighbors of modifiers that were not directly identified in the SGA screens. Only selected gene clusters are shown here; the complete extended network is shown in Figure S1. When extrapolated to human, the yeast extended network analysis identified Csn5 and Cul3, two modifiers also identified in our human UBA1 primary gene interaction network (boxed; see also Figure 3). In our second approach, we constructed networks from just the S. pombe modifier genes using high confidence S. pombe protein-protein interaction data from STRING. For this analysis, we identified network clusters that were expanded to include all the so-called “first-degree (1°) interacting proteins”, proteins not identified in our SGA screens but that physically interact with a genetically identified modifier ( Figure S1). The rationale underlying our approach to generate larger clusters comes from the facts that the 1° neighbor may be an essential gene (and therefore it is not present in our deletion library) or may fail to be detected in our screens for other, unknown reasons. This “Cluster with 1° neighbor” analysis reveals new interactions not observed in the yeast primary UBA1 gene interaction network ( Figure S2). A few examples of these types of clusters are shown in Figure 4. The clusters include several genes all of which were uniquely identified in the wildtype screen, including both synthetic lethals and synthetic suppressors. Of particular interest is the cluster of proteins surrounding Skp1(S-phase kinase associated protein-1), a protein that interacts and stablilizes F-box proteins, additional F-box proteins, and proteins involved in cullin deneddylation and neddylation. In addition, we found that the RNA-binding protein Cwf2 interacts directly with three of our modifiers, and the plastin ortholog Fim1, interacts with two. Interestingly, Plastin 3 (PLS3) has been identified as a disease modifier in animal models of spinal muscular atrophy [30– 32]. We then identified correlations between the extended S. pombe network clusters ( Figure 4) and the human primary network clusters ( Figure 3). In both networks, we identified components of the COPS complex, a known regulator of E3 cullins, and the E3 cullin Cul3. Note that Cul3 was not present in our primary S. pombe network ( Figure S2) yet was present in the more extended network clusters that incorporate 1° neighbors; this illustrates the importance of adding 1° neighbors to our analysis of the S. pombe data. Together, these data suggest a prominent role for E3 ubiquitin ligases in modulating UBA1 and potentially SMN1, making Cul3 the primary target for further analysis. The E3 ubiquitin ligase might inhibit SMN function, as ubiquitination often leads to protein degradation. If so, we reasoned that inhibition of E3 ubiquitin ligase might enhance SMN activity and thus suppress loss of function mutations in SMN1. To directly test the hypothesis that inhibition of E3 ubiquitin ligases can suppress a SMN1-spinal muscular atrophy phenotype, we turned to a vertebrate model of SMN deficiency previously reported in zebrafish [33]. Knockdown of Smn in zebrafish embryos causes developmental defects in motor neuron axonal outgrowth that include truncations and abnormal branching of neurons [24]. Motor neuron axon defects can be corrected by injection of mRNAs encoding wildtype human SMN [25]. We first confirmed that injection of smn morpholino (MO) into wildtype zebrafish causes severe motor axon abnormalities as compared to control uninjected embryos ( Figure 5A and 5B). We then tested whether an inhibitor of E3 ubiquitin ligases (MLN4294) would suppress the motor axon abnormalities. Addition of the Nedd8-E1 activating enzyme inhibitor MLN4294 (that blocks cullin-RING E3 ligases), at concentrations ranging from 10 µM to 15 µM, caused a concentration-dependent reduction in the degree of abnormal motor axon branching ( Figure 5A and 5B). At 15 µM inhibitor we found motor axon abnormalities that were completely suppressed: defects were not significantly different to those observed in uninjected embryos (P=0.379). At higher concentrations of drug (20 µM), the rescue was less pronounced ( Table 2) probably because higher levels of drug led to defects in development. Thus, E3 ligase inhibition can rescue neuronal defects caused by Smn protein depletion in zebrafish. In control experiments, MLN4294 failed to rescue the zebrafish mutant topped ( Figure 6), a mutant defective in neuronal axon guidance [34]. We also tested a drug that inhibits sumolyation corresponding to an unrelated target (SUMO) identified in our human primary network clusters ( Figure 3); addition of this compound had no effect on the spinal muscular atrophy model ( Figure S6) necessarily surprising since pmt3, which encodes the yeast ortholog of SUMO1, has the opposite effect of cul3 when deleted. Having been identified as a synthetic suppressor rather than a synthetic lethal, perhaps compounds that activate rather than inhibit sumolyation would be beneficial as a therapeutic.

Figure 5.

MLN4294 rescues the abnormal neuron outgrowth in Smn-depleted zebrafish.

( A) Representative lateral views of motor axons in Tg( mnx1:GFP) zebrafish embryos expressing GFP in motor neurons and injected with control MO and then grown in 10 or 15 µM MLN4294. ( B) Quantification of the effects of MLN4294 on motor axon development in zebrafish. Motor axons were scored in Tg( mnx1:GFP) embryos injected with control MO, and subsequently (10 hrs post-injection) incubated in 10 or 15 µM MLN4294. Embryos were classified as severe, moderate, mild, or no defects based on the severity of motor axon defects, and the percentage of each group is shown. Data in all graphs are represented as mean and SEM. ( C) Western blot analysis of Smn and HuD protein following treatment with MLN4294. Quantification of the results is shown in Figure S5.

Table 2.

Complete dataset for zebrafish experiments.

	Control MO	smn MO	smn MO + 10 µM	smn MO + 15 µM	smn MO + 20 µM
Severe	0 (± 0)	10.1 (± 6.3)	3.5 (± 2.3)	0 (± 0)	2.8 (± 2.4)
Moderate	4.4 (± 1.7)	39.7 (± 1.2)	19.6 (± 7.2)	16.4 (± 5.6)	11.3 (± 7.1)
Mild	44.7 (± 6.4)	39 (± 7.2)	53.1 (± 5)	40.6 (± 3.4)	46.8 (± 9.6)
No Defects	50.9 (± 7.2)	11.2 (± 3.2)	23.9 (± 5.5)	43 (± 7.9)	39.1 (± 19)
n (per exp)	23/19/27/25	23/20/28/27	21/23/27/26	19/17/26/26	22/25/26

Figure 6.

Treatment of zebrafish with MLN4294 fails to rescue defects in ventral motor axon guidance in the topped mutant.

Figure S6.

Zebrafish raw data.

A) MLN4924 treatment in smn MO zebrafish. B) MLN4924 treatment in topped mutant zebrafish. C) Anacardic acid treatment in smn MO zebrafish.

MLN4294 rescues the abnormal neuron outgrowth in Smn-depleted zebrafish.

Figure S5.

Expression of Smn and HuD in zebrafish embryos.

Quantification of western blot results shown in Figure 5C. Treatment of zebrafish embryos with MLN4294 results in approximately a 3-fold increase in both Smn and HuD levels 18 hrs following treatment.

Treatment of zebrafish with MLN4294 fails to rescue defects in ventral motor axon guidance in the topped mutant.

( A) Quantification of the effects of MLN4294 on motor axon guidance in the zebrafish topped mutant. Motor axons were scored in embryos injected with DMSO, 10 or 15 µM MLN4294. Embryos were classified as severe, moderate, mild, or no defects based on the severity of motor axon defects, and the percentage of each group is shown. Data in all graphs are represented as mean and SEM. ( B) Experimental data. To determine how E3 ubiquitin ligases might influence Smn1 function, we examined the protein levels for both Smn and HuD. HuD is a known SMN interacting protein that binds RNAs controlling their translation and stability and functions in neural development and plasticity [34– 36]. We observed greater than a 2-fold increase in both Smn and Hud protein levels following treatment with MLN4294, a NEDD8 activating enzyme (NAE) inhibitor that prevents activation of E3 ligases ( Figure 5C and Figure S5). It is therefore possible that rescue of the neuronal defects in our zebrafish model might involve stabilization of Smn as well as of additional proteins within the Smn network. UBA1 was expressed in S. pombe under inducing (- thiamine) and non-inducing conditions (+ thiamine) in wildtype strains (wtUBA1) and the disease-causing variant wildtype 1617UBA1 (mutUBA1), under the control of a regulatable nmt1 promoter. Growth rate data of expression of UBA1 under inducing (- thiamine) and non-inducing conditions (+ thiamine) in wtUBA1 and mutUBA1 are shown. For each condition there are six replicates. Click here for additional data file.

Discussion

In this paper we introduce YANA, a fission yeast genetic assay that that when applied to a specific human disease gene can leverage protein-protein interaction data to characterize human disease networks. From these networks, we can identify and prioritize genetic pathways likely to modify the disease-associated gene activity, and predict those genes that can be exploited as therapeutic targets. Using UBA1, a gene associated with X-linked spinal muscular atrophy (XL-SMA), we found a high degree of homology between our yeast modifiers and their corresponding human genes (>80%), underscoring the extensibility of the assay. Specifically, we identified several network clusters and high priority targets for therapeutic intervention, including GSK3 and HDAC both of which are potential therapeutic targets for SMA. In addition, we found two novel related targets Cul3 (a cullin required for E3 ubiquitin ligase activity) and NEDD4, an E3 ubiquitin ligase. Our yeast system, with its small but complex eukaryotic genome, and complete deletion library, is unique in allowing unbiased genome-wide screening of deletions that alter human disease gene activity. Moreover, YANA can be applied to any human gene, regardless of the phenotype or availability of endogenous mutations. The number of candidate genes identified by YANA for UBA1 represents a fraction of the approximately 30,000 genes in the human genome, providing a significant enrichment of potential modifiers. Therefore, YANA offers a simple, cost-effective, and relatively rapid technology that could be applied to all human genetic diseases.

Data availability

F1000Research: Dataset 1. Growth rate data of expression of UBA1 in S. pombe, 10.5256/f1000research.4188.d28505 [37]. The Wiley et al manuscript describes a beautiful synthesis of contemporary genetic approaches to, with astonishing efficiency, identify lead compounds for therapeutic approaches to a serious human disease. I believe the importance of this paper stems from the applicability of the approach to the several thousand of rare human disease genes that Next-Gen sequencing will uncover in the next few years and the challenge we will have in figuring out the function of these genes and their resulting defects. This work presents a paradigm that can be broadly and usefully applied. In detail, the authors begin with gene responsible for X-linked spinal muscular atrophy and express both the wild-type version of that human gene as well as a mutant form of that gene in S. pombe. The conceptual leap here is that progress in genetics is driven by phenotype, and this approach involving a yeast with no spine or muscles to atrophy is nevertheless and N-dimensional detector of phenotype. The study is not without a small measure of luck in that expression of the wild-type UBA1 gene caused a slow growth phenotype which the mutant did not. Hence there was something in S. pombe that could feel the impact of this protein. Given this phenotype, the authors then went to work and using the power of the synthetic genetic array approach pioneered by Boone and colleagues made a systematic set of double mutants combining the human expressed UBA1 gene with knockout alleles of a plurality of S. pombe genes. They found well over a hundred mutations that either enhanced or suppressed the growth defect of the cells expressing UBI1. Most of these have human orthologs. My hunch is that many human genes expressed in yeast will have some comparably exploitable phenotype, and time will tell. Building on the interaction networks of S. pombe genes already established, augmenting these networks by the protein interaction networks from yeast and from human proteome studies involving these genes, and from the structure of the emerging networks, the authors deduced that an E3 ligase modulated UBA1 and made the leap that it therefore might also impact X-linked Spinal Muscular Atrophy. Here, the awesome power of the model organism community comes into the picture as there is a zebrafish model of spinal muscular atrophy. The principle of phenologs articulated by the Marcotte group inspire the recognition of the transitive logic of how phenotypes in one organism relate to phenotypes in another. With this zebrafish model, they were able to confirm that an inhibitor of E3 ligases and of the Nedd8-E1 activating suppressed the motor axon anomalies, as predicted by the effect of mutations in S. pombe on the phenotypes of the UBA1 overexpression. I believe this is an important paper to teach in intro graduate courses as it illustrates beautifully how important it is to know about and embrace the many new sources of systematic genetic information and apply them broadly. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. This paper addresses the vexing problem of how to unravel the complexity of human diseases. Such diseases often involve interactions between genes, and the question is therefore well-suited to an automated genetic approach, such as Synthetic Genetic Array technology, which was pioneered in bakers yeast and more recently adopted to another well-characterized model system, namely S. pombe. In this study the authors examine conserved genetic interactions between a fission yeast genes and a human query gene (both wildtype and loss-of-function mutants) then combine their experimental data with bioinformatics to characterize the network of genetic interactions and protein-protein interactions. They then leverage this network to identify interacting partners that comprise novel therapeutic targets. The example presented in this paper is the human UBA1 gene which encodes a gene that, when mutated, results in human X-linked spinal muscular atrophy disease. They perform whole genome genetic interaction mapping for digenetic interactions with both wild type and mutant versions of the gene, build a network with the observed interactions, and then expand that primary network by adding the known partners that interact with members of the primary network. They then turned to the zebrafish vertebrate model system to validate their newly identified therapeutic targets. This study adds another tool to the armamentarium of systems biology approaches with which the complexity of human diseases can be dissected. Because the system is new in the sense that fission yeast has been largely unexplored (compared to its budding yeast relative from which it has diverged by several hundred million years) the observations on fission yeast, when combined with those from other model systems, provide an attractive complementary approach to understanding complex human biology. Given that this study builds on earlier approaches, the authors should cite the previously published studies on genetic interactions, chemical genetic interactions and protein-protein interactions using budding yeast, fly and other models. Similarly, the comprehensive gene expression profiling approaches used in cultured mammalian cells as well as more recent whole genome RNA interference and CRISPR-based disruption screens should be cited. From a methodological perspective, in the interaction maps presented in Figures 3 and 4 it would be useful to describe or cite the methods used to define the links between genes. In the Results, where the authors describe the results of the different screens which include wild type and mutant human protein interactions as well as primary and secondary interactions it would be instructive to provide a graphic or flow chart that depicts how all the key pieces of data from each independent screen in fission yeast (and their analysis) were collapsed and presented together. It would also be instructive to provide background information or appropriate citations on the two compounds used in the zebrafish experiments, specifically their known or suspected mechanisms of action as well as the data to support these mechanisms. In the Discussion, the authors have room to put the YANA screening platform into context, comparing it to other orthologous genome-wide approaches. A key innovation in this study arises from the unique approach used in analyzing the data. Specifically, the conserved yeast genes that modify the phenotype induced by expressing a human gene of interest are assembled into human disease gene networks which are then augmented by additional protein-protein interaction data. Augmenting traditional network analysis in this manner may permit the identification of new therapeutic targets and target pathways. In summary, this comprehensive, genome-wide work represents an important addition to the systems biology toolkit to understand the complexities of human biology and the impact of this and subsequent data sets will surely increase as additional screens are added to the compendium. Minor points Western blot and growth curve: "or eight generations" is a typo, also the type of media the cells were grown in “exponentially for 16 hours” should be listed. In Figure 2 it is not clear which band the tubulin arrow is pointing to- the loading control bands should be indicated directly on the blot/gel. Also, it is apparent from this loading control that the amount of UPA1 expressed is extremely high, the authors should comment on the fact that the overexpressing lanes are actually UNDERloaded relative to the non-expressing lanes. Third paragraph of the Results: the software programme cytoscape should include a reference. Third paragraph of the Results: histone is misspelled " histon". Sixth paragraph of the Results: the authors should clearly distinguish between the published data in zebrafish and their new data collected in this study Sixth paragraph of the Results: as mentioned be inhibitor’s mechanism of action should be described in more detail Second paragraph of the discussion: approximately 30,000 genes in the human genome should be cited and probably corrected to the conventional number of 21,000. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

36 in total

1. Zebrafish topped is required for ventral motor axon guidance.

Authors: Louise R Rodino-Klapac; Christine E Beattie
Journal: Dev Biol Date: 2004-09-15 Impact factor: 3.582

2. Network-based target ranking for polypharmacological therapies.

Authors: Francesca Vitali; Francesca Mulas; Pietro Marini; Riccardo Bellazzi
Journal: AMIA Jt Summits Transl Sci Proc Date: 2013-03-18

3. Survival motor neuron function in motor axons is independent of functions required for small nuclear ribonucleoprotein biogenesis.

Authors: Tessa L Carrel; Michelle L McWhorter; Eileen Workman; Honglai Zhang; Elizabeth C Wolstencroft; Christian Lorson; Gary J Bassell; Arthur H M Burghes; Christine E Beattie
Journal: J Neurosci Date: 2006-10-25 Impact factor: 6.167

4. Cleavage of structural proteins during the assembly of the head of bacteriophage T4.

Authors: U K Laemmli
Journal: Nature Date: 1970-08-15 Impact factor: 49.962

Review 5. Spinal muscular atrophy: the role of SMN in axonal mRNA regulation.

Authors: Claudia Fallini; Gary J Bassell; Wilfried Rossoll
Journal: Brain Res Date: 2012-01-28 Impact factor: 3.252

6. Survival motor neuron affects plastin 3 protein levels leading to motor defects.

Authors: Le T Hao; Marc Wolman; Michael Granato; Christine E Beattie
Journal: J Neurosci Date: 2012-04-11 Impact factor: 6.167

7. Temporal requirement for SMN in motoneuron development.

Authors: Le T Hao; Phan Q Duy; James D Jontes; Marc Wolman; Michael Granato; Christine E Beattie
Journal: Hum Mol Genet Date: 2013-03-03 Impact factor: 6.150

8. Network modeling links breast cancer susceptibility and centrosome dysfunction.

Authors: Miguel Angel Pujana; Jing-Dong J Han; Lea M Starita; Kristen N Stevens; Muneesh Tewari; Jin Sook Ahn; Gad Rennert; Víctor Moreno; Tomas Kirchhoff; Bert Gold; Volker Assmann; Wael M Elshamy; Jean-François Rual; Douglas Levine; Laura S Rozek; Rebecca S Gelman; Kristin C Gunsalus; Roger A Greenberg; Bijan Sobhian; Nicolas Bertin; Kavitha Venkatesan; Nono Ayivi-Guedehoussou; Xavier Solé; Pilar Hernández; Conxi Lázaro; Katherine L Nathanson; Barbara L Weber; Michael E Cusick; David E Hill; Kenneth Offit; David M Livingston; Stephen B Gruber; Jeffrey D Parvin; Marc Vidal
Journal: Nat Genet Date: 2007-10-07 Impact factor: 38.330

9. Rare missense and synonymous variants in UBE1 are associated with X-linked infantile spinal muscular atrophy.

Authors: Juliane Ramser; Mary Ellen Ahearn; Claus Lenski; Kemal O Yariz; Heide Hellebrand; Michael von Rhein; Robin D Clark; Rita K Schmutzler; Peter Lichtner; Eric P Hoffman; Alfons Meindl; Lisa Baumbach-Reardon
Journal: Am J Hum Genet Date: 2008-01 Impact factor: 11.025

10. Trichostatin A increases SMN expression and survival in a mouse model of spinal muscular atrophy.

Authors: Amy M Avila; Barrington G Burnett; Addis A Taye; Francesca Gabanella; Melanie A Knight; Parvana Hartenstein; Ziga Cizman; Nicholas A Di Prospero; Livio Pellizzoni; Kenneth H Fischbeck; Charlotte J Sumner
Journal: J Clin Invest Date: 2007-02-22 Impact factor: 14.808

5 in total

Review 1. SCF ubiquitin ligase-targeted therapies.

Authors: Jeffrey R Skaar; Julia K Pagan; Michele Pagano
Journal: Nat Rev Drug Discov Date: 2014-11-14 Impact factor: 84.694

Review 2. Applications of comparative evolution to human disease genetics.

Authors: Claire D McWhite; Benjamin J Liebeskind; Edward M Marcotte
Journal: Curr Opin Genet Dev Date: 2015-09-04 Impact factor: 5.578

3. Posttranslational Arginylation Enzyme Arginyltransferase1 Shows Genetic Interactions With Specific Cellular Pathways in vivo.

Authors: David J Wiley; Gennaro D'Urso; Fangliang Zhang
Journal: Front Physiol Date: 2020-05-06 Impact factor: 4.566

Review 4. Function Over Form: Modeling Groups of Inherited Neurological Conditions in Zebrafish.

Authors: Robert A Kozol; Alexander J Abrams; David M James; Elena Buglo; Qing Yan; Julia E Dallman
Journal: Front Mol Neurosci Date: 2016-07-07 Impact factor: 5.639

5. Identification of an oncogenic network with prognostic and therapeutic value in prostate cancer.

Authors: Fiorella Magani; Eric R Bray; Maria J Martinez; Ning Zhao; Valeria A Copello; Laine Heidman; Stephanie O Peacock; David J Wiley; Gennaro D'Urso; Kerry L Burnstein
Journal: Mol Syst Biol Date: 2018-08-14 Impact factor: 11.429

5 in total