Literature DB >> 20802085

Construction of a large extracellular protein interaction network and its resolution by spatiotemporal expression profiling.

Stephen Martin1, Christian Söllner, Varodom Charoensawan, Boris Adryan, Bernard Thisse, Christine Thisse, Sarah Teichmann, Gavin J Wright.   

Abstract

Extracellular interactions involving both secreted and membrane-tethered receptor proteins are essential to initiate signaling pathways that orchestrate cellular behaviors within biological systems. Because of the biochemical properties of these proteins and their interactions, identifying novel extracellular interactions remains experimentally challenging. To address this, we have recently developed an assay, AVEXIS (avidity-based extracellular interaction screen) to detect low affinity extracellular interactions on a large scale and have begun to construct interaction networks between zebrafish receptors belonging to the immunoglobulin and leucine-rich repeat protein families to identify novel signaling pathways important for early development. Here, we expanded our zebrafish protein library to include other domain families and many more secreted proteins and performed our largest screen to date totaling 16,544 potential unique interactions. We report 111 interactions of which 96 are novel and include the first documented extracellular ligands for 15 proteins. By including 77 interactions from previous screens, we assembled an expanded network of 188 extracellular interactions between 92 proteins and used it to show that secreted proteins have twice as many interaction partners as membrane-tethered receptors and that the connectivity of the extracellular network behaves as a power law. To try to understand the functional role of these interactions, we determined new expression patterns for 164 genes within our clone library by using whole embryo in situ hybridization at five key stages of zebrafish embryonic development. These expression data were integrated with the binding network to reveal where each interaction was likely to function within the embryo and were used to resolve the static interaction network into dynamic tissue- and stage-specific subnetworks within the developing zebrafish embryo. All these data were organized into a freely accessible on-line database called ARNIE (AVEXIS Receptor Network with Integrated Expression; www.sanger.ac.uk/arnie) and provide a valuable resource of new extracellular signaling interactions for developmental biology.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20802085      PMCID: PMC3101854          DOI: 10.1074/mcp.M110.004119

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


The individual cells within a multicellular organism communicate with each other to provide coordinated and appropriate cellular responses that ensure the normal development and maintenance of the organism as a whole. Frequently, these intercellular dialogues are initiated by specific binding events mediated by cell surface receptor glycoproteins, the molecular bridges through which cells receive information from their immediate environment and subsequently relay it to cytoplasmic signaling networks. Despite their importance in many different biological contexts, identifying novel extracellular protein interactions remains technically challenging because membrane proteins are difficult to biochemically manipulate, and their interactions are typified by extremely low interaction strengths (1, 2). Several scalable methods to identify this class of interactions have been developed that account for some or all of these challenges: they include approaches based on protein complementation (3), phage display (4), mass spectrometry (5), surface plasmon resonance (6–9), and detection of direct binding between multivalent recombinant proteins (Refs. 10–12; for a review, see Ref. 1). Our own approach, called avidity-based extracellular interaction screen (AVEXIS) (see Fig. 1A), involves expression by mammalian cells of soluble recombinant ectodomain regions of cell surface receptor and secreted proteins as either a monobiotinylated bait or a pentamerized β-lactamase-tagged prey. The pentamerization is achieved through a peptide derived from the cartilage oligomeric matrix protein (COMP) (13) and is necessary to increase the local concentration of the ectodomain fragments to effect gains in the overall binding avidity so that even very transient interactions can be detected. We have determined the parameters of the assay and shown that it can reliably detect interactions that are very transient, having monomeric half-lives of ≤0.1 s when subsequently measured in their monomeric state by surface plasmon resonance using purified proteins (10). This assay now permits the systematic screening of thousands of binary interactions and can be used to identify novel receptor-ligand pairs that were previously difficult to detect.
Fig. 1.

Summary of AVEXIS method and overall approach to construct and resolve extracellular protein interaction networks. A, the entire ectodomains of endogenous cell surface receptors (pink) are expressed as both monomeric biotinylated baits (B) and pentamerized β-lactamase-tagged (β) preys (P). Bait proteins are immobilized in individual wells of streptavidin-coated microtiter plates and probed with a normalized prey, which, if the two proteins physically interact, is captured within the well. B, a flowchart presenting an overall summary of the work described here, including how interactions from two previous screens (Refs. 10 (Bushell 2008) and 27 (Söllner 2009)) were integrated into the larger network of interactions. C, positive interactions are detected by adding a colorimetric β-lactamase substrate, nitrocefin, which is converted from a yellow to red product. Two typical screening plates are shown illustrating a heterophilic interaction between Cadm3 and Cadm4 that is detected in both bait-prey orientations and a homophilic interaction involving Cadm3. The controls for each prey included a negative bait, the rat Cd4d3+4 protein tag alone (well G6), and a biotinylated anti-rat Cd4 monoclonal antibody (OX68), which captured the Cd4-tagged preys (well G7). Each plate also contained positive control interactions: the rat Cd200R prey was probed against rat Cd200 baits immobilized at the normalized screening threshold and at 1:500 and 1:1,000 dilutions (wells G8–G10, respectively) and against the negative Cd4d3+4 bait (well G12). An additional negative bait (Fgfr1b) was included for both preys (wells G5 and G11). SLRPs, small leucine-rich proteoglycans.

To identify novel extracellular receptor-ligand interactions that initiate signaling pathways important for vertebrate development, we have selected two protein families that constitute a significant proportion of the extracellular proteome: the immunoglobulin superfamily (IgSF) (14) and leucine-rich repeat (LRR) (15) families. These two families, which both contain membrane-tethered receptors and secreted proteins, are disproportionately expanded in vertebrates relative to invertebrates and are therefore likely to be involved in initiating vertebrate-specific signaling processes (16). IgSF proteins are used as cell surface recognition molecules in a diverse array of tissues, including the immune and nervous systems (17). They function by forming specific receptor-ligand pairs with other IgSF receptors but also other protein domain families, including the LRR (18). Extracellular LRR domain-containing proteins can be separated into two main families: the small leucine-rich proteoglycans, which are secreted and form a major component of the extracellular matrix (19), and cell surface receptors that are involved in the development and maintenance of the nervous system (20). To elucidate the functional role of novel interactions in vertebrate development, we have chosen to screen for novel receptor-ligand pairs from the zebrafish. This popular model organism has many experimental advantages to study early developmental processes, including the amenability to forward (21–23) and reverse (24, 25) genetics approaches. Importantly, the ready accessibility of large numbers of externally developing translucent embryos and the ability to determine large scale gene expression patterns at different stages of embryonic development make whole organism spatiotemporal gene expression profiling possible (26). Using this approach, we first reported a network of 43 interactions within the zebrafish IgSF proteins (10) and subsequently a neural network of 34 interactions between membrane-tethered receptors from both the IgSF and LRR families (27). Here, we expanded our zebrafish recombinant library by 87 proteins to include many more secreted factors and proteins from other families and discovered 111 new interactions. By combining them with interactions identified in our previous screens, we compiled an expanded static network containing 188 interactions that we analyzed to identify properties of extracellular interaction networks. To begin the functional validation of identified interactions, we determined the developmental expression patterns of all genes encoding proteins within our library and describe here 164 new gene expression patterns, many of which are highly dynamic and tissue-restricted. By integrating the spatiotemporal expression patterns with the interaction network, we were able to determine in which stages and tissues each interaction was likely to function and resolved the aggregate interaction network into stage- and tissue-specific subnetworks. The work described in this study is summarized schematically in Fig. 1B. To facilitate navigation of these data, they were organized into a freely accessible on-line database that provides a valuable and accessible resource of new extracellular signaling interactions for developmental biology.

EXPERIMENTAL PROCEDURES

Large Scale Extracellular Interaction Screening and Analysis

Full details for expression construct generation, protein production, and interaction screening are described elsewhere (10). Briefly, genes encoding IgSF or LRR domains were identified in the zebrafish genome sequence, amplified by RT-PCR, and cloned using the topoisomerase cloning system (Invitrogen) or, where available, were purchased as clones from the Integrated Molecular Analysis of Genomes (IMAGE) consortium. All clones were fully sequenced and submitted to GenBankTM (see supplemental Table 1). The predicted ectodomain regions, including the native signal peptide of cloned genes and of those provided by other researchers, were amplified by PCR using oligonucleotides that contain flanking NotI and AscI rare cutting restriction enzymes. The ectodomain fragments were fully sequenced and subcloned into both bait and prey expression vectors to produce rat Cd4d3+4-tagged soluble proteins. Baits contain a C-terminal BirA substrate and are monobiotinylated during expression by cotransfecting with a modified BirA expression plasmid. Preys contain a C-terminal COMP pentamerization domain N-terminal to the β-lactamase enzyme. All baits and preys were expressed by transient transfection of HEK293E cells, harvested after 6 days, filtered, dialyzed (for the baits only), and stored at 4 °C after addition of 10 mm NaN3. Bait and prey proteins were normalized to threshold levels as described previously (10, 27). Protein pairs producing positive binding results from the first round of screening were produced as fresh samples and retested in an independent validation screen. The protein interactions from this publication have been submitted to the International Molecular Exchange Consortium (IMEx) (http://imex.sf.net) through IntAct (28) and assigned the identifier IM-11669; they are fully detailed in Table I. The extracellular protein interaction network was visualized using Cytoscape (29). Networks were analyzed and compared using igraph (30) within the statistical application R (31).
Table I

Summary of all interactions detected in this study

Interactions are numbered and separated into homophilic and heterophilic interactions. They are listed with their official ZFIN protein nomenclature and IntAct accession numbers for both bait-prey orientations where applicable. Interactions were considered to be of high confidence if they were positive in the primary screen and could be detected in both bait-prey orientations in either the primary or validation screens (classes A, C, D, E, and F). Interactions were classified according to Bushell et al. (10).

High Throughput Spatiotemporal Expression and Analysis

Genes within our clone library (now containing 249 different proteins) for which there was no representative image in the ZFIN expression database were identified by using BLAST against this database; in total, 164 genes had no previously documented expression pattern. Antisense probe templates were made by amplifying the entire extracellular region used in the interaction screen. Probe synthesis, in situ hybridization, and image capture were performed as described (26). All images were annotated using Open Biomedical Ontology-compliant terms from the “zebrafish anatomy and development” ontology (Biomedical Ontologies Foundry; http://www.obofoundry.org) as of February 2008 and have been deposited in the ZFIN database (www.zfin.org). The anatomical gene expression pattern descriptions for the 92 genes encoding proteins within the interaction network were retrieved from ZFIN, yielding 1,433 partially redundant spatiotemporal descriptions. Expression was summarized by classifying gene expression into five periods (gastrula (5.25–10 h postfertilization (hpf)), segmentation (10–24 hpf), pharyngula (24–48 hpf), hatching (48–72 hpf), and larval (72–120 hpf)), and spatially into 10 major systems (nervous and sensory, endocrine, digestive, liver, renal, skeletal, musculature, cardiovascular, hematopoietic, and immune). Because expression was never observed in any reproductive or respiratory tissue, these terms were removed from further analysis; expression networks were visualized using Cytoscape (29). Zebrafish were handled in strict accordance with good animal practice as defined by the relevant national and local animal welfare bodies.

Protein Homology

Two or more proteins were grouped into the same paralogous family if identified by Ensembl Compara (32) or if they have sequence similarity greater than 50%. A neighbor-joining tree was built for the 92 proteins in the network using ClustalW2 (33) to illustrate the phylogenetic relationship between protein sequences. Orthologous proteins in other metazoan species were obtained from Ensembl Compara (species name, Ensembl release): Caenorhabditis elegans, 37.10; Drosophila melanogaster, 37.4; Danio rerio, 47.7; Oryzias latipes, 41.1; Mus musculus, 37.34; and Homo sapiens, 43.36. Additional orthologs were detected by running OrthoMCL 1.4 (34) using the default BLAST E-value cutoff of 10−5, but the MCL inflation index was raised to 5 to ensure that only tight orthologous clusters were obtained. A table describing orthologous gene clusters in other species and the best homologous matches for each of the 92 proteins is available in the AVEXIS Receptor Network with Integrated Expression (ARNIE) database (www.sanger.ac.uk/arnie).

Connectivity Bias Assessment

The expected number of interactions between the proteins from the same and different families was calculated from 1,000 random family-shuffling experiments of the interaction network while maintaining the network node connectivity. The significance of the enrichment of heterophilic interactions occurring within the same paralogous cluster and the fraction of interactions that arose via a gene duplication event were assessed by randomizing the assigned paralogous clusters of the nodes in the network.

RESULTS

Identification of Novel Signaling Pathways Using AVEXIS

To identify novel signaling pathways that are important for vertebrate development, we expanded a library of recombinant zebrafish receptor and secreted proteins by 87 to a total of 249. We estimate that our protein library now contains ∼40% of all non-rearranging IgSF and ∼85% of the LRR families in the zebrafish genome, which accounts for around a tenth of all type I/glycosylphosphatidylinositol-linked cell surface receptors in this organism. The expression constructs for the additional proteins came from three main sources: IMAGE clones, genes amplified by RT-PCR using gene predictions from the zebrafish genome sequence, or expression clones kindly provided by other researchers (supplemental Table 1). These additional proteins were mainly members of the IgSF, but other protein families such as the small leucine-rich proteoglycans (12 proteins), Ephrins (three proteins), amyloid plaque proteins (two proteins), Netrin (one protein), and a secreted frizzled-related protein (one protein) were also included. To more closely mimic in vivo conditions of the extracellular environment and ensure that proteins were glycosylated and that disulfide bonds were formed, all proteins were expressed as soluble ectodomain fragments (including their own signal peptide) by transient transfection of a mammalian cell line. Proteins were produced as both biotinylated monomeric baits and enzyme-tagged pentameric preys and screened for direct, binary interactions using AVEXIS as described (10). Importantly, because both baits and preys were expressed at levels spanning several orders of magnitude, we normalized them to a previously determined stringent activity threshold using a “gold standard” set of quantified positive and published negative interactions within the human CD2 family that we have previously shown result in low false positive rates (10). Biotinylated baits were arrayed on streptavidin-coated 96-well microtiter plates including positive and negative controls and screened against the prey library; examples of typical screening plates are shown in Fig. 1C. The expanded protein library described here enabled a significant increase in the size of the screen to 16,544 unique interactions tested (that is, reciprocal interactions are not counted twice), more than doubling the number of interactions tested in previous studies (6,105 (10) and 7,592 (27)). Protein pairs that showed positive interactions from the first round of screening were then retested in both bait-prey orientations using independently produced and normalized samples. Summary of AVEXIS method and overall approach to construct and resolve extracellular protein interaction networks. A, the entire ectodomains of endogenous cell surface receptors (pink) are expressed as both monomeric biotinylated baits (B) and pentamerized β-lactamase-tagged (β) preys (P). Bait proteins are immobilized in individual wells of streptavidin-coated microtiter plates and probed with a normalized prey, which, if the two proteins physically interact, is captured within the well. B, a flowchart presenting an overall summary of the work described here, including how interactions from two previous screens (Refs. 10 (Bushell 2008) and 27 (Söllner 2009)) were integrated into the larger network of interactions. C, positive interactions are detected by adding a colorimetric β-lactamase substrate, nitrocefin, which is converted from a yellow to red product. Two typical screening plates are shown illustrating a heterophilic interaction between Cadm3 and Cadm4 that is detected in both bait-prey orientations and a homophilic interaction involving Cadm3. The controls for each prey included a negative bait, the rat Cd4d3+4 protein tag alone (well G6), and a biotinylated anti-rat Cd4 monoclonal antibody (OX68), which captured the Cd4-tagged preys (well G7). Each plate also contained positive control interactions: the rat Cd200R prey was probed against rat Cd200 baits immobilized at the normalized screening threshold and at 1:500 and 1:1,000 dilutions (wells G8–G10, respectively) and against the negative Cd4d3+4 bait (well G12). An additional negative bait (Fgfr1b) was included for both preys (wells G5 and G11). SLRPs, small leucine-rich proteoglycans. Screening of the 87 new proteins for interactions using the AVEXIS assay against both themselves and our existing protein library identified a total of 111 interactions, corresponding to an interaction detection frequency of 0.7% within these proteins; 56% of heterophilic interactions were detected in both bait-prey orientations (Table I). Fifteen interactions were expected from previous studies of orthologous proteins, demonstrating that the recombinant ectodomain fragments were correctly folded and retained their binding properties. These included interactions between secreted ligands and receptors, for example, the Robo-Slit (18) and Netrin-Neogenin interactions (35); homophilic interactions of the Kirrel family members (36); and heterophilic interactions between membrane-tethered receptors belonging to the Negr and Cadm families (37, 38). The large majority (86%) of detected interactions, however, were entirely novel, including 15 proteins that had no previously documented ligand. These interactions are listed in supplemental Table 2 and include Lrrc17, a negative regulator of osteoclast differentiation (39), and Ncam3 (40), which interacted with Neuroplastin, a receptor previously only known to interact homophilically and involved in hippocampal long term potentiation (41). Our systematic approach also identified novel ligands for well known receptors. For example, one striking feature of the network was the large number of interactions (17) with the vascular receptor Kdrl (VEGFR-2/Flk-1), which was one of the most highly connected nodes of the network. We found that the extracellular region of Kdrl could reproducibly interact in both bait-prey orientations with the two Robo3 splice isoforms present in the screen but not with Robo1, -2, or -4. This network represents a resource of novel extracellular receptor-ligand interactions, which are difficult to identify using other approaches.

Summary of all interactions detected in this study

Interactions are numbered and separated into homophilic and heterophilic interactions. They are listed with their official ZFIN protein nomenclature and IntAct accession numbers for both bait-prey orientations where applicable. Interactions were considered to be of high confidence if they were positive in the primary screen and could be detected in both bait-prey orientations in either the primary or validation screens (classes A, C, D, E, and F). Interactions were classified according to Bushell et al. (10).

Properties of Extracellular Protein Interaction Networks

To gain a more global understanding of extracellular protein interaction networks and their role in signaling, we constructed a larger network by including 77 interactions identified using the same method from previous non-overlapping interaction screens that screened different bait and prey combinations (10, 27). This expanded network contained 188 interactions among 92 proteins: 157 heterophilic and 31 homophilic interactions involving 59 proteins with IgSF domains, 22 with LRR domains, seven containing both IgSF and LRR, and four “other” proteins that contained other domains including EGF, laminin, A4 extra, and β amyloid precursor protein (app) domains (Fig. 2).
Fig. 2.

Large extracellular protein interaction network systematically determined by AVEXIS. The interaction network consists of 188 interactions between 92 proteins. The family to which each protein belongs and its predicted subcellular localization are indicated: red, secreted; blue, membrane-tethered; circle, IgSF; triangle, LRR; diamond, IgSF + LRR; square, other domains.

Large extracellular protein interaction network systematically determined by AVEXIS. The interaction network consists of 188 interactions between 92 proteins. The family to which each protein belongs and its predicted subcellular localization are indicated: red, secreted; blue, membrane-tethered; circle, IgSF; triangle, LRR; diamond, IgSF + LRR; square, other domains. We first considered whether secreted ligands or membrane-tethered receptors had different connectivity behaviors within the network and found that the average number of interaction partners for secreted proteins was 5.9 compared with 3.3 for receptors (p < 0.01, Wilcoxon two-sample test). The large majority (80%) of interactions made by secreted proteins involved a membrane-bound receptor, showing that most secreted ligands have multiple receptors. The expanded network enabled us to determine the characteristics of extracellular signaling networks for the first time; for example, we found that the connectivity distribution within the network behaved as a power law (supplemental Fig. S1) (42). We were also able to strengthen our previous observations that extracellular protein interaction networks are enriched for interactions between paralogous proteins because 9.6% (15 of 157) of heterophilic interactions occurred within the same paralogous cluster compared with 1.09 ± 0.8% (S.D.; p < 0.001) observed within randomized paralogous clusters. We also determined that over half (51.6%) of the interactions within the network could be explained by gene duplication of one or both of the interacting partners, a figure that is much higher than expected by chance alone (7.7 ± 2.2% (S.D.; p < 0.001)), and is a known property of other protein interaction networks (43).

Spatiotemporal Gene Expression Profiling Reveals Dynamic Expression of IgSF and LRR Genes during Early Vertebrate Development

To obtain functional information for the interactions within the network, we determined the expression profiles of all the genes within our protein library, for which no expression pattern had been previously determined, using whole-mount in situ hybridization from gastrula to larval periods of zebrafish embryonic development. In total, we determined 164 new expression patterns, which have been deposited in the publically accessible ZFIN database (44). A complete description of the principal events within these developmental periods is detailed in Kimmel et al. (45): they cover the initiation and completion of primary organogenesis (segmentation to hatching) and the development of circulation (pharyngula) and simple behaviors (larval) (Fig. 3A). To enable a summarized, systematic analysis of the expression data, the gene expression patterns were annotated using a restricted vocabulary and subsequently categorized into 10 general biological systems using the zebrafish anatomical ontology.
Fig. 3.

Genes encoding proteins within extracellular interaction network are expressed in tissue-restricted manner during early development. A, the development of the zebrafish embryo. Drawings of representative stages within each of the five main periods of zebrafish development are shown above a brief description of the main morphogenetic landmarks within each period. Drawings are taken with permission from Kimmel et al. (45). B, a summary of the tissue expression patterns of the genes encoding proteins from the interaction network. The expression patterns were annotated and placed into all the appropriate non-exclusive tissue categories. C, a summary of the temporal expression patterns of the genes encoding proteins from the interaction network. The number of genes expressed at each period of development is plotted, indicating those genes whose expression was ubiquitous (green) or restricted to particular tissues (blue).

Genes encoding proteins within extracellular interaction network are expressed in tissue-restricted manner during early development. A, the development of the zebrafish embryo. Drawings of representative stages within each of the five main periods of zebrafish development are shown above a brief description of the main morphogenetic landmarks within each period. Drawings are taken with permission from Kimmel et al. (45). B, a summary of the tissue expression patterns of the genes encoding proteins from the interaction network. The expression patterns were annotated and placed into all the appropriate non-exclusive tissue categories. C, a summary of the temporal expression patterns of the genes encoding proteins from the interaction network. The number of genes expressed at each period of development is plotted, indicating those genes whose expression was ubiquitous (green) or restricted to particular tissues (blue). We first compared the number of genes expressed in each tissue across all periods of development. This revealed that the large majority of the genes in the network were expressed within the nervous and sensory system (Fig. 3B) with almost twice as many genes expressed in this tissue compared with the next highest, the skeletal system. We then compared the expression of the genes at different developmental periods, noting whether the expression of that gene was ubiquitous or had a tissue-restricted expression pattern. We observed that the number of expressed genes gradually increased during early development, peaking at the pharyngula period (Fig. 3C), and with the exception of the gastrula stage, their expression patterns were largely restricted to particular tissues. We also observed that closely related paralogs usually had very different expression patterns, suggesting that the control of gene expression can evolve rapidly after gene duplication. This analysis showed an increased need for IgSF and LRR receptor-mediated cell surface recognition functions in the nervous and sensory system relative to other tissues as development proceeds. Overall, the analysis showed that the IgSF and LRR gene families were dynamically expressed in all tissues during embryonic development: only one gene lacked detectable expression (nitr14b), and just three genes (sc:d0647, lgi1a, and zgc:77222) were expressed constitutively (all tissues at all tested periods).

Integration of the Gene Expression Patterns with the Interaction Network Identifies Tissue Specificity of Interactions

A prerequisite for initiating signals through membrane receptor proteins is that their genes must be compatibly expressed in overlapping or adjacent cells at the same time. To identify for each interaction the tissues and developmental stages for which this criterion was met, we integrated the expression data with the interaction network by highlighting where each gene pair encoding interacting proteins was spatially and temporally co-expressed (Fig. 4). Of the 157 heterophilic interactions identified, 76% were compatible based on spatiotemporal co-expression. Interactions were clustered into those that functioned in specific tissues or throughout the whole organism and revealed that over half (63%) of the interactions were restricted to a particular system. In the vast majority (81%) of these cases, productive interactions were localized within the nervous and sensory systems, highlighting the role of these receptor families in cellular recognition events during the development of these tissues. Thirteen interactions involved genes expressed throughout the entire organism (two were constitutive, and 11 were stage-specific) and are likely to have general roles in adhesion rather than regulated intercellular communication (Fig. 4). This analysis also revealed that 30 interactions had spatially and/or temporally incompatible gene expression patterns during embryonic development. Remarkably, in all cases where the genes were spatially incompatible, at least one member of the pair was predicted to be a secreted protein for which one might not expect to see precise spatial colocalization by in situ hybridization. Likely reasons for observing incompatible gene expression include the incomplete nature of the protein library, the perdurance of a protein product where the corresponding mRNA has since degraded, and simply non-functional “leaky” expression.
Fig. 4.

Integration of spatiotemporal gene expression profiles with extracellular protein interaction network. All 188 gene pairs that encode interacting proteins are listed vertically; those encoding a secreted protein are highlighted in green. Each gene of a pair is arbitrarily assigned 1 (left column, purple) or 2 (right column, gold), and their spatiotemporal expression patterns are indicated by shading an appropriate box in the matrix, which is organized into 10 organ systems (nervous and sensory to immune), each of which is subdivided in up to five developmental stages as appropriate (G, gastrula; S, segmentation; P, pharyngula; H, hatching; L, larval). Where both genes are compatibly expressed, the box is shaded red. Interactions are first organized into those whose gene pairs are compatibly or incompatibly expressed and then divided further into functional subcategories as indicated.

Integration of spatiotemporal gene expression profiles with extracellular protein interaction network. All 188 gene pairs that encode interacting proteins are listed vertically; those encoding a secreted protein are highlighted in green. Each gene of a pair is arbitrarily assigned 1 (left column, purple) or 2 (right column, gold), and their spatiotemporal expression patterns are indicated by shading an appropriate box in the matrix, which is organized into 10 organ systems (nervous and sensory to immune), each of which is subdivided in up to five developmental stages as appropriate (G, gastrula; S, segmentation; P, pharyngula; H, hatching; L, larval). Where both genes are compatibly expressed, the box is shaded red. Interactions are first organized into those whose gene pairs are compatibly or incompatibly expressed and then divided further into functional subcategories as indicated. We next asked whether genes encoding interacting proteins were spatiotemporally co-expressed more often than one would expect by chance alone. To do this, we excluded all homophilic interactions (which always have compatible expression) and those involving at least one secreted protein (where one might not expect to see compatible spatial expression), leaving 81 heterophilic interactions of which only three have incompatible gene expression patterns. The expression for each gene within the 81 heterophilic interactions was randomized within the annotated stages and tissues. We observed that within the randomized networks the average number of compatible interactions was 55.9 ± 3.3 (S.D.; empirical p value <0.001), significantly less than the observed 78 compatible interactions, demonstrating a significant enrichment in compatible gene expression.

Spatiotemporal Expression Profiling Resolves Stage- and Tissue-specific Receptor Signaling Networks

The interaction network shown in Fig. 2 is a static aggregate of all possible IgSF- and LRR-mediated signaling networks that might occur in vivo at different developmental stages and in different tissues. To show how the extracellular network changed in different tissues during development, we used the expression data to deconvolute the static composite network into more physiologically relevant stage- and tissue-specific signaling subnetworks. As an example, we show how the receptor interaction network changes at different stages during the development of the nervous and sensory system (Fig. 5A). To provide a simplified overview of how the interaction networks change in each tissue system during development, we calculated the percentage of all interactions within each developmental period and compared them by plotting a heat map (Fig. 5B). This showed that within half the tissues, including the nervous and sensory, skeletal, and endocrine systems, the complexity of the network paralleled the relative number of genes expressed at that stage. In the remaining tissues, however, including the renal, hematopoietic, and immune systems, the interaction networks were relatively more complex at the segmentation stage.
Fig. 5.

Resolution of extracellular interaction network into stage- and tissue-specific signaling networks. A, graphs show time-resolved extracellular interaction networks at each period of development within the nervous and sensory system. The node key is as follows: blue, gene whose expression is restricted to nervous system; green, ubiquitously expressed gene at the stage considered. B, heat map showing the percentage of extracellular interaction network edges at a given developmental period relative to the others within each tissue. For comparison, the percentage of genes expressed at each period is also shown. G, gastrula; S, segmentation; P, pharyngula; H, hatching; L, larval.

Resolution of extracellular interaction network into stage- and tissue-specific signaling networks. A, graphs show time-resolved extracellular interaction networks at each period of development within the nervous and sensory system. The node key is as follows: blue, gene whose expression is restricted to nervous system; green, ubiquitously expressed gene at the stage considered. B, heat map showing the percentage of extracellular interaction network edges at a given developmental period relative to the others within each tissue. For comparison, the percentage of genes expressed at each period is also shown. G, gastrula; S, segmentation; P, pharyngula; H, hatching; L, larval.

ARNIE, an On-line Database to Navigate Extracellular Interaction Networks and Expression Patterns

By comparing the detailed spatiotemporal expression patterns of genes that encode interacting cell surface receptor and secreted proteins, hypotheses regarding the function of the interaction can be formulated and tested. To facilitate this comparison and provide easy access to the interactions and expression patterns identified in the network, we created a publically accessible on-line database called ARNIE (http://www.sanger.ac.uk/arnie). This database allows users to perform rapid side-by-side comparisons of the detailed whole mount in situ expression images, which are organized into stage- and orientation-matched pairs. ARNIE also enables users to interactively resolve the overall network into individual stage- and/or tissue-specific signaling subnetworks using drop-down menus. To extend the application of the protein interaction network to other organisms, this database also includes orthology mapping of all the proteins within the network to orthologous proteins in other popular model organisms and humans.

DISCUSSION

Interactions made by membrane-embedded receptor signaling proteins have important roles in both genetic and infectious diseases and yet remain technically challenging to identify on a large scale due to transient (monomeric half-lives <1 s) interaction strengths and the requirement for structurally critical posttranslational modifications. By using a specialized assay, AVEXIS, and a protein library containing the ectodomain fragments of receptors and secreted proteins mainly from the zebrafish IgSF and LRR families, we have systematically constructed and analyzed a network of extracellular interactions. This extends our previous studies by screening at a much larger scale and including many more secreted proteins and other protein families. By combining these networks, our aim was to identify novel signaling pathways that are important for vertebrate developmental processes. To take the first steps toward elucidating the function of these interactions, we have determined the spatiotemporal expression pattern of each gene within our protein library using whole-mount in situ hybridization during early development. By integrating the expression with the interaction network, we were able to transform the static network into a dynamic signaling map and resolve the interactions into tissue- and stage-specific functions. In the vast majority (81%) of these cases, productive interactions were localized within the nervous and sensory systems, highlighting the role of these receptor families in cellular recognition events during the development of these tissues. Future work from our laboratory will be focused toward identifying the functional role of the interactions identified from this and our previous screens using the genetic advantages of the zebrafish. The integrated interaction network and expression patterns have been organized into an on-line database and together are a useful resource of novel signaling interactions for developmental biology. All our screens have been performed at a stringent prey threshold that was initially determined using a set of quantified positive and published negative interactions within the human CD2 family (10). This stringency threshold enabled the detection of 15 known orthologous heterophilic interactions. However, the screen did not identify some known homophilic interactions, including that involving Ncam and one known orthologous heterophilic interaction (Netrin1b-Unc5b). We have previously reported that homophilic interactions constitute a class of false negatives using the AVEXIS technique most likely due to prey-prey associations (10). The inability to detect the Unc5b-Netrin1b interaction is unlikely to be due to incorrect folding of the recombinant proteins because we detect known Unc5b interactions with Flrt receptors (27, 46) and the Netrin1b-Neogenin interaction (35). The zebrafish genome contains two netrin1 paralogues, 1a and 1b, and it remains possible that because of this redundancy Netrin1b no longer interacts with Unc5b. Because Netrin1a was not sufficiently expressed, we were unable to test whether Netrin1a interacted with Unc5b (supplemental Table 1). In comparison with cytoplasmic protein interaction networks, extracellular networks are less well characterized. Largely, this is due to the fact that current popular high throughput protein interaction mapping assays such as the yeast two-hybrid and tandem affinity purification tagging approaches, which have been very successful at identifying cytoplasmic protein interactions, are generally unsuitable to detect transient extracellular interactions (47). This study, together with our previous work, represents one of the few attempts to systematically build extracellular protein interaction networks and shows that our approach can be used to screen for interactions within recombinant protein libraries containing hundreds of different proteins. Related approaches from others have focused on identifying homophilic interactions between different alternative splice isoforms of the Drosophila Dscam neural recognition molecule (12) or between extracellular matrix proteins and their receptors centered around endostatin (6). The integration of expression data with protein interaction networks has been successfully used in unicellular organisms to reveal novel context-dependent functional complexes (48, 49) and to gain insight into how gene co-expression and network topologies relate to one another (50, 51). In multicellular organisms, high throughput spatiotemporal promoter activity profiling has been used to provide additional evidence that two proteins could indeed interact in vivo or conversely show that they were unlikely to interact (52), or to explain how ubiquitously expressed proteins can have tissue-specific functions (53). The integrated interaction and expression map, which has been organized into an on-line database, can also be used to resolve which interactions could function at a particular time and place during embryonic development. In the cases where genes encoding interacting proteins were not compatibly expressed, the interaction could have a function at later stages of development, or the perdurance of one of the protein products expressed earlier might enable a productive interaction to occur. The interaction and expression data can be used to form testable hypotheses regarding the function of interactions that we have identified. During the course of our analysis, we noticed that the expression patterns of even closely related paralogous proteins were often very different. By using our expression and interaction data sets, we were able to find support for a model where the evolution of signaling pathways after gene duplication is more likely to occur by changes in the spatiotemporal expression of receptors and then by divergence in cytoplasmic signaling sequences, whereas extracellular sequences and interactions are more conserved. This analysis is reported separately (54). We have demonstrated here that by using the AVEXIS technique it is now possible to begin constructing large systematic extracellular protein interaction networks involving membrane-embedded receptor and secreted proteins, which have so far been a challenge to identify experimentally on this scale. By integrating these extracellular networks with gene distribution data, maps of the intercellular recognition processes used by multicellular organisms to coordinate their cellular behaviors can be built. Critically, these extracellular interactions will provide the means to link up the currently more extensive intracellular protein interaction networks (55) and build models of how neighboring cells within a multicellular organism respond to external stimuli.
  53 in total

Review 1.  The roles of nectins in cell adhesions: cooperation with other cell adhesion molecules and growth factor receptors.

Authors:  Toshiaki Sakisaka; Wataru Ikeda; Hisakazu Ogita; Naoyuki Fujita; Yoshimi Takai
Journal:  Curr Opin Cell Biol       Date:  2007-10-17       Impact factor: 8.382

2.  Large-scale screening for novel low-affinity extracellular protein interactions.

Authors:  K Mark Bushell; Christian Söllner; Benjamin Schuster-Boeckler; Alex Bateman; Gavin J Wright
Journal:  Genome Res       Date:  2008-02-22       Impact factor: 9.043

3.  High-resolution in situ hybridization to whole-mount zebrafish embryos.

Authors:  Christine Thisse; Bernard Thisse
Journal:  Nat Protoc       Date:  2008       Impact factor: 13.491

4.  A new way to explore the world of extracellular protein interactions.

Authors:  Christopher M Sanderson
Journal:  Genome Res       Date:  2008-04       Impact factor: 9.043

5.  New assay to detect low-affinity interactions and characterization of leukocyte receptors for collagen including leukocyte-associated Ig-like receptor-1 (LAIR-1).

Authors:  Lei Jiang; A Neil Barclay
Journal:  Eur J Immunol       Date:  2009-04       Impact factor: 5.532

6.  A vast repertoire of Dscam binding specificities arises from modular interactions of variable Ig domains.

Authors:  Woj M Wojtowicz; Wei Wu; Ingemar Andre; Bin Qian; David Baker; S Lawrence Zipursky
Journal:  Cell       Date:  2007-09-21       Impact factor: 41.582

7.  An experimentally derived confidence score for binary protein-protein interactions.

Authors:  Pascal Braun; Murat Tasan; Matija Dreze; Miriam Barrios-Rodiles; Irma Lemmens; Haiyuan Yu; Julie M Sahalie; Ryan R Murray; Luba Roncari; Anne-Sophie de Smet; Kavitha Venkatesan; Jean-François Rual; Jean Vandenhaute; Michael E Cusick; Tony Pawson; David E Hill; Jan Tavernier; Jeffrey L Wrana; Frederick P Roth; Marc Vidal
Journal:  Nat Methods       Date:  2008-12-07       Impact factor: 28.547

8.  Tissue specificity and the human protein interaction network.

Authors:  Alice Bossi; Ben Lehner
Journal:  Mol Syst Biol       Date:  2009-04-07       Impact factor: 11.429

9.  The extracellular leucine-rich repeat superfamily; a comparative survey and analysis of evolutionary relationships and expression patterns.

Authors:  Jackie Dolan; Karen Walshe; Samantha Alsbury; Karsten Hokamp; Sean O'Keeffe; Tatsuya Okafuji; Suzanne F C Miller; Guy Tear; Kevin J Mitchell
Journal:  BMC Genomics       Date:  2007-09-14       Impact factor: 3.969

10.  Ensembl 2008.

Authors:  P Flicek; B L Aken; K Beal; B Ballester; M Caccamo; Y Chen; L Clarke; G Coates; F Cunningham; T Cutts; T Down; S C Dyer; T Eyre; S Fitzgerald; J Fernandez-Banet; S Gräf; S Haider; M Hammond; R Holland; K L Howe; K Howe; N Johnson; A Jenkinson; A Kähäri; D Keefe; F Kokocinski; E Kulesha; D Lawson; I Longden; K Megy; P Meidl; B Overduin; A Parker; B Pritchard; A Prlic; S Rice; D Rios; M Schuster; I Sealy; G Slater; D Smedley; G Spudich; S Trevanion; A J Vilella; J Vogel; S White; M Wood; E Birney; T Cox; V Curwen; R Durbin; X M Fernandez-Suarez; J Herrero; T J P Hubbard; A Kasprzyk; G Proctor; J Smith; A Ureta-Vidal; S Searle
Journal:  Nucleic Acids Res       Date:  2007-11-13       Impact factor: 16.971

View more
  26 in total

Review 1.  Defining the extracellular matrix using proteomics.

Authors:  Adam Byron; Jonathan D Humphries; Martin J Humphries
Journal:  Int J Exp Pathol       Date:  2013-02-19       Impact factor: 1.925

Review 2.  Specification of synaptic connectivity by cell surface interactions.

Authors:  Joris de Wit; Anirvan Ghosh
Journal:  Nat Rev Neurosci       Date:  2015-12-10       Impact factor: 34.870

3.  mPPI: a database extension to visualize structural interactome in a one-to-many manner.

Authors:  Yekai Zhou; Hongjun Chen; Sida Li; Ming Chen
Journal:  Database (Oxford)       Date:  2021-06-22       Impact factor: 3.451

Review 4.  Neurexins, neuroligins and LRRTMs: synaptic adhesion getting fishy.

Authors:  Gavin J Wright; Philip Washbourne
Journal:  J Neurochem       Date:  2011-01-19       Impact factor: 5.372

5.  Avidity-based extracellular interaction screening (AVEXIS) for the scalable detection of low-affinity extracellular receptor-ligand interactions.

Authors:  Jason S Kerr; Gavin J Wright
Journal:  J Vis Exp       Date:  2012-03-05       Impact factor: 1.355

6.  The impact of gene expression regulation on evolution of extracellular signaling pathways.

Authors:  Varodom Charoensawan; Boris Adryan; Stephen Martin; Christian Söllner; Bernard Thisse; Christine Thisse; Gavin J Wright; Sarah A Teichmann
Journal:  Mol Cell Proteomics       Date:  2010-10-08       Impact factor: 5.911

7.  Replication and explorations of high-order epistasis using a large advanced intercross line pedigree.

Authors:  Mats Pettersson; Francois Besnier; Paul B Siegel; Orjan Carlborg
Journal:  PLoS Genet       Date:  2011-07-21       Impact factor: 5.917

8.  Jamb and jamc are essential for vertebrate myocyte fusion.

Authors:  Gareth T Powell; Gavin J Wright
Journal:  PLoS Biol       Date:  2011-12-13       Impact factor: 8.029

9.  Basigin is a receptor essential for erythrocyte invasion by Plasmodium falciparum.

Authors:  Cécile Crosnier; Leyla Y Bustamante; S Josefin Bartholdson; Amy K Bei; Michel Theron; Makoto Uchikawa; Souleymane Mboup; Omar Ndir; Dominic P Kwiatkowski; Manoj T Duraisingh; Julian C Rayner; Gavin J Wright
Journal:  Nature       Date:  2011-11-09       Impact factor: 49.962

10.  A panel of recombinant monoclonal antibodies against zebrafish neural receptors and secreted proteins suitable for wholemount immunostaining.

Authors:  Nicole Staudt; Nicole Müller-Sienerth; Alla Fane-Dremucheva; Shahnaz P Yusaf; David Millrine; Gavin J Wright
Journal:  Biochem Biophys Res Commun       Date:  2014-12-06       Impact factor: 3.575

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.