| Literature DB >> 22913808 |
Matthew S Creamer1, Edward C Stites, Meraj Aziz, James A Cahill, Chin Wee Tan, Michael E Berens, Haiyong Han, Kimberley J Bussey, Daniel D Von Hoff, William S Hlavacek, Richard G Posner.
Abstract
BACKGROUND: Mathematical/computational models are needed to understand cell signaling networks, which are complex. Signaling proteins contain multiple functional components and multiple sites of post-translational modification. The multiplicity of components and sites of modification ensures that interactions among signaling proteins have the potential to generate myriad protein complexes and post-translational modification states. As a result, the number of chemical species that can be populated in a cell signaling network, and hence the number of equations in an ordinary differential equation model required to capture the dynamics of these species, is prohibitively large. To overcome this problem, the rule-based modeling approach has been developed for representing interactions within signaling networks efficiently and compactly through coarse-graining of the chemical kinetics of molecular interactions.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22913808 PMCID: PMC3485121 DOI: 10.1186/1752-0509-6-107
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Combinatorial complexity is a feature of ERBB receptor signaling. (A) Diagram depicting a selected subset of interactions of epidermal growth factor (EGF), EGF receptor (EGFR), heregulin (HRG), ERBB3, GRB2, SOS1, GAB1, SHC1, and PI3K considered in the model. These binding partners of EGFR and ERBB3 are documented in NetPath [50] and HPRD [51]. This diagram also depicts substrates of the EGFR kinase domain (six tyrosine residues in EGFR and seven tyrosine residues in ERBB3). These sites of phosphorylation in EGFR and ERBB3 are documented in HPRD [51] and Phospho. ELM [52]. Next to each component of EGFR and ERBB3, the number of possible component states is indicated. These counts are based only on the proteins, sites of phosphorylation, and interactions depicted in this diagram. Note that additional interactions are considered in the model (cf. Figure 2). For example, in this diagram, we do not consider phosphorylation of SOS1 and GAB1. Ligand binding sites have two possible states (see below). Docking sites for SHC1 (blue rectangles) have eight possible states (see below). Docking sites for GRB2 (green rectangles) have six possible states. In the model, Y1114 in EGFR (white rectangle) is a docking site for both GRB2 and SHC1. Thus, this docking site has 12 possible states. Docking sites for PI3K (cyan rectangles) have three possible states. Based on these counts of possible component states, the number of possible states for an EGFR monomer is 2·8·6·8·12·6·6 = 331,776, and the number of possible states for an ERBB3 monomer is 2·36·8 = 11,664. An EGFR: ERBB3 heterodimer has more than 3.8 × 109 states, and an EGFR homodimer has more than 5.5 × 1010 states. When we consider the additional interactions included in the model but not shown here, we find that the number of possible states for an EGFR homodimer is much greater than a googol (10100). (B) As illustrated in this panel, the ectodomain of EGFR has two possible ligand-bound states: free or bound to EGF. (C) As illustrated in this panel, a docking site in EGFR for SHC1 has eight possible states: unphosphorylated, phosphorylated, bound to unphosphorylated SHC1, bound to phosphorylated SHC1, bound to SHC1 in complex with GRB2, bound to SHC1 in complex with GRB2: SOS1, bound to SHC1 in complex with GRB2: GAB1, and bound to SHC1 in complex with a ternary complex of GRB2, SOS1, and GAB1.
Figure 2A rule-based model for ERBB receptor signaling. Boxes and nested boxes represent proteins and (sub)components of proteins. Only (sub)components considered in the model are shown. Boxes are decorated with post-translational modification flags, which are each labeled at one end and connected to a small box at the other. The prefix of the label indicates the modification (i.e., ‘p’ represents addition of a phosphate group), and the rest of the label indicates the site of modification. Compartmental locations of proteins are indicated in tabs at the lower left corners of boxes: M, membrane; C, cytoplasmic; Ex, extracellular; and En, endosomal. A line that begins and ends with an arrowhead represents a direct binding interaction. A line with a circle at one end identifies an enzyme-substrate relationship; the circle identifies the substrate. A line that ends with an open, triangular arrowhead represents activation. Numbers next to arrows refer to sets of rules, which are identified in an accompanying model guide. See Methods for additional details.
Ranges considered for six classes of model parameters
| Rate constant for a bimolecular association reaction | 10-7 – 10-5 | (molecules/cell)-1·s-1 |
| Rate constant for a unimolecular dissociation reaction | 10-2 – 100 | s-1 |
| Rate constant for a phosphatase-catalyzed reaction* | 10-3 – 10-1 | s-1 |
| Rate constant for a receptor trafficking step (internalization or recycling)* | 10-3 – 10-1 | s-1 |
| Rate constant for endocytic degradation* | 10-3 – 10-1 | s-1 |
| Protein copy number | 104 – 106 | molecules/cell |
Processes characterized by parameter classes marked by asterisks are taken to be first-order processes.
Figure 3System size and simulation performance. (A) Cost of network-free simulation vs. cost of on-the-fly simulation. The CPU time required to perform the simulation specified in the BioNetGen input file of Supplemental Archive File 1, but without equilibration, was determined for the on-the-fly stochastic simulation algorithm (SSA) implemented in BioNetGen and also for the network-free stochastic simulation algorithm implemented in NFsim. Equilibration was not performed so that the initial condition would encompass a minimal number of populated species. Thus, in these simulations, all proteins were free and unphosphorylated at time t = 0. As can be seen, the computational cost for on-the-fly simulation increases exponentially as a function of time, whereas the computational cost of network-free simulation increases linearly as a function of time. There are no data points for t ≥ 1.5 s for the on-the-fly algorithm because the cost of network generation made simulating the model beyond this time impractical. (B) On-the-fly stochastic simulation of the model with BioNetGen (see Methods). The simulation results demonstrate that a large number of chemical species are populated in the ERBB receptor signaling network.
Figure 4Simulation of ERBB receptor signaling in response to addition of epidermal growth factor and heregulin. Network-free stochastic simulation of the rule-based model illustrated in Figure 2 with NFsim (see Methods). The simulation results demonstrate the capability of the rule-based modeling approach to represent site-specific phosphorylation kinetics. For each time course, phosphorylation level is normalized to the maximum level. Time courses are clustered by similarity (see Methods).
Summary of selected temporal phosphoproteomic data
| ERBB1 (EGFR) | | | | |
| | Y845 | | | X |
| | Y992 | | X | |
| | Y998 | X | | |
| | Y1045 | X | | |
| | Y1068 | | X | X |
| | Y1086 | | | X |
| | Y1148 | X | X | |
| | Y1173 | X | X | X |
| ERBB2 (HER2) | | | | |
| | Y1221/1222 | | | X |
| | Y1248 | X | X | |
| ERBB3 (HER3) | | | | |
| | Y1289 | | | X |
| | Y1328 | | | |
| ERBB4 | | | | |
| | Y1284 | | | X |
| Shc1 | | | | |
| | Y239/240 | X | | X |
| | Y317 | X | X | X |
| Raf-1 | | | | |
| | S289/296/301 | | | X |
| | S338 | | | X |
| MEK1/2 | | | | |
| | S217/T221 | | | X |
| ERK1/2 | | | | |
| | T202/Y204 | X | X | X |
| Gab1 | | | | |
| | Y373 | X | | |
| | Y406 | X | | |
| | Y627 | X | X | X |
| | Y659 | X | | |
| Akt1 | | | | |
| | T308 | | X | X |
| S473 | X | X |
Time courses have been measured for many of the serine, threonine and tyrosine (S/T/Y) residues considered in the model presented here. Here, we focus on three studies that applied distinct experimental techniques to measure time courses of phosphorylation for specific S/T/Y sites. In the study of Wolf-Yadlin et al. [45], the technique of selected reaction monitoring and quantitative mass spectrometry was applied. In the study of VanMeter et al. [46], the technique of reverse phase protein array was applied. In the study of Ciaccio et al. [47], the technique of microwestern array was applied. An ‘X’ entry in this table signifies that a time course of phosphorylation for the indicated residue was measured in the indicated study.