Cell fate is governed by combinatorial actions of transcriptional regulators assembling into multiprotein complexes. However, the molecular details of how these complexes form are poorly understood. One such complex, which contains the basic-helix-loop-helix heterodimer SCL:E47 and bridging proteins LMO2:LDB1, critically regulates hematopoiesis and induces T cell leukemia. Here, we report the crystal structure of (SCL:E47)bHLH:LMO2:LDB1LID bound to DNA, providing a molecular account of the network of interactions assembling this complex. This reveals an unexpected role for LMO2. Upon binding to SCL, LMO2 induces new hydrogen bonds in SCL:E47, thereby strengthening heterodimer formation. This imposes a rotation movement onto E47 that weakens the heterodimer:DNA interaction, shifting the main DNA-binding activity onto additional protein partners. Along with biochemical analyses, this illustrates, at an atomic level, how hematopoietic-specific SCL sequesters ubiquitous E47 and associated cofactors and supports SCL's reported DNA-binding-independent functions. Importantly, this work will drive the design of small molecules inhibiting leukemogenic processes.
Cell fate is governed by combinatorial actions of transcriptional regulators assembling into multiprotein complexes. However, the molecular details of how these complexes form are poorly understood. One such complex, which contains the basic-helix-loop-helix heterodimer SCL:E47 and bridging proteins LMO2:LDB1, critically regulates hematopoiesis and induces T cell leukemia. Here, we report the crystal structure of (SCL:E47)bHLH:LMO2:LDB1LID bound to DNA, providing a molecular account of the network of interactions assembling this complex. This reveals an unexpected role for LMO2. Upon binding to SCL, LMO2 induces new hydrogen bonds in SCL:E47, thereby strengthening heterodimer formation. This imposes a rotation movement onto E47 that weakens the heterodimer:DNA interaction, shifting the main DNA-binding activity onto additional protein partners. Along with biochemical analyses, this illustrates, at an atomic level, how hematopoietic-specific SCL sequesters ubiquitous E47 and associated cofactors and supports SCL's reported DNA-binding-independent functions. Importantly, this work will drive the design of small molecules inhibiting leukemogenic processes.
Many, if not all, cellular processes are driven by multiprotein complexes. In the nucleus, control of gene expression is achieved through assembly of such molecular machines on genomic loci in a highly controlled manner. These complexes contain combinations of regulators such as transcription factors (TFs), cofactors, and chromatin-remodeling proteins (Goodrich and Tjian, 2010; Malik and Roeder, 2010) that assemble in a modular manner for rapid adaptation of gene expression programs. Although at the heart of transcription, little is known about the molecular mechanisms driving formation of multiprotein complexes and their interaction with DNA, the role of individual components in this process, and the relationship between TFs and chromatin-remodeling proteins. To address some of these questions, we have analyzed the associations between two evolutionarily conserved families of transcriptional regulators, namely basic-helix-loop-helix (bHLH) and LIM domain-containing proteins (German et al., 1992; Kong et al., 1997; Lee and Pfaff, 2003) at structural, functional, and biochemical levels.The hematopoietic system offers a well-characterized model to study bHLH/LIM protein interactions, such as those engaged by the tissue-specific class II bHLH TF SCL/TAL1 (hereafter called SCL) and the non-DNA-binding LIM-only protein LMO2. SCL and LMO2 were initially discovered through chromosomal translocations involved in T cell acute lymphoblastic leukemia (T-ALL) (Begley et al., 1989; Boehm et al., 1991). Indeed, their ectopic expression in T cell precursors occurs in up to 60% of childhood T-ALL cases of which 80% coexpresses SCL and LMO2 (or LMO1) (Ferrando et al., 2002, 2004). In normal hematopoiesis, SCL and LMO2 are absolutely required for hematopoietic specification and terminal differentiation of specific hematopoietic lineages (Hall et al., 2003; Kassouf et al., 2008; Lécuyer and Hoang, 2004; Patterson et al., 2007; Porcher et al., 1996).SCL forms obligate heterodimers through its HLH domain with ubiquitously expressed class I bHLH E proteins (such as the E2A gene products, E47 and E12) (Porcher et al., 1999). The SCL:E47 heterodimer binds through its basic regions to an E box DNA recognition sequence (CANNTG), each monomer recognizing one-half of the E box (Massari and Murre, 2000). The SCL:E47 heterodimer then nucleates a “core” multiprotein complex by binding to the adaptor protein LMO2 and its interacting partner LDB1 (LIM-binding domain 1) (Lécuyer and Hoang, 2004).The SCL core complex (SCL:E47:LMO2:LDB1) acquires further specificity through recruitment of additional protein partners. In erythroid cells, LMO2 recruits hematopoietic-specific TF GATA1, creating the so-called “pentameric” complex that binds a bipartite E box-GATA DNA sequence (Wadman et al., 1997; Kassouf et al., 2010). The complex regulates expression of important hematopoietic-specific genes (Kassouf et al., 2010) upon recruitment of various combinations of cofactors and chromatin remodelers, such as ETO2, mSin3A, P300, PCAF, and LSD1 (Li et al., 2012; Schuh et al., 2005).In T-ALL, the prevailing model suggests that ectopically expressed SCL and LMO2 synergize to prevent the activity of E protein homodimers, essential for normal progression of T cell differentiation. Specifically, through a sequestration mechanism that is yet to be characterized at the molecular level, E proteins are locked into ectopic SCL core complexes that act by repressing apoptotic pathways and preventing the normal T cell transcriptional program (Herblot et al., 2000; Palii et al., 2011; Sanda et al., 2012).Interestingly, for its functions in hematopoietic specification and leukemogenesis, SCL does not rely on direct DNA-binding activities (Draheim et al., 2011; Kassouf et al., 2008), suggesting that it may work off DNA or be tethered to DNA through other DNA-bound TFs. Additionally, we previously showed that one-fifth of SCL’s genomic targets in erythroid cells can recruit the factor independently of its DNA-binding activity (Kassouf et al., 2010), suggesting a critical network of additional protein:protein and protein:DNA interactions for the nucleation of such complexes.Despite a wealth of studies defining the SCL core complex as a key transcriptional regulator in normal and malignant hematopoiesis, its mechanism of action remains undefined at a molecular level. Specifically, the molecular interactions governing protein:protein and protein:DNA associations are not understood. Moreover, the molecular details of the E protein sequestration model, which confers to ectopically expressed SCL and LMO2 their oncogenic properties, are unclear. Although functional, biochemical, and biophysical studies have suggested potential pathways to complex assembly (Lécuyer et al., 2007; Ryan et al., 2008; Schlaeger et al., 2004; Wadman et al., 1997), complete dissection of the molecular rules driving complex formation has been hindered by the lack of a full structural characterization. No structure exists for SCL, and whereas other bHLH proteins and LIM proteins, including E47 (Ellenberger et al., 1994) and LMO2 (El Omari et al., 2011), have been structurally characterized, no molecular account exists of their association. The mechanistic role of each complex component and their contribution to cofactor recruitment also remain unclear.Here, we report the structure of the SCL core complex (SCL:E47)bHLH:LMO2:LDB1LID bound to DNA. This structure, which details a bHLH:LIM protein association, provides an atomic description of the interactions driving the association of the SCL:E47 heterodimer and its interface with DNA in the presence and in absence of LMO2. Analysis of the structure uncovers unexpected roles for SCL, E47, and LMO2. Functional and biochemical analyses further reveal complex synergies between components of the complex, their cofactors, and DNA targets. Together, these results deepen our understanding of how higher-order, hematopoietic-specific complexes form and how tissue-specific gene expression programs might be regulated. Importantly, the structure, which delivers insights into the oncogenic properties of the complex, will drive the design of small molecules for targeted inhibition of oncogenic processes in T-ALL.
Results
Overview of the Structure
The crystal structures of the (SCL:E47)bHLH heterodimer and that of (SCL:E47)bHLH complexed to LMO2:LDB1LID, both bound to a 11 bp blunt-ended E box DNA, were determined at 2.8 and 2.9 Å resolution, respectively (Table 1; Figures 1 and S1C). DNA-bound (SCL:E47)bHLH is topologically similar to previously reported bHLH structures, such as the (E47:E47)bHLH homodimer and (E47:NeuroD1)bHLH heterodimer (Ellenberger et al., 1994; Longo et al., 2008). The basic domains contact the major groove of the DNA (Figure S1A), and the HLH domains, comprising helices 1 and 2 (H1, H2) joined by a loop region, form a parallel left-handed four-helix bundle (Figure 1A). H1 in SCL is four amino acids shorter at its C terminus compared to E47. The large side chain of Y235 projecting from the side of H2 prevents formation of a longer H1 in SCL, whereas in E47, the longer helix is accommodated by residue V595 in the same position (Figure 1A). Because the valine and the tyrosine are conserved residues respectively among class I and class II bHLH proteins (Figure 1E), it is likely that the length of H1 is a characteristic of each bHLH class. The length of H1 impacts the length of the loop: the size and amino acid composition of the loop are poorly conserved across different classes. On the other hand, sequence conservation in this region between SCL, Tal2, Lyl1, Hen1 and Hen2 (all class II proteins binding to LMO proteins; Aoyama et al., 2005; Manetopoulos et al., 2003; Wadman et al., 1994; Figure 1E) suggests a role for the loop in directing binding preference to LMO partner proteins.
Table 1
Data Collection and Refinement Statistics
Data Collection Details
(SCL:E47)bHLH:LMO2:LDB1LID:DNA
(SCL:E47)bHLH:DNA
Data collection site
I04 Diamond
I02 Diamond
Space group
F222
C2221
Cell dimensions (Å, °)
a = 102.9, b = 141.0, c = 148.8
a = 73.2, b = 152.8, c = 55.4
α = β = γ = 90
α = β = γ = 90
Wavelength (Å)
1.2829
0.9795
Resolution range (Å)
50.0–2.8 (2.9–2.8)
50.0–2.9 (3.0–2.9)
Redundancy
3.8 (2.8)
3.8 (3.7)
Completeness (%)
97.8 (95.5)
97.8 (98.5)
Average I/σ (I)
11.0 (1.8)
15.6 (1.5)
Rmerge
0.086 (0.511)
0.079 (0.972)
Refinement Statistics
Resolution range (Å)
30.15–2.80 (2.87–2.80)
42.44–2.87 (2.95–2.87)
R factor (Rwork/Rfree)
0.224/0.269
0.254/0.288
Rmsd bond length (Å)
0.006
0.005
Rmsd bond angle (°)
0.9
0.8
Mean B factor (Å2)
91.7
117.1
Ramachandran plot (%) favored/allowed/outliers
100.0/95.5/0.0
100.0/99.3/0.0
Figure 1
Structures of the DNA-Bound SCL:E47bHLH and SCL:E47bHLH:LMO2:LDB1LID
(A and B) Ribbon diagrams of the DNA-bound (SCL:E47)bHLH (A) and (SCL:E47)bHLH:LMO2:LDB1LID (B) structures. SCLbHLH, E47bHLH, LMO2, and LDB1LID are colored in yellow, raspberry, green, and magenta, respectively, whereas the DNA strands are shown in blue and pink. basic, basic DNA-binding helix.
(C and D) Surface and ribbon diagram representations showing the residues involved in the HLH interface in the (SCL:E47)bHLH (C) and the (SCL:E47)bHLH:LMO2:LDB1LID (D) structures. Hydrophobic residues are colored in orange; residues involved in hydrogen bonding in dark blue.
(E) Sequence and structural alignment of bHLH domains of SCL, E47, and other class I and class II bHLH proteins. Conserved amino acids are drawn in red boxes; similar amino acids are in red type. Secondary structures of the bHLH domain of SCL, NeuroD1, MyoD, and E47 are depicted in yellow, light gray, dark gray, and red, respectively. UniProtKB sequence accession numbers are P17542 (SCL), Q13562 (NEUROD1), P15172 (MYOD1), P12980 (LYL1), Q16559 (TAL2), Q02575 (HEN1), Q02577 (HEN2), P15884 (E2-2), Q99081 (HEB), and P15923 (E12 and E47). PDB accession numbers are 2YPB (SCL and E47), 2QL2 (NeuroD1), and 1MDY (MyoD).
See also Figures S1 and S2 and Table S1.
Figure S1
Electron Density Maps and B Factor Distribution of the DNA-Bound (SCL:E47)bHLH and (SCL:E47)bHLH:LMO2:LDB1LID, Related to Figure 1
(A) 2Fo - Fc electron density maps contoured at 1 level (blue) of the SCL:DNA interface (left) and E47:DNA interface (right). Proteins and DNA are colored as in Figure 1A.
(B) Hydrogen bond contacts seen at the (SCL:E47)bHLH:LMO2 interface. The 2Fo - Fc map is colored in blue and contoured at 1 level. Proteins and DNA are colored as in Figure 1A.
(C) Ribbon representations of the (SCL:E47)bHLH:LMO2:LDB1LID:DNA complex on the left, and the (SCL:E47)bHLH:DNA on the right. Complexes are colored according to the distribution of their B-factors from blue to red (low to high B-factors).
The three-dimensional structure of DNA-bound (SCL:E47)bHLH:LMO2:LDB1LID reveals how LMO2 binds to the bHLH heterodimer (Figure 1B). The interactions occur almost exclusively through the first LIM domain of LMO2 (LIM1), H2 of SCL and E47, and the loop of SCL. Only two residues (R100, R102) of the second LIM domain (LIM2) interface with (SCL:E47)bHLH. Our previous work reporting the structure of LMO2:LDB1LID (El Omari et al., 2011) highlighted the functional importance of the LIM interdomain flexibility mediated through hinge residue F88. The structure of (SCL:E47)bHLH:LMO2:LDB1LID supports this conclusion because the LIM domains undergo a hinge rotation around F88 to bind to (SCL:E47)bHLH (Figure S2).
Figure S2
Flexibility of the Two LIM Domains of LMO2, Related to Figure 1B
Structural superimposition of LMO2:LDB1LID from the quaternary complex (yellow), on the two crystal forms of the LMO2:LDB1LID structure: C2 space group (cyan) and P21 space group (green). Superimpositions are done on the first LIM domain, and the LID domain is colored in magenta. PDB codes, 2YPA (LMO2:LDB1LID from quaternary complex, this study), 2xjz, and 2xjy (LMO2:LDB1LID; El Omari et al., 2011).
The (SCL:E47)bHLH Heterodimer Interface
Interactions between bHLH proteins are driven by the strength of their homodimeric versus heterodimeric association. The composition of the buried hydrophobic interface of (SCL:E47)bHLH (orange in Figure 1C; Table S1) is very similar to that of (E47:E47)bHLH (Ellenberger et al., 1994). Sequence conservation of these hydrophobic residues across class I and II bHLH proteins (Figure 1E) suggests structural conservation of the HLH interface. Hydrophilic amino acids (Table S1, light-blue residues) are also found at the bHLH interface but are less conserved and might reflect heterodimerization specificity between different bHLH proteins. One salt bridge and two hydrogen bonds are formed between H2 residues from SCL and E47 in the (SCL:E47)bHLH structure (dark-blue residues in Figure 1C and Table S1): E247:R606, Q246:N607, and Y235:E600, respectively. Because this latter bond is also seen in (NeuroD1:E47)bHLH (Longo et al., 2008), it might be a common feature of all E47 heterodimers because Y235 is conserved among class II bHLH factors (Figure 1E). The E47 homodimer interface contains two symmetrically positioned hydrogen bonds between E600 of H2 from one monomer and H576 in H1 from the other monomer (Ellenberger et al., 1994). No structure is available for the SCL homodimer, but it is very likely that, like in the MyoD homodimer structure (Ma et al., 1994), no hydrogen bonds are formed between monomers, a characteristic that may decrease the stability of class II homodimers. In our hands, SCLbHLH homodimers could not be purified in isolation, whereas stable (SCL:E47)bHLH heterodimers could be formed by coexpressing the two proteins; moreover, SCL homodimers are not described in vivo. These results are consistent with the view that E47:SCL is thermodynamically favored in solution because SCL homodimers are disfavored, as previously suggested for NeuroD1 (Longo et al., 2008). Moreover, association constants have been measured for (E12:E12)bHLH, (SCL:SCL)bHLH, and (SCL:E12)bHLH with a respective KΔ of 7.4 × 104, 5.3 × 106, and 3.5 × 107 M−1 (Ryan et al., 2008), in agreement with the fact that the heterodimer is more stable than individual homodimers. Note that E2A isoforms E47 and E12 are highly homologous in their bHLH region and show 100% conservation in the residues at the interface with SCL and DNA, suggesting very similar structural and biophysical properties (see Figure 1E and Table S1).Analysis of SCL:E47 structures in the presence of LMO2 reveals the structural impact of LMO2 on heterodimerization. Binding of LMO2 only slightly increases the overall surface buried in the (SCL:E47)bHLH interface (2,185 versus 2,149 Å) (Figure 1D). However, a substantial gain is seen in the network of hydrogen bonds. Two new hydrogen bonds (Q246:V603 and R230:D561) and one salt bridge (R230:E568) are now formed (dark blue, Figure 1D; Table S1). Interestingly, two of these new bonds connect H2 of SCL to H1 of E47. Salt bridges have been implicated in the dimerization strengths and preferences of bHLH proteins (Shirakata et al., 1993). By inducing the formation of new hydrogen bonds and salt bridges, LMO2 binding is likely to increase the heterodimerization affinity over E47 homodimerization. Moreover, the reported in vitro association constant of LMO2 to the heterodimer (∼1.8 108 M−1) (Ryan et al., 2008) is much stronger than either homo- or heterodimeric association constants (see above), suggesting an important function for LMO2 in “locking” the heterodimer structure.
Interaction of SCL:E47bHLH with DNA
(SCL:E47)bHLH is oriented on the E box motif (forward, 5′-C1A2G3A4T5G6-3′; reverse, 5′-C1∗A2∗T3∗C4∗T5∗G6∗-3′); SCL binds the CAG half-site and E47 the C∗A∗T∗ half-site (Figures 2A, 2B, and S1A). Compared to other bHLH structures, (SCL:E47)bHLH makes fewer overall contacts to DNA. The conserved glutamates E555 (E47) and E196 (SCL) form the only specific interactions with the DNA bases. Notably, E555 of E47 contacts the DNA loosely, making one putative hydrogen bond to N4 of C1∗ (3.6 Å) and sitting at a distance of 3.9 Å from T5. The orientations of the conserved glutamate side chains are stabilized by a salt bridge with conserved neighboring arginines (SCL R199, E47 R558). All other interactions with DNA are through the phosphate backbone. Four bonded contacts are formed by SCL and three by E47 (Figure 2C). The side chains of conserved lysines at the tip of H2 (SCL K225, E47 K585; see Figure 1A), capped by the phosphate of C1/1∗, are stabilized by a salt bridge to conserved asparagines on H1 (SCL N204, E47 N563; see Figure 1A), suggesting a bHLH canonical mechanism to anchor the position of H1 and H2 to each other and to the DNA. All the other contacts with the DNA backbone are nonbonded contacts.
Figure 2
The DNA Recognition Interfaces
(A and B) Detailed view of SCLbHLH (A) and E47bHLH (B) interactions in the major groove.
(C and D) Schematic diagram of detailed protein-DNA contacts of the (SCL:E47)bHLH structure (C) compared with the contacts of the (SCL:E47)bHLH:LMO2:LDB1LID complex (D) using NUCPLOT (Luscombe et al., 1997). Hydrogen bonds (within 3.5 Å) are represented as thick lines to the backbone (filled circles) or to the bases (bold letter). van der Waals contacts (within 3.9 Å) are represented as thin lines.
(E) Subdomain motions of the bHLH domains induced by LMO2 binding. Ribbon diagrams of the DNA-bound (SCL:E47)bHLH and (SCL:E47)bHLH:LMO2:LDB1LID structures superimposed based on H2 of E47. The angle of rotation required to superimpose the basic and H1 helices of E47 is 12°. Color coding is yellow and gold (SCL), raspberry and red (E47) for the (SCL:E47)bHLH and (SCL:E47)bHLH:LMO2:LDB1LID structures, respectively. The black rectangle corresponds to the region shown in (F) and (G).
(F and G) Close-up views of the area around E47 residues K585 and N563, showing 2Fo − Fc electron density maps contoured at 1 s level (gray) and distances between residues and DNA. (F) (SCL:E47)bHLH structure. (G) (SCL:E47)bHLH:LMO2:LDB1LID structure. Proteins and DNA are colored as in (E).
See also Figure S1A.
In our structure, the (SCL:E47)bHLH heterodimer does not directly contact the central bases of the E box motif, although water-mediated contacts cannot be excluded at the resolution of our structures. This is in contrast to the specific contacts observed in the (E47:E47)bHLH:DNA structure (Ellenberger et al., 1994). Indeed, the two E47 monomers, crystallized in the presence of the consensus sequence for E47 (CACCTG), contacted the DNA asymmetrically: the “nonspecific” monomer interacted with the C∗A∗G∗ half-site through binding of E555 to the base of C1∗; the “specific monomer” interacted with CAC through binding of R556 to the central base G4∗. In the (SCL:E47)bHLH structure, both the E47 and the SCL basic helices adopt the nonspecific conformation. Although a specific conformation is not observed in the crystal structure itself, this does not preclude the possibility of a rotation in solution of the E47 R556 side chain to contact the central base A4. The basic helix of SCL appears however to be less flexible and less likely to adopt the specific conformation (i.e., rotation of the R197 side chain to contact the base of C4∗). Indeed, Q201 in SCL contacts the phosphate backbone at T3∗, whereas the equivalent residue in E47 (R560) does not, giving more flexibility to the protein-backbone interaction (Figure 2C). Similarly, the conserved glutamate E196 in SCL contacts the bases of C1, A2, and T5∗ more closely than the equivalent residue E555 in E47 (Figures 2A–2C), seemingly locking SCL in its unspecific conformation. It seems therefore likely that if (SCL:E47)bHLH is able to bind E boxes specifically, its orientation on DNA is dictated by E47.Binding of LMO2 to SCL:E47 imposes unsuspected structural constraints affecting heterodimer:DNA interactions. As discussed above, binding of LMO2 tightens the heterodimerization strength by increasing the network of hydrogen bonds, including the creation of a salt bridge between H2 of SCL (R230) and H1 of E47 (E568). The creation of these new bonds following LMO2 binding is likely to be caused by subtle rearrangements occurring to H2 of SCL, the main interface of interaction of the heterodimer with LMO2 (see below). The creation of the new bonds and the subtle conformational changes at the heterodimer interface induce movements of the H1 helices and, consequently, of the basic domains. Specifically, the hinge angle between H1 and H2 of E47 undergoes a large rotation of about 12° (Figure 2E). This causes a shift in the bond between the conserved lysine K585 at the tip of E47 H2 and conserved asparagine N563 in E47 H1 (Figures 2F and 2G), a loosening of the contacts between K585 and backbone DNA (from 2.7 to 4.2 Å, Figures 2D, 2F, and 2G), and a decrease in the overall surface buried between E47 and DNA (∼120 Å2). The overall network of bonded and nonbonded contacts between the LMO2-bound heterodimer and DNA decreases in size compared to the nonbound form (from 73 to 56 contacts). As a result, binding of LMO2 is predicted to destabilize the (SCL:E47)bHLH:DNA interface. Of note, in vitro measurement of the DNA-binding affinities shows that the (SCL:E12)bHLH:LMO2:LDB1LID complex binds palindromic and nonpalindromic E box motifs with lower affinity (between 3.7- and 1.5-fold) than (SCL:E12)bHLH (KΔ = 4 × 105 M−1 versus KΔ = 1.5 × 106 M−1 for CAGCTG, and KΔ = 1.7 × 106 M−1 versus KΔ = 2.5 × 106 M−1 for CAGGTG) (Ryan et al., 2008).
Interaction of SCL:E47bHLH with LMO2:LDB1LID
Analysis of the (SCL:E47)bHLH:LMO2:LDB1LID structure defines the SCL:LMO2 interface (Figure 3A). Despite a relatively high reported association constant (Ryan et al., 2008), the SCL:LMO2 complex buries a surface area of only 1,232 Å2, which is at the lower end of the “standard size” area (1,200–2,000 Å2) buried in heterodimeric protein-protein complex (Lo Conte et al., 1999). The SCL:LMO2 interaction occurs through residues localized within a small region of their surfaces, mainly through H2 and loop of SCL and LIM1 of LMO2 (Figure 3A), involving an approximately equal number of hydrophobic and hydrophilic interactions. The surface-exposed hydrophobic residues of SCL (F238, L242, L213) and LMO2 (L59, L79, L87, F88) are the main stabilizers of the interface, whereas two hydrogen bonds (L213:R86, H217:C60) and a salt bridge (D245:R77) provide specificity.
Figure 3
The SCL:E47bHLH:LMO2 Interface
(A and B) Surface and ribbon representation of the interface between LMO2 and, respectively, SCLbHLH (A) and E47bHLH (B). Residues found at the interface are depicted in a stick representation. Interaction surfaces of SCL and E47 are colored in yellow and red, respectively, with overlaid electrostatic potential (blue indicates positive; red is negative) as calculated by the program PyMOL (DeLano, 2002). See also Figure S1C.
Importantly, LMO2 interacts with E47. However, the surface area buried in this interface is very small (622 Å2). Despite the limited number of hydrophobic and hydrophilic interactions centered around residues F88 (LMO2 LIM1) and R604 (E47), a robust salt bridge is formed between R102 (LMO2 LIM2) and E600 (E47), which in turn forms a hydrogen bond with Y235 (SCL) (Figures 3B and S1B). A second arginine in LMO2, R100, is also in close proximity of E600 (4.5 Å), creating a highly charged binding groove. Potentially, this interaction is the driving force for the change in the SCL:E47 interface packing.
Functional Identification of the Residues Involved in the SCL:LMO2 Interaction
To probe the relationship between the observed crystal structure interactions and their biological role, we generated a series of mutations predicted to perturb the interfaces between SCL, LMO2 and E47.We first tested the capacity of LMO2 interface mutations (LMO2-L59G, R77A, R86A, F88D, and R100A/R102A) to bind to wild-type (WT) SCL in two-hybrid luciferase reporter assays (Figures 4A and S3A). The L59G mutation increased the luciferase activity, suggesting that other exposed hydrophobic residues compensate for the absence of this residue. With R77A and R86A, luciferase activity was reduced by 60% and 80%, respectively, pointing at a specific function for the residues mediating the hydrogen-bonding network. No activation was observed with the F88D hinge mutant, confirming our previous conclusions that the SCL:LMO2 interaction is strongly affected when the rotation movement between the two LIM domains of LMO2 is restricted (El Omari et al., 2011). Finally, the LIM2 domain mutant R100A/R102A was tested for its interaction with SCL. R102 is the only residue of LMO2 in hydrogen bond contact with E47; moreover, both R100 and R102 are not conserved in other LMOs, suggesting an LMO2-specific function. Surprisingly, the mutation increased the luciferase activity, suggesting that the function of these residues is not to provide specificity for the interaction but, perhaps, to lock the relative positions of LIM1 and LIM2 when binding to SCL:E47.
Figure 4
Identification of Functionally Critical Residues at the SCL:E47:LMO2 Interface
(A and B) Luciferase-based mammalian two-hybrid assays were conducted in HEK293 cells with VP16-SCL, Gal4-LMO2, and Gal4-E47 fusion proteins. (A) WT and mutant LMO2 proteins are coexpressed with WT SCL. (B) WT and mutant SCL proteins are coexpressed with WT E47, WT LMO2, or LMO2-L59G. Luciferase activities resulting from interactions between WT proteins are set to 1. Error bars represent the SE from three biologically independent.experiments.
(C) Gain-of-function studies in zebrafish embryos. One-cell-stage embryos were coinjected with WT Scl mRNA and WT or mutant Lmo2 mRNA. Expansion of the field of expression of the hemangioblast marker Fli1 was monitored by reverse-transcription qPCR (top) or whole-mount in situ hybridization (bottom) on ten-somite-stage embryos (14 hpf). Error bars represent the SE from four biologically independent.experiments. Brackets indicate cardiac mesoderm; arrows point to head mesoderm. Anterior to the top.
(D) Phenotypic rescue of hematopoiesis from Scl− ES cells is detected using the BL-CFC assay (bottom). Blast colonies were generated upon in vitro differentiation of WT ES cells and Scl− ES cells untransfected (−) or rescued with WT or mutated versions of SCL, as indicated (top). Error bars represent the SE from three biologically independent.experiments.
∗p < 0.05; ∗∗p < 0.01.
See also Figure S3.
Figure S3
The SCL and LMO2 Mutant Proteins Are Expressed at Levels Similar to the Corresponding Wild-Type Proteins, Related to Figure 4
(A) Western Blot analysis showing expression of wild-type and mutated VP16-SCL and Gal4-LMO2 as well as GAL4-E47, cotransfected in HEK293 cells as indicated. GAL4 and VP16 antibodies were used to reveal E47, SCL and LMO2 fusion proteins. mSIN3A serves as loading control.
(B) Top: The fluorescence emitted by GFP (reflecting LMO2 expression) and RFP (for injection efficiency) is detected in embryos coinjected with wild-type Scl and wild-type or mutant Lmo2 mRNA at 10-somite stage. Bottom: Western Blot showing expression of HA-SCL in all injected embryos. Anti-beta tubulin serves as loading control.
(C) Western Blot analysis of exogenous SCL in day 4 EBs derived from Scl ES cells untransfected (-) or transfected with wild-type or mutated versions of SCL, as indicated. Anti-SCL antibodies were used; mSin3A serves as loading control.
To analyze the functional consequences of these mutations in vivo, we performed gain-of-function studies in zebrafish embryos (Gering et al., 2003). Coinjection of WT HA-Scl and Gfp-Lmo2 mRNA leads to an expansion of Fli1-expressing hemangioblasts (mesodermal precursors of endothelial and hematopoietic cells) in the head, cardiac territory, and posterior domain of the embryos (Figure 4C, bottom panel); quantitative PCR (qPCR) at the ten-somite stage shows a 2-fold increase in Fli1 expression levels when compared to WT embryos (Figure 4C, top panel). When expressed with WT SCL, LMO2 L59G and R100A/R102A did not affect Fli1 expression, whereas the mutants R77A, R86A, and F88D led to a significant reduction in Fli1 expression when compared to WT LMO2 (Figures 4C and S3B), in agreement with the two-hybrid assay (Figure 4A).We then assessed the role of three residues in SCL for interaction with E47 and LMO2 and for their function in hematopoietic development. Y235, in H2, lies at the interface between SCL and E47. H217 and F238, at the interface between SCL and LMO2, are conserved in hematopoietic bHLH proteins (LYL1, TAL2), but not in neuronal and muscle proteins (NeuroD1, HEN1, HEN2, and MyoD, Figure 1E), suggesting a hematopoietic-specific function. When tested in the two-hybrid assay, interaction of SCL Y235A with E47 and LMO2 was greatly impaired (Figures 4B and S3A), confirming (1) the importance of Y235 for heterodimerization, and (2) the necessity of heterodimer formation for SCL to interact with LMO2. Individual mutations H217A and F238A did not affect the SCL:LMO2 interaction; however, when combined (H217A/F238A), interaction was reduced by two-thirds. Combining SCL F238A and LMO2 L59G mutations totally abolished the interaction.To test these mutations functionally, we performed phenotypic rescue assays of Scl− ES cells. WT ES cells differentiate into cellular structures called embryoid bodies (EBs) that contain hematopoietic precursors giving rise to blood lineages. Using this assay, we have previously shown that the SCL:LMO2 interaction was required for the development of primitive erythropoiesis (Schlaeger et al., 2004). In contrast, here, we assessed the formation and development of blast colony-forming cells (BL-CFCs or hemangioblasts), the earliest clonal ES cell-derived mesodermal precursors of blood and endothelial cells (Choi et al., 1998) (Figure 4D). This process requires SCL activity (D’Souza et al., 2005); therefore, blast colonies were not observed from Scl− cells. Blast colony-forming potential was fully rescued upon expression of a WT Scl cDNA into Scl− ES cells (Figures 4D and S3C). The single (Y235A, F238A, H217A) and double (F238A/H217A) mutants all abolished SCL activity, highlighting the impact of key, single mutations on the function of the multiprotein complex in vivo.Altogether, these data define residues critical for formation of a stable SCL:E47:LMO2 interface and demonstrate the requirement for SCL heterodimerization and interaction with LMO2 for its earliest role in hematopoietic development.
Building Higher-Order Multiprotein Complexes
In erythroid cells, the SCL core complex interacts with GATA1 and cofactors that confer transcriptional activity. In order to understand how these higher-order complexes assemble, we evaluated the contribution of the components of the complex to the recruitment of known interacting coactivators (histone acetyltransferases PCAF and P300), corepressors (ETO2 and histone deacetylase mSIN3A), and histone demethylase LDS1. Guided by our structural and functional analyses, we selected point mutations that precisely disrupt heterodimerization (SCL Y235A) and interaction of SCL with LMO2 (SCL H217A/F238A and LMO2 R86A); we also tested mutation of the hinge residue in LMO2 (F88D). We expressed the corresponding mutant forms of SCL or LMO2 as biotinylated proteins (bio-SCL or -LMO2) in erythroid MEL cells (Schuh et al., 2005). Upon streptavidin-affinity pull-down, copurified proteins were identified.Members of the core complex, GATA1, and all the cofactors tested were efficiently pulled down with WT bio-SCL (Figure 5A, left panels, PD fraction). SCL Y235A prevented interaction with E47 and all core members and with mSIN3A, PCAF, ETO2, and P300, confirming that heterodimerization is essential for SCL activity. Interestingly, interaction between SCL Y235A and LSD1 was preserved, suggesting a different mechanism of recruitment. The SCL H217A/F238A mutations abolished interaction with LMO2 and, consequently, with LDB1 and GATA1, but not with the cofactors; their recruitment therefore does not require interaction with LMO2 and LDB1 and is likely to occur via the heterodimer.
Figure 5
Building a Network of Interactions between SCL:E47:LMO2:LDB1 and GATA1, Corepressors, Coactivators, and Chromatin-Remodeling Proteins
(A) Multiprotein complexes containing bio-SCL (WT, Y235A, or H217A/F238A) or bio-LMO2 (WT, F88D, or R86A) were affinity purified from MEL cell nuclear extracts and analyzed by western blotting for the presence of other members of the core complex and known cofactors. Antibodies used are indicated. Streptavidin-HRP was used to detect bio-SCL or LMO2 (red-framed western blots). MEL BirA, negative control. The asterisk (∗) indicates unspecific bands.
(B) Model representing the pentameric complex bound to its recognition motifs (E box-GATA or 1/2 E box-GATA; Kassouf et al., 2010) and its interactions with known cofactors as suggested by our biochemical study. The structure of the quaternary complex bound to DNA is depicted as in Figure 1B. The Nt portion of E47 and the activation domain AD1 are schematically represented as a red ribbon to be able to position ETO2, P300, PCAF, and mSin3A. See Discussion for details.
See also Figure S4.
WT bio-LMO2 (Figure 5A, right panels, PD fraction) efficiently interacted with the core complex, GATA1, and the cofactors. The R86A mutant, unable to interact with SCL and E47, retained binding to LDB1 and GATA1, but not to the corepressors mSin3A, ETO2, and LSD1. It could however still interact with the coactivators PCAF and P300. When the conformational flexibility of LMO2 was impaired (F88D), interaction with SCL and E47 was abolished but not with LDB1 nor GATA1, indicating that these latter interactions can occur independently of the hinge movement and, in contrast to recent suggestions by Dastmalchi et al. (2012), confirming that mutation of F88 does not overall affect the structure of LMO2. mSIN3A, PCAF, ETO2, and P300 were not able to interact with LMO2 F88D; however, we detected an interaction with LSD1. Altogether, this suggests that interaction of LMO2 with mSin3A and ETO2 relies on binding to the heterodimer, whereas interaction with the coactivators can occur independently, yet it requires the hinge movement. The LMO2:LSD1 interaction does not require SCL:E47 but only occurs when LMO2 conformation is locked, possibly through GATA1 and its partner FOG1 (Snow and Orkin, 2009).
Discussion
This report details at the atomic and functional level the exquisite interplay of interactions between the bHLH proteins SCL and E47 and the LIM protein LMO2, thereby delivering mechanistic insights into hematopoietic specification and leukemogenic transformation. Given the importance of bHLH:LIM associations in development and differentiation processes, this provides general principles that will apply to other tissue-specific complexes.
LMO2 Stabilizes the SCL:E47 Association but Weakens the Interaction of the Heterodimer with DNA
In addition to their association into heterodimers, SCL and E2A bHLH domains can exist in vitro as homodimers (although to date, only E2A homodimers have known biological functions). Our results provide a structural explanation for the hetero/homodimerization preferences and for the modulation of these affinities upon LMO2 binding. As detailed in the Results section, we suggest that the SCL:E47 heterodimerization interface is more stable than either individual homodimerization interfaces and that binding of LMO2 induces subtle conformational changes resulting in the creation of new hydrogen bonds.Furthermore, conserved hydrogen bonds, found in cis between H1 and H2 of both SCL and E47 and in trans between SCL H2 and E47 H1, suggest a clear link between the angle of the HLH dimerization domain and the DNA contacts available to the basic domains. Binding of LMO2 alters the dimerization angle and weakens the interactions between the basic helices of SCL:E47 and DNA.Therefore, our structural data reveal how LMO2 strengthens the SCL:E47 interaction while weakening its association with DNA. As detailed below, this provides a powerful level of regulation of complex formation and transcriptional activities.
Implications for Leukemogenesis and Normal Hematopoiesis: The Sequestration Model
Ectopic expression of oncogenic TFs in T cell precursors causes T-ALL by inhibition of E2A:E2A target gene expression and aberrant activation of novel targets (Palii et al., 2011; Sanda et al., 2012). Our work provides strong evidence for the E2A-sequestration model in SCL-expressing T-ALL through formation of SCL:E2A heterodimers. Simultaneous expression of LMO2 reinforces this effect and weakens the DNA:heterodimer interactions. Consequently, LMO2 binding is likely to alter expression of E2A:E2A targets by facilitating the displacement of SCL:E2A from DNA. The complex can then be relocated to alternative genomic sites by association with other DNA-binding factors such as RUNX1, ETS1, and GATA3, as recently suggested by the reported association of SCL-binding events with Runx, Ets, and Gata motifs in T-ALL cells (Palii et al., 2011; Sanda et al., 2012) (Figure 6). This is corroborated by transcriptome analyses showing expression of different gene sets in T-ALL cells upon knockdown of SCL (Palii et al., 2011). Therefore, our structure explains at the atomic level how ectopic expression of SCL and LMO2 overall alters the gene repertoire in T cells to ultimately lead to oncogenic transformation.
Figure 6
Model of SCL-Mediated Network of Interactions in Leukemogenesis and Early and Definitive Hematopoiesis
The structural and functional data integrated into this model are discussed in the text. The consensus E box motifs are from Kassouf et al. (2010), Palii et al. (2011), and Soler et al. (2010). Although individual motifs have been reported, composite motifs have not been described in T cells. The DNA motifs and additional TFs recruiting the complex in mesoderm/prehematopoietic cells have not been identified. The “kinked” representation of SCL symbolizes its reported DNA-binding independent activities. The different sizes of the complexes in erythroid cells reflect the proportion of SCL-containing binding events on composite motifs associated with repressed and activated target genes; for details, see Kassouf et al. (2010).
The sequestration model might also help explain the roles of SCL and LMO2 in hematopoietic specification. Coexpression of SCL and LMO2 in mesodermal precursors would favor heterodimerization of E2A with SCL as opposed to other bHLH regulators, thereby imposing a hematopoietic fate at the expense of alternative cell fates. This agrees with recent studies showing ectopic cardiac development in the absence of SCL (Schoenebeck et al., 2007; Simões et al., 2011; Van Handel et al., 2012) (Figure 6).In line with this, our phenotypic rescue of ES cell-derived BL-CFC complemented by gain-of-function studies in zebrafish provides a demonstration that specification of mesodermal precursors toward the hematopoietic lineage critically depends on the activity of a functional SCL:E47:LMO2 complex. Further characterization will reveal some of the mechanisms underlying the earliest stages of blood development.
Structural Basis for the DNA-Binding-Independent Functions of SCL
Dissection of the (SCL-E47)bHLH:DNA interface shows weak interactions with nucleic acids. This might have physiological relevance, allowing fine-tuning of the activities of SCL:E47-containing complexes in vivo through interactions with additional DNA-bound TFs. This is also relevant for the DNA-binding independent activities of SCL in hematopoietic specification, in leukemogenesis, and in interacting with some of its erythroid genomic targets. In red cells, LMO2 mediates interactions of SCL:E47 with GATA1, the complex binding a composite CANNTG-(N)9-GATA. A manually docked pentameric complex bound to E box-GATA DNA (Figure S4) shows that the complex can be assembled only if the 30 amino acid linker between the N- and C-terminal domains of GATA1 assumes a semiextended conformation. In this model, and as dictated by the interaction of the heterodimer with LMO2, SCL binds to the GATA distal half-E box and E47 to the proximal one. Remarkably, the occurrence of a novel composite motif missing the distal SCL-bound half-E box (CTG-(N)9-GATA) (Figure 5B) is increased by up to 3-fold in cells expressing a DNA-binding mutant form of SCL (Kassouf et al., 2010).
Figure S4
Model of the Pentameric Complex Bound to E Box-GATA DNA, Related to Figure 5B
The model was constructed by combining the structure of (SCL:E47)bHLH:LMO2:LDB1LID (PDB: 2YPA, this study) to the previously published structure of the C-terminal domain of GATA1 bound to DNA (PDB code 3GAT; Omichinski et al., 1993) and the haddock model of the N-terminal domain of GATA1 bound to LMO2LIM2:LDB1LID (PDB code 2L6Y; Wilkinson-White et al., 2011).
(A) Ribbon diagram representations showing the location on a E box-GATA DNA of SCLbHLH, E47bHLH, LMO2, LDB1LID and GATAZnf respectively colored in yellow, raspberry, green, magenta and blue.
(B and C) Side view (B) and top view (C) of pentameric complex showing the semi-extended conformation of the linker between the N- and C-terminal zinc finger domains of GATA1.
Therefore, a model emerges both in leukemia and hematopoiesis by which DNA-binding specificity is not driven by SCL:E47 but by its DNA-bound protein partners, the identity of which depends upon the cellular context (Figure 6). This model could explain the lack of interaction of SCL:E47 with the central bases of the E box motif, in contrast to what is seen in the (E47:E47)bHLH homodimer. Indeed, whereas we do not rule out the possibility that binding of the specific monomer to central bases could still be achieved in solution, the lack of interaction observed in our crystals could alternatively be a hallmark of bHLH heterodimers. Because recruitment of the SCL complex to DNA essentially relies on interaction with LMO2 and other DNA-binding proteins, a degenerated E box sequence (CANNTG) would be sufficient to anchor the heterodimer to DNA. In contrast, E47 homodimers do not bind to LMO2 and require additional contacts with DNA for specific genomic target recognition. Altogether, these observations refine our understanding of how specific binding in promoters/enhancers is achieved and reveals an additional level of the mechanistic diversity of TFs.
SCL Directs E47 and Main Cofactors to Hematopoietic-Specific Targets
Our mutational and biochemical analysis uncovers a major role for SCL in directing E47 and associated regulators to DNA (Figure 5B). The regulators ETO2, P300, PCAF, and mSin3A are recruited to the complex by the SCL:E47 heterodimer defective for its interaction with LMO2:LDB1, but not by the heterodimerization-defective form of SCL. This agrees with a previous report mapping the E protein:ETO-related proteins and E protein:P300 interactions to the AD1 domain of E proteins (Zhang et al., 2004). Similarly, PCAF, a P300-associated factor, likely interacts with the AD1 domain. Finally, given the structural similarity of the AD1-binding domain of ETO-related proteins (eTAFH) and the paired amphipathic helix domain (PAH2) of mSIN3A (Plevin et al., 2006), mSin3A also plausibly binds the AD1 domain. Therefore, through both tissue-specific expression and interaction with LMO2, SCL’s main function appears to direct E47 and associated cofactors/chromatin remodelers to lineage-restricted target genes.In contrast, SCL interacts with LSD1 independently of its dimerization to E47, in agreement with their described direct association (Li et al., 2012). In this context, SCL could dimerize with partners other than E proteins, such as HLH ID proteins (Condorelli et al., 1995). SCL:ID complexes might regulate transcription through formation of transient, non-DNA-binding structures sequestering cofactors, such as LSD1, before releasing SCL and interacting proteins for association with E47 into DNA-bound complexes.
LMO2: A Versatile Adaptor Protein
In erythroid cells, LMO2 associates with corepressors mSIN3A and ETO2 only when in a complex with SCL:E47; therefore, not through GATA1. This corroborates previous studies showing that SCL repressor functions are GATA1 independent (and likely to be mediated, at least in part, through interaction with GATA2; Fujiwara et al., 2009), whereas the SCL:GATA1 complex is an activator complex (Tripic et al., 2009) (Figure 6). Moreover, GATA1 repressive complexes do not contain LMO2 (Tripic et al., 2009) and are therefore not associated with SCL:E47. Interestingly, the coactivators P300 and PCAF, although directly bound to E47, can interact with LMO2 independently of its association with SCL:E47. Because most nuclear LMO2 is bound to SCL (this work; Lécuyer et al., 2007), this interaction is likely to be mediated by another TF associated with the complex, such as GATA1. We suggest that acetylation of GATA1 by P300 and PCAF, described as facilitating associations with protein partners and chromatin templates (Lamonica et al., 2006), plays a critical role in the activating functions of the pentameric complex.The hinge movement is differentially required for the LMO2 interactions with SCL:E47 and GATA1 or for its SCL-independent interaction with P300/PCAF and LSD1. This allows LMO2 to accommodate, directly or indirectly, different types of interactions simultaneously, providing an extra level of regulation in complex formation.Therefore, LMO2 appears as a versatile adaptor protein, which not only locks SCL:E47 structures and modulates their DNA-binding affinities but also, through interaction with additional regulators, plays a critical role in directing complexes to DNA. These interactions mediate additional exposure to cofactors, thereby modulating the transcriptional activities of the complex.
The SCL:LMO2 Interface: A Target for Small Inhibitory Molecules
The small size of the SCL:LMO2 interface area (620 Å2) together with the presence of a defined secondary structure element (SCL H2) make it a suitable target for small inhibitory molecules (Bourgeas et al., 2010). Importantly, our mutational analyses provide a detailed picture of key residues along the SCL:LMO2 interface required for stable and synergistic interaction. Given the prevalence of human T-ALL cases coexpressing SCL and LMO2, this interface emerges as a promising target for the development of small molecules aimed at disrupting protein:protein interactions.
Experimental Procedures
Purification, Crystallization, and Structure Determination
LMO2:LDB1-LID construct was cloned, expressed, and purified as described (El Omari et al., 2010). Histidine-tag SCLbHLH and untagged E47bHLH were coexpressed in E. coli and purified by affinity and size exclusion chromatography. The complex was obtained by mixing LMO2:LDB1LID and (SCL:E47)bHLH prior to size exclusion chromatography. DNA was added as a 1.2:1 DNA-protein ratio before crystallization. Two crystal forms were obtained; the first one was solved by experimental phasing to 2.8 Å resolution and contained the DNA-bound (SCL:E47)bHLH:LMO2:LDB1LID. The second one, at 2.9 Å resolution, was solved by molecular replacement and contained DNA-bound (SCL:E47)bHLH. For further details, see the Extended Experimental Procedures and Table 1.
Mammalian Two-Hybrid Assays
Plasmids encoding pVP16-SCL (VP16-transactivating domain fused to SCL) and pM-LMO2 (GAL4 DNA-binding domain fused to LMO2) have been described (El Omari et al., 2011; Schlaeger et al., 2004). E47 was fused to GAL4 DNA-binding domain. All mutants were generated with a QuikChange II Site-Directed Mutagenesis Kit (Stratagene). The CheckMate Mammalian Two-Hybrid System (Promega) was used to conduct two-hybrid experiments in HEK293 cells as described (El Omari et al., 2011).
LMO2-GFP Mutant Expression Constructs, RNA Injection, Zebrafish Embryos, and Imaging
The gfp-Lmo2 construct (El Omari et al., 2011; Gering et al., 2003) was mutated using the QuikChange Site-Directed Mutagenesis Kit (Stratagene). Full-length Lmo2 RNA was synthesized using the Sp6 mMessage mMachine kit (Ambion). HA-Scl/Tal1 and H2B-RFP mRNA was generated as described (El Omari et al., 2011; Megason and Fraser, 2003). Zebrafish embryos were obtained from natural matings and coinjected at the one-cell stage with 100 pg of each mRNA. Embryos were raised and staged as described by Westerfield (2007). Fluorescent signal was acquired on a Zeiss Lumar V12 fluorescence microscope and in situ analysis carried out as described (El Omari et al., 2011). RNA isolation and quantitative reverse-transcription PCR are described in the Extended Experimental Procedures. The fish studies conformed to the regulatory standards approved by UK Government Home Office.
Biotin-Tagged Constructs
WT Scl and Lmo2 cDNAs were tagged in their 5′ end with an oligonucleotide encoding avidin-TEV-FLAG (Driegen et al., 2005) and cloned downstream of the EF1α promoter in a vector bearing a neomycin- or zeocin-resistance marker (pEF1αbio-SCL and pEF1αbio-LMO2). Mutations in Scl and Lmo2 cDNAs were generated by mutagenesis as directed by the manufacturer (QuikChange Lightning Multi Site-Directed Mutagenesis Kit; Agilent Technologies).
Cell Lines and Transfections
WT and Scl− J1 ES cells were maintained on gelatinized plates as previously described (Porcher et al., 1996). Murine erythroleukemia (MEL) cells were maintained as described by Schuh et al. (2005). Transfection procedures are described in the Extended Experimental Procedures.
Streptavidin-Affinity Pull-Downs
Nuclear extracts were prepared as described by Schuh et al. (2005). Pull-downs were performed in 50 mM Tris-HCl (pH 7.5), 0.3% NP40, 150 mM NaCl, and protease inhibitors, with 500 μg to 1 mg of nuclear extracts and 50 μl of paramagnetic streptavidin beads (Dynabeads; Invitrogen), at room temperature for 30 min (bio-SCL) or at 4°C overnight (bio-LMO2). Proteins were eluted by boiling in Laemmli buffer. Crude nuclear extracts (input [IN] 20 μg), pull-down eluate (PD), and unbound fraction ([UN] 20 μg) were analyzed by western blot.
Cloning Expression and Purification of LMO2:LDB1LID, SCLbHLH, and E47bHLH
LMO2:LDB1LID construct was cloned, expressed and purified as described (El Omari et al., 2010). The sequences encoding SCLbHLH (residues 180-253) and E47bHLH (residues 535-613) were PCR-amplified using the following primers: 5′-CGGGATCCGCAGATTTCTGATGGTCCTCA-3′ and 5′-’GGAATTCTCAACGCTGGGTGCCTTCCTCC-3′ for SCLbHLH; and 5′-GAAGATCTCTCCCTGGAGGAGAAGGACCT-3′ and 5′-CCGCTCGAGTCAGGCTGCTTTGGGGTTCAGGT-3′ for E47bHLH. The PCR products were cloned into the pETDuet-1 coexpression vector (Novagen) using BamHI/EcoRI and BglII/XhoI restriction sites for SCLbHLH and E47bHLH, respectively. This cloning strategy provides SCLbHLH with an amino-terminal six-histidine tag whereas E47bHLH is untagged. The pETDuet-SCLbHLH:E47bHLH construct was transformed into Rosetta(DE3)pLysS (Stratagene) for protein expression. Cultures were grown at 37°C in Luria broth (LB) medium supplemented with 50 μg/ml ampicillin and 34 μg/ml chloramphenicol. When the optical density at 600 nm reached 0.8, the cells were induced with 1 mM isopropyl-1-thio-β-d-galactopyranoside (IPTG) and incubated overnight at 25°C. Cells were collected after centrifugation at 6000 rpm for 20 min and resuspended in 50 mM NaH2PO4/Na2HPO4, pH 7.4, 500 mM NaCl, 5 mM imidazole plus 0.5% Tween 20 prior disruption by sonication. The lysate was centrifuged for 30 min at 20000 rpm and the supernatant applied to cobalt beads (Talon beads, Clontech) pre-equilibrated with the lysis buffer without detergent. The complex was eluted with a buffer containing 50 mM NaH2PO4/Na2HPO4, pH 7.4, 500 mM NaCl and 500 mM imidazole. Fractions containing SCLbHLH and E47bHLH were applied to a S75 gel filtration column (GE Healthcare) pre-equilibrated with 20 mM Tris pH 7.4, 200 mM NaCl and 1 mM DTT.LMO2:LDB1LID, SCLbHLH and E47bHLH were mixed together in a 1:1 ratio and applied again to a S75 gel filtration column. Formation of the complex was confirmed by a shift of the elution peak to higher molecular weight and by Multi-Angle Laser Light Scattering (MALS, data not shown). The quaternary complex was concentrated to 14 mg/ml using Amicon columns.
Crystallization, Structure Determination, and Refinement
While no crystals could be obtained using the SCL-preferred CAGCTG motif (Kassouf et al., 2010), (SCL:E47)bHLH:LMO2:LDB1LID were solved by cocrystallization with a CATCTG motif, previously crystallized with E47:NeuroD1 (Longo et al., 2008). Oligonucleotides containing an E box motif 5′-gaacagatggt-3′ and 5′-accatctgttc-3′ were annealed and mixed with the protein complex according to a 1:1.2 protein:DNA ratio. The proteins:DNA complex was used at a concentration of 4 mg/ml to screen 768 conditions using a cartesian technologies pipetting robot setting up 100 + 100 nl drops in Greiner 96-well plates (Walter et al., 2005). Initial hits appeared within a week and were optimized with an additive screen (Hampton). The best (SCL:E47)bHLH:LMO2:LDB1LID crystals grew in 5% (v/v) 2-Methyl-2,4-Pentanediol (MPD), 40 mM magnesium chloride, 50 mM sodium cacodylate pH 6.0 and 2mM glutathione. A second crystal form, only containing the (SCL:E47)bHLH heterodimer, was obtained a month later, from the same condition. The two crystals therefore grew in the same buffer. Crystals were cryoprotected with 20% glycerol for flash cooling in nitrogen gas stream at 100 K.For the first crystal form, a single anomalous dispersion (SAD) X-ray diffraction data set was collected at Diamond IO4 (Didcot, UK) near the zinc absorption edge (λ = 1.2829 Å, with f’ = −6.51 and f’’ = 6.99). The data set was indexed, integrated and scaled with HKL2000 (Otwinowski and Minor, 1997) in the F222 space group with unit cell dimensions: a = 102.9, b = 141.0, c = 148.8 to a resolution of 2.8 Å (Table 1). The asymmetric unit was composed of a single complex according to Matthews coefficient calculations corresponding to a VM of 2.7 Å3 Da-1 and a solvent content of 59.4 %. Phasing was performed with PHENIX AUTOSOL (Adams et al., 2002) which located the four zinc atoms present in LMO2, with an overall figure of merit of 0.82 after phase improvement by solvent flattening. Resulting maps were of quality to fit the previously solved LMO2:LDB1LID structure (PDB code: 2XJY) (El Omari et al., 2011) and the bHLH heterodimer E47:NeuroD1 (PDB code: 2QL2) (Longo et al., 2008) as starting model for rebuilding.The second crystals form diffracted to 2.9 Å resolution and belong to the space group C2221 with unit cell dimensions a = 73.2, b = 152.8, c = 55.4 Å. The asymmetric unit only contains the bHLH heterodimer bound to its cognate DNA and was solved using the SCL:E47 heterodimer from the first crystal form by molecular replacement with MOLREP (Lebedev et al., 2008).Manual model correction of the structures was carried out using the program COOT (Emsley and Cowtan, 2004) and restrained refinement was performed with REFMAC5 (Murshudov et al., 1997) using TLS refinement (Translation/Libration/Screw) to obtain models that were refined to Rwork/Rfree of 22.5%/26.4% for the quaternary complex and 25.9%/29.4% for the heterodimer. The geometry of the structures was checked by using the molprobity server (Davis et al., 2007).
RNA Isolation, cDNA Synthesis, and qRT-PCR
Ten somite-stage embryos were homogenized and total RNA isolated with Trizol (Invitrogen) and purified through RNeasy Micro columns (QIAGEN). cDNA was synthesized from 250ng of total RNA with Superscript III (Invitrogen). The following primer pairs were used to assess gene expression: fli1a 5-GAGAAGTTCGAGAAGGAAGC-3′ and 5′-CGTAGTATTTGCTGGTCTCG −3′; βactin2 5′-GGACCTGTATGCCAACACTGTA-3′ and 5′-ATGTGATCTCCTTCTGCATCCT-3′; ef1α 5′-CCGAGGTCCTGCTCTCACAT-3′ and 5′-GGGACTGGTCAGGGTGAGAT-3′. Data were normalized to the geometric mean of α-actin2 and ef1α expression, and analyzed using the ΔΔCt method. Error bars represent the standard error from four biologically independent experiments. Significance was assessed by a two-tailed t test.
Cell Lines and Transfections
Wild-type and Scl J1 ES cells were maintained on gelatinized plates as previously described (Porcher et al., 1996). Electroporation of Scl ES cells with Scl cDNA constructs (wild-type and mutants) under control of EF1α promoter (pEF1α) and selection of stably transfected ES cells were as described (Schuh et al., 2005). Differentiation into embryoid bodies (EBs) and blast colony formation from day 3.5 EB cells in VEGF-containing medium have been described (Kassouf et al., 2010; Robertson et al., 2000). Murine erythroleukemia (MEL) cells were maintained as described (Schuh et al., 2005). MEL cells expressing BirA biotin ligase (Schuh et al., 2005) were transfected with pEF1α-bioSCL or pEF1α-bioLMO2 (biotin-tagged, wild-type and mutant constructs) using Transfast (Promega) and selected in Zeocin (200 μg/ml, Invitrogen) or G418 (750 μg/ml; GIBCO). Expression of the LMO2 mutant impaired in its interaction with SCL (R86A) was not detected by western Blot in MEL cells. The proteasome inhibitor MG-132 (Calbiochem) was added to the MEL:BirA:LMO2-R86A cells at 10μM final for 12 hr to stabilize LMO2-R86A (Lécuyer et al., 2007). This treatment did not affect the overall nuclear protein levels, as judged from the input lanes of the western blots shown in Figure 5A.
Western Blotting and Antibodies
Western Blot analyses were performed as described previously (Schuh et al., 2005) using precast NuPAGE gels (Invitrogen) according to manufacturers’ instructions. Primary antibodies used for detection are: α-LMO2 rabbit polyclonal antibody (generous gift from Stuart Orkin), α-ETO-2 (sc-9741), -E47 (sc-763), -GATA1 (sc-265), GFI1b (sc-8559), -LDB1 (sc-11198), -LSD1 (sc-271720), -mSIN3A (sc-994), -pCAF (sc-13124), -p300 (sc-585), -SCL/TAL1 (sc-12984), -Gal4DBD (sc-577) all from Santa Cruz and α-VP16 (ab4809-100, Abcam). Secondary antibodies were HRP-conjugated goat anti-rat (sc-2006), goat anti-mouse (sc-2031), donkey anti-rabbit (sc-2313) or donkey anti-goat (sc-2020) IgGs (Santa Cruz). Streptavidin-HRP was used to detect biotinylated SCL and LMO2. Detection was performed with the ECL kit (GE Healthcare).
Authors: Paul D Adams; Ralf W Grosse-Kunstleve; Li Wei Hung; Thomas R Ioerger; Airlie J McCoy; Nigel W Moriarty; Randy J Read; James C Sacchettini; Nicholas K Sauter; Thomas C Terwilliger Journal: Acta Crystallogr D Biol Crystallogr Date: 2002-10-21
Authors: Mark A Hall; David J Curtis; Donald Metcalf; Andrew G Elefanty; K Sourris; Lorraine Robb; Joachim R Gothert; Stephen M Jane; C Glenn Begley Journal: Proc Natl Acad Sci U S A Date: 2003-01-27 Impact factor: 11.205
Authors: Marie-Claude Sincennes; Magali Humbert; Benoît Grondin; Véronique Lisi; Diogo F T Veiga; André Haman; Christophe Cazaux; Nazar Mashtalir; El Bachir Affar; Alain Verreault; Trang Hoang Journal: Proc Natl Acad Sci U S A Date: 2016-01-13 Impact factor: 11.205
Authors: Tatsuya Morishima; Ann-Christin Krahl; Masoud Nasri; Yun Xu; Narges Aghaallaei; Betül Findik; Maksim Klimiankou; Malte Ritter; Marcus D Hartmann; Christian Johannes Gloeckner; Sylwia Stefanczyk; Christian Lindner; Benedikt Oswald; Regine Bernhard; Karin Hähnel; Ursula Hermanutz-Klein; Martin Ebinger; Rupert Handgretinger; Nicolas Casadei; Karl Welte; Maya Andre; Patrick Müller; Baubak Bajoghli; Julia Skokowa Journal: Blood Date: 2019-07-31 Impact factor: 22.113
Authors: Rebecca A Luchtel; Michael T Zimmermann; Guangzhen Hu; Surendra Dasari; Manli Jiang; Naoki Oishi; Hailey K Jacobs; Yu Zeng; Tanya Hundal; Karen L Rech; Rhett P Ketterling; Jeong-Heon Lee; Bruce W Eckloff; Huihuang Yan; Krutika S Gaonkar; Shulan Tian; Zhenqing Ye; Marshall E Kadin; Jagmohan Sidhu; Liuyan Jiang; Jesse Voss; Brian K Link; Sergei I Syrbu; Fabio Facchetti; N Nora Bennani; Susan L Slager; Tamas Ordog; Jean-Pierre Kocher; James R Cerhan; Stephen M Ansell; Andrew L Feldman Journal: Blood Date: 2019-05-17 Impact factor: 22.113
Authors: Phuong Cao Thi Ngoc; Shi Hao Tan; Tze King Tan; Min Min Chan; Zhenhua Li; Allen E J Yeoh; Daniel G Tenen; Takaomi Sanda Journal: Leukemia Date: 2018-03-26 Impact factor: 11.528
Authors: Charnise Goodings; Rati Tripathi; Susan M Cleveland; Natalina Elliott; Yan Guo; Yu Shyr; Utpal P Davé Journal: Leuk Res Date: 2014-11-29 Impact factor: 3.156
Authors: Benjamin N Ediger; Hee-Woong Lim; Christine Juliana; David N Groff; LaQueena T Williams; Giselle Dominguez; Jin-Hua Liu; Brandon L Taylor; Erik R Walp; Vasumathi Kameswaran; Juxiang Yang; Chengyang Liu; Chad S Hunter; Klaus H Kaestner; Ali Naji; Changhong Li; Maike Sander; Roland Stein; Lori Sussel; Kyoung-Jae Won; Catherine Lee May; Doris A Stoffers Journal: J Clin Invest Date: 2016-12-12 Impact factor: 14.808