Literature DB >> 29896522

Data set on G4 DNA interactions with human proteins.

M Vlasenok1, O Levchenko1, D Basmanov1, D Klinov1, A Varizhuk1, G Pozmogova1.   

Abstract

Guanine-rich DNA/RNA fragments can fold into G-quadruplexes (G4s) - non-canonical four-strand secondary structures. The article contains data on quadruplex interaction with human proteins. Binding of three topologically different G4 structures to more than 9000 human proteins was analyzed. Physicochemical methods were used to verify the results.The dataset was generated to identify the protein targets for DNA quadruplex structures for the purpose of better understanding the role of the structures in gene expression regulation. Presented data include functional interpretation of obtained gene lists, visualized with Cytoscape.

Entities:  

Year:  2018        PMID: 29896522      PMCID: PMC5996148          DOI: 10.1016/j.dib.2018.02.081

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table Value of the data These data describe the use of human protein microarrays to identify binding partners of G4 DNA structures. Functional enrichment analysis of G4-binding proteins can be used to elucidate biological roles of genomic G4s and possible side effects of G4-based therapeutics. Data allow one to compare topologically different G4s in terms of their interactions with human proteins. Data can be added to previously published data on G4-binding proteins for a more integrated view of the G4 interactome.

Data

The dataset of this article provides information on interactions between G4-forming oligodeoxyribonucleotides (G4-1, G4-2 and G4-3) and human proteins (over 9000) immobilized on microarrays. The set of oligonucleotides (ONs) was designed to allow comparison of topologically different intramolecular G4 structures. According to published circular dichroism (CD) data, G4-1 ON (d(GGGAGGCTGAGGCAGG), designated previously as PQS2 [1]) adopts antiparallel topology in the presence of potassium ions [2], G4-2 ON (d(GGTGACAGGGGTATGGGGAGGGG), designated previously as Ct1 [3]) forms a parallel quadruplex structure, and G4-3 ON (d(GGGGACAGGGGTATGGGGAGGGG), designated previously as CtG [3]) adopts mixed topology (the “antipatrallel” CD signature dominates [4]). Terminal modification (biotinylation) required for the profiling experiments could alter ON secondary structures, therefore, CD spectra of the modified ONs were recorded to verify the maintenance of G4 folding. Substantial difference from the published spectra was observed only for G4-3, so we additionally performed a rotational relaxation time assay (RRT) to verify its intramolecular folding. Characterization of the biotinylated G4s by optical methods is summarized in Fig. 1. Fig. 2 is an overview of the protein array profiling procedure. Supplementary Figures S1–8 are high resolution scans of the control protein arrays and arrays after treatment with solutions of biotinylated G4s and visualization of the interactions using streptavidin-conjugated Alexa Fluor®647 (white spots in the array scans refer to G4-protein complexes). Fig. 3 shows intersection of the sets of significant protein hits identified in the profiling experiments with each G4. Table 1, Table 2, Table 3, Table 4, Table 5 contain lists of protein hits from the central diagram in Fig. 3 and corresponding significance parameter values (Z-score). Functional interpretations of the respective gene lists visualized with Cytoscape are provided as Supplementary networks 1–7. Supplementary files SF1 and SF2 contain data on all investigated proteins, their parameters and interactions with each G4. Fig. 4 shows verification of the profiling data for two selected proteins by electrophoretic method and a method based on photonic crystal surface waves (PCSW).
Fig. 1

Characterization of biotinylated G4s. (A) СD spectra (per mole of nucleotide) and schematic representations of previously characterized initial (non-biotinylated) G4 structures. (B) CD spectra of the 3’-biotinylated G4s. (C) Rotational relaxation time (RRT) of EtBr in complex with G4-3. Conditions: 20 mM Tris–HCl buffer (pH 7.5), 100 mM KCl. Points on the graph correspond to the average values of three measurements.

Fig. 2

Workflow of profiling G4-protein interactions using human protein microarrays.

Fig. 3

Three Venn diagrams for protein interactors of G4 ONs profiled at different concentrations.

Table 1

Top non-specific protein hits. Overlap of the three G4 interactor sets significant at both 2.5 μM and 25 μM G4 concentrations, ranked by average Z-score (G4-1 ∩ G4-2 ∩ G4-3). TV = transcript variant.

ProteinDatabase IDZ-score, 2.5 μM / 25 μM
G4-1G4-2G4-3
hypothetical protein HSPC111 (HSPC111)BC040106.17.99 / 6.306.78 / 4.859.11 / 6.46
additional sex combs like 1 (Drosophila) (ASXL1)BC064984.16.13 / 7.316.79 / 4.869.22 / 6.47
bromodomain containing 3 (BRD3)BC032124.18.82 / 9.286.79 / 4.854.51 / 6.47
chromosome 1 open reading frame 63 (C1orf63)NM_020317.25.41 / 10.516.17 / 4.865.93 / 6.47
cyclin B3 (CCNB3), transcript variant 2NM_033671.17.96 / 5.456.74 / 4.816.97 / 6.42
peptidyl arginine deiminase, type IV (PADI4)NM_012387.15.72 / 6.586.78 / 4.857.58 / 5.27
Coiled-coil domain-containing protein 28 ABC000758.14.95 / 4.886.78 / 4.858.15 / 6.45
ankyrin repeat and zinc finger domain containing 1 (ANKZF1)BC000238.16.13 / 4.466.68 / 4.866.34 / 6.15
GABA(A) receptor-associated protein (GABARAP)NM_007278.14.42 / 5.136.62 / 4.867.02 / 6.47
Cyclin-dependent kinase-like 3NM_016508.26.26 / 4.836.29 / 4.865.63 / 6.20
actin related protein 2/3 complex, subunit 1B, 41 kDa (ARPC1B)NM_005720.15.47 / 5.106.19 / 4.866.54 / 5.66
peripheral myelin protein 2 (PMP2)NM_002677.14.12 / 3.496.73 / 4.867.28 / 6.31
Peptidyl-tRNA hydrolase 2, mitochondrialNM_001015509.16.85 / 6.943.46 / 4.853.95 / 4.81
survival motor neuron domain containing 1 (SMNDC1)BC011234.14.49 / 3.606.40 / 4.864.79 / 6.46
small trans-membrane and glycosylated protein (LOC57228), transcript variant 2NM_020467.23.31 / 4.285.81 / 4.864.59 / 5.78
La ribonucleoprotein domain family, member 1 (LARP1)BC033856.14.49 / 4.625.37 / 4.854.71 / 4.09
cDNA FLJ42001 fis, clone SPLEN2029912 (LOC153684 protein)NM_194290.13.85 / 3.396.45 / 4.864.46 / 4.40
splicing factor, arginine/serine-rich 6 (SFRS6)NM_006275.24.01 / 4.493.80 / 3.505.18 / 3.18
Table 2

Top semi-specific protein hits. Pairwise overlaps of the G4 interactor sets significant at both G4 concentrations, ranked by average Z-score. TV = transcript variant.

ProteinDatabase IDZ-score, 2.5 μM / 25 μM
G4-1 ∩G4-2
G4-1G4-2
MAP kinase-activated protein kinase 3BC001662.16.79 / 4.866.60 / 5.98
RNA polymerase II-associated protein 3BC056415.16.32 / 4.624.70 / 6.19
Jun dimerization protein p21SNFT (SNFT)NM_018664.16.00 / 3.684.53 / 3.81
Band 4.1-like protein 4 ANM_022140.24.23 / 4.853.59 / 3.55
rRNA-processing protein FCF1 homologBC022361.14.28 / 4.863.18 / 3.88



G4-1 ∩G4-3
G4-1G4-3
Finkel-Biskis-Reilly murine sarcoma virus (FBR-MuSV) ubiquitously expressed (FAU)NM_001997.26.01 / 5.798.88 / 6.47
nucleolar protein 7, 27 kDa (NOL7)NM_016167.34.76 / 5.888.76 / 6.47
PHD finger protein 20-like 1 (PHF20L1), TV 3NM_198513.15.84 / 5.577.63 / 6.47
Ras-like without CAAX 1 (RIT1)NM_006912.36.00 / 3.836.28 / 5.58
casein kinase 2, alpha 1 polypeptide (CSNK2A1), TV 1NM_177559.25.06 / 4.015.58 /4.25



G4-2 ∩G4-3 (top 20)
G4-2G4-3
fibroblast growth factor 12 (FGF12)BC022524.16.79 / 4.858.31 / 6.47
DIM1 dimethyladenosine transferase 1-like (S. cerevisiae) (DIMT1L)NM_014473.26.79 / 4.868.31 / 4.84
Probable ATP-dependent RNA helicase DDX6NM_004397.36.79 / 4.867.51 / 5.57
fibroblast growth factor 12 (FGF12), TV 1NM_021032.25.82 / 4.858.02 / 5.89
Non-histone chromosomal protein HMG-14BC070154.16.03 / 4.777.64 / 6.04
FACT complex subunit SPT16BC021561.16.76 / 4.866.59 / 6.02
heterochromatin protein 1, binding protein 3 (HP1BP3)NM_016287.26.79 / 4.866.82 / 5.03
KRR1, small subunit (SSU) processome component, homolog (yeast) (KRR1)BC016778.16.50 / 4.856.33 / 5.50
Nuclear protein Hcc-1BC093051.14.43 / 4.857.27 / 6.41
RAD51 associated protein 1 (RAD51AP1)BC016330.16.79 / 4.864.69 / 6.18
Serine/threonine-protein kinase 12BC000442.16.15 / 4.865.47 / 5.82
serpin peptidase inhibitor, clade F (alpha-2 antiplasmin, pigment epithelium derived factor), member 1 (SERPINF1)BC000522.15.71 / 4.865.21 / 6.47
high-mobility group box 2 (HMGB2)NM_002129.26.19 / 4.867.26 / 3.68
high mobility group nucleosomal binding domain 3 (HMGN3), TV 2NM_138730.14.29 / 4.866.66 / 6.10
small proline-rich protein 4 (SPRR4)NM_173080.16.44 / 4.865.25 / 5.18
Regulator of G-protein signaling 3BC019039.25.68 / 4.855.25 / 5.59
RAB35, member RAS oncogene family (RAB35)NM_006861.25.92 / 4.545.81 / 5.09
G protein-coupled receptor kinase 6NM_002082.15.15 / 4.866.21 / 5.13
Ras-like without CAAX 2 (RIT2)BC018060.14.81 / 4.855.52 / 6.07
signal transducing adaptor family member 1 (STAP1)NM_012108.15.96 / 3.966.03 / 5.01
Table 3

G4-1-specific/semi-specific protein hits. Significant interactors of G4-1 at both concentrations classified as insignificant for G4-2 and G4-3 at one or both concentrations (the non-overlapping subset of the G4-1 interactor set, ranked by average Z-score). TV = transcript variant.

ProteinDatabase IDZ-score
2.5 μM G4-12.5 μM G4-1
Regulator of G-protein signaling 3 (RGS3), TV 4NM_134427.17.508.19
OTU domain containing 6B (OTUD6B)BC029760.18.137.08
Mitogen-activated protein kinase-activated protein kinase 5 (MAPKAPK5), TV 1NM_003668.26.587.71
Casein kinase 2, alpha 1 polypeptide (CSNK2A1), TV 2NM_001895.16.106.85
Chromatin modifying protein 6 (CHMP6)NM_024591.17.014.39
Chromosome 11 open reading frame 63 (C11orf63), TV 2NM_199124.14.936.17
Mitochondrial ribosomal protein L19 (MRPL19), nuclear gene encoding mitochondrial proteinNM_014763.24.584.49
Chondrosarcoma associated gene 1 (CSAG1)BC059947.14.933.83
La ribonucleoprotein domain family, member 6 (LARP6), TV 1NM_018357.23.994.70
Dihydrouridine synthase 1-like (S. cerevisiae) (DUS1L)NM_022156.33.493.75
Coiled-coil domain containing 23 (CCDC23)BC029427.13.603.52
PRKR interacting protein 1 (IL11 inducible) (PRKRIP1)BC014298.13.603.47
UBX domain containing 3 (UBXD3)NM_152376.23.523.24
LSM4 homolog, U6 small nuclear RNA associated (S. cerevisiae) (LSM4)NM_012321.13.243.26
Table 4

G4-2-specific/semi-specific protein hits (top 20). Significant interactors of G4-2 at both concentrations classified as insignificant for G4-1 and G4-3 at one or both concentrations (the non-overlapping subset of the G4-2 interactor set, ranked by average Z-score). TV = transcript variant.

ProteinDatabase IDZ-score
2.5 μM G4-22.5 μM G4-2
Nuclear protein Hcc-1NM_033082.16.784.85
PNMA-like 1, mRNA (cDNA clone MGC:45422 IMAGE:5246377), complete cdsBC032508.16.094.86
chromosome 11 open reading frame 52 (C11orf52)NM_080659.16.004.71
Ubiquitin specific peptidase 39 (USP39)NM_006590.25.864.83
Hypothetical protein MGC31957 (MGC31957)BC005043.15.594.86
Small inducible cytokine subfamily E, member 1 (endothelial monocyte-activating) (SCYE1)BC014051.15.444.85
Mitochondrial ribosomal protein S6 (MRPS6), nuclear gene encoding mitochondrial proteinNM_032476.15.374.60
Synaptotagmin I (SYT1)NM_005639.14.954.86
Cysteinyl-tRNA synthetase (CARS)BC002880.14.704.86
Regulator of G-protein signaling 8BC069677.14.594.85
Coiled-coil domain containing 43 (CCDC43)BC047776.24.184.85
Rho GTPase-activating protein 12BC094719.14.464.54
Ring finger protein 4 (RNF4)NM_002938.24.104.86
Within bgcn homolog (Drosophila) (WIBG)NM_032345.14.084.85
Proline/serine-rich coiled-coil 1 (PSRC1), TV 1NM_032636.24.054.86
Nucleophosmin (nucleolar phosphoprotein B23, numatrin) (NPM1)BC021983.13.734.85
Protein FAM76BNM_144664.34.144.39
Spermatogenesis associated, serine-rich 2 (SPATS2)BC048299.13.784.63
Synaptonemal complex protein 3 (SYCP3)NM_153694.33.544.86
EzrinBC068458.13.514.86
Table 5

G4-3-specific/semi-specific protein hits (top 20). Significant interactors of G4-2 at both concentrations classified as insignificant for G4-1 and G4-3 at one or both concentrations (the non-overlapping subset of the G4-2 interactor set, ranked by average Z-score). TV = transcript variant.

ProteinDatabase IDZ-score
2.5 μM G4-32.5 μM G4-3
Methionyl aminopeptidase 2 (METAP2)NM_006838.18.146.08
Potassium channel tetramerisation domain containing 18 (KCTD18)BC067755.13.356.17
GTPase activating protein (SH3 domain) binding protein 1 (G3BP1), transcript variant 2NM_198395.14.674.78
CAP-GLY domain containing linker protein family, member 4 (CLIP4)NM_024692.34.344.82
Polymerase (DNA directed), beta (POLB)NM_002690.15.024.00
Chromosome 6 open reading frame 130 (C6orf130)NM_145063.15.723.18
Transcription elongation factor A (SII)-like 2 (TCEAL2)NM_080390.34.594.10
Laminin, gamma 1 (formerly LAMB2) (LAMC1)BC015586.23.934.23
Three prime histone mRNA exonuclease 1 (THEX1)NM_153332.24.034.09
Ribosomal protein L35 (RPL35)BC010919.13.254.72
RNA (guanine-9-)-methyltransferase domain-containing protein 3BC057774.14.573.36
Angiogenic factor with G patch and FHA domains 1BC029382.14.003.80
Chromosome 8 open reading frame 59 (C8orf59)BC032347.13.454.32
Membrane protein, palmitoylated 7 (MAGUK p55 subfamily member 7) (MPP7)BC038105.23.334.39
Cyclin-dependent kinase-like 1NM_004196.24.293.32
Double-stranded RNA-binding protein Staufen homolog 1NM_004602.23.653.91
Transcription factor AP-2 beta (activating enhancer binding protein 2 beta) (TFAP2B)NM_003221.13.733.40
Methyltransferase like 1 (METTL1), transcript variant 1NM_005371.23.983.14
Cell division cycle 7-related protein kinaseNM_003503.23.463.56
Histone cluster 2, H2ac (HIST2H2AC)NM_003517.23.913.01
Fig. 4

Verification of the profiling data for selected proteins. (A) Data on BRD3 interactions with G4 and non-G4 (hairpin and (TG)7) ONs: electrophoretic mobility shift assay. (B) Data on SNFT interactions with G4 and non-G4 (N15) ONs: photonic crystal surface wave assay. The sensorgrams illustrate changes in the effective adlayer thickness (delta H) upon ON sorption on/desorption from the SNFT-coated photonic crystal (PC) slide surface.

Characterization of biotinylated G4s. (A) СD spectra (per mole of nucleotide) and schematic representations of previously characterized initial (non-biotinylated) G4 structures. (B) CD spectra of the 3’-biotinylated G4s. (C) Rotational relaxation time (RRT) of EtBr in complex with G4-3. Conditions: 20 mM Tris–HCl buffer (pH 7.5), 100 mM KCl. Points on the graph correspond to the average values of three measurements. Workflow of profiling G4-protein interactions using human protein microarrays. Three Venn diagrams for protein interactors of G4 ONs profiled at different concentrations. Verification of the profiling data for selected proteins. (A) Data on BRD3 interactions with G4 and non-G4 (hairpin and (TG)7) ONs: electrophoretic mobility shift assay. (B) Data on SNFT interactions with G4 and non-G4 (N15) ONs: photonic crystal surface wave assay. The sensorgrams illustrate changes in the effective adlayer thickness (delta H) upon ON sorption on/desorption from the SNFT-coated photonic crystal (PC) slide surface. Top non-specific protein hits. Overlap of the three G4 interactor sets significant at both 2.5 μM and 25 μM G4 concentrations, ranked by average Z-score (G4-1G4-2 ∩ G4-3). TV = transcript variant. Top semi-specific protein hits. Pairwise overlaps of the G4 interactor sets significant at both G4 concentrations, ranked by average Z-score. TV = transcript variant. G4-1-specific/semi-specific protein hits. Significant interactors of G4-1 at both concentrations classified as insignificant for G4-2 and G4-3 at one or both concentrations (the non-overlapping subset of the G4-1 interactor set, ranked by average Z-score). TV = transcript variant. G4-2-specific/semi-specific protein hits (top 20). Significant interactors of G4-2 at both concentrations classified as insignificant for G4-1 and G4-3 at one or both concentrations (the non-overlapping subset of the G4-2 interactor set, ranked by average Z-score). TV = transcript variant. G4-3-specific/semi-specific protein hits (top 20). Significant interactors of G4-2 at both concentrations classified as insignificant for G4-1 and G4-3 at one or both concentrations (the non-overlapping subset of the G4-2 interactor set, ranked by average Z-score). TV = transcript variant.

Experimental design, materials and methods

G4 design, synthesis and characterization by optical methods

Sequences of ONs G4-1 and G4-2 were chosen as well-characterized quadruplex motifs from the human genome. G4-1 is an Alu repeat fragment, the only potential quadruplex site in a conservative region of Alu elements (PQS2 in [1]). G4-2 is a fragment of CTIF gene intron (NCBI Reference Sequence: NC_000018.9, chr18: +46379322 to +46379344), it reportedly forms an “imperfect” quadruplex structure with a bulge between G-tetrads or a mismatch/vacancy in a tetrad (Ct1 in [3]). G4-3 is a G4-2 mutant, it forms a canonical (“perfect”) G4 structure (CtG in [3]). Standard solid-phase ON synthesis was performed using standard reagents and 3’-BiotinTEG CPG (Glen Research). HPLC purification and MALDITOF MS analysis were performed as previously described [4]. The purity of all oligonucleotides was determined to be ≥95% by HPLC. Circular dichroism spectra of the 3’-biotin-ONs in 20 mM Tris–HCl buffer (pH 7.5) containing 100 mM KCl were recorded using a V-550 spectrophotometer (JASCO). ON concentration was 3 μM. The samples were heated to 90 °C for 5 minutes and cooled rapidly prior to the experiments. Molar CD per nucleotide residue was calculated as follows: Δε = θ/(32982∙C∙n∙l), where θ is ellipticity (degree), C is ON concentration (M), l is optical path length (cm) and n is the number of nucleotide residues in the ON. Rotational relaxation times (RRT) of ethidium bromide (EtBr) in complexes with the ONs were estimated to clarify whether ON G4-3 folds into an inter- or intramolecular structure (in the first approximation RRT is proportional to the hydrodynamic volume of the complex). RRT was calculated based on EtBr fluorescence lifetime and polarization values as described in [4]. The fluorescence lifetime was evaluated using Easy Life V (fluorescence decay was registered through a RG610 long pass filter at excitation LED 525 nm). Fluorescence polarization was calculated based on vertical and horizontal components of fluorescence intensity at emission maximum (610 nm) that were measured with Cary Eclipse spectrofluorometer at 4 °C upon excitation at 540 nm by the vertically polarized light. Concentration of EtBr was 1 μM, and ON concentration was 5 μM. To correlate G4-3 RRT with inter/intramolecular folding, ‘monomolecular’ and ‘bimolecular’ calibration plots were obtained using previously described (control) ON structures. Monomolecular controls were d(GGGTGGGTGGGTGGG) (G3 in [3] and [4]) and d(GGGGGCCGTGGGGTGGGAGCTGGGG) (Bcl in [3] and [4]). Bimolecular controls were d(TCACCTCCCTCC/ GGAGGGAGGTGA) (duplex) and d(GGGGGCCGTGGGGTGAGAGCTGGGG) (BclA in [3] and [4]).

Small molecule - protein interaction assays

To identify G4-binding proteins, we used the commercially available profiling service “Biotinylated or fluorophore-labeled small molecule detection-identification of fluorescent or biotinylated drug target substrates”. The profiling experiments were performed on protein microarrays containing duplicate probes of more than 9000 human proteins and additional control proteins spotted on a modified glass slide by Invitrogen (ProtoArray® Human Protein Microarray Version 5.0; Invitrogen http://www.invitrogen.com). Three biotinylated ONs (G4-1, G4-2 and G4-3) were profiled at two concentrations (2.5 μM and 25 μM). The ON solutions in a working buffer (20 mM Tris (pH 7.5), 5 mM MgCl2, 100 mM KCl) were annealed rapidly prior to the experiments as described in the previous subsection to ensure intramolecular folding, and 0.1% Tween-20 was added to each sample after annealing to prevent aggregation. A mixture of non-biotinylated A/T-rich ONs was used to block non-specific interactions in the profiling experiments. The mixture consisted of random-sequence ONs (N6, N10 and N15) and AT-rich ONs (listed below) in the ratio 2:3. A/T-rich ONs in the mixture: d(gCTTCTTgCCgAgATTTCgATTACTACCATTTTTTTTTTCATgC); d(gggATACTgATgACTgCCAgC); d(AgAAATCTCAgTTCCTACAgA); d(ggCgCTggCTTgACAgTTTC); d(CTgCCTTCAggTTgTTgCTTg); d(TATCCAgCTTTCTgTAACTTC); d(TCCCTAggTATCTggAATAg); d(gCAgAgACATCTTAACTTCAC); d(TgAgTTTCACAAAgCTATCTg); d(TAATTgAgATggAggTATgTC); d(TTACATAgCTgggAggATATC); d(TTTATCACCAAgTgCATgAAg); d(AATTTACTATATAAATTATACTAATCCA). Two types of control assays (negative and positive) were performed on separate ProtoArray® Human Protein Microarrays in parallel with assays probed with G4s (detailed information about control assays is available from the Invitrogen web-page page [5]). The data on G4-protein interactions were processed by ProtoArray® Prospector (the manual is available to the general public). Significance call queries were performed by Prospector to identify hits on each protein array. Significance score (Z-score) was calculated as difference between signal minus background values for the protein of interest and mean signal minus background value for all non-control proteins on the array, divided by the standard deviation of the human protein features. The protein has positive significance call if corresponding Z-score is greater than 3.0. Interactors (hits) are defined as proteins having positive significance calls that are not observed for the appropriate negative control.

Verification of the microarray profiling data

To verify the results of the microarray experiments, two proteins (BRD3 and SNFT) were selected from the top ‘non-specific’ and ‘semi-specific’ hits (Table 1, Table 2, respectively; Italics), and their interactions with G4s were assessed by independent methods. Recombinant BRD3 and SNFT were purchased from and Abnova(Taiwan), #H00008019-P01 and MyBioSource(USA), #MBS953061, respectively. G4 interactions with BRD3 were assessed by electrophoretic mobility shift assay. Non-G4 ssDNA (TG)7 and hairpin DNA d(TCACATTCCTGAGAATGTGA) were used as controls to address the specificity factor. Preannealed ON solutions (200 nM) in 20 mM Tris–HCl buffer (pH 8) containing 100 mM KCl were incubated with recombinant protein BRD3 (concentrations: 0–200 nM) for 5 min at room temperature. The band shifts were resolved on a 15% non-denaturing polyacrylamide gel (29:1) in a standard TBE buffer (pH 8). A mixture of 10–100 nt single-stranded oligonucleotide fragments was used as molecular weight marker (Low MW Marker, Affymetrix). The gel was stained with SYBR Gold and analyzed using a GelDoc scanner (BioRad). G4 interactions with SNFT were assessed using biosensors based on photonic crystal surface waves (PCSW) [6]. A mixture of 15-mer ONs with randomized sequences (N15) was used as a control to address the specificity factor. The protein was immobilized on the functionalized photonic crystal (PC) slide surface. PC slide preparation and functionalization were performed as described in [7]. ON solutions (1.5 µM) in standard PBS buffer (10 mM sodium phosphate, pH 7.4, 140 mM NaCl, and 3 mM KCl) were pumped through the working chamber (over the slide surface), and binding was registered as an increment of the effective adlayer thickness (ΔH). After signal saturation, the chamber was rinsed with the working buffer to remove non-specifically bound ONs.

Gene ontology enrichment analysis

We determined which Gene Ontology (GO) categories are statistically overrepresented in the set of proteins, which interact with G4 according to microarray experiment results. Enrichment analysis of the molecular functions of quadruplex binding proteins was performed with BiNGO plugin [8] with custom ontology and annotation files derived from the Gene Ontology database, all proteins present on the Human Protein Microarray were used as a custom reference set. For assessing overrepresentation accurately Hypergeometric statistical test (test without replacement) was used with Benjamini&Hochberg False Discovery Rate (FDR) correction. The results were visualized with Cytoscape 3.6.0.
Subject areaMolecular Biology; Physical chemistry
More specific subject areaDNA secondary structures
Type of dataTables, high resolution images, figures, network diagrams
How data was acquiredSmall Molecule-Protein Interaction Profiling on ProtoArray® Human Protein Microarrays (Invitrogen); CD spectroscopy (JASCO V-550 spectrophotometer); fluorescence polarization measurements (Cary Eclipse fluorescence spectrophotometer, Agilent Technologies); fluorescence decay measurements (Easy Life V fluorescence lifetime fluorometer, Optical Building Blocks Corporation), electrophoretic mobility shift assay (results were visualized using Gel Doc XR+ system BIO RAD)
Data formatRaw, analyzed
Experimental factorsON solutions in the specified buffers were denatured at 95 °C for 5 min and snap cooled on ice prior to the experiments
Experimental featuresSmall Molecule-Protein Interaction Profiling was performed using ProtoArray® Human
Protein Microarrays, CD spectroscopy was recorded on a spectrophotometer equipped with temperature-controlled cuvette holder, fluorescence rotational relaxation times were calculated using fluorescence polarization and fluorescence lifetime values, the gel in electrophoretic mobility shift assay was stained with SYBR Gold.
Data source locationResearch and Clinical Center for Physical Chemical Medicine and Engelhardt Institute of Molecular Biology, Moscow, Russian Federation
Data accessibilityThe data is available within this article
  5 in total

1.  BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks.

Authors:  Steven Maere; Karel Heymans; Martin Kuiper
Journal:  Bioinformatics       Date:  2005-06-21       Impact factor: 6.937

2.  The expanding repertoire of G4 DNA structures.

Authors:  Anna Varizhuk; Dmitry Ischenko; Vladimir Tsvetkov; Roman Novikov; Nikolay Kulemin; Dmitry Kaluzhny; Maria Vlasenok; Vladimir Naumov; Igor Smirnov; Galina Pozmogova
Journal:  Biochimie       Date:  2017-01-19       Impact factor: 4.079

3.  Data on secondary structures and ligand interactions of G-rich oligonucleotides that defy the classical formula for G4 motifs.

Authors:  Maria Vlasenok; Anna Varizhuk; Dmitry Kaluzhny; Igor Smirnov; Galina Pozmogova
Journal:  Data Brief       Date:  2017-02-12

4.  Photonic crystal biosensor based on optical surface waves.

Authors:  Valery N Konopsky; Tanya Karakouz; Elena V Alieva; Chiara Vicario; Sergey K Sekatskii; Giovanni Dietler
Journal:  Sensors (Basel)       Date:  2013-02-19       Impact factor: 3.576

5.  Comparison of the 'chemical' and 'structural' approaches to the optimization of the thrombin-binding aptamer.

Authors:  Olga Tatarinova; Vladimir Tsvetkov; Dmitry Basmanov; Nikolay Barinov; Igor Smirnov; Edward Timofeev; Dmitry Kaluzhny; Andrey Chuvilin; Dmitry Klinov; Anna Varizhuk; Galina Pozmogova
Journal:  PLoS One       Date:  2014-02-20       Impact factor: 3.240

  5 in total
  4 in total

Review 1.  Promise of G-Quadruplex Structure Binding Ligands as Epigenetic Modifiers with Anti-Cancer Effects.

Authors:  Antara Sengupta; Akansha Ganguly; Shantanu Chowdhury
Journal:  Molecules       Date:  2019-02-06       Impact factor: 4.411

Review 2.  Telomeres and Their Neighbors.

Authors:  Leon P Jenner; Vratislav Peska; Jana Fulnečková; Eva Sýkorová
Journal:  Genes (Basel)       Date:  2022-09-16       Impact factor: 4.141

3.  Constrained G4 structures unveil topology specificity of known and new G4 binding proteins.

Authors:  A Pipier; A Devaux; T Lavergne; A Adrait; Y Couté; S Britton; P Calsou; J F Riou; E Defrancq; D Gomez
Journal:  Sci Rep       Date:  2021-06-29       Impact factor: 4.379

Review 4.  Epigenetic Modulation of Chromatin States and Gene Expression by G-Quadruplex Structures.

Authors:  Chiara Reina; Vincenzo Cavalieri
Journal:  Int J Mol Sci       Date:  2020-06-11       Impact factor: 5.923

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.