| Literature DB >> 27978814 |
Svetlana Bulashevska1, Colin Priest2, Daniel Speicher3,4, Jörg Zimmermann3,4, Frank Westermann5, Armin B Cremers3,4.
Abstract
BACKGROUND: Biological systems and processes are highly dynamic. To gain insights into their functioning time-resolved measurements are necessary. Time-resolved gene expression data captures temporal behaviour of the genes genome-wide under various biological conditions: in response to stimuli, during cell cycle, differentiation or developmental programs. Dissecting dynamic gene expression patterns from this data may shed light on the functioning of the gene regulatory system. The present approach facilitates this discovery. The fundamental idea behind it is the following: there are change-points (switches) in the gene behaviour separating intervals of increasing and decreasing activity, whereas the intervals may have different durations. Elucidating the switch-points is important for the identification of biologically meanigfull features and patterns of the gene dynamics.Entities:
Keywords: ATRA-induced differentiation; Bayesian modeling; Change-point modeling; Change-point problem; Dynamic patterns of gene expression; Gibbs sampling; MCMC; Neuroblastoma; Time-series analysis
Mesh:
Substances:
Year: 2016 PMID: 27978814 PMCID: PMC5160026 DOI: 10.1186/s12859-016-1391-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Illustration of the modeling. a Example of a model with four regimes and five switches. b When the switch locations are known, the data within each regime is fitted with a line - linear interpolation. c Design matrix for the model in A, calculated using linear factors, specifies the linear regression needed to determine the switch heights. d Roulette method for sampling of a switch location between the neighbouring switches, based on the calculated probabilities of four possible locations
Fig. 2Workflow of the algorithm. The algorithm presents the iterative sampling of the model parameters in the Gibbs procedure. Parameter values (switch locations, heights, standard deviation) and fitted data are stored for each current model. The generated switch locations are accepted only if they produce a valid model. The parameters of the fitted model (switch heights and covariance matrix) are used to generate new sample values for the switch heights. Then, upon model fit, the fitted standard deviation is used to produce a sample value for the standard deviation. The next iteration with the updated model proceeds, new switch locations will be generated
Fig. 3Features of the switch-points defined in SwitchFinder: Growth, Decay, Spike and Cleft. Features are assigned to the switch-points to capture meaningfull properties of the time-series. The feature Growth is assigned to the switches of types trough, if the corresponding Slope is greater than the Threshold with the default value threshold_growth. The user is able to adjust the Threshold for selecting more striking Growth-effects. The feature Decay is defined similarly. The higher-order feature Spike is assigned to the switch, designated with Decay, if its left-hand side neighbour has the feature Growth and the absolute difference Dif in the gene expression levels of the neighbouring switches is smaller than the Threshold with the default threshold_dif. The aim is to select a really spiking behaviour: after a rapid growth, rapid decay to almost the same level occurs. The feature Cleft designates the opposite behaviour: after a decay, growth to almost the same level occurs
Fig. 4SwitchFinder query interface. Snapshot of the user interface demonstrating the result of the query - gene profiles with the feature Growth at the switch-point at time t=5. The Slope of the Growth was greater than the Treshold. It is possible to download the result and to store the query
Results of the application of SwitchFinder to 10 simulated data sets
| Mean of | SD of | Precision | Recall | Functions |
|---|---|---|---|---|
| RSDs | RSDs | mismatches | ||
| 0.17 | 0.09 | 0.93 | 0.94 | 0.07 |
| 0.17 | 0.09 | 0.92 | 0.94 | 0.06 |
| 0.17 | 0.10 | 0.92 | 0.94 | 0.06 |
| 0.17 | 0.09 | 0.93 | 0.94 | 0.06 |
| 0.17 | 0.11 | 0.92 | 0.94 | 0.06 |
| 0.17 | 0.10 | 0.92 | 0.94 | 0.06 |
| 0.17 | 0.09 | 0.91 | 0.94 | 0.07 |
| 0.17 | 0.10 | 0.91 | 0.93 | 0.06 |
| 0.17 | 0.09 | 0.92 | 0.93 | 0.06 |
| 0.17 | 0.10 | 0.92 | 0.93 | 0.06 |
Fig. 5Examples of the fit. SwitchFinder was applied to the time-series of the cell cycle regulated genes from Whitfield et al. The fitted data is presented with dashed lines/curves, the switch-points are depicted in black. The time-course of the gene MCM6 is fitted with Model_Lin, the gene CDC20 showed better fit with Model_Logit
Fig. 6Dynamic patterns of the gene expression response in neuroblastoma cell line to treatment with ATRA. a INDUCED_IMMEDIATELY Genes in this group were induced immediately upon treatment with ATRA. b INDUCED_12 The activation of these genes by ATRA started at 12 hrs. c INDUCED_24 Genes in this group were induced in response to ATRA after 24 hrs. d INDUCED_LATE Genes in this group showed late induction: after 48 or 96 hrs. e REPRESSED These genes responded to ATRA immediately with the decrease of expression. f REPRESSED_CYCLIC These genes, involved in the cell-cycle, were repressed by ATRA. g SPIKED Genes in this group responded to ATRA with increase and then decrease of their activity, revealing a peak between 12 and 48 hrs. h CLEFTED This group summarizes the genes with a transient response to ATRA i.e their expression declined and then increased. The average gene expression profile for each group is depicted in black
A. INDUCED_IMMEDIATELY
| BACH2, BATF2, CREM, CSRNP3, DACH1, EBF1, EGR1/2/3, FOS, FOXC1, GATA6, HES1, HEY1, HIC1, HIF1A, HOXD1/3/8/9/10/13, KDM5B(JARID1B), KLF12, LEF1, MAFB, NCOA3/7, NKX3-1, NR0B1, PBX1, PPARG/D, RARA, SMAD3, SOX4/8/9, TBX2/3, TEAD2, TLE3, TLX2, TULP4, ZFP2, ZNF71/135/436/606/641 | GO:000 3700 sequence-specific DNA binding transcription factor activity; GO:0006355 regulation of transcription, DNA-templated; GO:0030154 cell differentiation |
| AKR1C1/3, BCDO2, CRABP2, CYP26A1/B1, DHRS3, RARA, RBP1, RDH10, SDC4, SP100, STRA6, PPARD/G | GO:0001523 retinoid metabolic process; GO:0042573 retinoic acid metabolic process; GO:0001972 retinoic acid binding; GO:0032526 response to retinoic acid |
| BMP4, EGR1, GREM2, LEF1 | GO:0030509 BMP signaling pathway |
| DACT3, LEF1, PSEN1, SOX4 | GO:0016055 Wnt signaling pathway |
| FOXC1, HES1, HEY1, HIF1A, MDK, NCOR2, PSEN1, TLE3 | GO:0007219 Notch signaling pathway; GO:0005112 Notch binding |
| ERBB2, IRS2, KITLG, PDGFRA/B, SPRY2/4 | GO:0007173 epidermal growth factor receptor signaling |
| PDGFRA/B, PLAT | GO:0048008 platelet-derived growth factor receptor signaling pathway |
| NGFR, NTRK1, PCSK5, PLEKHG2, RALB, RIT1 | GO:0048011 neurotrophin TRK receptor signaling pathway; GO:0038180 nerve growth factor signaling pathway |
| DISP1 | GO:0007224 smoothened signaling pathway; GO:0008158 hedgehog receptor activity; GO:0009880 embryonic pattern specification |
| APC2, EML4, KIFAP3, LYST, NEIL2, SPTAN1 | GO:0015630 microtubule cytoskeleton |
| AHNAK, ARPC1B, AVIL, CORO2A, CTTNBP2NL, FAM129B, FGD4/6, FHL2, FLNB, KALRN, LCP1, MYRIP, PDLIM5/7, PPP1R12B, SYNPO/2, TRIOBP, VCL | GO:0015629 actin cytoskeleton |
| ARHGDIB, CLASP2, CNN2, LIMK1, NUAK2, PAK1, PALM, PFN2, PLK2, RND3, SDCBP, SOX9 | GO:0007010 cytoskeleton organization |
| CEACAM1, GAB2, ITGA1, ITGB8; ADD3, LIMK1, MYADM, MRCL3(MYL12A), TRIO | GO:0007229 integrin-mediated signaling pathway; GO:0005911 cell-cell junction; GO:0040011 locomotion; GO:0016477 cell migration |
| ANTXR1, ATP1B1, BVES, CALCA, CDH23, CEACAM1, CLSTN3, COL12A1, COMP, FBLIM1, KITLG, NCAM2, NEO1, PCDHB2/4/6/9-11/13/14, PPFIBP1, PSEN1, PVRL2, RET, RND3, SPP1, TGFB1I1, TPBG, TRO,VTN | GO:0007155 cell adhesion; GO:0007411 axon guidance |
| HIF1A, HTR2B, KITLG, LEF1, RET, SOX8 | GO:0001755 neural crest cell migration |
| EGR2/EGR3, ERBB2, SOX8 | GO:0007422 peripheral nervous system development |
| JARID1B, JARID2 | GO:0016568 chromatin modification; GO:0048863 stem cell differentiation |
| SLIT2, SLITRK6, FLOT1 | GO:0035385 Roundabout signaling pathway; GO:0050772 positive regulation of axonogenesis |
| EPHA2, EPHB3; SEMA6C, SEMA6D | GO:0048013 ephrin receptor signaling pathway; GO:0030215 semaphorin receptor binding; GO:0007411 axon guidance |
| DCX, DPYSL3, ERBB2, KCNQ2, PSEN1, PTPRO, RRAS, SPTAN1, ST8SIA4; STMN2, TEAD2 | GO:0007411 axon guidance; GO:0030426 growth cone; GO:0048666 neuron development |
| LAMB2, LAMC1 | GO:0005605 basal lamina; GO:0031175 neuron projection development |
| DLG2, GLS, GNG2/8, HCN1, KCNQ2, PANX, RRAS, SDCBP, SST, SYNJ2, SYT2; STX7, STXBP5/6 | GO:0007268 synaptic transmission; GO:0019905 syntaxin binding; GO:0045202 synapse |
| HTR2B, FOS,KALRN, NAB2, NAV2, DCX, RGS9, RTN4, VCL | GO:0007399 nervous system development; neurite branching; GO:0030334 regulation of cell migration |
| CDKL5 | GO:0001764 neuron migration; GO:0050773 regulation of dendrite development; GO:0051726 regulation of cell cycle |
| BCL2, BOK, CASP4/9, CTSB, NLRP1, SKIL; ANGPT1, CPEB4, CRLF1, F2R, HIF1A, MDK, NTRK1, PSEN1 | GO:0006915 apoptotic process; GO:0043524 negative regulation of neuron apoptotic process |
| ADAM12, ADAMTS9, MMP2/11 | GO:0008237 metallopeptidase activity |
| F2R, GALR1, GPR161, HTR2B, IGF2R, P2RY2, PTGER2, PTGIR | GO:0004930 G-protein coupled receptor activity; GO:0004966 galanin receptor activity; GO:0007218 neuropeptide signaling pathway; GO:0007189 adenylate cyclase-activating G-protein coupled receptor signaling pathway |
| CRLF1 | GO:0005127 ciliary neurotrophic factor receptor binding |
The table displays exemplary the genes from the group A and their functional annotations. The group A contains genes that demonstrated immediate increase of expression in response to ATRA