Literature DB >> 23544450

Cancer bioinformatics: detection of chromatin states, SNP-containing motifs, and functional enrichment modules.

Xiaobo Zhou.   

Abstract

In this editorial preface, I briefly review cancer bioinformatics and introduce the four articles in this special issue highlighting important applications of the field: detection of chromatin states; detection of SNP-containing motifs and association with transcription factor-binding sites; improvements in functional enrichment modules; and gene association studies on aging and cancer. We expect this issue to provide bioinformatics scientists, cancer biologists, and clinical doctors with a better understanding of how cancer bioinformatics can be used to identify candidate biomarkers and targets and to conduct functional analysis.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23544450      PMCID: PMC3845569          DOI: 10.5732/cjc.013.10045

Source DB:  PubMed          Journal:  Chin J Cancer        ISSN: 1944-446X


Cancer bioinformatics is a field that has recently emerged from traditional bioinformatics and clinical informatics. The general purpose of cancer bioinformatics is to discover biomarkers, therapeutic targets, novel pathways, and relationships among diseases or individuals, and to identify candidate causal factors of cancer initiation or progression. Topics in cancer bioinformatics include genomics, Proteomics and metabolic data analysis, image data analysis, survival data analysis, data integration, and database building. Genomics data can be collected from arrays at the DNA and RNA levels, including single nucleotide polymorphism (SNP) array, gene array, microRNA array, axon array, tile array, and many others, and from association studies between any two types of array data. Genomics data can also include DNA and transcriptome sequencing data for the human genome. Proteomics data include measures at total protein expression level, such as mass spectrometry data, and measures of protein activity, such as phosphoproteomic data. Imaging data include data from microscopic cellular imaging, three-dimensional (3D) confocal microscopy, flow Cytometry, 3D computed tomography (CT), magnetic resonance imaging (MRI), diffusion tensor imaging (DTI), and ultrasound. Survival data include information on patient survival, risk analysis, and stratification. Each type of data requires distinct bioinformatic technologies, for example, hierarchical clustering, K-means, and Fuzzy c-means for disease or individual clustering and correlation analyses, Bayesian network inference for signaling network studies, classification of patients and diseases, eQTL mapping, and others. Biomedical research is becoming an information science. However, there are many challenges to developing advanced bioinformatics approaches. First, biological questions and events must be understood in depth, a challenging task for bioinformatics scientists who were not trained in biology. Second, bioinformatics scientists develop sophisticated algorithms to solve a biological question, and biologists must understand how and why the algorithms can answer the question. Thus, bioinformatics scientists and biologists must “speak each other's language” and communicate effectively. Third, one major obstacle in modeling biological data is the limited sample size in almost all applications, forcing the use of power analysis in each project. However, power analysis can only be performed based on preliminary data. This again highlights the need for seamless collaboration between bioinformatics scientists and biologists. In this April issue of Chinese Journal of Cancer, four articles from the international leaders in cancer bioinformatics are published to describe their works with state-of-the-art technologies and methodologies and to share their insights on the trends and outlook for the field. Wang et al.[1] reports the detection and characterization of regulatory elements using probabilistic conditional random field and hidden Markov models. These models, which were developed to mark gene regions with different states based on recurrent and spatially coherent characters, revealed chromatin states that may correspond to enhancers and promoters, transcribed regions, transcriptional elongation, and low-signal regions. Fan et al.[2] identified SNP-containing regulatory motifs in the myelodysplastic syndromes model using data from SNP and gene expression arrays. More specifically, they developed an approach to associate SNP-containing motifs introduced by genetic variation and mutation with transcription factor-binding sites. A set of SNP-containing motifs were discovered as candidate transcription factor-binding sites when using regression model-based LARS-EN algorithm. The identified motifs and their location genes can be considered potential biomarkers. In the third article, Huang et al.[3] introduce GOMA, a functional enrichment tool based on GO modules. Unlike many pathway enrichment tools, which output a long list of significantly enriched terms that are often redundant, GOMA systematically reveals GO modules based on an optimization model. This method simplifies enrichment analysis results and provides more biologically meaningful results. Finally, Wang[4] reviews the association studies on aging and cancer, of which there is currently limited knowledge. Using gene expression profiling data for aging cells, stem cells, and cancer cells, Wang determined that genes related to aging or to human embryonic stem cells are often highly expressed in cancer. Notably, the product of these genes are involved in critical cancer processes, including cell cycle regulation, metabolism, DNA damage response, apoptosis, p53 signaling, immune/inflammatory response, and others. Cancer bioinformatics is becoming well integrated into biomedical research. Indeed, the National Cancer Institute (NCI) issued a call for proposals for cancer bioinformatics technology[5]. Some common themes include automation in data collection, data processing and analysis, data quality assessment, data integration, data presentation and visualization, text mining and natural language processing, archiving, organization, retrieval, and dissemination of data and knowledge, and environment for interactive modeling and simulation. Data integration and modeling is an interesting topic to many cancer researchers. For example, there are some discussions[6] on the integration of array data (e.g., SNP array, gene expression array, microRNA array, axon array, and tile array). Although it is beyond the scope of this special issue, we also recommend reference[7], which presents the advantages and disadvantages of different platforms used for image analyses—another area of interest in the field. Ultimately, we expect that advancements in cancer bioinformatics will contribute significantly to our understanding of the complex mechanisms of cancer and lead to better therapeutic strategies.
  5 in total

1.  Identification of functional elements and regulatory circuits by Drosophila modENCODE.

Authors:  Sushmita Roy; Jason Ernst; Peter V Kharchenko; Pouya Kheradpour; Nicolas Negre; Matthew L Eaton; Jane M Landolin; Christopher A Bristow; Lijia Ma; Michael F Lin; Stefan Washietl; Bradley I Arshinoff; Ferhat Ay; Patrick E Meyer; Nicolas Robine; Nicole L Washington; Luisa Di Stefano; Eugene Berezikov; Christopher D Brown; Rogerio Candeias; Joseph W Carlson; Adrian Carr; Irwin Jungreis; Daniel Marbach; Rachel Sealfon; Michael Y Tolstorukov; Sebastian Will; Artyom A Alekseyenko; Carlo Artieri; Benjamin W Booth; Angela N Brooks; Qi Dai; Carrie A Davis; Michael O Duff; Xin Feng; Andrey A Gorchakov; Tingting Gu; Jorja G Henikoff; Philipp Kapranov; Renhua Li; Heather K MacAlpine; John Malone; Aki Minoda; Jared Nordman; Katsutomo Okamura; Marc Perry; Sara K Powell; Nicole C Riddle; Akiko Sakai; Anastasia Samsonova; Jeremy E Sandler; Yuri B Schwartz; Noa Sher; Rebecca Spokony; David Sturgill; Marijke van Baren; Kenneth H Wan; Li Yang; Charles Yu; Elise Feingold; Peter Good; Mark Guyer; Rebecca Lowdon; Kami Ahmad; Justen Andrews; Bonnie Berger; Steven E Brenner; Michael R Brent; Lucy Cherbas; Sarah C R Elgin; Thomas R Gingeras; Robert Grossman; Roger A Hoskins; Thomas C Kaufman; William Kent; Mitzi I Kuroda; Terry Orr-Weaver; Norbert Perrimon; Vincenzo Pirrotta; James W Posakony; Bing Ren; Steven Russell; Peter Cherbas; Brenton R Graveley; Suzanna Lewis; Gos Micklem; Brian Oliver; Peter J Park; Susan E Celniker; Steven Henikoff; Gary H Karpen; Eric C Lai; David M MacAlpine; Lincoln D Stein; Kevin P White; Manolis Kellis
Journal:  Science       Date:  2010-12-22       Impact factor: 47.728

Review 2.  Discovery of molecular associations among aging, stem cells, and cancer based on gene expression profiling.

Authors:  Xiaosheng Wang
Journal:  Chin J Cancer       Date:  2013-01-09

3.  Detection and characterization of regulatory elements using probabilistic conditional random field and hidden Markov models.

Authors:  Hongyan Wang; Xiaobo Zhou
Journal:  Chin J Cancer       Date:  2012-12-07

4.  GOMA: functional enrichment analysis tool based on GO modules.

Authors:  Qiang Huang; Ling-Yun Wu; Yong Wang; Xiang-Sun Zhang
Journal:  Chin J Cancer       Date:  2012-12-07

5.  Identification of SNP-containing regulatory motifs in the myelodysplastic syndromes model using SNP arrays and gene expression arrays.

Authors:  Jing Fan; Jennifer G Dy; Chung-Che Chang; Xiaobo Zhou
Journal:  Chin J Cancer       Date:  2013-01-18
  5 in total
  1 in total

1.  Cancer research in the era of next-generation sequencing and big data calls for intelligent modeling.

Authors:  Jari Yli-Hietanen; Antti Ylipää; Olli Yli-Harja
Journal:  Chin J Cancer       Date:  2015-04-11
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.