Literature DB >> 17027692

Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics.

John H Phan1, Chang-Feng Quo, May D Wang.   

Abstract

The goal of this chapter is to introduce some of the available computational methods for expression analysis. Genomic and proteomic experimental techniques are briefly discussed to help the reader understand these methods and results better in context with the biological significance. Furthermore, a case study is presented that will illustrate the use of these analytical methods to extract significant biomarkers from high-throughput microarray data. Genomic and proteomic data analysis is essential for understanding the underlying factors that are involved in human disease. Currently, such experimental data are generally obtained by high-throughput microarray or mass spectrometry technologies among others. The sheer amount of raw data obtained using these methods warrants specialized computational methods for data analysis. Biomarker discovery for neurological diagnosis and prognosis is one such example. By extracting significant genomic and proteomic biomarkers in controlled experiments, we come closer to understanding how biological mechanisms contribute to neural degenerative diseases such as Alzheimers' and how drug treatments interact with the nervous system. In the biomarker discovery process, there are several computational methods that must be carefully considered to accurately analyze genomic or proteomic data. These methods include quality control, clustering, classification, feature ranking, and validation. Data quality control and normalization methods reduce technical variability and ensure that discovered biomarkers are statistically significant. Preprocessing steps must be carefully selected since they may adversely affect the results of the following expression analysis steps, which generally fall into two categories: unsupervised and supervised. Unsupervised or clustering methods can be used to group similar genomic or proteomic profiles and therefore can elucidate relationships within sample groups. These methods can also assign biomarkers to sub-groups based on their expression profiles across patient samples. Although clustering is useful for exploratory analysis, it is limited due to its inability to incorporate expert knowledge. On the other hand, classification and feature ranking are supervised, knowledge-based machine learning methods that estimate the distribution of biological expression data and, in doing so, can extract important information about these experiments. Classification is closely coupled with feature ranking, which is essentially a data reduction method that uses classification error estimation or other statistical tests to score features. Biomarkers can subsequently be extracted by eliminating insignificantly ranked features. These analytical methods may be equally applied to genetic and proteomic data. However, because of both biological differences between the data sources and technical differences between the experimental methods used to obtain these data, it is important to have a firm understanding of the data sources and experimental methods. At the same time, regardless of the data quality, it is inevitable that some discovered biomarkers are false positives. Thus, it is important to validate discovered biomarkers. The validation process may be slow; yet, the overall biomarker discovery process is significantly accelerated due to initial feature ranking and data reduction steps. Information obtained from the validation process may also be used to refine data analysis procedures for future iteration. Biomarker validation may be performed in a number of ways - bench-side in traditional labs, web-based electronic resources such as gene ontology and literature databases, and clinical trials.

Entities:  

Mesh:

Year:  2006        PMID: 17027692     DOI: 10.1016/S0079-6123(06)58004-5

Source DB:  PubMed          Journal:  Prog Brain Res        ISSN: 0079-6123            Impact factor:   2.453


  10 in total

1.  Imaging-based observational databases for clinical problem solving: the role of informatics.

Authors:  Alex A T Bui; William Hsu; Corey Arnold; Suzie El-Saden; Denise R Aberle; Ricky K Taira
Journal:  J Am Med Inform Assoc       Date:  2013-06-17       Impact factor: 4.497

2.  Object-oriented regression for building predictive models with high dimensional omics data from translational studies.

Authors:  Lue Ping Zhao; Hamid Bolouri
Journal:  J Biomed Inform       Date:  2016-03-10       Impact factor: 6.317

3.  The grand challenge: use of a new approach in developing policies in the area of radiation and health.

Authors:  Dariusz Leszczynski
Journal:  Front Public Health       Date:  2014-05-21

4.  Cluster and Principal Component Analysis of Human Glioblastoma Multiforme (GBM) Tumor Proteome.

Authors:  Mehdi Pooladi; Mostafa Rezaei-Tavirani; Mehrdad Hashemi; Saeed Hesami-Tackallou; Solmaz Khaghani-Razi-Abad; Afshin Moradi; Ali Reza Zali; Masoumeh Mousavi; Leila Firozi-Dalvand; Azadeh Rakhshan; Mona Zamanian Azodi
Journal:  Iran J Cancer Prev       Date:  2014

5.  A multivariate predictive modeling approach reveals a novel CSF peptide signature for both Alzheimer's Disease state classification and for predicting future disease progression.

Authors:  Daniel A Llano; Saurabh Bundela; Raksha A Mudar; Viswanath Devanarayan
Journal:  PLoS One       Date:  2017-08-03       Impact factor: 3.240

6.  Multimodal deep learning models for early detection of Alzheimer's disease stage.

Authors:  Janani Venugopalan; Li Tong; Hamid Reza Hassanzadeh; May D Wang
Journal:  Sci Rep       Date:  2021-02-05       Impact factor: 4.379

Review 7.  Zebrafish Is a Powerful Tool for Precision Medicine Approaches to Neurological Disorders.

Authors:  Katarzyna Ochenkowska; Aveeva Herold; Éric Samarut
Journal:  Front Mol Neurosci       Date:  2022-07-06       Impact factor: 6.261

8.  A repository based on a dynamically extensible data model supporting multidisciplinary research in neuroscience.

Authors:  Luca Corradi; Ivan Porro; Andrea Schenone; Parastoo Momeni; Raffaele Ferrari; Flavio Nobili; Michela Ferrara; Gabriele Arnulfo; Marco M Fato
Journal:  BMC Med Inform Decis Mak       Date:  2012-10-08       Impact factor: 2.796

9.  Comparison of statistical data models for identifying differentially expressed genes using a generalized likelihood ratio test.

Authors:  Kok-Yong Seng; Robb W Glenny; David K Madtes; Mary E Spilker; Paolo Vicini; Sina A Gharib
Journal:  Gene Regul Syst Bio       Date:  2008

10.  Toxicity prediction from toxicogenomic data based on class association rule mining.

Authors:  Keisuke Nagata; Takashi Washio; Yoshinobu Kawahara; Akira Unami
Journal:  Toxicol Rep       Date:  2014-11-07
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.