Literature DB >> 22081063

Data mining approaches for genome-wide association of mood disorders.

Mehdi Pirooznia1, Fayaz Seifuddin, Jennifer Judy, Pamela B Mahon, James B Potash, Peter P Zandi.   

Abstract

BACKGROUND: Mood disorders are highly heritable forms of major mental illness. A major breakthrough in elucidating the genetic architecture of mood disorders was anticipated with the advent of genome-wide association studies (GWAS). However, to date few susceptibility loci have been conclusively identified. The genetic etiology of mood disorders appears to be quite complex, and as a result, alternative approaches for analyzing GWAS data are needed. Recently, a polygenic scoring approach that captures the effects of alleles across multiple loci was successfully applied to the analysis of GWAS data in schizophrenia and bipolar disorder (BP). However, this method may be overly simplistic in its approach to the complexity of genetic effects. Data mining methods are available that may be applied to analyze the high dimensional data generated by GWAS of complex psychiatric disorders.
RESULTS: We sought to compare the performance of five data mining methods, namely, Bayesian networks, support vector machine, random forest, radial basis function network, and logistic regression, against the polygenic scoring approach in the analysis of GWAS data on BP. The different classification methods were trained on GWAS datasets from the Bipolar Genome Study (2191 cases with BP and 1434 controls) and their ability to accurately classify case/control status was tested on a GWAS dataset from the Wellcome Trust Case Control Consortium.
CONCLUSION: The performance of the classifiers in the test dataset was evaluated by comparing area under the receiver operating characteristic curves. Bayesian networks performed the best of all the data mining classifiers, but none of these did significantly better than the polygenic score approach. We further examined a subset of single-nucleotide polymorphisms (SNPs) in genes that are expressed in the brain, under the hypothesis that these might be most relevant to BP susceptibility, but all the classifiers performed worse with this reduced set of SNPs. The discriminative accuracy of all of these methods is unlikely to be of diagnostic or clinical utility at the present time. Further research is needed to develop strategies for selecting sets of SNPs likely to be relevant to disease susceptibility and to determine if other data mining classifiers that utilize other algorithms for inferring relationships among the sets of SNPs may perform better.

Entities:  

Mesh:

Year:  2012        PMID: 22081063      PMCID: PMC3306768          DOI: 10.1097/YPG.0b013e32834dc40d

Source DB:  PubMed          Journal:  Psychiatr Genet        ISSN: 0955-8829            Impact factor:   2.458


  40 in total

Review 1.  Bayesian networks for knowledge discovery in large datasets: basics for nurse researchers.

Authors:  Sun-Mi Lee; Patricia A Abbott
Journal:  J Biomed Inform       Date:  2003 Aug-Oct       Impact factor: 6.317

Review 2.  Tools for intelligent control: fuzzy controllers, neural networks and genetic algorithms.

Authors:  Mo Jamshidi
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2003-08-15       Impact factor: 4.226

3.  Genetic dissection and prognostic modeling of overt stroke in sickle cell anemia.

Authors:  Paola Sebastiani; Marco F Ramoni; Vikki Nolan; Clinton T Baldwin; Martin H Steinberg
Journal:  Nat Genet       Date:  2005-03-20       Impact factor: 38.330

4.  Multiple SVM-RFE for gene selection in cancer classification with expression data.

Authors:  Kai-Bo Duan; Jagath C Rajapakse; Haiying Wang; Francisco Azuaje
Journal:  IEEE Trans Nanobioscience       Date:  2005-09       Impact factor: 2.935

Review 5.  Machine learning to boost the next generation of visualization technology.

Authors:  Kwan-Liu Ma
Journal:  IEEE Comput Graph Appl       Date:  2007 Sep-Oct       Impact factor: 2.088

6.  Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier.

Authors:  Malik Yousef; Michael Nebozhyn; Hagit Shatkay; Stathis Kanterakis; Louise C Showe; Michael K Showe
Journal:  Bioinformatics       Date:  2006-03-16       Impact factor: 6.937

7.  New models of collaboration in genome-wide association studies: the Genetic Association Information Network.

Authors:  Teri A Manolio; Laura Lyman Rodriguez; Lisa Brooks; Gonçalo Abecasis; Dennis Ballinger; Mark Daly; Peter Donnelly; Stephen V Faraone; Kelly Frazer; Stacey Gabriel; Pablo Gejman; Alan Guttmacher; Emily L Harris; Thomas Insel; John R Kelsoe; Eric Lander; Norma McCowin; Matthew D Mailman; Elizabeth Nabel; James Ostell; Elizabeth Pugh; Stephen Sherry; Patrick F Sullivan; John F Thompson; James Warram; David Wholley; Patrice M Milos; Francis S Collins
Journal:  Nat Genet       Date:  2007-09       Impact factor: 38.330

8.  Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.

Authors:  Brian L Browning; Zhaoxia Yu
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

Review 9.  Decision tree for the treatment of bipolar disorder.

Authors:  Gary S Sachs
Journal:  J Clin Psychiatry       Date:  2003       Impact factor: 4.384

Review 10.  Machine learning and its applications to biology.

Authors:  Adi L Tarca; Vincent J Carey; Xue-wen Chen; Roberto Romero; Sorin Drăghici
Journal:  PLoS Comput Biol       Date:  2007-06       Impact factor: 4.475

View more
  7 in total

Review 1.  Data Mining Algorithms and Techniques in Mental Health: A Systematic Review.

Authors:  Susel Góngora Alonso; Isabel de la Torre-Díez; Sofiane Hamrioui; Miguel López-Coronado; Diego Calvo Barreno; Lola Morón Nozaleda; Manuel Franco
Journal:  J Med Syst       Date:  2018-07-21       Impact factor: 4.460

2.  Machine learning for genetic prediction of psychiatric disorders: a systematic review.

Authors:  Matthew Bracher-Smith; Karen Crawford; Valentina Escott-Price
Journal:  Mol Psychiatry       Date:  2020-06-26       Impact factor: 15.992

3.  Predicting disease risk using bootstrap ranking and classification algorithms.

Authors:  Ohad Manor; Eran Segal
Journal:  PLoS Comput Biol       Date:  2013-08-22       Impact factor: 4.475

4.  The genetic interacting landscape of 63 candidate genes in Major Depressive Disorder: an explorative study.

Authors:  Magnus Lekman; Ola Hössjer; Peter Andrews; Henrik Källberg; Daniel Uvehag; Dennis Charney; Husseini Manji; John A Rush; Francis J McMahon; Jason H Moore; Ingrid Kockum
Journal:  BioData Min       Date:  2014-09-09       Impact factor: 2.522

5.  Evaluation of potential novel variations and their interactions related to bipolar disorders: analysis of genome-wide association study data.

Authors:  Cengizhan Acikel; Yesim Aydin Son; Cemil Celik; Husamettin Gul
Journal:  Neuropsychiatr Dis Treat       Date:  2016-11-24       Impact factor: 2.570

6.  Machine learning algorithm performance evaluation in structural magnetic resonance imaging-based classification of pediatric bipolar disorders type I patients.

Authors:  Ruhai Dou; Weijia Gao; Qingmin Meng; Xiaotong Zhang; Weifang Cao; Liangfeng Kuang; Jinpeng Niu; Yongxin Guo; Dong Cui; Qing Jiao; Jianfeng Qiu; Linyan Su; Guangming Lu
Journal:  Front Comput Neurosci       Date:  2022-08-23       Impact factor: 3.387

7.  Influence of Feature Encoding and Choice of Classifier on Disease Risk Prediction in Genome-Wide Association Studies.

Authors:  Florian Mittag; Michael Römer; Andreas Zell
Journal:  PLoS One       Date:  2015-08-18       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.