Literature DB >> 31025483

Improving the classification of neuropsychiatric conditions using gene ontology terms as features.

Thomas P Quinn1,2,3, Samuel C Lee1, Svetha Venkatesh1, Thin Nguyen1.   

Abstract

Although neuropsychiatric disorders have an established genetic background, their molecular foundations remain elusive. This has prompted many investigators to search for explanatory biomarkers that can predict clinical outcomes. One approach uses machine learning to classify patients based on blood mRNA expression. However, these endeavors typically fail to achieve the high level of performance, stability, and generalizability required for clinical translation. Moreover, these classifiers can lack interpretability because not all genes have relevance to researchers. For this study, we hypothesized that annotation-based classifiers can improve classification performance, stability, generalizability, and interpretability. To this end, we evaluated the models of four classification algorithms on six neuropsychiatric data sets using four annotation databases. Our results suggest that the Gene Ontology Biological Process database can transform gene expression into an annotation-based feature space that is accurate and stable. We also show how annotation features can improve the interpretability of classifiers: as annotations are used to assign biological importance to genes, the biological importance of annotation-based features are the features themselves. In evaluating the annotation features, we find that top ranked annotations tend contain top ranked genes, suggesting that the most predictive annotations are a superset of the most predictive genes. Based on this, and the fact that annotations are used routinely to assign biological importance to genetic data, we recommend transforming gene-level expression into annotation-level expression prior to the classification of neuropsychiatric conditions.
© 2019 Wiley Periodicals, Inc.

Entities:  

Keywords:  biomarkers; classification; gene expression; machine learning; prediction

Mesh:

Year:  2019        PMID: 31025483     DOI: 10.1002/ajmg.b.32727

Source DB:  PubMed          Journal:  Am J Med Genet B Neuropsychiatr Genet        ISSN: 1552-4841            Impact factor:   3.568


  2 in total

1.  Solving for X: Evidence for sex-specific autism biomarkers across multiple transcriptomic studies.

Authors:  Samuel C Lee; Thomas P Quinn; Jerry Lai; Sek Won Kong; Irva Hertz-Picciotto; Stephen J Glatt; Tamsyn M Crowley; Svetha Venkatesh; Thin Nguyen
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2018-12-06       Impact factor: 3.568

2.  DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types.

Authors:  Adham Beykikhoshk; Thomas P Quinn; Samuel C Lee; Truyen Tran; Svetha Venkatesh
Journal:  BMC Med Genomics       Date:  2020-02-24       Impact factor: 3.063

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.