Literature DB >> 25862765

Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text.

Yuan Luo1, Yu Xin2, Ephraim Hochberg3, Rohit Joshi2, Ozlem Uzuner4, Peter Szolovits2.   

Abstract

OBJECTIVE: Extracting medical knowledge from electronic medical records requires automated approaches to combat scalability limitations and selection biases. However, existing machine learning approaches are often regarded by clinicians as black boxes. Moreover, training data for these automated approaches at often sparsely annotated at best. The authors target unsupervised learning for modeling clinical narrative text, aiming at improving both accuracy and interpretability.
METHODS: The authors introduce a novel framework named subgraph augmented non-negative tensor factorization (SANTF). In addition to relying on atomic features (e.g., words in clinical narrative text), SANTF automatically mines higher-order features (e.g., relations of lymphoid cells expressing antigens) from clinical narrative text by converting sentences into a graph representation and identifying important subgraphs. The authors compose a tensor using patients, higher-order features, and atomic features as its respective modes. We then apply non-negative tensor factorization to cluster patients, and simultaneously identify latent groups of higher-order features that link to patient clusters, as in clinical guidelines where a panel of immunophenotypic features and laboratory results are used to specify diagnostic criteria. RESULTS AND
CONCLUSION: SANTF demonstrated over 10% improvement in averaged F-measure on patient clustering compared to widely used non-negative matrix factorization (NMF) and k-means clustering methods. Multiple baselines were established by modeling patient data using patient-by-features matrices with different feature configurations and then performing NMF or k-means to cluster patients. Feature analysis identified latent groups of higher-order features that lead to medical insights. We also found that the latent groups of atomic features help to better correlate the latent groups of higher-order features.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  natural language processing; non-negative tensor factorization; subgraph mining; unsupervised learning

Mesh:

Year:  2015        PMID: 25862765      PMCID: PMC4986663          DOI: 10.1093/jamia/ocv016

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  28 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors:  Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal:  Nat Med       Date:  2002-01       Impact factor: 53.440

3.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

4.  Some mathematical notes on three-mode factor analysis.

Authors:  L R Tucker
Journal:  Psychometrika       Date:  1966-09       Impact factor: 2.500

5.  Integration of early physiological responses predicts later illness severity in preterm infants.

Authors:  Suchi Saria; Anand K Rajani; Jeffrey Gould; Daphne Koller; Anna A Penn
Journal:  Sci Transl Med       Date:  2010-09-08       Impact factor: 17.956

6.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

7.  Computational medicine: translating models to clinical care.

Authors:  Raimond L Winslow; Natalia Trayanova; Donald Geman; Michael I Miller
Journal:  Sci Transl Med       Date:  2012-10-31       Impact factor: 17.956

8.  Regulatory networks define phenotypic classes of human stem cell lines.

Authors:  Franz-Josef Müller; Louise C Laurent; Dennis Kostka; Igor Ulitsky; Roy Williams; Christina Lu; In-Hyun Park; Mahendra S Rao; Ron Shamir; Philip H Schwartz; Nils O Schmidt; Jeanne F Loring
Journal:  Nature       Date:  2008-08-24       Impact factor: 49.962

9.  Unsupervised analysis of classical biomedical markers: robustness and medical relevance of patient clustering using bioinformatics tools.

Authors:  Michal Markovich Gordon; Asher M Moser; Eitan Rubin
Journal:  PLoS One       Date:  2012-03-05       Impact factor: 3.240

10.  Subtypes of pancreatic ductal adenocarcinoma and their differing responses to therapy.

Authors:  Eric A Collisson; Anguraj Sadanandam; Peter Olson; William J Gibb; Morgan Truitt; Shenda Gu; Janine Cooc; Jennifer Weinkle; Grace E Kim; Lakshmi Jakkula; Heidi S Feiler; Andrew H Ko; Adam B Olshen; Kathleen L Danenberg; Margaret A Tempero; Paul T Spellman; Douglas Hanahan; Joe W Gray
Journal:  Nat Med       Date:  2011-04-03       Impact factor: 53.440

View more
  21 in total

1.  Trends in biomedical informatics: automated topic analysis of JAMIA articles.

Authors:  Dong Han; Shuang Wang; Chao Jiang; Xiaoqian Jiang; Hyeon-Eui Kim; Jimeng Sun; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2015-11       Impact factor: 4.497

2.  Bridging semantics and syntax with graph algorithms-state-of-the-art of extracting biomedical relations.

Authors:  Yuan Luo; Özlem Uzuner; Peter Szolovits
Journal:  Brief Bioinform       Date:  2016-02-05       Impact factor: 11.622

3.  Contralateral Breast Cancer Event Detection Using Nature Language Processing.

Authors:  Zexian Zeng; Xiaoyu Li; Sasa Espino; Ankita Roy; Kristen Kitsch; Susan Clare; Seema Khan; Yuan Luo
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

4.  Recurrent neural networks for classifying relations in clinical notes.

Authors:  Yuan Luo
Journal:  J Biomed Inform       Date:  2017-07-08       Impact factor: 6.317

5.  Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2016-11-10

6.  Tensor factorization toward precision medicine.

Authors:  Yuan Luo; Fei Wang; Peter Szolovits
Journal:  Brief Bioinform       Date:  2017-05-01       Impact factor: 11.622

7.  Segment convolutional neural networks (Seg-CNNs) for classifying relations in clinical notes.

Authors:  Yuan Luo; Yu Cheng; Özlem Uzuner; Peter Szolovits; Justin Starren
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

8.  Phenotyping Multiple Organ Dysfunction Syndrome Using Temporal Trends in Critically Ill Children.

Authors:  Emily Kunce Stroup; Yuan Luo; L Nelson Sanchez-Pinto
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2020-02-06

Review 9.  Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction.

Authors:  Yuan Luo; Faraz S Ahmad; Sanjiv J Shah
Journal:  J Cardiovasc Transl Res       Date:  2017-01-23       Impact factor: 4.132

10.  Rich Text Formatted EHR Narratives: A Hidden and Ignored Trove.

Authors:  Zexian Zeng; Yuan Zhao; Mengxin Sun; Andy H Vo; Justin Starren; Yuan Luo
Journal:  Stud Health Technol Inform       Date:  2019-08-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.