UNLABELLED: Current research on high throughput identification of patients with a specific phenotype is in its infancy. There is an urgent need to develop a general automatic approach for patient identification. OBJECTIVE: We took advantage of Mayo Clinic electronic clinical notes and proposed a novel method of combining NLP, machine learning, and ontology for automatic patient identification. We also investigated the benefits of involving existing SNOMED semantic knowledge in a patient identification task. METHODS: the SVM algorithm was applied on SNOMED concept units extracted from T2DM case/control clinical notes. Precision, recall, and F-score were calculated to evaluate the performance. RESULTS: This approach achieved an F-score of above 0.950 for both groups when using all identified concept units as features. Concept units from semantic type-Disease or Syndrome contain the most important information for patient identification. Our results also implied that the coarse level concepts contain enough information to classify T2DM cases/controls.
UNLABELLED: Current research on high throughput identification of patients with a specific phenotype is in its infancy. There is an urgent need to develop a general automatic approach for patient identification. OBJECTIVE: We took advantage of Mayo Clinic electronic clinical notes and proposed a novel method of combining NLP, machine learning, and ontology for automatic patient identification. We also investigated the benefits of involving existing SNOMED semantic knowledge in a patient identification task. METHODS: the SVM algorithm was applied on SNOMED concept units extracted from T2DM case/control clinical notes. Precision, recall, and F-score were calculated to evaluate the performance. RESULTS: This approach achieved an F-score of above 0.950 for both groups when using all identified concept units as features. Concept units from semantic type-Disease or Syndrome contain the most important information for patient identification. Our results also implied that the coarse level concepts contain enough information to classify T2DM cases/controls.
Authors: Elena Birman-Deych; Amy D Waterman; Yan Yan; David S Nilasena; Martha J Radford; Brian F Gage Journal: Med Care Date: 2005-05 Impact factor: 2.983
Authors: Serguei S V Pakhomov; Harry Hemingway; Susan A Weston; Steven J Jacobsen; Richard Rodeheffer; Véronique L Roger Journal: Am Heart J Date: 2007-04 Impact factor: 4.749
Authors: Elizabeth F O Kern; Miriam Maney; Donald R Miller; Chin-Lin Tseng; Anjali Tiwari; Mangala Rajan; David Aron; Leonard Pogach Journal: Health Serv Res Date: 2006-04 Impact factor: 3.402
Authors: Wei-Qi Wei; Pedro L Teixeira; Huan Mo; Robert M Cronin; Jeremy L Warner; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2015-09-02 Impact factor: 4.497
Authors: Juan Zhao; Yun Zhang; David J Schlueter; Patrick Wu; Vern Eric Kerchberger; S Trent Rosenbloom; Quinn S Wells; QiPing Feng; Joshua C Denny; Wei-Qi Wei Journal: J Biomed Inform Date: 2019-08-22 Impact factor: 6.317
Authors: Wei-Qi Wei; Cynthia L Leibson; Jeanine E Ransom; Abel N Kho; Pedro J Caraballo; High Seng Chai; Barbara P Yawn; Jennifer A Pacheco; Christopher G Chute Journal: J Am Med Inform Assoc Date: 2012-01-16 Impact factor: 4.497
Authors: Yukun Chen; Robert J Carroll; Eugenia R McPeek Hinz; Anushi Shah; Anne E Eyler; Joshua C Denny; Hua Xu Journal: J Am Med Inform Assoc Date: 2013-07-13 Impact factor: 4.497