S Velupillai1, D Mowery, B R South, M Kvist, H Dalianis. 1. Sumithra Velupillai, Department of Computer and Systems Sciences, Stockholm University, Postbox 7003, 164 07 Kista, Sweden, Tel: +46 8 161 174, Fax: +46 8 703 9025, E-mail: sumithra@dsv.su.se.
Abstract
OBJECTIVES: We present a review of recent advances in clinical Natural Language Processing (NLP), with a focus on semantic analysis and key subtasks that support such analysis. METHODS: We conducted a literature review of clinical NLP research from 2008 to 2014, emphasizing recent publications (2012-2014), based on PubMed and ACL proceedings as well as relevant referenced publications from the included papers. RESULTS: Significant articles published within this time-span were included and are discussed from the perspective of semantic analysis. Three key clinical NLP subtasks that enable such analysis were identified: 1) developing more efficient methods for corpus creation (annotation and de-identification), 2) generating building blocks for extracting meaning (morphological, syntactic, and semantic subtasks), and 3) leveraging NLP for clinical utility (NLP applications and infrastructure for clinical use cases). Finally, we provide a reflection upon most recent developments and potential areas of future NLP development and applications. CONCLUSIONS: There has been an increase of advances within key NLP subtasks that support semantic analysis. Performance of NLP semantic analysis is, in many cases, close to that of agreement between humans. The creation and release of corpora annotated with complex semantic information models has greatly supported the development of new tools and approaches. Research on non-English languages is continuously growing. NLP methods have sometimes been successfully employed in real-world clinical tasks. However, there is still a gap between the development of advanced resources and their utilization in clinical settings. A plethora of new clinical use cases are emerging due to established health care initiatives and additional patient-generated sources through the extensive use of social media and other devices.
OBJECTIVES: We present a review of recent advances in clinical Natural Language Processing (NLP), with a focus on semantic analysis and key subtasks that support such analysis. METHODS: We conducted a literature review of clinical NLP research from 2008 to 2014, emphasizing recent publications (2012-2014), based on PubMed and ACL proceedings as well as relevant referenced publications from the included papers. RESULTS: Significant articles published within this time-span were included and are discussed from the perspective of semantic analysis. Three key clinical NLP subtasks that enable such analysis were identified: 1) developing more efficient methods for corpus creation (annotation and de-identification), 2) generating building blocks for extracting meaning (morphological, syntactic, and semantic subtasks), and 3) leveraging NLP for clinical utility (NLP applications and infrastructure for clinical use cases). Finally, we provide a reflection upon most recent developments and potential areas of future NLP development and applications. CONCLUSIONS: There has been an increase of advances within key NLP subtasks that support semantic analysis. Performance of NLP semantic analysis is, in many cases, close to that of agreement between humans. The creation and release of corpora annotated with complex semantic information models has greatly supported the development of new tools and approaches. Research on non-English languages is continuously growing. NLP methods have sometimes been successfully employed in real-world clinical tasks. However, there is still a gap between the development of advanced resources and their utilization in clinical settings. A plethora of new clinical use cases are emerging due to established health care initiatives and additional patient-generated sources through the extensive use of social media and other devices.
Entities:
Keywords:
Annotation; Clinical Natural Language Processing; Domain Adaptation; Information Extraction; Review; Semantics
Authors: R H Perlis; D V Iosifescu; V M Castro; S N Murphy; V S Gainer; J Minnier; T Cai; S Goryachev; Q Zeng; P J Gallagher; M Fava; J B Weilburg; S E Churchill; I S Kohane; J W Smoller Journal: Psychol Med Date: 2011-06-20 Impact factor: 7.723
Authors: Wendy W Chapman; Prakash M Nadkarni; Lynette Hirschman; Leonard W D'Avolio; Guergana K Savova; Ozlem Uzuner Journal: J Am Med Inform Assoc Date: 2011 Sep-Oct Impact factor: 4.497
Authors: Veronika Laippala; Timo Viljanen; Antti Airola; Jenna Kanerva; Sanna Salanterä; Tapio Salakoski; Filip Ginter Journal: Artif Intell Med Date: 2014-03-05 Impact factor: 5.326
Authors: John P Pestian; Pawel Matykiewicz; Michelle Linn-Gust; Brett South; Ozlem Uzuner; Jan Wiebe; K Bretonnel Cohen; John Hurdle; Christopher Brew Journal: Biomed Inform Insights Date: 2012-01-30
Authors: Stephen T Wu; Vinod C Kaggal; Dmitriy Dligach; James J Masanz; Pei Chen; Lee Becker; Wendy W Chapman; Guergana K Savova; Hongfang Liu; Christopher G Chute Journal: J Biomed Semantics Date: 2013-01-03
Authors: Dmitriy Dligach; Steven Bethard; Lee Becker; Timothy Miller; Guergana K Savova Journal: J Am Med Inform Assoc Date: 2013-10-03 Impact factor: 4.497
Authors: Gabrielle Gurdin; Jorge A Vargas; Luke G Maffey; Amy L Olex; Nastassja A Lewinski; Bridget T McInnes Journal: AMIA Jt Summits Transl Sci Proc Date: 2020-05-30
Authors: Christoph P Hornik; Andrew M Atz; Catherine Bendel; Francis Chan; Kevin Downes; Robert Grundmeier; Ben Fogel; Debbie Gipson; Matthew Laughon; Michael Miller; Michael Smith; Chad Livingston; Cindy Kluchar; Anne Heath; Chanda Jarrett; Brian McKerlie; Hetalkumar Patel; Christina Hunter Journal: Appl Clin Inform Date: 2019-05-08 Impact factor: 2.342
Authors: Jason H Moore; Mary Regina Boland; Pablo G Camara; Hannah Chervitz; Graciela Gonzalez; Blanca E Himes; Dokyoon Kim; Danielle L Mowery; Marylyn D Ritchie; Li Shen; Ryan J Urbanowicz; John H Holmes Journal: Per Med Date: 2019-02-14 Impact factor: 2.512
Authors: Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark Journal: Sci Data Date: 2016-05-24 Impact factor: 6.444
Authors: J Bouaziz; R Mashiach; S Cohen; A Kedem; A Baron; M Zajicek; I Feldman; D Seidman; D Soriano Journal: Biomed Res Int Date: 2018-03-20 Impact factor: 3.411