Timothy Miller1, Dmitriy Dligach2, Steven Bethard3, Chen Lin4, Guergana Savova5. 1. Boston Children's Hospital, Boston, MA, United States; Harvard Medical School, Boston, MA, United States. Electronic address: timothy.miller@childrens.harvard.edu. 2. Loyola University Chicago, Chicago, IL, United States. 3. University of Arizona, Tucson, AZ, United States. 4. Boston Children's Hospital, Boston, MA, United States. 5. Boston Children's Hospital, Boston, MA, United States; Harvard Medical School, Boston, MA, United States.
Abstract
OBJECTIVE: This work investigates the problem of clinical coreference resolution in a model that explicitly tracks entities, and aims to measure the performance of that model in both traditional in-domain train/test splits and cross-domain experiments that measure the generalizability of learned models. METHODS: The two methods we compare are a baseline mention-pair coreference system that operates over pairs of mentions with best-first conflict resolution and a mention-synchronous system that incrementally builds coreference chains. We develop new features that incorporate distributional semantics, discourse features, and entity attributes. We use two new coreference datasets with similar annotation guidelines - the THYME colon cancer dataset and the DeepPhe breast cancer dataset. RESULTS: The mention-synchronous system performs similarly on in-domain data but performs much better on new data. Part of speech tag features prove superior in feature generalizability experiments over other word representations. Our methods show generalization improvement but there is still a performance gap when testing in new domains. DISCUSSION: Generalizability of clinical NLP systems is important and under-studied, so future work should attempt to perform cross-domain and cross-institution evaluations and explicitly develop features and training regimens that favor generalizability. A performance-optimized version of the mention-synchronous system will be included in the open source Apache cTAKES software.
OBJECTIVE: This work investigates the problem of clinical coreference resolution in a model that explicitly tracks entities, and aims to measure the performance of that model in both traditional in-domain train/test splits and cross-domain experiments that measure the generalizability of learned models. METHODS: The two methods we compare are a baseline mention-pair coreference system that operates over pairs of mentions with best-first conflict resolution and a mention-synchronous system that incrementally builds coreference chains. We develop new features that incorporate distributional semantics, discourse features, and entity attributes. We use two new coreference datasets with similar annotation guidelines - the THYMEcolon cancer dataset and the DeepPhe breast cancer dataset. RESULTS: The mention-synchronous system performs similarly on in-domain data but performs much better on new data. Part of speech tag features prove superior in feature generalizability experiments over other word representations. Our methods show generalization improvement but there is still a performance gap when testing in new domains. DISCUSSION: Generalizability of clinical NLP systems is important and under-studied, so future work should attempt to perform cross-domain and cross-institution evaluations and explicitly develop features and training regimens that favor generalizability. A performance-optimized version of the mention-synchronous system will be included in the open source Apache cTAKES software.
Authors: Jiaping Zheng; Wendy W Chapman; Timothy A Miller; Chen Lin; Rebecca S Crowley; Guergana K Savova Journal: J Am Med Inform Assoc Date: 2012-01-31 Impact factor: 4.497
Authors: Ozlem Uzuner; Andreea Bodnari; Shuying Shen; Tyler Forbush; John Pestian; Brett R South Journal: J Am Med Inform Assoc Date: 2012-02-24 Impact factor: 4.497
Authors: Stephen Wu; Timothy Miller; James Masanz; Matt Coarr; Scott Halgrim; David Carrell; Cheryl Clark Journal: PLoS One Date: 2014-11-13 Impact factor: 3.240
Authors: Dmitriy Dligach; Steven Bethard; Lee Becker; Timothy Miller; Guergana K Savova Journal: J Am Med Inform Assoc Date: 2013-10-03 Impact factor: 4.497
Authors: Daniel Albright; Arrick Lanfranchi; Anwen Fredriksen; William F Styler; Colin Warner; Jena D Hwang; Jinho D Choi; Dmitriy Dligach; Rodney D Nielsen; James Martin; Wayne Ward; Martha Palmer; Guergana K Savova Journal: J Am Med Inform Assoc Date: 2013-01-25 Impact factor: 4.497
Authors: Seyedmostafa Sheikhalishahi; Riccardo Miotto; Joel T Dudley; Alberto Lavelli; Fabio Rinaldi; Venet Osmani Journal: JMIR Med Inform Date: 2019-04-27