| Literature DB >> 33125616 |
Emma Richard1, Bhargava Reddy2.
Abstract
The ability to detect patterns and trends across protocol deviations (PDs) is key to ensure high data quality and sufficient oversight of patient safety. In clinical trial operations, some business processes and work instructions limit efficient protocol deviation trending because a majority of protocol deviations are left unclassified. When this occurs, it restricts clinical teams from determining systemic issues or signals in the data. The unstructured text in protocol deviation descriptions is an important component of trial operation knowledge. Natural language processing (NLP) can make protocol deviation descriptions more accessible and can support information extraction and trending analysis. This paper reviews how the natural language processing techniques of Term-Frequency Inverse-Document-Frequency (TF-IDF) combined with the supervised machine learning model of Support Vector Machines (SVM) and word embedding approaches such as word2vec can be used to categorize/label protocol deviations across multiple therapeutic areas. NLP is a key tool that will lead to more data driven decisions in clinical trial operations.Entities:
Keywords: Clinical trial operations; Natural language processing; Protocol deviations; Term frequency inverse document frequency; Word2vec
Mesh:
Year: 2020 PMID: 33125616 DOI: 10.1007/s43441-020-00236-x
Source DB: PubMed Journal: Ther Innov Regul Sci ISSN: 2168-4790 Impact factor: 1.778