Literature DB >> 36264941

A natural language processing approach towards harmonisation of European medicinal product information.

Erik Bergman1, Kim Sherwood1, Markus Forslund1, Peter Arlett2, Gabriel Westman1,3.   

Abstract

Product information (PI) is a vital part of any medicinal product approved for use within the European Union and consists of a summary of products characteristics (SmPC) for healthcare professionals and package leaflet (PL) for patients, together with the product packaging. In this study, based on the English corpus of the EMA product information documents for all centrally approved medicinal products within the EU, a BERT sentence embedding model was used together with clustering and dimensional reduction techniques to identify sentence similarity clusters that could be candidates for standardization. A total of 1258 medicinal products were included in the study. From these, a total of 783 K sentences were extracted from SmPC and PL documents which were aggregated into a total of 284 and 129 semantic similarity clusters, respectively. The spread distribution among clusters shows separation into different cluster types. Examples of clusters with low spread include those with identical word embeddings due to current standardization, such as section headings and standard phrases. Others show minor linguistic variations, while the group with the largest variability contains variable wording but with significant semantic overlap. The sentence clusters identified could serve as candidates for further standardization of the PI. Moving from free text human wording to auto-generated text elements based on multiple-choice input for appropriate parts of the package leaflet and summary of product characteristics, could reduce both time and complexity for applicants as well as regulators, and ultimately provide patients and prescribers with documents that are easier to understand and better adapted for search availabilities.

Entities:  

Mesh:

Year:  2022        PMID: 36264941      PMCID: PMC9584511          DOI: 10.1371/journal.pone.0275386

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


  5 in total

1.  Analysis of German package inserts.

Authors:  J Fuchs; M Hippius; M Schaefer
Journal:  Int J Clin Pharmacol Ther       Date:  2006-01       Impact factor: 1.366

Review 2.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

Authors:  Kory Kreimeyer; Matthew Foster; Abhishek Pandey; Nina Arya; Gwendolyn Halford; Sandra F Jones; Richard Forshee; Mark Walderhaug; Taxiarchis Botsis
Journal:  J Biomed Inform       Date:  2017-07-17       Impact factor: 6.317

3.  Information Extraction From FDA Drug Labeling to Enhance Product-Specific Guidance Assessment Using Natural Language Processing.

Authors:  Yiwen Shi; Ping Ren; Yi Zhang; Xiajing Gong; Meng Hu; Hualou Liang
Journal:  Front Res Metr Anal       Date:  2021-06-10

4.  Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

Authors:  Jyotishman Pathak; Kent R Bailey; Calvin E Beebe; Steven Bethard; David C Carrell; Pei J Chen; Dmitriy Dligach; Cory M Endle; Lacey A Hart; Peter J Haug; Stanley M Huff; Vinod C Kaggal; Dingcheng Li; Hongfang Liu; Kyle Marchant; James Masanz; Timothy Miller; Thomas A Oniki; Martha Palmer; Kevin J Peterson; Susan Rea; Guergana K Savova; Craig R Stancl; Sunghwan Sohn; Harold R Solbrig; Dale B Suesse; Cui Tao; David P Taylor; Les Westberg; Stephen Wu; Ning Zhuo; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2013-11-04       Impact factor: 4.497

Review 5.  Medication Errors: New EU Good Practice Guide on Risk Minimisation and Error Prevention.

Authors:  Thomas Goedecke; Kathryn Ord; Victoria Newbould; Sabine Brosch; Peter Arlett
Journal:  Drug Saf       Date:  2016-06       Impact factor: 5.606

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.