Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Generating real-world evidence from unstructured clinical notes to examine clinical utility of genetic tests: use case in BRCAness.

Literature DB >> 33407429

Generating real-world evidence from unstructured clinical notes to examine clinical utility of genetic tests: use case in BRCAness.

Yiqing Zhao¹, Saravut J Weroha², Ellen L Goode³, Hongfang Liu¹, Chen Wang⁴.

Abstract

BACKGROUND: Next-generation sequencing provides comprehensive information about individuals' genetic makeup and is commonplace in oncology clinical practice. However, the utility of genetic information in the clinical decision-making process has not been examined extensively from a real-world, data-driven perspective. Through mining real-world data (RWD) from clinical notes, we could extract patients' genetic information and further associate treatment decisions with genetic information.
METHODS: We proposed a real-world evidence (RWE) study framework that incorporates context-based natural language processing (NLP) methods and data quality examination before final association analysis. The framework was demonstrated in a Foundation-tested women cancer cohort (N = 196). Upon retrieval of patients' genetic information using NLP system, we assessed the completeness of genetic data captured in unstructured clinical notes according to a genetic data-model. We examined the distribution of different topics regarding BRCA1/2 throughout patients' treatment process, and then analyzed the association between BRCA1/2 mutation status and the discussion/prescription of targeted therapy.
RESULTS: We identified seven topics in the clinical context of genetic mentions including: Information, Evaluation, Insurance, Order, Negative, Positive, and Variants of unknown significance. Our rule-based system achieved a precision of 0.87, recall of 0.93 and F-measure of 0.91. Our machine learning system achieved a precision of 0.901, recall of 0.899 and F-measure of 0.9 for four-topic classification and a precision of 0.833, recall of 0.823 and F-measure of 0.82 for seven-topic classification. We found in result-containing sentences, the capture of BRCA1/2 mutation information was 75%, but detailed variant information (e.g. variant types) is largely missing. Using cleaned RWD, significant associations were found between BRCA1/2 positive mutation and targeted therapies.
CONCLUSIONS: In conclusion, we demonstrated a framework to generate RWE using RWD from different clinical sources. Rule-based NLP system achieved the best performance for resolving contextual variability when extracting RWD from unstructured clinical notes. Data quality issues such as incompleteness and discrepancies exist thus manual data cleaning is needed before further analysis can be performed. Finally, we were able to use cleaned RWD to evaluate the real-world utility of genetic information to initiate a prescription of targeted therapy.

Entities: CellLine Chemical Disease Gene Mutation Species

Keywords: BRCA1/2; Electronic health records; Natural language processing; PARP inhibitor; Precision medicine; Real-world evidence

Mesh：

Year: 2021 PMID： 33407429 PMCID： PMC7789545 DOI： 10.1186/s12911-020-01364-y

Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN： 1472-6947 Impact factor: 2.796

42 in total

1. CSER and eMERGE: current and potential state of the display of genetic information in the electronic health record.

Authors: Brian H Shirts; Joseph S Salama; Samuel J Aronson; Wendy K Chung; Stacy W Gray; Lucia A Hindorff; Gail P Jarvik; Sharon E Plon; Elena M Stoffel; Peter Z Tarczy-Hornoch; Eliezer M Van Allen; Karen E Weck; Christopher G Chute; Robert R Freimuth; Robert W Grundmeier; Andrea L Hartzler; Rongling Li; Peggy L Peissig; Josh F Peterson; Luke V Rasmussen; Justin B Starren; Marc S Williams; Casey L Overby
Journal: J Am Med Inform Assoc Date: 2015-07-03 Impact factor: 4.497

2. Correlating Lab Test Results in Clinical Notes with Structured Lab Data: A Case Study in HbA1c and Glucose.

Authors: Sijia Liu; Liwei Wang; Donna Ihrke; Vipin Chaudhary; Cui Tao; Chunhua Weng; Hongfang Liu
Journal: AMIA Jt Summits Transl Sci Proc Date: 2017-07-26

3. Data-driven Sublanguage Analysis for Cancer Genomics Knowledge Modeling: Applications in Mining Oncological Genetics Information from Patients' Genetic Reports.

Authors: Yiqing Zhao; Hanzhong Yu; Sunyang Fu; Feichen Shen; Jaime I Davila; Hongfang Liu; Chen Wang
Journal: AMIA Jt Summits Transl Sci Proc Date: 2020-05-30

4. BRCA1 testing in families with hereditary breast-ovarian cancer. A prospective study of patient decision making and outcomes.

Authors: C Lerman; S Narod; K Schulman; C Hughes; A Gomez-Caminero; G Bonney; K Gold; B Trock; D Main; J Lynch; C Fulmore; C Snyder; S J Lemon; T Conway; P Tonin; G Lenoir; H Lynch
Journal: JAMA Date: 1996-06-26 Impact factor: 56.272

5. AACR Project GENIE: Powering Precision Medicine through an International Consortium.

Authors:
Journal: Cancer Discov Date: 2017-06-01 Impact factor: 39.397

6. Deep Phenotyping on Electronic Health Records Facilitates Genetic Diagnosis by Clinical Exomes.

Authors: Jung Hoon Son; Gangcai Xie; Chi Yuan; Lyudmila Ena; Ziran Li; Andrew Goldstein; Lulin Huang; Liwei Wang; Feichen Shen; Hongfang Liu; Karla Mehl; Emily E Groopman; Maddalena Marasa; Krzysztof Kiryluk; Ali G Gharavi; Wendy K Chung; George Hripcsak; Carol Friedman; Chunhua Weng; Kai Wang
Journal: Am J Hum Genet Date: 2018-06-28 Impact factor: 11.025

7. The HUGO Gene Nomenclature Database, 2006 updates.

Authors: Tina A Eyre; Fabrice Ducluzeau; Tam P Sneddon; Sue Povey; Elspeth A Bruford; Michael J Lush
Journal: Nucleic Acids Res Date: 2006-01-01 Impact factor: 16.971

Review 8. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge.

Authors: Katarzyna Tomczak; Patrycja Czerwińska; Maciej Wiznerowicz
Journal: Contemp Oncol (Pozn) Date: 2015

9. An information extraction framework for cohort identification using electronic health records.

Authors: Hongfang Liu; Suzette J Bielinski; Sunghwan Sohn; Sean Murphy; Kavishwar B Wagholikar; Siddhartha R Jonnalagadda; K E Ravikumar; Stephen T Wu; Iftikhar J Kullo; Christopher G Chute
Journal: AMIA Jt Summits Transl Sci Proc Date: 2013-03-18

10. Toward a Learning Health-care System - Knowledge Delivery at the Point of Care Empowered by Big Data and NLP.

Authors: Vinod C Kaggal; Ravikumar Komandur Elayavilli; Saeed Mehrabi; Joshua J Pankratz; Sunghwan Sohn; Yanshan Wang; Dingcheng Li; Majid Mojarad Rastegar; Sean P Murphy; Jason L Ross; Rajeev Chaudhry; James D Buntrock; Hongfang Liu
Journal: Biomed Inform Insights Date: 2016-06-23

1 in total

1. Automating Access to Real-World Evidence.

Authors: Marie-Pier Gauthier; Jennifer H Law; Lisa W Le; Janice J N Li; Sajda Zahir; Sharon Nirmalakumar; Mike Sung; Christopher Pettengell; Steven Aviv; Ryan Chu; Adrian Sacher; Geoffrey Liu; Penelope Bradbury; Frances A Shepherd; Natasha B Leighl
Journal: JTO Clin Res Rep Date: 2022-05-17

1 in total