Literature DB >> 31419182

Validation of a Semiautomated Natural Language Processing-Based Procedure for Meta-Analysis of Cancer Susceptibility Gene Penetrance.

Zhengyi Deng1, Kanhua Yin1, Yujia Bao2, Victor Diego Armengol1, Cathy Wang3,4, Ankur Tiwari1, Regina Barzilay2, Giovanni Parmigiani3,4, Danielle Braun3,4, Kevin S Hughes1,5.   

Abstract

PURPOSE: Quantifying the risk of cancer associated with pathogenic mutations in germline cancer susceptibility genes-that is, penetrance-enables the personalization of preventive management strategies. Conducting a meta-analysis is the best way to obtain robust risk estimates. We have previously developed a natural language processing (NLP) -based abstract classifier which classifies abstracts as relevant to penetrance, prevalence of mutations, both, or neither. In this work, we evaluate the performance of this NLP-based procedure.
MATERIALS AND METHODS: We compared the semiautomated NLP-based procedure, which involves automated abstract classification and text mining, followed by human review of identified studies, with the traditional procedure that requires human review of all studies. Ten high-quality gene-cancer penetrance meta-analyses spanning 16 gene-cancer associations were used as the gold standard by which to evaluate the performance of our procedure. For each meta-analysis, we evaluated the number of abstracts that required human review (workload) and the ability to identify the studies that were included by the authors in their quantitative analysis (coverage).
RESULTS: Compared with the traditional procedure, the semiautomated NLP-based procedure led to a lower workload across all 10 meta-analyses, with an overall 84% reduction (2,774 abstracts v 16,941 abstracts) in the amount of human review required. Overall coverage was 93%-we are able to identify 132 of 142 studies-before reviewing references of identified studies. Reasons for the 10 missed studies included blank and poorly written abstracts. After reviewing references, nine of the previously missed studies were identified and coverage improved to 99% (141 of 142 studies).
CONCLUSION: We demonstrated that an NLP-based procedure can significantly reduce the review workload without compromising the ability to identify relevant studies. NLP algorithms have promising potential for reducing human efforts in the literature review process.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31419182      PMCID: PMC6873944          DOI: 10.1200/CCI.19.00043

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  33 in total

1.  Reducing workload in systematic review preparation using automated citation classification.

Authors:  A M Cohen; W R Hersh; K Peterson; Po-Yin Yen
Journal:  J Am Med Inform Assoc       Date:  2005-12-15       Impact factor: 4.497

2.  Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem.

Authors:  Qiu-Yue Zhong; Leena P Mittal; Margo D Nathan; Kara M Brown; Deborah Knudson González; Tianrun Cai; Sean Finan; Bizu Gelaye; Paul Avillach; Jordan W Smoller; Elizabeth W Karlson; Tianxi Cai; Michelle A Williams
Journal:  Eur J Epidemiol       Date:  2018-12-10       Impact factor: 8.082

3.  CHEK2 mutation and risk of prostate cancer: a systematic review and meta-analysis.

Authors:  Yue Wang; Bo Dai; Dingwei Ye
Journal:  Int J Clin Exp Med       Date:  2015-09-15

4.  Autonomous detection, grading, and reporting of postoperative complications using natural language processing.

Authors:  Luke V Selby; Wazim R Narain; Ashley Russo; Vivian E Strong; Peter Stetson
Journal:  Surgery       Date:  2018-07-26       Impact factor: 3.982

5.  Meta-analysis of BRCA1 and BRCA2 penetrance.

Authors:  Sining Chen; Giovanni Parmigiani
Journal:  J Clin Oncol       Date:  2007-04-10       Impact factor: 44.544

Review 6.  Cancer genetics, precision prevention and a call to action.

Authors:  Clare Turnbull; Amit Sud; Richard S Houlston
Journal:  Nat Genet       Date:  2018-08-29       Impact factor: 38.330

7.  A large-scale meta-analysis to refine colorectal cancer risk estimates associated with MUTYH variants.

Authors:  E Theodoratou; H Campbell; A Tenesa; R Houlston; E Webb; S Lubbe; P Broderick; S Gallinger; E M Croitoru; M A Jenkins; A K Win; S P Cleary; T Koessler; P D Pharoah; S Küry; S Bézieau; B Buecher; N A Ellis; P Peterlongo; K Offit; L A Aaltonen; S Enholm; A Lindblom; X-L Zhou; I P Tomlinson; V Moreno; I Blanco; G Capellà; R Barnetson; M E Porteous; M G Dunlop; S M Farrington
Journal:  Br J Cancer       Date:  2010-11-09       Impact factor: 7.640

8.  Reducing systematic review workload through certainty-based screening.

Authors:  Makoto Miwa; James Thomas; Alison O'Mara-Eves; Sophia Ananiadou
Journal:  J Biomed Inform       Date:  2014-06-19       Impact factor: 6.317

9.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.

Authors:  David Moher; Alessandro Liberati; Jennifer Tetzlaff; Douglas G Altman
Journal:  PLoS Med       Date:  2009-07-21       Impact factor: 11.069

10.  Automated screening of research studies for systematic reviews using study characteristics.

Authors:  Guy Tsafnat; Paul Glasziou; George Karystianis; Enrico Coiera
Journal:  Syst Rev       Date:  2018-04-25
View more
  7 in total

1.  Using Machine Learning and Natural Language Processing to Review and Classify the Medical Literature on Cancer Susceptibility Genes.

Authors:  Yujia Bao; Zhengyi Deng; Yan Wang; Heeyoon Kim; Victor Diego Armengol; Francisco Acevedo; Nofal Ouardaoui; Cathy Wang; Giovanni Parmigiani; Regina Barzilay; Danielle Braun; Kevin S Hughes
Journal:  JCO Clin Cancer Inform       Date:  2019-09

2.  Predictive article recommendation using natural language processing and machine learning to support evidence updates in domain-specific knowledge graphs.

Authors:  Bhuvan Sharma; Van C Willis; Claudia S Huettner; Kirk Beaty; Jane L Snowdon; Shang Xue; Brett R South; Gretchen P Jackson; Dilhan Weeraratne; Vanessa Michelini
Journal:  JAMIA Open       Date:  2020-09-29

3.  Intelligent, Autonomous Machines in Surgery.

Authors:  Tyler J Loftus; Amanda C Filiberto; Jeremy Balch; Alexander L Ayzengart; Patrick J Tighe; Parisa Rashidi; Azra Bihorac; Gilbert R Upchurch
Journal:  J Surg Res       Date:  2020-04-24       Impact factor: 2.192

4.  Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research.

Authors:  Denis Newman-Griffis; Jill Fain Lehman; Carolyn Rosé; Harry Hochheiser
Journal:  Proc Conf       Date:  2021-06

5.  Disease Spectrum of Breast Cancer Susceptibility Genes.

Authors:  Jin Wang; Preeti Singh; Kanhua Yin; Jingan Zhou; Yujia Bao; Menghua Wu; Kush Pathak; Sophia K McKinley; Danielle Braun; Kevin S Hughes
Journal:  Front Oncol       Date:  2021-04-20       Impact factor: 6.244

Review 6.  Artificial Intelligence and Cardiovascular Genetics.

Authors:  Chayakrit Krittanawong; Kipp W Johnson; Edward Choi; Scott Kaplin; Eric Venner; Mullai Murugan; Zhen Wang; Benjamin S Glicksberg; Christopher I Amos; Michael C Schatz; W H Wilson Tang
Journal:  Life (Basel)       Date:  2022-02-14

7.  Applications of Model-Based Meta-Analysis in Drug Development.

Authors:  Phyllis Chan; Kirill Peskov; Xuyang Song
Journal:  Pharm Res       Date:  2022-02-16       Impact factor: 4.580

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.