Literature DB >> 30476175

Automated and flexible identification of complex disease: building a model for systemic lupus erythematosus using noisy labeling.

Sara G Murray1, Anand Avati2, Gabriela Schmajuk1,3, Jinoos Yazdany1.   

Abstract

Accurate and efficient identification of complex chronic conditions in the electronic health record (EHR) is an important but challenging task that has historically relied on tedious clinician review and oversimplification of the disease. Here we adapt methods that allow for automated "noisy labeling" of positive and negative controls to create a "silver standard" for machine learning to automate identification of systemic lupus erythematosus (SLE). Our final model, which includes both structured data as well as text processing of clinical notes, outperformed all existing algorithms for SLE (AUC 0.97). In addition, we demonstrate how the probabilistic outputs of this model can be adapted to various clinical needs, selecting high thresholds when specificity is the priority and lower thresholds when a more inclusive patient population is desired. Deploying a similar methodology to other complex diseases has the potential to dramatically simplify the landscape of population identification in the EHR. MeSH terms: Electronic Health Records, Machine Learning, Lupus Erythematosus, Phenotype, Algorithms.

Entities:  

Mesh:

Year:  2019        PMID: 30476175      PMCID: PMC6308013          DOI: 10.1093/jamia/ocy154

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  9 in total

1.  Exploratory undersampling for class-imbalance learning.

Authors:  Xu-Ying Liu; Jianxin Wu; Zhi-Hua Zhou
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2008-12-16

2.  Updating the American College of Rheumatology revised criteria for the classification of systemic lupus erythematosus.

Authors:  M C Hochberg
Journal:  Arthritis Rheum       Date:  1997-09

3.  Portability of an algorithm to identify rheumatoid arthritis in electronic health records.

Authors:  Robert J Carroll; Will K Thompson; Anne E Eyler; Arthur M Mandelin; Tianxi Cai; Raquel M Zink; Jennifer A Pacheco; Chad S Boomershine; Thomas A Lasko; Hua Xu; Elizabeth W Karlson; Raul G Perez; Vivian S Gainer; Shawn N Murphy; Eric M Ruderman; Richard M Pope; Robert M Plenge; Abel Ngo Kho; Katherine P Liao; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2012-02-28       Impact factor: 4.497

4.  Developing Electronic Health Record Algorithms That Accurately Identify Patients With Systemic Lupus Erythematosus.

Authors:  April Barnado; Carolyn Casey; Robert J Carroll; Lee Wheless; Joshua C Denny; Leslie J Crofford
Journal:  Arthritis Care Res (Hoboken)       Date:  2017-04-10       Impact factor: 4.794

5.  Learning statistical models of phenotypes using noisy labeled training data.

Authors:  Vibhu Agarwal; Tanya Podchiyska; Juan M Banda; Veena Goel; Tiffany I Leung; Evan P Minty; Timothy E Sweeney; Elsie Gyang; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2016-05-12       Impact factor: 4.497

Review 6.  A systematic review of validated methods for identifying systemic lupus erythematosus (SLE) using administrative or claims data.

Authors:  Kevin G Moores; Nila A Sathe
Journal:  Vaccine       Date:  2013-12-30       Impact factor: 3.641

7.  Electronic medical records for discovery research in rheumatoid arthritis.

Authors:  Katherine P Liao; Tianxi Cai; Vivian Gainer; Sergey Goryachev; Qing Zeng-treitler; Soumya Raychaudhuri; Peter Szolovits; Susanne Churchill; Shawn Murphy; Isaac Kohane; Elizabeth W Karlson; Robert M Plenge
Journal:  Arthritis Care Res (Hoboken)       Date:  2010-08       Impact factor: 4.794

8.  The accuracy of administrative data diagnoses of systemic autoimmune rheumatic diseases.

Authors:  Sasha Bernatsky; Tina Linehan; John G Hanly
Journal:  J Rheumatol       Date:  2011-05-01       Impact factor: 4.666

9.  Development of phenotype algorithms using electronic medical records and incorporating natural language processing.

Authors:  Katherine P Liao; Tianxi Cai; Guergana K Savova; Shawn N Murphy; Elizabeth W Karlson; Ashwin N Ananthakrishnan; Vivian S Gainer; Stanley Y Shaw; Zongqi Xia; Peter Szolovits; Susanne Churchill; Isaac Kohane
Journal:  BMJ       Date:  2015-04-24
  9 in total
  11 in total

Review 1.  Machine Learning in Rheumatic Diseases.

Authors:  Mengdi Jiang; Yueting Li; Chendan Jiang; Lidan Zhao; Xuan Zhang; Peter E Lipsky
Journal:  Clin Rev Allergy Immunol       Date:  2021-02       Impact factor: 8.667

2.  Using a Multi-Institutional Pediatric Learning Health System to Identify Systemic Lupus Erythematosus and Lupus Nephritis: Development and Validation of Computable Phenotypes.

Authors:  Scott E Wenderfer; Joyce C Chang; Amy Goodwin Davies; Ingrid Y Luna; Rebecca Scobell; Cora Sears; Bliss Magella; Mark Mitsnefes; Brian R Stotter; Vikas R Dharnidharka; Katherine D Nowicki; Bradley P Dixon; Megan Kelton; Joseph T Flynn; Caroline Gluck; Mahmoud Kallash; William E Smoyer; Andrea Knight; Sangeeta Sule; Hanieh Razzaghi; L Charles Bailey; Susan L Furth; Christopher B Forrest; Michelle R Denburg; Meredith A Atkinson
Journal:  Clin J Am Soc Nephrol       Date:  2021-11-03       Impact factor: 8.237

Review 3.  Machine Learning Techniques for Personalised Medicine Approaches in Immune-Mediated Chronic Inflammatory Diseases: Applications and Challenges.

Authors:  Junjie Peng; Elizabeth C Jury; Pierre Dönnes; Coziana Ciurtin
Journal:  Front Pharmacol       Date:  2021-09-30       Impact factor: 5.810

4.  Diversity in Machine Learning: A Systematic Review of Text-Based Diagnostic Applications.

Authors:  Lane Fitzsimmons; Maya Dewan; Judith W Dexheimer
Journal:  Appl Clin Inform       Date:  2022-05-25       Impact factor: 2.762

Review 5.  An introduction to machine learning and analysis of its use in rheumatic diseases.

Authors:  Kathryn M Kingsmore; Christopher E Puglisi; Amrie C Grammer; Peter E Lipsky
Journal:  Nat Rev Rheumatol       Date:  2021-11-02       Impact factor: 20.543

6.  Comparative analysis, applications, and interpretation of electronic health record-based stroke phenotyping methods.

Authors:  Phyllis M Thangaraj; Benjamin R Kummer; Tal Lorberbaum; Mitchell S V Elkind; Nicholas P Tatonetti
Journal:  BioData Min       Date:  2020-12-07       Impact factor: 2.522

7.  Clinical Text Data Categorization and Feature Extraction Using Medical-Fissure Algorithm and Neg-Seq Algorithm.

Authors:  Naveen S Pagad; Pradeep N; Khalid K Almuzaini; Manish Maheshwari; Durgaprasad Gangodkar; Piyush Shukla; Musah Alhassan
Journal:  Comput Intell Neurosci       Date:  2022-03-07

8.  Identifying lupus patients in electronic health records: Development and validation of machine learning algorithms and application of rule-based algorithms.

Authors:  April Jorge; Victor M Castro; April Barnado; Vivian Gainer; Chuan Hong; Tianxi Cai; Tianrun Cai; Robert Carroll; Joshua C Denny; Leslie Crofford; Karen H Costenbader; Katherine P Liao; Elizabeth W Karlson; Candace H Feldman
Journal:  Semin Arthritis Rheum       Date:  2019-01-04       Impact factor: 5.532

9.  Development and validation of phenotype classifiers across multiple sites in the observational health data sciences and informatics network.

Authors:  Mehr Kashyap; Martin Seneviratne; Juan M Banda; Thomas Falconer; Borim Ryu; Sooyoung Yoo; George Hripcsak; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2020-06-01       Impact factor: 4.497

Review 10.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.