Literature DB >> 29431517

Informatics and machine learning to define the phenotype.

Anna Okula Basile1, Marylyn DeRiggi Ritchie1,2.   

Abstract

INTRODUCTION: For the past decade, the focus of complex disease research has been the genotype. From technological advancements to the development of analysis methods, great progress has been made. However, advances in our definition of the phenotype have remained stagnant. Phenotype characterization has recently emerged as an exciting area of informatics and machine learning. The copious amounts of diverse biomedical data that have been collected may be leveraged with data-driven approaches to elucidate trait-related features and patterns. Areas covered: In this review, the authors discuss the phenotype in traditional genetic associations and the challenges this has imposed.Approaches for phenotype refinement that can aid in more accurate characterization of traits are also discussed. Further, the authors highlight promising machine learning approaches for establishing a phenotype and the challenges of electronic health record (EHR)-derived data. Expert commentary: The authors hypothesize that through unsupervised machine learning, data-driven approaches can be used to define phenotypes rather than relying on expert clinician knowledge. Through the use of machine learning and an unbiased set of features extracted from clinical repositories, researchers will have the potential to further understand complex traits and identify patient subgroups. This knowledge may lead to more preventative and precise clinical care.

Entities:  

Keywords:  Cluster analysis; complex traits; dimensionality reduction; electronic health records (EHRs); heterogeneity; machine learning; missing data; phenotype; topological analysis; unsupervised analysis

Mesh:

Year:  2018        PMID: 29431517      PMCID: PMC6080627          DOI: 10.1080/14737159.2018.1439380

Source DB:  PubMed          Journal:  Expert Rev Mol Diagn        ISSN: 1473-7159            Impact factor:   5.225


  63 in total

1.  A comparison of phenotype definitions for diabetes mellitus.

Authors:  Rachel L Richesson; Shelley A Rusincovitch; Douglas Wixted; Bryan C Batch; Mark N Feinglos; Marie Lynn Miranda; W Ed Hammond; Robert M Califf; Susan E Spratt
Journal:  J Am Med Inform Assoc       Date:  2013-09-11       Impact factor: 4.497

2.  Personal genomes: The case of the missing heritability.

Authors:  Brendan Maher
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

3.  Using electronic medical records to enable large-scale studies in psychiatry: treatment resistant depression as a model.

Authors:  R H Perlis; D V Iosifescu; V M Castro; S N Murphy; V S Gainer; J Minnier; T Cai; S Goryachev; Q Zeng; P J Gallagher; M Fava; J B Weilburg; S E Churchill; I S Kohane; J W Smoller
Journal:  Psychol Med       Date:  2011-06-20       Impact factor: 7.723

4.  Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: a novel informatics approach.

Authors:  Ashwin N Ananthakrishnan; Tianxi Cai; Guergana Savova; Su-Chun Cheng; Pei Chen; Raul Guzman Perez; Vivian S Gainer; Shawn N Murphy; Peter Szolovits; Zongqi Xia; Stanley Shaw; Susanne Churchill; Elizabeth W Karlson; Isaac Kohane; Robert M Plenge; Katherine P Liao
Journal:  Inflamm Bowel Dis       Date:  2013-06       Impact factor: 5.325

5.  The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies.

Authors:  Catherine A McCarty; Rex L Chisholm; Christopher G Chute; Iftikhar J Kullo; Gail P Jarvik; Eric B Larson; Rongling Li; Daniel R Masys; Marylyn D Ritchie; Dan M Roden; Jeffery P Struewing; Wendy A Wolf
Journal:  BMC Med Genomics       Date:  2011-01-26       Impact factor: 3.063

6.  Using electronic health records data to assess comorbidities of substance use and psychiatric diagnoses and treatment settings among adults.

Authors:  Li-Tzy Wu; Kenneth R Gersing; Marvin S Swartz; Bruce Burchett; Ting-Kai Li; Dan G Blazer
Journal:  J Psychiatr Res       Date:  2013-01-19       Impact factor: 4.791

7.  Detection of temporal lobe epilepsy using support vector machines in multi-parametric quantitative MR imaging.

Authors:  Diego Cantor-Rivera; Ali R Khan; Maged Goubran; Seyed M Mirsattari; Terry M Peters
Journal:  Comput Med Imaging Graph       Date:  2014-07-21       Impact factor: 4.790

8.  Secondary Use of EHR: Data Quality Issues and Informatics Opportunities.

Authors:  Taxiarchis Botsis; Gunnar Hartvigsen; Fei Chen; Chunhua Weng
Journal:  Summit Transl Bioinform       Date:  2010-03-01

9.  UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age.

Authors:  Cathie Sudlow; John Gallacher; Naomi Allen; Valerie Beral; Paul Burton; John Danesh; Paul Downey; Paul Elliott; Jane Green; Martin Landray; Bette Liu; Paul Matthews; Giok Ong; Jill Pell; Alan Silman; Alan Young; Tim Sprosen; Tim Peakman; Rory Collins
Journal:  PLoS Med       Date:  2015-03-31       Impact factor: 11.069

Review 10.  Exome sequencing and complex disease: practical aspects of rare variant association studies.

Authors:  Ron Do; Sekar Kathiresan; Gonçalo R Abecasis
Journal:  Hum Mol Genet       Date:  2012-09-13       Impact factor: 6.150

View more
  16 in total

Review 1.  Dissecting the Genetics of Osteoporosis using Systems Approaches.

Authors:  Basel M Al-Barghouthi; Charles R Farber
Journal:  Trends Genet       Date:  2018-11-20       Impact factor: 11.639

Review 2.  Artificial intelligence and machine learning in precision and genomic medicine.

Authors:  Sameer Quazi
Journal:  Med Oncol       Date:  2022-06-15       Impact factor: 3.738

3.  EPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants.

Authors:  Saeid Parvandeh; Lawrence A Donehower; Katsonis Panagiotis; Teng-Kuei Hsu; Jennifer K Asmussen; Kwanghyuk Lee; Olivier Lichtarge
Journal:  Nucleic Acids Res       Date:  2022-07-08       Impact factor: 19.160

4.  High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP).

Authors:  Yichi Zhang; Tianrun Cai; Sheng Yu; Kelly Cho; Chuan Hong; Jiehuan Sun; Jie Huang; Yuk-Lam Ho; Ashwin N Ananthakrishnan; Zongqi Xia; Stanley Y Shaw; Vivian Gainer; Victor Castro; Nicholas Link; Jacqueline Honerlaw; Sicong Huang; David Gagnon; Elizabeth W Karlson; Robert M Plenge; Peter Szolovits; Guergana Savova; Susanne Churchill; Christopher O'Donnell; Shawn N Murphy; J Michael Gaziano; Isaac Kohane; Tianxi Cai; Katherine P Liao
Journal:  Nat Protoc       Date:  2019-11-20       Impact factor: 13.491

5.  Accuracy of identifying hospital acquired venous thromboembolism by administrative coding: implications for big data and machine learning research.

Authors:  Tiffany Pellathy; Melissa Saul; Gilles Clermont; Artur W Dubrawski; Michael R Pinsky; Marilyn Hravnak
Journal:  J Clin Monit Comput       Date:  2021-02-08       Impact factor: 1.977

6.  COVID-19 Clinical Phenotypes: Presentation and Temporal Progression of Disease in a Cohort of Hospitalized Adults in Georgia, United States.

Authors:  Juliana F da Silva; Alfonso C Hernandez-Romieu; Sean D Browning; Beau B Bruce; Pavithra Natarajan; Sapna B Morris; Jeremy A W Gold; Robyn Neblett Fanfair; Jessica Rogers-Brown; John Rossow; Christine M Szablewski; Nadine Oosmanally; Melissa Tobin D'Angelo; Cherie Drenzek; David J Murphy; Julie Hollberg; James M Blum; Robert Jansen; David W Wright; William Sewell; Jack Owens; Benjamin Lefkove; Frank W Brown; Deron C Burton; Timothy M Uyeki; Priti R Patel; Brendan R Jackson; Karen K Wong
Journal:  Open Forum Infect Dis       Date:  2020-12-07       Impact factor: 3.835

7.  Predictive Analytics for Care and Management of Patients With Acute Diseases: Deep Learning-Based Method to Predict Crucial Complication Phenotypes.

Authors:  Jessica Qiuhua Sheng; Paul Jen-Hwa Hu; Xiao Liu; Ting-Shuo Huang; Yu Hsien Chen
Journal:  J Med Internet Res       Date:  2021-02-12       Impact factor: 5.428

8.  Unsupervised machine learning and prognostic factors of survival in chronic lymphocytic leukemia.

Authors:  Caitlin E Coombes; Zachary B Abrams; Suli Li; Lynne V Abruzzo; Kevin R Coombes
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

9.  Characterisation, identification, clustering, and classification of disease.

Authors:  A J Webster; K Gaitskell; I Turnbull; B J Cairns; R Clarke
Journal:  Sci Rep       Date:  2021-03-08       Impact factor: 4.379

10.  Data-Driven Patient Clustering and Differential Clinical Outcomes in the Brigham and Women's Rheumatoid Arthritis Sequential Study Registry.

Authors:  Jeffrey R Curtis; Michael Weinblatt; Kenneth Saag; Vivian P Bykerk; Daniel E Furst; Stefano Fiore; Gregory St John; Toshio Kimura; Shen Zheng; Clifton O Bingham; Grace Wright; Martin Bergman; Kamala Nola; Christina Charles-Schoeman; Nancy Shadick
Journal:  Arthritis Care Res (Hoboken)       Date:  2021-03-13       Impact factor: 4.794

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.