Literature DB >> 34669035

Interpretable machine learning for genomics.

David S Watson1.   

Abstract

High-throughput technologies such as next-generation sequencing allow biologists to observe cell function with unprecedented resolution, but the resulting datasets are too large and complicated for humans to understand without the aid of advanced statistical methods. Machine learning (ML) algorithms, which are designed to automatically find patterns in data, are well suited to this task. Yet these models are often so complex as to be opaque, leaving researchers with few clues about underlying mechanisms. Interpretable machine learning (iML) is a burgeoning subdiscipline of computational statistics devoted to making the predictions of ML models more intelligible to end users. This article is a gentle and critical introduction to iML, with an emphasis on genomic applications. I define relevant concepts, motivate leading methodologies, and provide a simple typology of existing approaches. I survey recent examples of iML in genomics, demonstrating how such techniques are increasingly integrated into research workflows. I argue that iML solutions are required to realize the promise of precision medicine. However, several open challenges remain. I examine the limitations of current state-of-the-art tools and propose a number of directions for future research. While the horizon for iML in genomics is wide and bright, continued progress requires close collaboration across disciplines.
© 2021. The Author(s).

Entities:  

Mesh:

Year:  2021        PMID: 34669035      PMCID: PMC8527313          DOI: 10.1007/s00439-021-02387-9

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   5.881


  39 in total

1.  Permutation importance: a corrected feature importance measure.

Authors:  André Altmann; Laura Toloşi; Oliver Sander; Thomas Lengauer
Journal:  Bioinformatics       Date:  2010-04-12       Impact factor: 6.937

2.  Causal inference and the data-fusion problem.

Authors:  Elias Bareinboim; Judea Pearl
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-05       Impact factor: 11.205

Review 3.  From Real-World Patient Data to Individualized Treatment Effects Using Machine Learning: Current and Future Methods to Address Underlying Challenges.

Authors:  Ioana Bica; Ahmed M Alaa; Craig Lambert; Mihaela van der Schaar
Journal:  Clin Pharmacol Ther       Date:  2020-06-28       Impact factor: 6.875

4.  Interpretable genotype-to-phenotype classifiers with performance guarantees.

Authors:  Alexandre Drouin; Gaël Letarte; Frédéric Raymond; Mario Marchand; Jacques Corbeil; François Laviolette
Journal:  Sci Rep       Date:  2019-03-11       Impact factor: 4.379

Review 5.  Opening the Black Box: Interpretable Machine Learning for Geneticists.

Authors:  Christina B Azodi; Jiliang Tang; Shin-Han Shiu
Journal:  Trends Genet       Date:  2020-04-17       Impact factor: 11.639

6.  Structural Racism, Social Risk Factors, and Covid-19 - A Dangerous Convergence for Black Americans.

Authors:  Leonard E Egede; Rebekah J Walker
Journal:  N Engl J Med       Date:  2020-07-22       Impact factor: 91.245

Review 7.  Random forests for genomic data analysis.

Authors:  Xi Chen; Hemant Ishwaran
Journal:  Genomics       Date:  2012-04-21       Impact factor: 5.736

8.  Machine learning for RNA sequencing-based intrinsic subtyping of breast cancer.

Authors:  Silvia Cascianelli; Ivan Molineris; Claudio Isella; Marco Masseroli; Enzo Medico
Journal:  Sci Rep       Date:  2020-08-21       Impact factor: 4.379

9.  eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research.

Authors:  Augusto Anguita-Ruiz; Alberto Segura-Delgado; Rafael Alcalá; Concepción M Aguilera; Jesús Alcalá-Fdez
Journal:  PLoS Comput Biol       Date:  2020-04-10       Impact factor: 4.475

10.  Explainable AI reveals changes in skin microbiome composition linked to phenotypic differences.

Authors:  Anna Paola Carrieri; Niina Haiminen; Sean Maudsley-Barton; Laura-Jayne Gardiner; Barry Murphy; Andrew E Mayes; Sarah Paterson; Sally Grimshaw; Martyn Winn; Cameron Shand; Panagiotis Hadjidoukas; Will P M Rowe; Stacy Hawkins; Ashley MacGuire-Flanagan; Jane Tazzioli; John G Kenny; Laxmi Parida; Michael Hoptroff; Edward O Pyzer-Knapp
Journal:  Sci Rep       Date:  2021-02-25       Impact factor: 4.379

View more
  2 in total

1.  Mapping Data to Deep Understanding: Making the Most of the Deluge of SARS-CoV-2 Genome Sequences.

Authors:  Bahrad A Sokhansanj; Gail L Rosen
Journal:  mSystems       Date:  2022-03-21       Impact factor: 7.324

2.  Access to online learning: Machine learning analysis from a social justice perspective.

Authors:  Nora A McIntyre
Journal:  Educ Inf Technol (Dordr)       Date:  2022-10-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.