Literature DB >> 31121115

MultiPLIER: A Transfer Learning Framework for Transcriptomics Reveals Systemic Features of Rare Disease.

Jaclyn N Taroni1, Peter C Grayson2, Qiwen Hu3, Sean Eddy4, Matthias Kretzler5, Peter A Merkel6, Casey S Greene7.   

Abstract

Most gene expression datasets generated by individual researchers are too small to fully benefit from unsupervised machine-learning methods. In the case of rare diseases, there may be too few cases available, even when multiple studies are combined. To address this challenge, we utilize transfer learning to extract coordinated expression patterns and use learned patterns to analyze small rare disease datasets. We trained a pathway-level information extractor (PLIER) model on a large public data compendium comprising multiple experiments, tissues, and biological conditions and then transferred the model to small datasets in an approach we call MultiPLIER. Models constructed from the public data compendium included features that aligned well to known biological factors and were more comprehensive than those constructed from individual datasets or conditions. When transferred to rare disease datasets, the models describe biological processes related to disease severity more effectively than models trained only on a given dataset.
Copyright © 2019 The Author(s). Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  genomics; machine learning; medulloblastoma; rare diseases; transcriptomics; transfer learning; unsupervised learning; vasculitis

Mesh:

Year:  2019        PMID: 31121115      PMCID: PMC6538307          DOI: 10.1016/j.cels.2019.04.003

Source DB:  PubMed          Journal:  Cell Syst        ISSN: 2405-4712            Impact factor:   10.304


  39 in total

1.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository.

Authors:  Ron Edgar; Michael Domrachev; Alex E Lash
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  Biclustering via sparse singular value decomposition.

Authors:  Mihee Lee; Haipeng Shen; Jianhua Z Huang; J S Marron
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

Review 3.  Neutrophils, from marrow to microbes.

Authors:  Niels Borregaard
Journal:  Immunity       Date:  2010-11-24       Impact factor: 31.745

4.  Immune response in silico (IRIS): immune-specific genes identified from a compendium of microarray expression data.

Authors:  A R Abbas; D Baldwin; Y Ma; W Ouyang; A Gurney; F Martin; S Fong; M van Lookeren Campagne; P Godowski; P M Williams; A C Chan; H F Clark
Journal:  Genes Immun       Date:  2005-06       Impact factor: 2.676

5.  Adjusting batch effects in microarray expression data using empirical Bayes methods.

Authors:  W Evan Johnson; Cheng Li; Ariel Rabinovic
Journal:  Biostatistics       Date:  2006-04-21       Impact factor: 5.899

6.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis.

Authors:  Daniela M Witten; Robert Tibshirani; Trevor Hastie
Journal:  Biostatistics       Date:  2009-04-17       Impact factor: 5.899

7.  Modeling sample variables with an Experimental Factor Ontology.

Authors:  James Malone; Ele Holloway; Tomasz Adamusiak; Misha Kapushesky; Jie Zheng; Nikolay Kolesnikov; Anna Zhukova; Alvis Brazma; Helen Parkinson
Journal:  Bioinformatics       Date:  2010-03-03       Impact factor: 6.937

8.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

9.  Transcription of proteinase 3 and related myelopoiesis genes in peripheral blood mononuclear cells of patients with active Wegener's granulomatosis.

Authors:  Chris Cheadle; Alan E Berger; Felipe Andrade; Regina James; Kristen Johnson; Tonya Watkins; Jin Kyun Park; Yu-Chi Chen; Eva Ehrlich; Marissa Mullins; Francis Chrest; Kathleen C Barnes; Stuart M Levine
Journal:  Arthritis Rheum       Date:  2010-06

10.  A modular analysis framework for blood genomics studies: application to systemic lupus erythematosus.

Authors:  Damien Chaussabel; Charles Quinn; Jing Shen; Pinakeen Patel; Casey Glaser; Nicole Baldwin; Dorothee Stichweh; Derek Blankenship; Lei Li; Indira Munagala; Lynda Bennett; Florence Allantaz; Asuncion Mejias; Monica Ardura; Ellen Kaizer; Laurence Monnet; Windy Allman; Henry Randall; Diane Johnson; Aimee Lanier; Marilynn Punaro; Knut M Wittkowski; Perrin White; Joseph Fay; Goran Klintmalm; Octavio Ramilo; A Karolina Palucka; Jacques Banchereau; Virginia Pascual
Journal:  Immunity       Date:  2008-07-18       Impact factor: 31.745

View more
  24 in total

1.  Decomposing Cell Identity for Transfer Learning across Cellular Measurements, Platforms, Tissues, and Species.

Authors:  Genevieve L Stein-O'Brien; Brian S Clark; Thomas Sherman; Cristina Zibetti; Qiwen Hu; Rachel Sealfon; Sheng Liu; Jiang Qian; Carlo Colantuoni; Seth Blackshaw; Loyal A Goff; Elana J Fertig
Journal:  Cell Syst       Date:  2019-05-22       Impact factor: 10.304

2.  projectR: an R/Bioconductor package for transfer learning via PCA, NMF, correlation and clustering.

Authors:  Gaurav Sharma; Carlo Colantuoni; Loyal A Goff; Elana J Fertig; Genevieve Stein-O'Brien
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

3.  Parameterized algorithms for identifying gene co-expression modules via weighted clique decomposition.

Authors:  Madison Cooley; Casey S Greene; Davis Issac; Milton Pividori; Blair D Sullivan
Journal:  Proc 2021 SIAM Conf Appl Comput Discret Algorithms (2021)       Date:  2021

4.  Proteogenomics reveals sex-biased aging genes and coordinated splicing in cardiac aging.

Authors:  Yu Han; Sara A Wennersten; Julianna M Wright; R W Ludwig; Edward Lau; Maggie P Y Lam
Journal:  Am J Physiol Heart Circ Physiol       Date:  2022-08-05       Impact factor: 5.125

5.  KLFDAPC: a supervised machine learning approach for spatial genetic structure analysis.

Authors:  Xinghu Qin; Charleston W K Chiang; Oscar E Gaggiotti
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

6.  GenomicSuperSignature facilitates interpretation of RNA-seq experiments through robust, efficient comparison to public databases.

Authors:  Sehyun Oh; Ludwig Geistlinger; Marcel Ramos; Daniel Blankenberg; Marius van den Beek; Jaclyn N Taroni; Vincent J Carey; Casey S Greene; Levi Waldron; Sean Davis
Journal:  Nat Commun       Date:  2022-06-27       Impact factor: 17.694

7.  Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter.

Authors:  A Hoarfrost; A Aptekmann; G Farfañuk; Y Bromberg
Journal:  Nat Commun       Date:  2022-05-11       Impact factor: 17.694

Review 8.  New Developments and Possibilities in Reanalysis and Reinterpretation of Whole Exome Sequencing Datasets for Unsolved Rare Diseases Using Machine Learning Approaches.

Authors:  Samarth Thonta Setty; Marie-Pier Scott-Boyer; Tania Cuppens; Arnaud Droit
Journal:  Int J Mol Sci       Date:  2022-06-18       Impact factor: 6.208

9.  Topological network measures for drug repositioning.

Authors:  Apurva Badkas; Sébastien De Landtsheer; Thomas Sauter
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

Review 10.  Responsible, practical genomic data sharing that accelerates research.

Authors:  James Brian Byrd; Anna C Greene; Deepashree Venkatesh Prasad; Xiaoqian Jiang; Casey S Greene
Journal:  Nat Rev Genet       Date:  2020-07-21       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.