Literature DB >> 34805988

Transfer learning compensates limited data, batch effects and technological heterogeneity in single-cell sequencing.

Youngjun Park1, Anne-Christin Hauschild1, Dominik Heider1.   

Abstract

Tremendous advances in next-generation sequencing technology have enabled the accumulation of large amounts of omics data in various research areas over the past decade. However, study limitations due to small sample sizes, especially in rare disease clinical research, technological heterogeneity and batch effects limit the applicability of traditional statistics and machine learning analysis. Here, we present a meta-transfer learning approach to transfer knowledge from big data and reduce the search space in data with small sample sizes. Few-shot learning algorithms integrate meta-learning to overcome data scarcity and data heterogeneity by transferring molecular pattern recognition models from datasets of unrelated domains. We explore few-shot learning models with large scale public dataset, TCGA (The Cancer Genome Atlas) and GTEx dataset, and demonstrate their potential as pre-training dataset in other molecular pattern recognition tasks. Our results show that meta-transfer learning is very effective for datasets with a limited sample size. Furthermore, we show that our approach can transfer knowledge across technological heterogeneity, for example, from bulk cell to single-cell data. Our approach can overcome study size constraints, batch effects and technical limitations in analyzing single-cell data by leveraging existing bulk-cell sequencing data.
© The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2021        PMID: 34805988      PMCID: PMC8598306          DOI: 10.1093/nargab/lqab104

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  23 in total

Review 1.  The technology and biology of single-cell RNA sequencing.

Authors:  Aleksandra A Kolodziejczyk; Jong Kyoung Kim; Valentine Svensson; John C Marioni; Sarah A Teichmann
Journal:  Mol Cell       Date:  2015-05-21       Impact factor: 17.970

2.  MARS: discovering novel cell types across heterogeneous single-cell experiments.

Authors:  Maria Brbić; Marinka Zitnik; Sheng Wang; Angela O Pisco; Russ B Altman; Spyros Darmanis; Jure Leskovec
Journal:  Nat Methods       Date:  2020-10-19       Impact factor: 28.547

3.  scVAE: variational auto-encoders for single-cell gene expression data.

Authors:  Christopher Heje Grønbech; Maximillian Fornitz Vording; Pascal N Timshel; Casper Kaae Sønderby; Tune H Pers; Ole Winther
Journal:  Bioinformatics       Date:  2020-08-15       Impact factor: 6.937

4.  Efficient integration of heterogeneous single-cell transcriptomes using Scanorama.

Authors:  Brian Hie; Bryan Bryson; Bonnie Berger
Journal:  Nat Biotechnol       Date:  2019-05-06       Impact factor: 54.908

5.  Meta-learning reduces the amount of data needed to build AI models in oncology.

Authors:  Olivier Gevaert
Journal:  Br J Cancer       Date:  2021-03-29       Impact factor: 7.640

Review 6.  A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications.

Authors:  Ashraful Haque; Jessica Engel; Sarah A Teichmann; Tapio Lönnberg
Journal:  Genome Med       Date:  2017-08-18       Impact factor: 11.117

7.  Deep generative modeling for single-cell transcriptomics.

Authors:  Romain Lopez; Jeffrey Regier; Michael B Cole; Michael I Jordan; Nir Yosef
Journal:  Nat Methods       Date:  2018-11-30       Impact factor: 28.547

8.  Fast and precise single-cell data analysis using a hierarchical autoencoder.

Authors:  Duc Tran; Hung Nguyen; Bang Tran; Carlo La Vecchia; Hung N Luu; Tin Nguyen
Journal:  Nat Commun       Date:  2021-02-15       Impact factor: 14.919

9.  Transfer learning efficiently maps bone marrow cell types from mouse to human using single-cell RNA sequencing.

Authors:  Patrick S Stumpf; Xin Du; Haruka Imanishi; Yuya Kunisaki; Yuichiro Semba; Timothy Noble; Rosanna C G Smith; Matthew Rose-Zerili; Jonathan J West; Richard O C Oreffo; Katayoun Farrahi; Mahesan Niranjan; Koichi Akashi; Fumio Arai; Ben D MacArthur
Journal:  Commun Biol       Date:  2020-12-04

10.  SCENIC: single-cell regulatory network inference and clustering.

Authors:  Sara Aibar; Carmen Bravo González-Blas; Thomas Moerman; Vân Anh Huynh-Thu; Hana Imrichova; Gert Hulselmans; Florian Rambow; Jean-Christophe Marine; Pierre Geurts; Jan Aerts; Joost van den Oord; Zeynep Kalender Atak; Jasper Wouters; Stein Aerts
Journal:  Nat Methods       Date:  2017-10-09       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.