Literature DB >> 33658382

Integration and transfer learning of single-cell transcriptomes via cFIT.

Minshi Peng1, Yue Li1, Brie Wamsley2, Yuting Wei1, Kathryn Roeder3,4.   

Abstract

Large, comprehensive collections of single-cell RNA sequencing (scRNA-seq) datasets have been generated that allow for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these datasets or transfer knowledge from one to the other to better understand cellular identity and functions. Here, we present a simple yet surprisingly effective method named common factor integration and transfer learning (cFIT) for capturing various batch effects across experiments, technologies, subjects, and even species. The proposed method models the shared information between various datasets by a common factor space while allowing for unique distortions and shifts in genewise expression in each batch. The model parameters are learned under an iterative nonnegative matrix factorization (NMF) framework and then used for synchronized integration from across-domain assays. In addition, the model enables transferring via low-rank matrix from more informative data to allow for precise identification in data of lower quality. Compared with existing approaches, our method imposes weaker assumptions on the cell composition of each individual dataset; however, it is shown to be more reliable in preserving biological variations. We apply cFIT to multiple scRNA-seq datasets of developing brain from human and mouse, varying by technologies and developmental stages. The successful integration and transfer uncover the transcriptional resemblance across systems. The study helps establish a comprehensive landscape of brain cell-type diversity and provides insights into brain development.
Copyright © 2021 the Author(s). Published by PNAS.

Entities:  

Keywords:  brain cells; data integration; single-cell RNA-seq; transfer learning

Year:  2021        PMID: 33658382     DOI: 10.1073/pnas.2024383118

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  5 in total

1.  Rare coding variation provides insight into the genetic architecture and phenotypic context of autism.

Authors:  Jack M Fu; F Kyle Satterstrom; Minshi Peng; Harrison Brand; Ryan L Collins; Shan Dong; Brie Wamsley; Lambertus Klei; Lily Wang; Stephanie P Hao; Christine R Stevens; Caroline Cusick; Mehrtash Babadi; Eric Banks; Brett Collins; Sheila Dodge; Stacey B Gabriel; Laura Gauthier; Samuel K Lee; Lindsay Liang; Alicia Ljungdahl; Behrang Mahjani; Laura Sloofman; Andrey N Smirnov; Mafalda Barbosa; Catalina Betancur; Alfredo Brusco; Brian H Y Chung; Edwin H Cook; Michael L Cuccaro; Enrico Domenici; Giovanni Battista Ferrero; J Jay Gargus; Gail E Herman; Irva Hertz-Picciotto; Patricia Maciel; Dara S Manoach; Maria Rita Passos-Bueno; Antonio M Persico; Alessandra Renieri; James S Sutcliffe; Flora Tassone; Elisabetta Trabetti; Gabriele Campos; Simona Cardaropoli; Diana Carli; Marcus C Y Chan; Chiara Fallerini; Elisa Giorgio; Ana Cristina Girardi; Emily Hansen-Kiss; So Lun Lee; Carla Lintas; Yunin Ludena; Rachel Nguyen; Lisa Pavinato; Margaret Pericak-Vance; Isaac N Pessah; Rebecca J Schmidt; Moyra Smith; Claudia I S Costa; Slavica Trajkova; Jaqueline Y T Wang; Mullin H C Yu; David J Cutler; Silvia De Rubeis; Joseph D Buxbaum; Mark J Daly; Bernie Devlin; Kathryn Roeder; Stephan J Sanders; Michael E Talkowski
Journal:  Nat Genet       Date:  2022-08-18       Impact factor: 41.307

2.  GenomicSuperSignature facilitates interpretation of RNA-seq experiments through robust, efficient comparison to public databases.

Authors:  Sehyun Oh; Ludwig Geistlinger; Marcel Ramos; Daniel Blankenberg; Marius van den Beek; Jaclyn N Taroni; Vincent J Carey; Casey S Greene; Levi Waldron; Sean Davis
Journal:  Nat Commun       Date:  2022-06-27       Impact factor: 17.694

3.  Transfer learning between preclinical models and human tumors identifies a conserved NK cell activation signature in anti-CTLA-4 responsive tumors.

Authors:  Emily F Davis-Marcisak; Allison A Fitzgerald; Michael D Kessler; Ludmila Danilova; Elizabeth M Jaffee; Neeha Zaidi; Louis M Weiner; Elana J Fertig
Journal:  Genome Med       Date:  2021-08-11       Impact factor: 15.266

4.  Cell type hierarchy reconstruction via reconciliation of multi-resolution cluster tree.

Authors:  Minshi Peng; Brie Wamsley; Andrew G Elkins; Daniel H Geschwind; Yuting Wei; Kathryn Roeder
Journal:  Nucleic Acids Res       Date:  2021-09-20       Impact factor: 16.971

5.  Biologically relevant transfer learning improves transcription factor binding prediction.

Authors:  Gherman Novakovsky; Manu Saraswat; Oriol Fornes; Sara Mostafavi; Wyeth W Wasserman
Journal:  Genome Biol       Date:  2021-09-27       Impact factor: 13.583

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.