Literature DB >> 31038689

LAmbDA: label ambiguous domain adaptation dataset integration reduces batch effects and improves subtype detection.

Travis S Johnson1,2, Tongxin Wang2,3, Zhi Huang2,4, Christina Y Yu1,2, Yi Wu2, Yatong Han5, Yan Zhang1,6, Kun Huang2,7, Jie Zhang8.   

Abstract

MOTIVATION: Rapid advances in single cell RNA sequencing (scRNA-seq) have produced higher-resolution cellular subtypes in multiple tissues and species. Methods are increasingly needed across datasets and species to (i) remove systematic biases, (ii) model multiple datasets with ambiguous labels and (iii) classify cells and map cell type labels. However, most methods only address one of these problems on broad cell types or simulated data using a single model type. It is also important to address higher-resolution cellular subtypes, subtype labels from multiple datasets, models trained on multiple datasets simultaneously and generalizability beyond a single model type.
RESULTS: We developed a species- and dataset-independent transfer learning framework (LAmbDA) to train models on multiple datasets (even from different species) and applied our framework on simulated, pancreas and brain scRNA-seq experiments. These models mapped corresponding cell types between datasets with inconsistent cell subtype labels while simultaneously reducing batch effects. We achieved high accuracy in labeling cellular subtypes (weighted accuracy simulated 1 datasets: 90%; simulated 2 datasets: 94%; pancreas datasets: 88% and brain datasets: 66%) using LAmbDA Feedforward 1 Layer Neural Network with bagging. This method achieved higher weighted accuracy in labeling cellular subtypes than two other state-of-the-art methods, scmap and CaSTLe in brain (66% versus 60% and 32%). Furthermore, it achieved better performance in correctly predicting ambiguous cellular subtype labels across datasets in 88% of test cases compared with CaSTLe (63%), scmap (50%) and MetaNeighbor (50%). LAmbDA is model- and dataset-independent and generalizable to diverse data types representing an advance in biocomputing.
AVAILABILITY AND IMPLEMENTATION: github.com/tsteelejohnson91/LAmbDA. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 31038689      PMCID: PMC6853662          DOI: 10.1093/bioinformatics/btz295

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  37 in total

1.  Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq.

Authors:  Amit Zeisel; Ana B Muñoz-Manchado; Simone Codeluppi; Peter Lönnerberg; Gioele La Manno; Anna Juréus; Sueli Marques; Hermany Munguba; Liqun He; Christer Betsholtz; Charlotte Rolny; Gonçalo Castelo-Branco; Jens Hjerling-Leffler; Sten Linnarsson
Journal:  Science       Date:  2015-02-19       Impact factor: 47.728

2.  ClusterMap: compare multiple single cell RNA-Seq datasets across different experimental conditions.

Authors:  Xin Gao; Deqing Hu; Madelaine Gogol; Hua Li
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

3.  An RNA-sequencing transcriptome and splicing database of glia, neurons, and vascular cells of the cerebral cortex.

Authors:  Ye Zhang; Kenian Chen; Steven A Sloan; Mariko L Bennett; Anja R Scholze; Sean O'Keeffe; Hemali P Phatnani; Paolo Guarnieri; Christine Caneda; Nadine Ruderisch; Shuyun Deng; Shane A Liddelow; Chaolin Zhang; Richard Daneman; Tom Maniatis; Ben A Barres; Jian Qian Wu
Journal:  J Neurosci       Date:  2014-09-03       Impact factor: 6.167

Review 4.  Pancreatic islet cell hormones distribution of cell types in the islet and evidence for the presence of somatostatin and gastrin within the D cell.

Authors:  S L Erlandsen; O D Hegre; J A Parsons; R C McEvoy; R P Elde
Journal:  J Histochem Cytochem       Date:  1976-07       Impact factor: 2.479

5.  A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure.

Authors:  Maayan Baron; Adrian Veres; Samuel L Wolock; Aubrey L Faust; Renaud Gaujoux; Amedeo Vetere; Jennifer Hyoje Ryu; Bridget K Wagner; Shai S Shen-Orr; Allon M Klein; Douglas A Melton; Itai Yanai
Journal:  Cell Syst       Date:  2016-09-22       Impact factor: 10.304

6.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt.

Authors:  Steffen Durinck; Paul T Spellman; Ewan Birney; Wolfgang Huber
Journal:  Nat Protoc       Date:  2009-07-23       Impact factor: 13.491

7.  Using neural networks for reducing the dimensions of single-cell RNA-Seq data.

Authors:  Chieh Lin; Siddhartha Jain; Hannah Kim; Ziv Bar-Joseph
Journal:  Nucleic Acids Res       Date:  2017-09-29       Impact factor: 16.971

8.  Splatter: simulation of single-cell RNA sequencing data.

Authors:  Luke Zappia; Belinda Phipson; Alicia Oshlack
Journal:  Genome Biol       Date:  2017-09-12       Impact factor: 13.583

9.  CaSTLe - Classification of single cells by transfer learning: Harnessing the power of publicly available single cell RNA sequencing experiments to annotate new experiments.

Authors:  Yuval Lieberman; Lior Rokach; Tal Shay
Journal:  PLoS One       Date:  2018-10-10       Impact factor: 3.240

10.  cellHarmony: cell-level matching and holistic comparison of single-cell transcriptomes.

Authors:  Erica A K DePasquale; Daniel Schnell; Phillip Dexheimer; Kyle Ferchen; Stuart Hay; Kashish Chetal; Íñigo Valiente-Alandí; Burns C Blaxall; H Leighton Grimes; Nathan Salomonis
Journal:  Nucleic Acids Res       Date:  2019-12-02       Impact factor: 16.971

View more
  10 in total

1.  BERMUDA: a novel deep transfer learning method for single-cell RNA sequencing batch correction reveals hidden high-resolution cellular subtypes.

Authors:  Tongxin Wang; Travis S Johnson; Wei Shao; Zixiao Lu; Bryan R Helm; Jie Zhang; Kun Huang
Journal:  Genome Biol       Date:  2019-08-12       Impact factor: 13.583

Review 2.  Eleven grand challenges in single-cell data science.

Authors:  David Lähnemann; Johannes Köster; Ewa Szczurek; Davis J McCarthy; Stephanie C Hicks; Mark D Robinson; Catalina A Vallejos; Kieran R Campbell; Niko Beerenwinkel; Ahmed Mahfouz; Luca Pinello; Pavel Skums; Alexandros Stamatakis; Camille Stephan-Otto Attolini; Samuel Aparicio; Jasmijn Baaijens; Marleen Balvert; Buys de Barbanson; Antonio Cappuccio; Giacomo Corleone; Bas E Dutilh; Maria Florescu; Victor Guryev; Rens Holmer; Katharina Jahn; Thamar Jessurun Lobo; Emma M Keizer; Indu Khatri; Szymon M Kielbasa; Jan O Korbel; Alexey M Kozlov; Tzu-Hao Kuo; Boudewijn P F Lelieveldt; Ion I Mandoiu; John C Marioni; Tobias Marschall; Felix Mölder; Amir Niknejad; Lukasz Raczkowski; Marcel Reinders; Jeroen de Ridder; Antoine-Emmanuel Saliba; Antonios Somarakis; Oliver Stegle; Fabian J Theis; Huan Yang; Alex Zelikovsky; Alice C McHardy; Benjamin J Raphael; Sohrab P Shah; Alexander Schönhuth
Journal:  Genome Biol       Date:  2020-02-07       Impact factor: 13.583

Review 3.  Computational methods for the integrative analysis of single-cell data.

Authors:  Mattia Forcato; Oriana Romano; Silvio Bicciato
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

4.  A comparison of automatic cell identification methods for single-cell RNA sequencing data.

Authors:  Tamim Abdelaal; Lieke Michielsen; Davy Cats; Dylan Hoogduin; Hailiang Mei; Marcel J T Reinders; Ahmed Mahfouz
Journal:  Genome Biol       Date:  2019-09-09       Impact factor: 13.583

5.  Acidic leucine-rich nuclear phosphoprotein-32A expression contributes to adverse outcome in acute myeloid leukemia.

Authors:  Sai Huang; Zhi Huang; Chao Ma; Lan Luo; Yan-Fen Li; Yong-Li Wu; Yuan Ren; Cong Feng
Journal:  Ann Transl Med       Date:  2020-03

6.  Using transfer learning from prior reference knowledge to improve the clustering of single-cell RNA-Seq data.

Authors:  Bettina Mieth; James R F Hockley; Nico Görnitz; Marina M-C Vidovic; Klaus-Robert Müller; Alex Gutteridge; Daniel Ziemek
Journal:  Sci Rep       Date:  2019-12-30       Impact factor: 4.379

7.  Spatial cell type composition in normal and Alzheimers human brains is revealed using integrated mouse and human single cell RNA sequencing.

Authors:  Travis S Johnson; Shunian Xiang; Bryan R Helm; Zachary B Abrams; Peter Neidecker; Raghu Machiraju; Yan Zhang; Kun Huang; Jie Zhang
Journal:  Sci Rep       Date:  2020-10-22       Impact factor: 4.379

8.  scSorter: assigning cells to known cell types according to marker genes.

Authors:  Hongyu Guo; Jun Li
Journal:  Genome Biol       Date:  2021-02-22       Impact factor: 13.583

Review 9.  Automated methods for cell type annotation on scRNA-seq data.

Authors:  Giovanni Pasquini; Jesus Eduardo Rojo Arias; Patrick Schäfer; Volker Busskamp
Journal:  Comput Struct Biotechnol J       Date:  2021-01-19       Impact factor: 7.271

10.  Diagnostic Evidence GAuge of Single cells (DEGAS): a flexible deep transfer learning framework for prioritizing cells in relation to disease.

Authors:  Travis S Johnson; Christina Y Yu; Zhi Huang; Siwen Xu; Tongxin Wang; Chuanpeng Dong; Wei Shao; Mohammad Abu Zaid; Xiaoqing Huang; Yijie Wang; Christopher Bartlett; Yan Zhang; Brian A Walker; Yunlong Liu; Kun Huang; Jie Zhang
Journal:  Genome Med       Date:  2022-02-01       Impact factor: 11.117

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.