Vinay S Swamy1, Temesgen D Fufa2, Robert B Hufnagel2, David M McGaughey1. 1. Bioinformatics Group, Ophthalmic Genetics & Visual Function Branch, National Eye Institute, National Institutes of Health, 20892, Bethesda, Maryland, USA. 2. Medical Genetics and Ophthalmic Genomics Unit, National Eye Institute, National Institutes of Health, 20892, Bethesda, Maryland, USA.
Abstract
BACKGROUND: The development of highly scalable single-cell transcriptome technology has resulted in the creation of thousands of datasets, >30 in the retina alone. Analyzing the transcriptomes between different projects is highly desirable because this would allow for better assessment of which biological effects are consistent across independent studies. However it is difficult to compare and contrast data across different projects because there are substantial batch effects from computational processing, single-cell technology utilized, and the natural biological variation. While many single-cell transcriptome-specific batch correction methods purport to remove the technical noise, it is difficult to ascertain which method functions best. RESULTS: We developed a lightweight R package (scPOP, single-cell Pick Optimal Parameters) that brings in batch integration methods and uses a simple heuristic to balance batch merging and cell type/cluster purity. We use this package along with a Snakefile-based workflow system to demonstrate how to optimally merge 766,615 cells from 33 retina datsets and 3 species to create a massive ocular single-cell transcriptome meta-atlas. CONCLUSIONS: This provides a model for how to efficiently create meta-atlases for tissues and cells of interest. Published by Oxford University Press on behalf of GigaScience 2021.
BACKGROUND: The development of highly scalable single-cell transcriptome technology has resulted in the creation of thousands of datasets, >30 in the retina alone. Analyzing the transcriptomes between different projects is highly desirable because this would allow for better assessment of which biological effects are consistent across independent studies. However it is difficult to compare and contrast data across different projects because there are substantial batch effects from computational processing, single-cell technology utilized, and the natural biological variation. While many single-cell transcriptome-specific batch correction methods purport to remove the technical noise, it is difficult to ascertain which method functions best. RESULTS: We developed a lightweight R package (scPOP, single-cell Pick Optimal Parameters) that brings in batch integration methods and uses a simple heuristic to balance batch merging and cell type/cluster purity. We use this package along with a Snakefile-based workflow system to demonstrate how to optimally merge 766,615 cells from 33 retina datsets and 3 species to create a massive ocular single-cell transcriptome meta-atlas. CONCLUSIONS: This provides a model for how to efficiently create meta-atlases for tissues and cells of interest. Published by Oxford University Press on behalf of GigaScience 2021.
Authors: Karl J Wahlin; Lynette Lim; Elizabeth A Grice; Peter A Campochiaro; Donald J Zack; Ruben Adler Journal: Mol Vis Date: 2004-06-03 Impact factor: 2.367
Authors: Evan Z Macosko; Anindita Basu; Rahul Satija; James Nemesh; Karthik Shekhar; Melissa Goldman; Itay Tirosh; Allison R Bialas; Nolan Kamitaki; Emily M Martersteck; John J Trombetta; David A Weitz; Joshua R Sanes; Alex K Shalek; Aviv Regev; Steven A McCarroll Journal: Cell Date: 2015-05-21 Impact factor: 41.582
Authors: Nicholas M Tran; Karthik Shekhar; Irene E Whitney; Anne Jacobi; Inbal Benhar; Guosong Hong; Wenjun Yan; Xian Adiconis; McKinzie E Arnold; Jung Min Lee; Joshua Z Levin; Dingchang Lin; Chen Wang; Charles M Lieber; Aviv Regev; Zhigang He; Joshua R Sanes Journal: Neuron Date: 2019-11-26 Impact factor: 17.173
Authors: Andrew P Voigt; Elaine Binkley; Miles J Flamme-Wiese; Shemin Zeng; Adam P DeLuca; Todd E Scheetz; Budd A Tucker; Robert F Mullins; Edwin M Stone Journal: Cells Date: 2020-02-13 Impact factor: 6.600
Authors: Guillermo L Lehmann; Christin Hanke-Gogokhia; Yang Hu; Rohan Bareja; Zelda Salfati; Michael Ginsberg; Daniel J Nolan; Santiago P Mendez-Huergo; Tomas Dalotto-Moreno; Alexandre Wojcinski; Francisca Ochoa; Shemin Zeng; Juan P Cerliani; Lampros Panagis; Patrick J Zager; Robert F Mullins; Shuntaro Ogura; Gerard A Lutty; Jakyung Bang; Jonathan H Zippin; Carmelo Romano; Gabriel A Rabinovich; Olivier Elemento; Alexandra L Joyner; Shahin Rafii; Enrique Rodriguez-Boulan; Ignacio Benedicto Journal: J Exp Med Date: 2020-06-01 Impact factor: 14.307