Matthew D Young1, Sam Behjati1,2,3. 1. Wellcome Trust Sanger Institute, Cellular Genetics, Wellcome Genome Campus, Hinxton, CB10 1SA, UK. 2. Cambridge University Hospitals NHS Foundation Trust, Hills Road, Cambridge, CB2 0QQ, UK. 3. University of Cambridge, Department of Paediatrics, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK.
Abstract
BACKGROUND: Droplet-based single-cell RNA sequence analyses assume that all acquired RNAs are endogenous to cells. However, any cell-free RNAs contained within the input solution are also captured by these assays. This sequencing of cell-free RNA constitutes a background contamination that confounds the biological interpretation of single-cell transcriptomic data. RESULTS: We demonstrate that contamination from this "soup" of cell-free RNAs is ubiquitous, with experiment-specific variations in composition and magnitude. We present a method, SoupX, for quantifying the extent of the contamination and estimating "background-corrected" cell expression profiles that seamlessly integrate with existing downstream analysis tools. Applying this method to several datasets using multiple droplet sequencing technologies, we demonstrate that its application improves biological interpretation of otherwise misleading data, as well as improving quality control metrics. CONCLUSIONS: We present SoupX, a tool for removing ambient RNA contamination from droplet-based single-cell RNA sequencing experiments. This tool has broad applicability, and its application can improve the biological utility of existing and future datasets.
BACKGROUND: Droplet-based single-cell RNA sequence analyses assume that all acquired RNAs are endogenous to cells. However, any cell-free RNAs contained within the input solution are also captured by these assays. This sequencing of cell-free RNA constitutes a background contamination that confounds the biological interpretation of single-cell transcriptomic data. RESULTS: We demonstrate that contamination from this "soup" of cell-free RNAs is ubiquitous, with experiment-specific variations in composition and magnitude. We present a method, SoupX, for quantifying the extent of the contamination and estimating "background-corrected" cell expression profiles that seamlessly integrate with existing downstream analysis tools. Applying this method to several datasets using multiple droplet sequencing technologies, we demonstrate that its application improves biological interpretation of otherwise misleading data, as well as improving quality control metrics. CONCLUSIONS: We present SoupX, a tool for removing ambient RNA contamination from droplet-based single-cell RNA sequencing experiments. This tool has broad applicability, and its application can improve the biological utility of existing and future datasets.
Authors: Evan Z Macosko; Anindita Basu; Rahul Satija; James Nemesh; Karthik Shekhar; Melissa Goldman; Itay Tirosh; Allison R Bialas; Nolan Kamitaki; Emily M Martersteck; John J Trombetta; David A Weitz; Joshua R Sanes; Alex K Shalek; Aviv Regev; Steven A McCarroll Journal: Cell Date: 2015-05-21 Impact factor: 41.582
Authors: Rapolas Zilionis; Juozas Nainys; Adrian Veres; Virginia Savova; David Zemmour; Allon M Klein; Linas Mazutis Journal: Nat Protoc Date: 2016-12-08 Impact factor: 13.491
Authors: Haynes Heaton; Arthur M Talman; Andrew Knights; Maria Imaz; Daniel J Gaffney; Richard Durbin; Martin Hemberg; Mara K N Lawniczak Journal: Nat Methods Date: 2020-05-04 Impact factor: 28.547
Authors: Grace X Y Zheng; Jessica M Terry; Phillip Belgrader; Paul Ryvkin; Zachary W Bent; Ryan Wilson; Solongo B Ziraldo; Tobias D Wheeler; Geoff P McDermott; Junjie Zhu; Mark T Gregory; Joe Shuga; Luz Montesclaros; Jason G Underwood; Donald A Masquelier; Stefanie Y Nishimura; Michael Schnall-Levin; Paul W Wyatt; Christopher M Hindson; Rajiv Bharadwaj; Alexander Wong; Kevin D Ness; Lan W Beppu; H Joachim Deeg; Christopher McFarland; Keith R Loeb; William J Valente; Nolan G Ericson; Emily A Stevens; Jerald P Radich; Tarjei S Mikkelsen; Benjamin J Hindson; Jason H Bielas Journal: Nat Commun Date: 2017-01-16 Impact factor: 14.919
Authors: Karsten Bach; Sara Pensa; Marta Grzelak; James Hadfield; David J Adams; John C Marioni; Walid T Khaled Journal: Nat Commun Date: 2017-12-11 Impact factor: 14.919
Authors: Dorin-Mirel Popescu; Rachel A Botting; Emily Stephenson; Kile Green; Simone Webb; Laura Jardine; Emily F Calderbank; Krzysztof Polanski; Issac Goh; Mirjana Efremova; Meghan Acres; Daniel Maunder; Peter Vegh; Yorick Gitton; Jong-Eun Park; Roser Vento-Tormo; Zhichao Miao; David Dixon; Rachel Rowell; David McDonald; James Fletcher; Elizabeth Poyner; Gary Reynolds; Michael Mather; Corina Moldovan; Lira Mamanova; Frankie Greig; Matthew D Young; Kerstin B Meyer; Steven Lisgo; Jaume Bacardit; Andrew Fuller; Ben Millar; Barbara Innes; Susan Lindsay; Michael J T Stubbington; Monika S Kowalczyk; Bo Li; Orr Ashenberg; Marcin Tabaka; Danielle Dionne; Timothy L Tickle; Michal Slyper; Orit Rozenblatt-Rosen; Andrew Filby; Peter Carey; Alexandra-Chloé Villani; Anindita Roy; Aviv Regev; Alain Chédotal; Irene Roberts; Berthold Göttgens; Sam Behjati; Elisa Laurenti; Sarah A Teichmann; Muzlifah Haniffa Journal: Nature Date: 2019-10-09 Impact factor: 69.504
Authors: Tomislav Ilicic; Jong Kyoung Kim; Aleksandra A Kolodziejczyk; Frederik Otzen Bagger; Davis James McCarthy; John C Marioni; Sarah A Teichmann Journal: Genome Biol Date: 2016-02-17 Impact factor: 13.583
Authors: Zoe A Clarke; Tallulah S Andrews; Jawairia Atif; Delaram Pouyabahar; Brendan T Innes; Sonya A MacParland; Gary D Bader Journal: Nat Protoc Date: 2021-05-24 Impact factor: 13.491
Authors: Adrienne Niederriter Shami; Xianing Zheng; Sarah K Munyoki; Qianyi Ma; Gabriel L Manske; Christopher D Green; Meena Sukhwani; Kyle E Orwig; Jun Z Li; Saher Sue Hammoud Journal: Dev Cell Date: 2020-06-05 Impact factor: 12.270
Authors: Andrew C Yang; Fabian Kern; Patricia M Losada; Maayan R Agam; Christina A Maat; Georges P Schmartz; Tobias Fehlmann; Julian A Stein; Nicholas Schaum; Davis P Lee; Kruti Calcuttawala; Ryan T Vest; Daniela Berdnik; Nannan Lu; Oliver Hahn; David Gate; M Windy McNerney; Divya Channappa; Inma Cobos; Nicole Ludwig; Walter J Schulz-Schaeffer; Andreas Keller; Tony Wyss-Coray Journal: Nature Date: 2021-06-21 Impact factor: 49.962
Authors: Duncan M Morgan; Bert Ruiter; Neal P Smith; Ang A Tu; Brinda Monian; Brandon E Stone; Navneet Virk-Hundal; Qian Yuan; Wayne G Shreffler; J Christopher Love Journal: Sci Immunol Date: 2021-08-13
Authors: Sonia Malaiya; Marcia Cortes-Gutierrez; Brian R Herb; Sydney R Coffey; Samuel R W Legg; Jeffrey P Cantle; Carlo Colantuoni; Jeffrey B Carroll; Seth A Ament Journal: J Neurosci Date: 2021-05-19 Impact factor: 6.167
Authors: Amin Abedini; Yuan O Zhu; Shatakshee Chatterjee; Gabor Halasz; Kishor Devalaraja-Narashimha; Rojesh Shrestha; Michael S Balzer; Jihwan Park; Tong Zhou; Ziyuan Ma; Katie Marie Sullivan; Hailong Hu; Xin Sheng; Hongbo Liu; Yi Wei; Carine M Boustany-Kari; Uptal Patel; Salem Almaani; Matthew Palmer; Raymond Townsend; Shira Blady; Jonathan Hogan; Lori Morton; Katalin Susztak Journal: J Am Soc Nephrol Date: 2021-02-02 Impact factor: 10.121