Chuanqi Wang1, Jun Li1. 1. Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, Notre Dame, IN 46556, USA.
Abstract
MOTIVATION: Scaling by sequencing depth is usually the first step of analysis of bulk or single-cell RNA-seq data, but estimating sequencing depth accurately can be difficult, especially for single-cell data, risking the validity of downstream analysis. It is thus of interest to eliminate the use of sequencing depth and analyze the original count data directly. RESULTS: We call an analysis method 'scale-invariant' (SI) if it gives the same result under different estimates of sequencing depth and hence can use the original count data without scaling. For the problem of classifying samples into pre-specified classes, such as normal versus cancerous, we develop a deep-neural-network based SI classifier named scale-invariant deep neural-network classifier (SINC). On nine bulk and single-cell datasets, the classification accuracy of SINC is better than or competitive to the best of eight other classifiers. SINC is easier to use and more reliable on data where proper sequencing depth is hard to determine. AVAILABILITY AND IMPLEMENTATION: This source code of SINC is available at https://www.nd.edu/∼jli9/SINC.zip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Scaling by sequencing depth is usually the first step of analysis of bulk or single-cell RNA-seq data, but estimating sequencing depth accurately can be difficult, especially for single-cell data, risking the validity of downstream analysis. It is thus of interest to eliminate the use of sequencing depth and analyze the original count data directly. RESULTS: We call an analysis method 'scale-invariant' (SI) if it gives the same result under different estimates of sequencing depth and hence can use the original count data without scaling. For the problem of classifying samples into pre-specified classes, such as normal versus cancerous, we develop a deep-neural-network based SI classifier named scale-invariant deep neural-network classifier (SINC). On nine bulk and single-cell datasets, the classification accuracy of SINC is better than or competitive to the best of eight other classifiers. SINC is easier to use and more reliable on data where proper sequencing depth is hard to determine. AVAILABILITY AND IMPLEMENTATION: This source code of SINC is available at https://www.nd.edu/∼jli9/SINC.zip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Florian Buettner; Kedar N Natarajan; F Paolo Casale; Valentina Proserpio; Antonio Scialdone; Fabian J Theis; Sarah A Teichmann; John C Marioni; Oliver Stegle Journal: Nat Biotechnol Date: 2015-01-19 Impact factor: 54.908
Authors: Lauren Fishbein; Ignaty Leshchiner; Vonn Walter; Ludmila Danilova; A Gordon Robertson; Amy R Johnson; Tara M Lichtenberg; Bradley A Murray; Hans K Ghayee; Tobias Else; Shiyun Ling; Stuart R Jefferys; Aguirre A de Cubas; Brandon Wenz; Esther Korpershoek; Antonio L Amelio; Liza Makowski; W Kimryn Rathmell; Anne-Paule Gimenez-Roqueplo; Thomas J Giordano; Sylvia L Asa; Arthur S Tischler; Karel Pacak; Katherine L Nathanson; Matthew D Wilkerson Journal: Cancer Cell Date: 2017-02-02 Impact factor: 31.743
Authors: Maayan Baron; Adrian Veres; Samuel L Wolock; Aubrey L Faust; Renaud Gaujoux; Amedeo Vetere; Jennifer Hyoje Ryu; Bridget K Wagner; Shai S Shen-Orr; Allon M Klein; Douglas A Melton; Itai Yanai Journal: Cell Syst Date: 2016-09-22 Impact factor: 10.304
Authors: Rhonda Bacher; Li-Fang Chu; Ning Leng; Audrey P Gasch; James A Thomson; Ron M Stewart; Michael Newton; Christina Kendziorski Journal: Nat Methods Date: 2017-04-17 Impact factor: 28.547