Literature DB >> 33560368

Per-sample standardization and asymmetric winsorization lead to accurate clustering of RNA-seq expression profiles.

Davide Risso1, Stefano Maria Pagnotta2.   

Abstract

MOTIVATION: Data transformations are an important step in the analysis of RNA-seq data. Nonetheless, the impact of transformation on the outcome of unsupervised clustering procedures is still unclear.
RESULTS: Here, we present an Asymmetric Winsorization per Sample Transformation (AWST), which is robust to data perturbations and removes the need for selecting the most informative genes prior to sample clustering. Our procedure leads to robust and biologically meaningful clusters both in bulk and in single-cell applications. AVAILABILITY: The AWST method is available at https://github.com/drisso/awst. The code to reproduce the analyses is available at https://github.com/drisso/awst\_analysis. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 33560368      PMCID: PMC8388024          DOI: 10.1093/bioinformatics/btab091

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  34 in total

1.  Moderated statistical tests for assessing differences in tag abundance.

Authors:  Mark D Robinson; Gordon K Smyth
Journal:  Bioinformatics       Date:  2007-09-19       Impact factor: 6.937

2.  Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments.

Authors:  Luyi Tian; Xueyi Dong; Saskia Freytag; Kim-Anh Lê Cao; Shian Su; Abolfazl JalalAbadi; Daniela Amann-Zalcenstein; Tom S Weber; Azadeh Seidi; Jafar S Jabbari; Shalin H Naik; Matthew E Ritchie
Journal:  Nat Methods       Date:  2019-05-27       Impact factor: 28.547

3.  Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors.

Authors:  Huipeng Li; Elise T Courtois; Debarka Sengupta; Yuliana Tan; Kok Hao Chen; Jolene Jie Lin Goh; Say Li Kong; Clarinda Chua; Lim Kiat Hon; Wah Siew Tan; Mark Wong; Paul Jongjoon Choi; Lawrence J K Wee; Axel M Hillmer; Iain Beehuat Tan; Paul Robson; Shyam Prabhakar
Journal:  Nat Genet       Date:  2017-03-20       Impact factor: 38.330

4.  A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor.

Authors:  Aaron T L Lun; Davis J McCarthy; John C Marioni
Journal:  F1000Res       Date:  2016-08-31

5.  Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma.

Authors:  Michele Ceccarelli; Floris P Barthel; Tathiane M Malta; Thais S Sabedot; Sofie R Salama; Bradley A Murray; Olena Morozova; Yulia Newton; Amie Radenbaugh; Stefano M Pagnotta; Samreen Anjum; Jiguang Wang; Ganiraju Manyam; Pietro Zoppoli; Shiyun Ling; Arjun A Rao; Mia Grifford; Andrew D Cherniack; Hailei Zhang; Laila Poisson; Carlos Gilberto Carlotti; Daniela Pretti da Cunha Tirapelli; Arvind Rao; Tom Mikkelsen; Ching C Lau; W K Alfred Yung; Raul Rabadan; Jason Huse; Daniel J Brat; Norman L Lehman; Jill S Barnholtz-Sloan; Siyuan Zheng; Kenneth Hess; Ganesh Rao; Matthew Meyerson; Rameen Beroukhim; Lee Cooper; Rehan Akbani; Margaret Wrensch; David Haussler; Kenneth D Aldape; Peter W Laird; David H Gutmann; Houtan Noushmehr; Antonio Iavarone; Roel G W Verhaak
Journal:  Cell       Date:  2016-01-28       Impact factor: 41.582

6.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors: 
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

7.  Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment.

Authors:  Marek Gierliński; Christian Cole; Pietà Schofield; Nicholas J Schurch; Alexander Sherstnev; Vijender Singh; Nicola Wrobel; Karim Gharbi; Gordon Simpson; Tom Owen-Hughes; Mark Blaxter; Geoffrey J Barton
Journal:  Bioinformatics       Date:  2015-07-23       Impact factor: 6.937

8.  Simultaneous epitope and transcriptome measurement in single cells.

Authors:  Marlon Stoeckius; Christoph Hafemeister; William Stephenson; Brian Houck-Loomis; Pratip K Chattopadhyay; Harold Swerdlow; Rahul Satija; Peter Smibert
Journal:  Nat Methods       Date:  2017-07-31       Impact factor: 28.547

9.  EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data.

Authors:  Aaron T L Lun; Samantha Riesenfeld; Tallulah Andrews; The Phuong Dao; Tomas Gomes; John C Marioni
Journal:  Genome Biol       Date:  2019-03-22       Impact factor: 13.583

10.  Cluster analysis on high dimensional RNA-seq data with applications to cancer research - An evaluation study.

Authors:  Linda Vidman; David Källberg; Patrik Rydén
Journal:  PLoS One       Date:  2019-12-05       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.