Pan Tong1, Yong Chen, Xiao Su, Kevin R Coombes. 1. Department of Bioinformatics and Computational Biology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA.
Abstract
MOTIVATION: Identification of bimodally expressed genes is an important task, as genes with bimodal expression play important roles in cell differentiation, signalling and disease progression. Several useful algorithms have been developed to identify bimodal genes from microarray data. Currently, no method can deal with data from next-generation sequencing, which is emerging as a replacement technology for microarrays. RESULTS: We present SIBER (systematic identification of bimodally expressed genes using RNAseq data) for effectively identifying bimodally expressed genes from next-generation RNAseq data. We evaluate several candidate methods for modelling RNAseq count data and compare their performance in identifying bimodal genes through both simulation and real data analysis. We show that the lognormal mixture model performs best in terms of power and robustness under various scenarios. We also compare our method with alternative approaches, including profile analysis using clustering and kurtosis (PACK) and cancer outlier profile analysis (COPA). Our method is robust, powerful, invariant to shifting and scaling, has no blind spots and has a sample-size-free interpretation. AVAILABILITY: The R package SIBER is available at the website http://bioinformatics.mdanderson.org/main/OOMPA:Overview.
MOTIVATION: Identification of bimodally expressed genes is an important task, as genes with bimodal expression play important roles in cell differentiation, signalling and disease progression. Several useful algorithms have been developed to identify bimodal genes from microarray data. Currently, no method can deal with data from next-generation sequencing, which is emerging as a replacement technology for microarrays. RESULTS: We present SIBER (systematic identification of bimodally expressed genes using RNAseq data) for effectively identifying bimodally expressed genes from next-generation RNAseq data. We evaluate several candidate methods for modelling RNAseq count data and compare their performance in identifying bimodal genes through both simulation and real data analysis. We show that the lognormal mixture model performs best in terms of power and robustness under various scenarios. We also compare our method with alternative approaches, including profile analysis using clustering and kurtosis (PACK) and cancer outlier profile analysis (COPA). Our method is robust, powerful, invariant to shifting and scaling, has no blind spots and has a sample-size-free interpretation. AVAILABILITY: The R package SIBER is available at the website http://bioinformatics.mdanderson.org/main/OOMPA:Overview.
Authors: Nicole Cloonan; Alistair R R Forrest; Gabriel Kolle; Brooke B A Gardiner; Geoffrey J Faulkner; Mellissa K Brown; Darrin F Taylor; Anita L Steptoe; Shivangi Wani; Graeme Bethel; Alan J Robertson; Andrew C Perkins; Stephen J Bruce; Clarence C Lee; Swati S Ranade; Heather E Peckham; Jonathan M Manning; Kevin J McKernan; Sean M Grimmond Journal: Nat Methods Date: 2008-05-30 Impact factor: 28.547
Authors: Stephen R Piccolo; Michelle R Withers; Owen E Francis; Andrea H Bild; W Evan Johnson Journal: Proc Natl Acad Sci U S A Date: 2013-10-15 Impact factor: 11.205
Authors: Mianen Sun; Pan Tong; Wen Kong; Baijun Dong; Yiran Huang; In Young Park; Lijun Zhou; Xian-De Liu; Zhiyong Ding; Xuesong Zhang; Shanshan Bai; Peter German; Reid Powell; Quan Wang; Xuefei Tong; Nizar M Tannir; Surena F Matin; W Kimryn Rathmell; Gregory N Fuller; Ian E McCutcheon; Cheryl L Walker; Jing Wang; Eric Jonasch Journal: Cancer Res Date: 2017-08-14 Impact factor: 12.701
Authors: Triparna Sen; Pan Tong; C Allison Stewart; Sandra Cristea; Aly Valliani; David S Shames; Abena B Redwood; You Hong Fan; Lerong Li; Bonnie S Glisson; John D Minna; Julien Sage; Don L Gibbons; Helen Piwnica-Worms; John V Heymach; Jing Wang; Lauren Averett Byers Journal: Cancer Res Date: 2017-05-10 Impact factor: 12.701
Authors: Patrick K Kimes; Christopher R Cabanski; Matthew D Wilkerson; Ni Zhao; Amy R Johnson; Charles M Perou; Liza Makowski; Christopher A Maher; Yufeng Liu; J S Marron; D Neil Hayes Journal: Nucleic Acids Res Date: 2014-07-16 Impact factor: 16.971
Authors: C Allison Stewart; Carl M Gay; Yuanxin Xi; Santhosh Sivajothi; V Sivakamasundari; Junya Fujimoto; Mohan Bolisetty; Patrice M Hartsfield; Veerakumar Balasubramaniyan; Milind D Chalishazar; Cesar Moran; Neda Kalhor; John Stewart; Hai Tran; Stephen G Swisher; Jack A Roth; Jianjun Zhang; John de Groot; Bonnie Glisson; Trudy G Oliver; John V Heymach; Ignacio Wistuba; Paul Robson; Jing Wang; Lauren Averett Byers Journal: Nat Cancer Date: 2020-02-17
Authors: Pan Tong; Lixia Diao; Li Shen; Lerong Li; John Victor Heymach; Luc Girard; John D Minna; Kevin R Coombes; Lauren Averett Byers; Jing Wang Journal: Cancer Inform Date: 2016-05-10