Yufei Xiao1, Tzu-Hung Hsiao, Uthra Suresh, Hung-I Harry Chen, Xiaowu Wu, Steven E Wolf, Yidong Chen. 1. Department of Epidemiology and Biostatistics, Computational Biology and Bioinformatics Division, Greehey Children's Cancer Research Institute, Department of Surgery, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, United States Army Institute of Surgical Research, Fort Sam Houston, TX 78234 and Cancer Therapy and Research Center, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA.
Abstract
MOTIVATION: When identifying differentially expressed (DE) genes from high-throughput gene expression measurements, we would like to take both statistical significance (such as P-value) and biological relevance (such as fold change) into consideration. In gene set enrichment analysis (GSEA), a score that can combine fold change and P-value together is needed for better gene ranking. RESULTS: We defined a gene significance score π-value by combining expression fold change and statistical significance (P-value), and explored its statistical properties. When compared to various existing methods, π-value based approach is more robust in selecting DE genes, with the largest area under curve in its receiver operating characteristic curve. We applied π-value to GSEA and found it comparable to P-value and t-statistic based methods, with added protection against false discovery in certain situations. Finally, in a gene functional study of breast cancer profiles, we showed that using π-value helps elucidating otherwise overlooked important biological functions. AVAILABILITY: http://gccri.uthscsa.edu/Pi_Value_Supplementary.asp CONTACT: xy@ieee.org, cheny8@uthscsa.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: When identifying differentially expressed (DE) genes from high-throughput gene expression measurements, we would like to take both statistical significance (such as P-value) and biological relevance (such as fold change) into consideration. In gene set enrichment analysis (GSEA), a score that can combine fold change and P-value together is needed for better gene ranking. RESULTS: We defined a gene significance score π-value by combining expression fold change and statistical significance (P-value), and explored its statistical properties. When compared to various existing methods, π-value based approach is more robust in selecting DE genes, with the largest area under curve in its receiver operating characteristic curve. We applied π-value to GSEA and found it comparable to P-value and t-statistic based methods, with added protection against false discovery in certain situations. Finally, in a gene functional study of breast cancer profiles, we showed that using π-value helps elucidating otherwise overlooked important biological functions. AVAILABILITY: http://gccri.uthscsa.edu/Pi_Value_Supplementary.asp CONTACT: xy@ieee.org, cheny8@uthscsa.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Yixin Wang; Jan G M Klijn; Yi Zhang; Anieta M Sieuwerts; Maxime P Look; Fei Yang; Dmitri Talantov; Mieke Timmermans; Marion E Meijer-van Gelder; Jack Yu; Tim Jatkoe; Els M J J Berns; David Atkins; John A Foekens Journal: Lancet Date: 2005 Feb 19-25 Impact factor: 79.321
Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205
Authors: Yuliang Wang; Diana G Eng; Natalya V Kaverina; Carol J Loretz; Abbal Koirala; Shreeram Akilesh; Jeffrey W Pippin; Stuart J Shankland Journal: Kidney Int Date: 2020-06-25 Impact factor: 10.612
Authors: Ying Lin; Jacob M Wozniak; Neil J Grimsey; Sravan Girada; Anand Patwardhan; Olivia Molinar-Inglis; Thomas H Smith; John D Lapek; David J Gonzalez; JoAnn Trejo Journal: Proc Natl Acad Sci U S A Date: 2020-02-18 Impact factor: 11.205
Authors: Sarah A Comerford; Elizabeth A Hinnant; Yidong Chen; Hima Bansal; Shawn Klapproth; Dinesh Rakheja; Milton J Finegold; Dolores Lopez-Terrada; Kathryn A O'Donnell; Gail E Tomlinson; Robert E Hammer Journal: JCI Insight Date: 2016-10-06
Authors: Katherine I Zhou; Hailing Shi; Ruitu Lyu; Adam C Wylder; Żaneta Matuszek; Jessica N Pan; Chuan He; Marc Parisien; Tao Pan Journal: Mol Cell Date: 2019-08-21 Impact factor: 17.970
Authors: Ryan M Layer; Brent S Pedersen; Tonya DiSera; Gabor T Marth; Jason Gertz; Aaron R Quinlan Journal: Nat Methods Date: 2018-01-08 Impact factor: 28.547
Authors: Young-Mi Go; Roy L Sutliff; Joshua D Chandler; Rahman Khalidur; Bum-Yong Kang; Frank A Anania; Michael Orr; Li Hao; Bruce A Fowler; Dean P Jones Journal: Toxicol Sci Date: 2015-07-16 Impact factor: 4.849