Literature DB >> 34354339

Supervised t-distributed stochastic neighbor embedding for data visualization and classification.

Yichen Cheng1, Xinlei Wang2, Yusen Xia1.   

Abstract

We propose a novel supervised dimension reduction method, called supervised t-distributed stochastic neighbor embedding (St-SNE), which achieves dimension reduction by preserving the similarities of data points in both feature and outcome spaces. The proposed method can be used for both prediction and visualization tasks, with the ability to handle high-dimensional data. We show through a variety of datasets that when compared with a comprehensive list of existing methods, St-SNE has superior prediction performance in the ultra-high dimensional setting where the number of features p exceeds the sample size n, and has competitive performance in the p ≤ n setting. We also show that St-SNE is a competitive visualization tool that is capable of capturing within cluster variations. In addition, we propose a penalized Kullback-Leibler divergence criterion to automatically select the reduced dimension size k for St-SNE.

Entities:  

Keywords:  classification; dimension size estimation; supervised dimension reduction; ultra-high dimension; visualization

Year:  2020        PMID: 34354339      PMCID: PMC8330414          DOI: 10.1287/ijoc.2020.0961

Source DB:  PubMed          Journal:  INFORMS J Comput        ISSN: 1091-9856            Impact factor:   2.276


  7 in total

1.  Suitability of dysphonia measurements for telemonitoring of Parkinson's disease.

Authors:  Max A Little; Patrick E McSharry; Eric J Hunter; Jennifer Spielman; Lorraine O Ramig
Journal:  IEEE Trans Biomed Eng       Date:  2009-04       Impact factor: 4.538

2.  viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia.

Authors:  El-ad David Amir; Kara L Davis; Michelle D Tadmor; Erin F Simonds; Jacob H Levine; Sean C Bendall; Daniel K Shenfeld; Smita Krishnaswamy; Garry P Nolan; Dana Pe'er
Journal:  Nat Biotechnol       Date:  2013-05-19       Impact factor: 54.908

3.  International application of a new probability algorithm for the diagnosis of coronary artery disease.

Authors:  R Detrano; A Janosi; W Steinbrunn; M Pfisterer; J J Schmid; S Sandhu; K H Guppy; S Lee; V Froelicher
Journal:  Am J Cardiol       Date:  1989-08-01       Impact factor: 2.778

4.  Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia.

Authors:  Timothy J Ley; Christopher Miller; Li Ding; Benjamin J Raphael; Andrew J Mungall; A Gordon Robertson; Katherine Hoadley; Timothy J Triche; Peter W Laird; Jack D Baty; Lucinda L Fulton; Robert Fulton; Sharon E Heath; Joelle Kalicki-Veizer; Cyriac Kandoth; Jeffery M Klco; Daniel C Koboldt; Krishna-Latha Kanchi; Shashikant Kulkarni; Tamara L Lamprecht; David E Larson; Ling Lin; Charles Lu; Michael D McLellan; Joshua F McMichael; Jacqueline Payton; Heather Schmidt; David H Spencer; Michael H Tomasson; John W Wallis; Lukas D Wartman; Mark A Watson; John Welch; Michael C Wendl; Adrian Ally; Miruna Balasundaram; Inanc Birol; Yaron Butterfield; Readman Chiu; Andy Chu; Eric Chuah; Hye-Jung Chun; Richard Corbett; Noreen Dhalla; Ranabir Guin; An He; Carrie Hirst; Martin Hirst; Robert A Holt; Steven Jones; Aly Karsan; Darlene Lee; Haiyan I Li; Marco A Marra; Michael Mayo; Richard A Moore; Karen Mungall; Jeremy Parker; Erin Pleasance; Patrick Plettner; Jacquie Schein; Dominik Stoll; Lucas Swanson; Angela Tam; Nina Thiessen; Richard Varhol; Natasja Wye; Yongjun Zhao; Stacey Gabriel; Gad Getz; Carrie Sougnez; Lihua Zou; Mark D M Leiserson; Fabio Vandin; Hsin-Ta Wu; Frederick Applebaum; Stephen B Baylin; Rehan Akbani; Bradley M Broom; Ken Chen; Thomas C Motter; Khanh Nguyen; John N Weinstein; Nianziang Zhang; Martin L Ferguson; Christopher Adams; Aaron Black; Jay Bowen; Julie Gastier-Foster; Thomas Grossman; Tara Lichtenberg; Lisa Wise; Tanja Davidsen; John A Demchok; Kenna R Mills Shaw; Margi Sheth; Heidi J Sofia; Liming Yang; James R Downing; Greg Eley
Journal:  N Engl J Med       Date:  2013-05-01       Impact factor: 91.245

5.  Identifying disease-associated copy number variations by a doubly penalized regression model.

Authors:  Yichen Cheng; James Y Dai; Xiaoyu Wang; Charles Kooperberg
Journal:  Biometrics       Date:  2018-06-12       Impact factor: 2.571

6.  A shared transcriptional program in early breast neoplasias despite genetic and clinical distinctions.

Authors:  Alayne L Brunner; Jun Li; Xiangqian Guo; Robert T Sweeney; Sushama Varma; Shirley X Zhu; Rui Li; Robert Tibshirani; Robert B West
Journal:  Genome Biol       Date:  2014-05-23       Impact factor: 13.583

7.  An integrated model of the transcriptome of HER2-positive breast cancer.

Authors:  Krishna R Kalari; Brian M Necela; Xiaojia Tang; Kevin J Thompson; Melissa Lau; Jeanette E Eckel-Passow; Jennifer M Kachergus; S Keith Anderson; Zhifu Sun; Saurabh Baheti; Jennifer M Carr; Tiffany R Baker; Poulami Barman; Derek C Radisky; Richard W Joseph; Sarah A McLaughlin; High-seng Chai; Stephan Camille; David Rossell; Yan W Asmann; E Aubrey Thompson; Edith A Perez
Journal:  PLoS One       Date:  2013-11-01       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.